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Description 

The present invention relates to the field of molecular biology. In particular, it relates to, among other things, nu- 
cleotide sequences of Staphylococcus aureus, contigs, ORFs, fragments, probes, primers and related polynucleotides 
5 thereof, peptides and polypeptides encoded by the sequences, and uses of the polynucleotides and sequences thereof, 
such as in fermentation, polypeptide production, assays and pharmaceutical development, among others. 

The genus Staphylococcus includes at least 20 distinct species. (For a review see Novick, R. P., The Staphyloco- 
ccus as a Molecular Genetic System, Chapter 1, pgs. 1-37 in MOLECULAR BIOLOGY OF THE STAPHYLOCOCCI, 
R. Novick, Ed., VCH Publishers, New York (1990)). Species differ from one another by 80% or more, by hybridization 
10 kinetics, whereas strains within a species are at least 90% identical by the same measure. 

The species Staphylococcus aureus, a gram-positive, facultatively aerobic, clump-forming cocci, is among the 
most important etiological agents of bacterial infection in humans, as discussed briefly below. 

Human Health and S. Aureus 

IS 

Staphylococcus aureus is a ubiquitous pathogen. (See, for instance. Mims et ai, MEDICAL MICROBIOLOGY, 
Mosby-Year Book Europe Limited, London, UK (1993)). It is an etiological agent of a variety of conditions, ranging in 
severity from mild to fatal. A few of the more common conditions caused by S. aureus infection are burns, cellulitis, 
eyelid infections, food poisoning, joint infections, neonatal conjunctivitis,osteomyelitls, skin infections, surgical wound 
^ infection, scalded skin syndrome and toxic shock syndrome, some of which are described further below. 

Bums 

Burn wounds generally are sterile initially. However, they generally compromise physical and immune barriers to 
25 infection, cause loss of fluid and electrolytes and result in local or general physiological dysfunction. After cooling, 
contact with viable bacteria results in mixed cobnization at the injury site. Infection may be restricted to the non-viable 
debris on the bum surface ('eschar'), it may progress into full skin infection and invade viable tissue below the eschar 
and it may reach below the skin, enter the lymphatic and blood circulation and develop into septicaemia. S. aureus is 
among the most important pathogens typically found in burn wound infections. It can destroy granulation tissue and 
30 produce severe septicaemia. 

Cellulitis 

Cellulitis, an acute infection of the skin that expands from a typically superficial origin to spread below the cutaneous 
35 layer, most commonly is caused by S. aureus in conjunction with S. pyrogenes. Cellulitis can lead to systemic infection. 
In fact, cellulitis can be one aspect of synergistic bacterial gangrene. This condition typically is caused by a mixture of 
S. aureus ar\6 microaerophilic streptococci. It causes necrosis and treatment is limited toexcisbn of the necrotic tissue. 
The conditbn often is fatal. 

40 Eyelid infections 

S. aureus is the cause of styes and of sticky eye" in neonates, among other eye infectbns. Typically such infections 
are limited to the surface of the eye, and may occasbnally penetrate the surface with more severe consequences. 

45 Food poisoning 

Some strains of S. aureus produce one or more of five serologically distinct, heat and acid stable enterotoxins that 
are not destroyed by digestive process of the stomach and small intestine (enterotoxins A-E). Ingestion of the toxin, 
in sufficient quantities, typrcalty results in severe vomiting, but not diarrhoea. The effect does not require viable bacteria. 
so Although the toxins are known, their mechanism of action is not understood. 

Joint infections 

S. aureus infects bone joints causing diseases such osteomyelitis. 

55 

Osteon^yelitis 

S. aureus is the most common causative agent of haematogenous osteomyelitis. The disease tends to occur in 



2 



EP 0 786 519 A2 



children and adolescents more than adults and it is associated with non-penetrating injuries to bones. Infection typically 
occurs in the long end of growing bone, hence Its occurrence in physically immature populations. Most often, Infection 
is localized in the vicinity of sprouting capillary loops adjacent to epiphysial growth plates in the end of long, growing 
bones. 

5 

Skin infections 

S. aureus is the most common pathogen of such minor skin infections as abscesses and boils. Such infections 
often are resolved by normal host response mechanisms, but they also can develop Into severe internal Infections. 
10 Recurrent infections of the nasal passages plague nasal carriers of S. aureus. 

Surgical Wound Infections 

Surgical wounds often penetrate far into the body. Infection of such wound thus poses a grave risk to the patient. 
IS s. aureus is the most important causative agent of infections in surgical wounds. S. aureus is unusually adept at 
invading surgteal wounds; sutured wounds can be infected by far fewer S. aureus cells then are necessary to cause 
infection In normal skin. Invasion of surgical wound can lead to severe S. aureus septicaemia. Invasion of the blood 
stream by S. aureus can lead to seeding and infection of internal organs, particularly heart valves and bone, causing 
systemic diseases, such as endocarditis and osteomyelitis. 

20 

Scalded Skin Syndrome 

S. aureus is responsible for "scalded skin syndrome* (also called toxic epidermal necrosis, Ritter's disease and 
Lyell's disease). This diseases occurs in older children, typically in outbreaks caused by flowering of S. aureus strains 
2S produce exfoliation(also called scalded skin syndrome toxin). Although the bacteria initially may infect only a minor 
lesion, the toxin destroys intercellular connections, spreads epidermal layers and allows the infection to penetrate the 
outer layer of the skin, producing the desquamation that typifies the diseases. Shedding of the outer layer of skin 
generally reveals normal skin below, but fluid lost in the process can produce severe injury in young children if It Is not 
treated properly. 

30 

Toxic Shock Syndrome 

Toxic shock syndrome Is caused by strains of S. aureus that produce the so-called toxic shock syndrome toxin. 
The disease can be caused by S. aureus Infection at any site, but It is too often erroneously viewed exclusively as a 
3S disease solely of women who use tampons. The disease involves toxaemia and septicaemia, and can be fatal. 

Nocosomial Infections 

In the 1984 National Nocosomial Infection Sun^elllance Study ("NNIS") 5. aureus was the nriost prevalent agent 
40 of surgical wound infections in many hospital services, including medicine, surgery, obstetrics, pediatrics and newborns. 

Resistance to drugs of S. aureus strains 

Prior to the introduction of penicillin the prognosis for patients seriously infected with S. aureus was unfavorable. 
45 Following the introduction of penicillin in the early 1 940s even the worst S. aureus infections generally could be treated 
successfully. The emergence of penicillin-resistant strains of S. aureus did not take long, however. Most strains of S. 
aureus encountered in hospital infections today do not respond to penicillin; although, fortunately, this is not the case 
for S. aureus encountered In community Infections. 

It Is well known now that penicillin-resistant strains of S. aureus produce a lactamase which converts penicillin to 
so pencillinoic acid, and thereby destroys antibbtk; activity. Furthermore, the lactamase gene often Is propagated episo- 
mally. typically on a plasmid, and often is only one of several genes on an episomal element that, together, confer 
multidrug resistance. 

Methicilllns, Introduced In the 1960s, largely overcame the problem of penicillin resistance In S. aureus. These 
compounds conserve the portions of penicillin responsible for antibiotic activity and modify or alter other portions that 
55 make penicillin a good substrate for inactivating lactamases. However, methlcillin resistance has emerged in S. aureus, 
along with resistance to many other antibiotics effective against this organism, Including aminoglycosides, tetracycline, 
chloramphenicol, macrolides and lincosamides. In fact, methk:illin-resistant strains of S. acyreus generally are multiply 
drug resistant. 
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The molecular genetics of most types of drug resistance in S. aureus has been elucidated (See Lyon et al., Micro- 
biology Reviews 5^\ 88-1 34 (1 987)). Generally, resistance is mediated by plasmids, as noted above regarding penicillin 
resistance; however, several stable forms of drug resistance have been observed that apparently involve integration 
of a resistance element into the S. aureus genome itself. 
s Thus far each new antibiotic gives rise to resistance strains, stains emerge that are resistance to multiple drugs 

and increasingly persistent forms of resistance begin to emerge. Drug resistance of S. aureus infections already poses 
significant treatment difficulties, which are likely to get much worse unless new therapeutic agents are developed. 

Molecular Genetics of Staphyloeoceua Aureus 

10 

Despite its importance in, among other things, human disease, relatively little is known about the genome of this 
organism. 

Most genetic studies of S. aureus have been carried out using the the strain NCTC8325, which contains prophages 
psi11 psi12 and psi13, and the UV-cured derivative of this strain, 6325-4 (also referred to as RN450), which is free of 
IS the prophages. 

These studies revealed that the 5. aureus genome, like that of other staphybcocci, consists of one circular, cov- 
alently closed, double-stranded DNA and a collection of so-called variable accessory genetic elements, such as 
prophages, plasmids, transposons and the tike. 

Physical characterization of the genome has not been carried out in any detail, Pattee et al. published a low res- 

20 olution and incomplete genetic and physical map of the chromosome of S. aureus strain NCTC 8325. (Pattee etal. 
Genetic and Physical Mapping of Chromosome of Staphylococcus aureus NCTC 8325, Chapter 11 , pgs. 163-169 in. 
MOLECULAR BIOLOGY OF THE STAPHYLOCOCCI, P.P. Novick, Ed., VCH Publishers, New York. (1 990) The genetic 
map largely was produced by mapping Insertions of Tn551 and Tn4001 , which, respectively, confer erythromycin and 
gentamicin resistance, and by analysis of Smal-digested DNA by Pulsed Field Gel Electrophoresis ("PFGE"). 

25 The map was of low resolution; even estimating the physical size of the genome was difficult, according to the 

Investigators. The size of the largest Smal chromosome fragment, for instance, was too large for accurate sizing by 
PFGE. To estimate its size, additional restriction sites had to be introduced into the chromosome using a transposon 
containing a Smal recognition sequence. 

In sum, most physical characteristics and almost all of the genes of Staphylococcus aureus are unknown. Among 

30 the few genes that have been identified, most have not been physically mapped or characterized in detail. Only a very 
few genes of this organism have been sequenced. (See, for instance Thornsberry, J. , Antimicrobial Chemotherapy 2^ 
Suppl C : 9-16 (1986), current versions of GENBANK and other nucleic acid databases, and references that relate to 
the genome of S. aureus such as those set out elsewhere herein.) 

It is clear that the etiology of diseases mediated or exacerbated by S. aureus infection involves the programmed 

3S expression of S. aureus genes, and that characterizing the genes and their patterns of expression would add dramat- 
ically to our understanding of the organism and its host interactions. Knowledge of S. aureus genes and genomic 
organization woukJ dramatically improve understanding of disease etiology and lead to improved and new ways of 
preventing, amelbrating, arresting and reversing diseases. Moreover, characterized genes and genomic fragments of 
S. aureus would provide reagents for, among other things, detecting, characterizing and controlling S. aureus infections. 

40 There is a need therefore to characterize the genome of S. aureus and for polynucleotides and sequences of this 
organism. 

The present invention is based on the sequencing of fragments ot the Staphylococcus aureus Qenome. The primary 
nucleotide sequences which were generated are provided in SEQ ID NOS: 1-5,191. 

The present invention provides the nucleotide sequence of several thousand contigs of the Staphylococcus aureus 
45 genome, which are listed in tables below and set out in the Sequence Listing submitted herewith, and representative 
fragments thereof, in a form which can be readily used, analyzed, and interpreted by a skilled artisan. In one embod- 
iment, the present invention is provided as contiguous strings of primary sequence information corresponding to the 
nucleotide sequences depicted in SEQ ID NOS:1-5,191. 

The present invention further provides nucleotide sequences which are at least 95%, preferably 99% and most 
so preferably 99.9%, identical to the nucleotide sequences of SEQ ID NOS:1 -5, 191 . 

The nucleotide sequence of SEQ ID NOS: 1-5, 191, a representative fragment thereof, or a nucleotide sequence 
which is at least 95%, preferably 99% and most preferably 99.9%, identical to the nucleotide sequence ot SEQ ID 
NOS: 1 -5,1 91 may be provided in a variety of mediums to facilitate its use. In one application of this embodiment, the 
sequences of the present invention are recorded on computer readable media. Such media includes, but is not limited 
ss to: magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media 
such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/ 
optical storage media. 

The present invention further provides systems, particutariy computer-based systems which contain the sequence 
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information herein described stored in a data storage means. Such systems are designed to identify commercially 
important fragments of the Staphylococcus aureus genome. 

Another embodiment of the present invention is directed to fragments, preferably isolated fragments, of the Sta- 
phylococcus aureus genome having particular structural or functional attributes. Such fragments of the Staphylococcus 
5 aureus genome of the present invention include, but are not limited to, fragments which encode peptides, hereinafter 
referred to as open reading frames or ORFs," fragments which modulate the expression of an operabty linked ORF, 
hereinafter referred to as expression modulating fragments or EMFs," and fragments which can be used to diagnose 
the presence of Staphylococcus aureus in a sample, hereinafter referred to as diagnostic fragments or "DFs." 

Each of the ORFs In fragments of the Staphylococcus aureus genome disclosed In Tables 1 -3. and the EMFs 
10 found 5' to the ORFs, can be used In numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining the presence of a specific microbe in 
a sample, to selectively control gene expression in a host and in the production of polypeptides, such as polypeptides 
encoded by ORFs of the present invention, particular those polypeptides that have a pharmacological activity 

The present invention further includes recombinant constructs comprising one or more fragments of the Staphy- 
15 lococcus aureus genome of the present invention. The recombinant constructs of the present invention comprise vec- 
tors, such as a plasmid or viral vector, into which a fragment of the Staphylococcus aureus has been inserted. 

The present invention further provides host cells containing any of the isolated fragments of the Staphylococcus 
aureus genome of the present invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a bacterial cell. 
20 The present invention is further directed to polypeptides and proteins, preferably isolated polypeptides and pro- 

teins, encoded by ORFs of the present invention. A variety of methods, well known to those of skill in the art, routinely 
may be utilized to obtain any of the polypeptides and proteins of the present invention. For instance, polypeptides and 
proteins of the present invention having relatively short, simple amino acid sequences readily can be synthesized using 
commercially available automated peptide synthesizers. Polypeptides and proteins of the present invention also may 
2S be purified from bacterial cells which naturally produce the protein. Yet another alternative is to purify polypeptide and 
proteins of the present invention can from celts which have been attered to express them. 

The invention further provides polypeptides, preferably isolated polypeptides, comprising Staphylococcus aureus 
epitopes and vaccine compositions comprising such polypeptides. Also provided are methods for vaccinlating an in- 
dividual against Staphylococcus aureus infection. 
30 The invention further provides methods of obtaining homologs of the fragments of the Staphylococcus aureus 

genome of the present invention and homologs of the proteins encoded by the ORFs of the present invention. Specif- 
ically, by using the nucleotkle and amino acid sequences disclosed herein as a probe or as primers, and techniques 
such as PGR cloning and colony/plaque hybridization, one skilled in the art can obtain homologs. 

The invention further provides antibodies which selectively bind polypeptides and proteins of the present invention. 
35 Such antibodies include both monoclonal and polyclonal antibodies. 

The Invention further provides hybridomas which produce the above-described antibodies. A hybridoma is an 
immortalized cell line which is capable of secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test samples derived from cells which express one 
of the ORFs of the present invention, or a homolog thereof. Such methods comprise incubating a test sample with one 
40 or more of the antibodies of the present invention, or one or more of the Dfs or antigens of the present invention, under 
conditions which allow a skilled artisan to determine if the sample contains the ORF or product produced therefrom. 

In another embodiment of the present invention, kits are provkjed whrch contain the necessary reagents to carry 
out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
45 which comprises: (a) a first container comprising one of the antibodies, antigens, or one of the DFs of the present 
invention; and (b) one or more other containers comprising one or more of the following:wash reagents, reagents 
capable of detecting presence of bound antibodies, antigens or hybridized DFs. 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
and identifying agents capable of binding to a polypeptide or protein encoded by one of the ORFs of the present 
so invention. Specifically, such agents include, as further described below, antibodies, peptides, carbohydrates, pharnia- 
ceutical agents and the like. Such methods comprise steps of: (a)contacting an agent with an isolated protein encoded 
by one of the ORFs of the present invention; and (b)determining whether the agent binds to said protein. 

The present genomic sequences of Staphylococcus aureus will be of great value to ail laboratories working with 
this organism and for a variety of commercial purposes. Many fragments of the Staphylococcus aureus genome will 
55 be immediately klentifled by similarity searches against Gen Bank or protein databases and will be of immediate value 
to Staphylococcus aureus researchers and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic sequences of bacterial and other genomes 
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has and will greatly enhance the ability to analyze and understand chromosomal organization. In particular, sequenced 
contigs and genomes will provide the models for developing tools for the analysis of chromosome structure and function, 
including the ability to identify genes within large segments of genomic DNA, the structure, position, and spacing of 
regulatory elements, the identification of genes with potential industrial applications, and the ability to do comparative 
5 genomic and molecular phytogeny 

FIGURE 1 is a block diagram of a computer system (102) that can be used to implement computer-based systems 
of present invention. 

FIGURE 2 is a schematic diagram depicting the data flow and computer programs used to collect, assemble, edit 
and annotate the contigs of the Staphylococcus aureus genome of the present invention. Both Macintosh and Unix 

10 platforms are used to handle the AB 373 and 377 sequence data files, largely as described In Kerlavage et al. Pro- 
ceedings of the Twenty-Sixth Annual Hawaii International Conference on System Sciences, 585, IEEE Computer So- 
ciety Press, Washington D.C. (1993). Factura (AB) is a Macintosh program designed for automatic vector sequence 
removal and end-trimming of sequence files. The program Loadis runs on a Macintosh platform and parses the feature 
data extracted from the sequence files by Factura to the Unix based Staphylococcus aureus relational database. As- 

15 sembly of contigs (and whole genome sequences) is accomplished by retrieving a specific set of sequence files and 
their associated features using extrseq, a Unix utility for retrieving sequences from an SQL database. The resulting 
sequence file is processed by seq_filter to trim portions of the sequences with more than 2% ambiguous nucleotides. 
The sequence files were assembled using TIGR Assembler, an assembly engine designed at The Institute for Genomic 
Research ( TIGR") for rapid and accurate assembly of thousands of sequence fragments. The collection of contigs 

20 generated by the assembly step is loaded into the database with the lassie program. Identification of open reading 
frames (ORFs) is accomplished by processing contigs with zorf. The ORFs are searched against S. aureus sequences 
from Genbank and against all protein sequences using the BLASTN and BLASTP programs, described in Attschul et 
ai, J. Mol. Biol. 215 : 403-410 (1990)). Results of the ORF determination and similarity searching steps were loaded 
into the database. As described below, some results of the determination and the searches are set out in Tables 1 -3.. 

2S The present invention is based on the sequencing of fragments of the Staphylococcus aureus genome and analysis 

of the sequences. The primary nucleotide sequences generated by sequencing the fragments are provided in SEQ ID 
NOS:1-5,191 . (As used herein, the 'primary sequence' refers to the nucleotide sequence represented by the lUPAC 
nomenclature system.) 

In addition to the aforementioned Staphylococcus aureus polynucleotide and polynucleotide sequences, the 
30 present invention provides the nucleotide sequences of SEQ ID NOS:l -5, 1 91 , or representative fragments thereof, in 
a form which can be readily used, analyzed, and interpreted by a skilled artisan. 

As used herein, a 'representative fragment of the nucleotide sequence depicted in SEQ ID NOS:1 -5,1 91' refers 
to any portion of the SEQ ID NOS:1-5.191 which is not presently represented within a publicly available database. 
Preferred representative fragments of the present invention are Staphylococcus aureus open reading frames ( ORFs"), 
3S expression modulating fragment ( EMFs") and fragments which can be used to diagnose the presence of Staphyloco- 
ccus aureus in sample ("DFs"). A non-limiting identification of preferred representative fragments is provided in Tables 
1-3. 

As discussed in detail below, the information provided in SEQ ID NOS:1-5.191 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled in the art to clone and sequence 
40 all "representative fragments' of interest, including open reading frames encoding a large variety of Staphylococcus 
aureus proteins. 

While the presently disclosed sequences of SEQ ID NOS:1 -5, 1 91 are highly accurate, sequencing techniques are 
not perfect and, in relatively rare instances, further investigation of a fragment or sequence of the invention may reveal 
a nucleotide sequence error present in a nucleotide sequence disclosed in SEQ ID NOS:1-5,191 . However, once the 

45 present invention is made available {i.e., once the Information in SEQ ID NOS:1 -5,191 and Tables 1 -3 has been made 
available), resolving a rare sequencing error in SEQ ID NOS:1 -5, 191 will be well within the skill of the art. The present 
disclosure makes available sufficient sequence information to allow any of the described contigs or portions thereof to 
be obtained readily by straightforward application of routine techniques. Further sequencing of such polynucleotide 
may proceed in like manner using manual and automated sequencing methods which are employed ubiquitous in the 

so art. Nucleotide sequence editing software is publicly available. For example, Applied Biosystem's (AB) AutoAssembler 
can be used as an aid during visual inspection of nucleotide sequences. By employing such routine techniques potential 
errors readily may be identified and the correct sequence then may be ascertained by targeting further sequencing 
effort, also of a routine nature, to the region containing the potential error. 

Even if all of the very rare sequencing errors in SEQ ID NOS:1-5,191 were corrected, the resulting nucleotide 

ss sequences would still be at least 95% identical, nearly all would be at least 99% identical, and the great majority wouki 
be at least 99.9% identical to the nucleotide sequences of SEQ ID NOS:1 -5,1 91 . 

As discussed elsewhere hererin, polynucleotides of the present invention readily may be obtained by routine ap- 
plication of well known and standard procedures for cloning and sequencing DNA. Detailed methods for obtaining 
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libraries and tor sequencing are provided below, for instance. A wide variety of Staphylococcus aureus strains that can 
be used to prepare S aureus genomic DNA for cloning and for obtaining polynucleotides of the present invention are 
available to the public from recognized depository institutions, such as the American Type Culture Collection (ATCC). 
The nucleotide sequences of the genomes from different strains of Staphylococcus aureus differ somewhat. How- 

5 ever, the nucleotide sequences of the genomes of all Staphylococcus aureus strains will be at least 95% identical, in 
corresponding part, to the nucleotide sequences provided In SEQ ID NOS:1-5,191. Nearly all will be at least 99% 
identical and the great majority will be 99.9% identical. 

Thus, the present invention further provides nucleotide sequences which are at least 95%, preferably 99% and 
most preferably 99.9% identical to the nucleotide sequences of SEQ ID NOS:1-5,191 , in a form which can be readily 

10 used, analyzed and interpreted by the skilled artisan. 

Methods for determining whether a nucleotide sequence is at least 95%, at least 99% or at least 99.9% identical 
to the nucleotide sequences of SEQ ID NOS:1 -5. 1 91 are routine and readily available to the skilled artisan. For example, 
the well known fasta algorithm described in Pearson and Lipman, Proc. Natl. Acad. Sci. USAB5: 2444 (1988) can be 
used to generate the percent identity of nucleotide sequences. The BLASTN program also can be used to generate 

IS an identity score of polynucleotides compared to one another. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS:1-5,191, a representative fragment thereof, or a nucleotide 

20 sequence at least 95%, preferably at least 99% and most preferably at least 99.9% identical to a polynucleotide se- 
quence of SEQ ID NOS:1-5,191 may be "provided" in a variety of mediums to facilitate use thereof. As used herein, 
^provided" refers to a manufacture, other than an Isolated nucleic acid molecule, which contains a nucleotide sequence 
of the present invention; i.e., a nucleotide sequence provided in SEQ ID N0S:1 -5,191. a representative fragment 
thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most preferably at least 99.9% identical 

25 to a polynucleotide of SEQ I D NOS: 1-5,191. Such a manufacture provides a large portion of the Staphylococcus aureus 
genome and parts thereof {e.g., a Staphylococcus aureus open reading frame (ORF)) in a form which albws a skilled 
artisan to examine the manufacture using means not directly applicable to examining the Staphylococcus aureus ge- 
nome or a subset thereof as it exists In nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer 

30 readable media. As used herein, "computer readable media" refers to any medium which can be read and accessed 
directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard 
disc storage medium, and magnetic tape; optical storage media such as CD- ROM; electrical storage media such as 
RAM and ROM; and hybrids of these categories, such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used to create a manufacture com- 

35 prising computer readable medium having recorded thereon a nucleotide sequence of the present invention. Likewise, 
it will be clear to those of skill how additional computer readable media that may be developed also can be used to 
create analogous manufactures having recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing informatbn on computer readable medium. A skilled 
artisan can readily adopt any of the presently know methods for recording information on computer readable medium 

40 to generate manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a computer readable medium 
having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored informatbn. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence informatk>n of the present invention on computer 

45 readable medium. The sequence information can be represented in a word processing text file, formatted in commer- 
cially- available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored 
in a database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of 
data-processor structuring formats (e.g., text file or database) in order to obtain computer readable medium having 
recorded thereon the nucleotide sequence information of the present invention. 

50 Computer software is publicly available which allows a skilled artisan to access sequence information provided in 

a computer readable medium. Thus, by providing in computer readable fomn the nucleotide sequences of SEQ ID 
NOS: 1-5,1 91 , a representative fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and 
most preferably at least 99.9% identical to a sequence of SEQ ID NOS: 1-5, 191 the present Invention enables the 
skilled artisan routinely to access the provided sequence information for a wide variety of purposes. 

55 The examples which follow demonstrate how software which implements the BLAST (Altschul et ai, J. Mol. Biol. 

215:403410 (1990)) and BLAZE (Brutlag et ai, Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Staphylococcus aureus genome which contain 
homology to ORFs or proteins from both Staphylococcus aureus and from other organisms. Among the ORFs discussed 
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herein are protein encoding fragments of the Staphylococcus aureus genome useful in producing commercially impor- 
tant proteins, such as enzymes used in fermentation reactions and in the production of commercially useful metabolites. 

The present invention further provides systems, particularly computer-based systems, which contain the sequence 
information described herein. Such systems are designed to identify, among other things, commercially important f rag- 

s ments of the Staphylococcus aureus genome. 

As used herein, "a computer-based system" refers to the hardware means, software means, and data storage 
means used to analyze the nucleotide sequence information of the present invention. The minimum hardware means 
of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, 
output means, and data storage means. A skilled artisan can readily appreciate that any one of the currently available 

10 computer-based system are suitable for use In the present invention. 

As stated above, the computer-based systems of the present invention comprise a data storage means having 
stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means 
for supporting and implementing a search means. 

As used herein, 'data storage means' refers to memory which can store nucleotide sequence information of the 

IS present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide 
sequence information of the present invention. 

As used herein, "search means' refers to one or more programs which are implemented on the computer- based 
system to compare a target sequence or target structural motif with the sequence information stored within the data 
storage means. Search means are used to identify fragments or regions of the present genomic sequences which 

20 match a particular target sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the computer-based systems 
of the present invention. Examples of such software includes, but is not limited to, MacPattern (EMBL), BLASTN and 
BLASTX (NCBIA). A skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present computer-based systems. 

25 As used herein, a "target sequence" can be any DNA or amino acid sequence of six or more nucleotides or two 

or more amino acids. A skilled artisan can readily recognize that the longer a target sequence is. the less likely a target 
sequence will be present as a random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 1 0 to 1 00 amino acids or from about 30 to 300 nucleotide residues. However, it is well recognized 
that searches for commercially important fragments, such as sequence fragments involved in gene expression and 

30 protein processing, may be of shorter length. 

As used herein, "a target structural motif," or 'target motif," refers to any rationally selected sequence or combi- 
nation of sequences in which the sequence(s) are chosen based on a three-dimensional configuratbn which is formed 
upon the folding of the target motif. There are a variety of target motifs known in the art. Protein target motifs include, 
but are not limited to, enzymic active sites and signal sequences. Nucleic acid target motifs include, but are not limited 

35 to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences). 

A variety of structural formats for the input and output means can be used to input and output the information in 
the computer-based systems of the present invention. A preferred format for an output means ranks fragments of the 
Staphylococcus aureus genomic sequences possessing varying degrees of homology to the target sequence or target 
motif. Such presentation provides a skilled artisan with a ranking of sequences which contain varksus amounts of the 

40 target sequence or target motif and identifies the degree of homology contained in the identified fragment. 

A variety of comparing means can be used to compare a target sequence or target motif with the data storage 
means to identify sequence fragments of the Staphylococcus aureus genome. In the present examples, implementing 
software which implement the BLAST and BLAZE algorithms, described in Altschul et al., J. Mol. Biol. 215: 403-410 
(1990), was used to identify open reading frames within the Staphylococcus aureus genome. A skilled artisan can 

45 readily recognize that any one of the publicly available homology search programs can be used as the search means 
for the computer-based systems of the present invention. Of course, suitable proprietary systems that may be known 
to those of skill also may be employed in this regard. 

Figure 1 provides a block diagram of a computer system illustrative of embodiments of this aspect of present 
invention. The computer system 1 02 includes a processor 1 06 connected to a bus 1 04. Also connected to the bus 104 

50 are a main memory 1 08 (preferably implemented as random access memory, RAM) and a variety of secondary storage 
devices 110, such as a hard drive 112 and a removable medium storage device 114. The removable medium storage 
device 114 may represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A removable 
storage medium 1 1 6 (such as a floppy disk, a compact disk, a magnetic tape, etc.) containing control logic and/or data 
recorded therein may be inserted Into the removable medium storage device 114. The computer system 102 includes 

55 appropriate software for reading the control logic and/or the data from the removable medium storage device 11 4, once 
it is inserted into the removable medium storage device 114. 

A nucleotide sequence of the present invention may be stored in a well known manner in the main memory 108, 
any of the secondary storage devices 110, and/or a removable storage medium 116. During execution, software for 
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accessing and processing the genonnic sequence (such as search tools, comparing tools, etc.) reside in main memory 
108, in accordance with the requirements and operating parameters of the operating system, the hardware system 
and the software program or programs. 



5 BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to fragments of the Staphylococcus aureus genome, 
preferably to isolated fragments. The fragments of the Staphylococcus aureus genome of the present invention include, 
but are not limited to fragments which encode peptides, hereinafter open reading frames (ORFs). fragments which 
10 modulate the expression of an operably linked ORF, hereinafter expression modulating fragments (EMFs) and frag- 
ments which can be used to diagnose the presence of Staphylococcus aureus In a sample, hereinafter diagnostic 
fragments (DFs). 

As used herein, an "isolated nucleic acid molecule" or an "isolated fragment of the Sfap/iy/ococcasawreus genome" 
refers to a nucleic acid molecule possessing a specific nucleotide sequence which has been subjected to purification 
IS means to reduce, from the composition, the number of compounds which are normally associated with the composition. 
Particularly, the term refers to the nucleic acid molecules having the sequences set out In SEQ ID NOS:1-5,191. to 
representative fragments thereof as described above, to polynucleotides at least 95%, preferably at least 99% and 
especially preferably at least 99.9% identical in sequence thereto, also as set out above. 

A variety of purification means can be used to generated the isolated fragments of the present invention. These 
20 Include, but are not limited to methods which separate constituents of a solution based on charge, solubility or size. 

In one embodiment. Staphylococcus aureus DNA can be mechanically sheared to produce fragments of 1 5-20 kb 
in length. These fragments can then be used to generate an Staphylococcus aureus library by inserting them into 
lambda clones as described in the Examples below. Primers flanking, for example, an ORF. such as those enumerated 
in Tables 1-3 can then be generated using nucleotide sequence information provided in SEQ ID NOS: 1-5,191. Well 
25 known and routine techniques of PGR cloning then can be used to isolate the ORF from the lambda DNA library of 
Staphylococcus aureus genomic DNA. Thus, given the availability of SEQ ID NOS:1-5,igi, the information in Tables 
1, 2 and 3, and the information that may be obtained readily by analysis of the sequences of SEQ ID NOS:1-5,191 
using methods set out above, those of skill will be enabled by the present disclosure to Isolate any ORF-containIng or 
other nucleic acid fragment of the present invention. 
30 The isolated nucleic acid molecules of the present invention include, but are not limited to single stranded and 

double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame." ORR means a series of triplets coding for amino acids without any 
termination codons and Is a sequence translatable into protein. 

Tables 1, 2 and 3 list ORFs in the Staphylococcus aureus genomic contigs of the present invention that were 
35 Identified as putative coding regions by the Gene Mark software using organism-specific second-order Markov proba- 
bility transition matrices. It will be appreciated that other criteria can be used, in accordance with well known analytical 
methods, such as those discussed herein, to generate more inclusive, more restrictive or more selective lists. 

Table 1 sets out ORFs in the Staphylococcus aureus contigs of the present invention that are at least 80 amino 
acids long and over a continuous regbn of at least 50 bases which are 95% or more identical (by BLAST analysis) to 
40 an S. aureus nucleotide sequence available through Genbank in November 1996. 

Table 2 sets out ORFs in the Staphylococcus aureus contigs of the present invention that are not in Table 1 and 
match, with a BLASTP probability score of 0.01 or less, a polypeptide sequence available through Genbank by Sep- 
tember 1996. 

Table 3 sets out ORFs in the Staphylococcus aureus contigs of the present Invention that do not match significantly, 
45 by BLASTP analysis, a polypeptide sequence available through Genbank by September 1996. 

In each table, the first and second columns identify the ORF by, respectively, contig number and ORF number 
within the contig; the third column indicates the reading frame, taking the first 5' nucleotide of the contig as the start of 
the +1 frame; the fourth column indicates the first nucleotide of the ORF, counting from the 5' end of the contig strand; 
and the fifth column indicates the length of each ORF in nucleotides. 
50 In Tables 1 and 2, column six, lists the Reference" for the closest matching sequence available through Genbank. 

These reference numbers are the databases entry numbers commonly used by those of skill in the art, who will be 
familiar with their denominators. Descriptions of the numenclature are available from the National Center for Biotech- 
nology Information. Column seven in Tables 1 and 2 provides the gene name" of the matching sequence; column eight 
provides the BLAST identity" score from the comparison of the ORF and the homologous gene; and column nine 
55 indicates the length in nucleotkles of the highest scoring segment pair' identified by the BLAST identity analysis. 

In Table 3, the last column, column six. indk:ates the length of each ORF in amino acid residues. 

The concepts of percent identity and percent similarity of two polypeptide sequences is well understood in the art. 
For example, two polypeptides 10 amino acids in length which differ at three amino acid positions (e.g., at positions 
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1 , 3 and 5) are said to have a percent identity of 70%. However, the same two polypeptides would be deemed to have 
a percent similarity of 80% if, for example at position 5, the amino acids moieties, although not identical, were "similar" 
{i.e., possessed similar biochemical characteristics). Many programs for analysis of nucleotide or amino acid sequence 
similarity, such as fasta and BLAST specifically list per cent identity of a matching region as an output parameter Thus, 

s for instance, Tables 1 and 2 herein enumerate the per cent identity' of the highest scoring segment pair' in each ORF 
and its listed relative. Further details concerning the algorithms and criteria used for homology searches are provided 
below and are described in the pertinent literature highlighted by the citations provided below. 

It will be appreciated that other criteria can be used to generate more inclusive and more exclusive listings of the 
types set out in the tables. As those of skill will appreciate, narrow and broad searches both are useful. Thus, a skilled 

10 artisan can readily identify ORFs in contigs of the Staphylococcus aureus genome other than those listed in Tables 
1 -3, such as ORFs which are overlapping or encoded by the opposite strand of an identified ORF in addition to those 
ascertainable using the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment," EMF, means a series of nucleotide molecules whrch mod- 
ulates the expression of an operably linked ORF or EMF 

IS As used herein, a sequence is sakJ to 'modulate the expression of an operably linked sequence' when the ex- 

pression of the sequence is altered by the presence of the EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One class of EMFs are fragments which induce the expression 
or an operably linked ORF in response to a specific regulatory factor or physiological event. 

EMF sequences can be identified within the contigs of the Staphylococcus aureus genome by their proximity to 

20 the ORFs provided in Tables 1-3. An intergenic segment, or a fragment of the intergenic segment, from about 10 to 
200 nucleotides in length, taken from any one of the ORFs of Tables 1-3 will modulate the expression of an operably 
linked ORF in a fashbn similar to that found with the naturally linked ORF sequence. As used herein, an "intergenic 
segment" refers to fragments of the Staphylococcus aureus genome which are between two ORF(s) herein described. 
EMFs also can be identified using known EMFs as a target sequence or target motif in the computer-based systems 

25 of the present invention. Further, the two methods can be combined and used together 

The presence and activity of an EMF can be confirmed using an EMF trap vector An EMF trap vector contains a 
cloning site linked to a marker sequence. A marker sequence encodes an identifiable phenotype, such as antibiotic 
resistance or a complementing nutrition auxotrophic factor, which can be identified or assayed when the EMF trap 
vector is placed within an appropriate host under appropriate conditions. As described above, a EMF will modulate the 

30 expression of an operably linked marker sequence. A more detailed discussion of various marker sequences is provided 
below. 

A sequence whk;h Is suspected as being an EMF is ck>ned in all three reading frames in one or more restriction 
sites upstream from the marker sequence in the EMF trap vector The vector Is then transformed into an appropriate 
host using known procedures and the phenotype of the transformed host in examined under appropriate conditions. 
3S As described above, an EMF will modulate the expression of an operably linked marker sequence. 

As used herein, a "diagnostic fragment," DP, means a series of nucleotide molecules which selectively hybridize 
to Staphylococcus aureus sequences. DFs can be readily identified by identifying unk^ue sequences within contigs of 
the Staphylococcus aureus genome, such as by using well-known computer analysis software, and by generating and 
testing probes or amplification primers consisting of the DF sequence In an appropriate diagnostic format which de- 
40 termines amplification or hybridization selectivity 

The sequences falling within the scope of the present invention are not limited to the specific sequences herein 
described, but also include allelk; and species variations thereof. Allelic and species variations can be routinely deter- 
mined by comparing the sequences provided in SEQ I D NOS:1 -5. 1 91 . a representative fragment thereof, or a nucleotide 
sequence at least 95%, preferably 99% and most preferably 99.9% identical to SEC ID NOS:1 -5, 1 91 , with a sequence 
45 from another isolate of the same species. 

Furthermore, to accomodate codon variability, the invention includes nucleic acid molecules coding for the same 
amino acid sequences as do the nucleic acid sequences mentioned above. In other words. In the coding region of an 
ORF, substitutbn of one codon for another which encodes the same amino acid is expressly contemplated. 

Any specific sequence disclosed herein can be readily screened for errors by resequencing a particular fragment, 
so such as an ORF, in both directtons [i.e., sequence both strands). Alternatively error screening can be performed by 
sequencing corresponding polynucleotides of Staphylococcus aureusorlgin isolated by using part or all of the fragments 
in question as a probe or primer 

Each of the ORFs of the Staphylococcus aureus genome disclosed in Tables 1 , 2 and 3, and the EMFs found 5' 
to the ORFs, can be used as polynucleotide reagents In numerous ways. For example, the sequences can be used 
ss as diagnostic probes or diagnostic amplification primers to detect the presence of a specific microbe in a sample, 
partk^ular Staphylococcus aureus. Especially preferred in this regard are ORF such as those of Table 3, which do not 
match previously characterized sequences from other organisms and thus are most likely to be highly selective for 
Staphylococcus aureus. Also particularly preferred are ORFs that can be used to distinguish between strains of Sta- 
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phylococcus aureus, particularly those that distinguish medically important strain, such as drug-resistant strains. 

In addition, the fragments of the present invention, as broadly described, can be used to control gene expression 
through triple helix formation or antisense DNA or RNA, both of which methods are based on the binding of a polynu- 
cleotide sequence to DNA or RNA. Triple helix- formation optimally results in a shut-off of RNA transcription from DNA, 

5 while antisense RNA hybridization blocks translation of an mRNA molecule Into polypeptide. Information from the 
sequences of the present invention can be used to design antisense and triple helix-forming oligonucleotides. Polynu- 
cleotides suitable for use in these methods are usually 20 to 40 bases In length and are designed to be complementary 
to a region of the gene involved in transcription, for triple-helix formation, or to the mRNA itself, for antisense inhibition. 
Both techniques have been demonstrated to be effective in model systems, and the requisite techniques are well known 

10 and involve routine procedures. Triple helix techniques are discussed in, for example, Lee et al., Nucl. Acids Res. 6: 
3073 (1979); Cooney etaL, Science 241: 456 (1988); and Dervan etal., Science 25V. 1360 (1991). Antisense tech- 
niques in general are discussed in, for Instance, Okano, J. Neurochem.^. 560 (1991) and OLIGODEOXYNUCLE- 
OTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC Press. Boca Raton, FL (1988)). 

The present invention further provides recombinant constructs comprising one or more fragments of the Staphy- 

15 lococcus aureus genomic fragments and contigs of the present invention. Certain preferred recombinant constructs of 
the present invention comprise a vector, such as a plasmid or viral vector, into which a fragment of the Staphyiococcus 
aureus genome has been inserted, in a forward or reverse orientation. In the case of a vector comprising one of the 
ORFs of the present invention, the vector may further comprise regulatory sequences, including for example, a pro- 
moter, operably linked to the ORF For vectors comprising the EMFs of the present invention, the vector may further 

20 comprise a marker sequence or heterologous ORF operably linked to the EMF 

Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially 
available for generating the recombinant constructs of the present invention. The following vectors are provided by 
way of example. Useful bacterial vectors include phagescript, PsiX174, pBluescript SK and KS (+ and -), pNHSa, 
pNHISa, pNH18a, pNH46a (available from Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (available 

2S from Pharmacia). Useful eukaryotic vectors include pWLneo, pSV2cat, pOG44, pXT1 , pSG (available from Stratagene) 
pSVK3. pBPV, pMSG, pSVL (available from Pharmacia). 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other 
vectors with selectable markers. Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial pro- 
moters include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate eariy. HSV 

30 thymidine kinase, early and late SV40. LTRs from retrovirus, and mouse metaltothionein- 1. Selection of the appropriate 
vector and promoter is well within the level of ordinary skill in the art. 

The present Invention further provides host cells containing any one of the isolated fragments of the Staphylococcus 
aureus genomk; fragments and contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 

35 eukaryotic host cell, such as a yeast cell, or a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct comprising an ORF of the present 
invention, may be introduced into the host by a variety of well established techniques that are standard In the art, such 
as calcium phosphate transfection, DEAE, dextran mediated transfection and electroporation, which are described in, 
for instance, Davis, L. etai.. BASIC METHODS IN MOLECULAR BIOLOGY (1986). 

40 A host cell containing one of the fragments of the Staphyiococcus aureus genomic fragments and contigs of the 

present invention, can be used in conventional manners to produce the gene product encoded by the isolated fragment 
(in the case of an ORF) or can be used to produce a heterologous protein under the control of the EMF 

The present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present invention. By "degenerate variant" is 

45 intended nucleotide fragments which differ from a nucleic acid fragment of the present invention {e.g., an ORF) by 
nucleotide sequence but, due to the degeneracy of the Genetic Code, encode an identical polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs depicted in Tables 2 and 3 which encode 
proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the isolated polypeptides or proteins 
so of the present invention. At the simplest level, the amino acid sequence can be synthesized using commercially avail- 
able peptide synthesizers. This is particularly useful in producing small peptides and fragments of larger polypeptides. 
Such short fragments as may be obtained most readily by synthesis are useful, for example, In generating antibodies 
against the native polypeptide, as discussed further below. 

In an alternative method, the polypeptide or protein Is purified from bacterial cells which naturally produce the 
S5 polypeptide or protein. One skilled In the art can readily employ well-known methods for isolating polpeptides and 
proteins to isolate and purify polypeptides or proteins of the present Invention produced naturally by a bacterial strain, 
or by other methods. Methods for isolation and purification that can be employed in this regard include, but are not 
limited to, immunochromatography. HPLC, size-exclusion chromatography, ion-exchange chromatography, and immu- 
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no-affinity chromatography. 

The polypeptides and proteins of the present invention also can be purified from cells which have been altered to 
express the desired polypeptide or protein. As used herein, a cell is said to be altered to express a desired polypeptide 
or protein when the cell, through genetic manipulation, is made to produce a polypeptide or protein which it normally 

s does not produce or which the cell normally produces at a lower level. Those skilled in the art can readily adapt pro- 
cedures for introducing and expressing either recombinant or synthetic sequences into eukaryotic or prokaryotlc cells 
in order to generate a cell which produces one of the polypeptides or proteins of the present invention. 

Any hostA/ector system can be used to express one or more of the ORFs of the present invention. These include, 
but are not limited to, eukaryotic hosts such as HeLa cells, CV-1 ceil, COS cells, and St9 cells, as well as prokaryotic 

10 host such as E. coli and B. subtilis. The most preferred cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or protein at low natural level. 

"Recombinant," as used herein, means that a polypeptide or protein is derived from recombinant (a^., microbial 
or mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins made In bacterial or 
fungal {e.g., yeast) expression systems. As a product, "recombinant microbiardefines a polypeptide or protein essen- 

15 tially free of native endogenous substances and unaccompanied by associated native glycosylation. Polypeptides or 
proteins expressed in most bacterial cultures, e.g., E. coli, will be free of gtycosylatk>n modlfk:atlons; polypeptides or 
proteins expressed in yeast will have a glycosylation pattern different from that expressed in mammalian cells. 

"Nucleotide sequence" refers toa heteropolymer of deoxyribonucleotides. Generally, DNA segments encoding the 
polypeptides and proteins provided by this invention are assembled from fragments of the Staphylococcus aureus 

20 genome and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene which is 
capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a mi- 
crobial or viral operon. 

■Recombinant expression vehicle or vector" refers to a plasmid or phage or virus or vector, for expressing a polypep- 
tide from a DNA (RNA) sequence. The expression vehicle can comprise a transcriptional unit comprising an assembly 

25 of (1 ) a genetic regulatory elements necessary for gene expression in the host, including elements required to initiate 
and maintain transcription at a level sufficient for suitable expression of the desired polypeptide, including, for example, 
promoters and, where necessary, an enhancers and a polyadenylation signal; (2) a structural or coding sequence 
which is transcribed into mRNA and translated into protein, and (3) appropriate signals to initiate translation at the 
beginning of the desired coding region and terminate translation at Its end. Structural units intended for use in yeast 

30 or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated 
protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or transport sequence, 
it may include an N-terminal methionine residue. This reskJue may or may not be subsequently cleaved from the 
expressed recombinant protein to provide a final product. 

"Recombinant expression system" means host cells which have stably integrated a recombinant transcriptional 

35 unit into chromosomal DNA or carry the recombinant transcriptional unit extra chromosomally The cells can be prokary- 
otlc or eukaryotic. Recombinant expression systems as defined herein will express heterologous polypeptides or pro- 
teins upon induction of the regulatory elements linked to the DNA segment or synthetk: gene to be expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appro- 
priate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs derived 

40 from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with prokaryotic 
and eukaryotic hosts are described in Sambrook etaL, MOLECULAR CLONING:A LABORATORY MANUAL, 2«<^ Edi- 
tion, Cold Spring Harbor Laboratory Press, Cokl Spring Harbor, New York (1989), the disclosure of which is hereby 
incorporated by reference In Its entirety. 

Generally, recombinant expression vectors will include origins of replicatbn and selectable markers permitting 

45 transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a 
promoter derived from a highly expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), alpha- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is assembled 
in appropriate phase with translation Initiation and termlnatbn sequences, and preferably, a leader sequence capable 

50 of directing secretion of translated protein into the periplasmic space or extracellular medium. Optionally the heterol- 
ogous sequence can encode a fusion protein including an N-terminal identification peptide imparting desired charac- 
teristics, e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expressk>n vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a 
desired protein together with suitable translation initiation and termination signals in operable reading phase with a 

55 functional promoter. The vector will comprise one or more phenotypk; selectable markers and an origin of replication 
to ensure maintenance of the vector and, when desirable, provide amplificatbn within the host. 

Suitable prokaryotic hosts for transformation include strains of Staptiylococcus aureus, E. coli, B. subtilis, Salmo- 
nella typhimurium and various species within the genera Pseudomonas, Streptomyces, and Staptiylococcus. Others 
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may, also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements 
of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
5 (available form Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (available from Promega Biotec, Madison. 
Wl , USA). These pBR322 'backbone' sections are combined with an appropriate promoter and the structural sequence 
to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the 
selected promoter, where it is inducible, is derepressed or induced by appropriate means [e.g., temperature shift or 
10 chemical induction) and cells are cultured for an additional period to provide for expression of the induced gene product. 
Thereafter cells are typically han/ested, generally by centrifugation. disrupted to release expressed protein, generally 
by physical or chemical means, and the resulting crude extract is retained for further purification. 

Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mam- 
malian expression systems include the COS-7 lines of monkey kidney fibroblasts, described in Gluzman, Cell 22: 175 
IS (1981), and other cell lines capable of expressing a compatible vector, for example, the C127, 3T3, OHO, HeLa and 
BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also 
any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5* flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for ex- 

^ ample, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required 
nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is usually isolated by initial extraction from 
cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Mi- 
crobial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw 

2S cycling, sonication, mechanical disruption, or use of cell lysing agents. Protein refolding steps can be used, as neces- 
sary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can 
be employed for final purification steps. 

An additional aspect of the invention includes Staphylococcus aureus polypeptides which are useful as immuno- 
diagnostic antigens and/or immunoprotective vaccines, collectively "immunologically useful polypeptides'. Such im- 

30 munologically useful polypeptides may be selected from the ORFs disclosed herein based on techniques well known 
in the art and described elsewhere herein. The inventors have used the following criteria to select several immunolog- 
ically useful polypeptides: 

As is known in the art, an amino terminal type I signal sequence directs a nascent protein across the plasma and 
outer membranes to the exterior of the bacterial cell. Such outermembrane polypeptides are expected to be immuno- 

35 logically useful. According to Izard, J. W. et al., Mol. Microbiol. 13, 765-773; (1994), polypeptides containing type I 
signal sequences contain the following physical attributes: The length of the type 1 signal sequence is approximately 
15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus; the 
central region of the signal sequence must adopt an alpha-helical conformation in a hydrophobic environment; and the 
region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 

40 and -3 positions. 

Also known in the art is the type IV signal sequence which is an example of the several types of functional signal 
sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the type 
IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bac- 
teriol. 174, 7345-7351; 1992)). These are typically six to eight amino acids with a net basic charge followed by an 

45 additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically 
after the initial six to eight amino acids at the extreme amino terminus. In addition, all type IV signal sequences contain 
a phenylalanine residue at the +1 site relative to the cleavage site. 

Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus 
amino acid sequence for lipoprotein cleavage. Nearly three^ourths of the bacterial lipoprotein precursors examined 

so contained the sequence L-(A.S)-(G, A)-C at positions -3 to +1 , relative to the point of cleavage (Hayashi. S. and WU. 
H. C. Lipoproteins in bacteria. J Bioenerg. Biomembr 22, 451-471; 1990). 

It well known that most anchored proteins found on the surface of gram-positive bacteria possess a highly con- 
served carboxy terminal sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, E. 
faecalis, S. pneumoniae, and others, have been kientified based on their extracellular location and carboxy terminal 

ss amino acid sequence (Fischetti, V. A. Gram-positive commensal bacteria deliver antigens to elicit mucosal and systemic 
immunity. ASM News 62, 40541 0; 1 996). The conserved region is comprised of six charged amino acids at the extreme 
carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Im- 
mediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins ex- 
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amined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any amino acid. 

Amino acid sequence similarities to proteins of known function by BLAST enables the assignment of putative 
functions to novel amino acid sequences and allows for the selection of proteins thought to function outside the cell 
wall. Such proteins are well known in the art and include "lipoprotein", "periplasmic", or "antigen". 

s An algorithm for selecting antigenic and immunogenic Staphylococcus aureus polypeptides including the foregoing 

criteria was developed by the present inventors. Use of the algorithm by the inventors to select immunologically useful 
Staphylococcus aureus polypeptides resulted in the selection of several ORFs which are predicted to be outenmem- 
brane-associated proteins. These proteins are identified in Table 4, below, and shown in the Sequence Listing as SEQ 
I D NOS:5, 1 92 to 5,255. Thus the amino acid sequence of each of several antigen icSrap/7y/ococcus aureus polypeptides 

to listed in Table 4 can be determined, for example, by kx:ating the amino ackJ sequence of the ORF in the Sequence 
Listing. Likewise the polynucleotide sequence encoding each ORF can be found by locating the corresponding poly- 
nucleotide SEQ ID in Tables 1, 2, or 3, and finding the corresponding nucleotide sequence in the sequence listing. 

As will be appreciated by those of ordinary skill in the art, although a polypeptide representing an entire ORF may 
be the closest approximation to a protein found in vh/o, it is not always technically practical to express a complete ORF 

IS in vitro. It may be very challenging to express and purify a highly hydrophobic protein by common laboratory methods. 
As a result, the immunologically useful polypeptides described herein as SEQ ID NOS:5, 192-5,255 may have been 
modified slightly to simplify the production of recombinant protein, and are the preferred embodiments. In general, 
nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal 
sequence, are excluded for enhanced in vitro expression of the polypeptides. Furthermore, any highly hydrophobic 

20 amino acid sequences occurring at the carboxy terminus are also excluded. Such truncated polypeptides include for 
example the mature forms of the polypeptides expected to exist in nature. 

Those of ordinary skill in the art can identify soluble portions the polypeptide Identified in Table 4, and in the case 
of truncated polypeptides sequences shown as SEQ ID NOS:5, 192-5,255, may obtain the complete predicted amino 
acid sequence of each polypeptide by translating the corresponding polynucleotides sequences of the corresponding 

2S ORF listed in Tables 1 ,2 and 3 and found in the sequence listing. 

Accordingly, polypeptides comprising the complete amino acid of an immunologically useful polypeptide selected 
from the group of polypeptides encoded by the ORFs identified in Table 4, or an amino acid sequence at least 95% 
identical thereto, preferably at least 97% identical thereto, and most preferably at least 99% identical thereto form an 
embodiment of the invention; in addition polypeptides comprising an amino acid sequence selected from the group of 

30 amino acid sequences shown in the sequence listing as SEQ ID NOS:5,1 91 -5,255, or an amino acid sequence at least 
95% identical thereto, preferably at least 97% identical thereto and most preferably at least 99% Identical thereto, form 
an embodiment of the invention. Polynucleotides encoding the foregoing polypeptides also form part of the present 
invention. 

In another aspect, the invention provides a peptide or polypeptide comprising an epitope-bearing portion of a 

3S polypeptide of the invention, particularly those epitope-bearing portions (antigenic regions) identified in Table 4. The 
epitope-bearing portion is an immunogenic or antigenic epitope of a polypeptide of the invention. An "immunogenic 
epitope' is defined as a part of a protein that elicits an antibody response when the whole protein is the immunogen. 
On the other hand, a region of a protein molecule to which an antibody can bind is defined as an "antigenic epitope." 
The number of Immunogenic epitopes of a protein generally Is less than the number of antigenic epitopes. See, for 

40 instance, Geysen et al., Proc. Natl. Acad. Sci. USA 81 :3998- 4002 (1983). 

As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that contain a region of a protein 
molecule to whk^h an antibody can bind), it Is well known In that art that relatively short synthetic peptides that mimic 
part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein. 
See, for instance, Sutcliffe, J. G., Shinnick, T M., Green, N. and Learner, R. A. (1983) "Antibodies that react with 

45 predetermined sites on proteins", Science, 219:660-666. Peptides capable of eliciting protein-reactive sera are fre- 
quently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and 
are confined neither to immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to the amino or 
carboxyl terminals. Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise 
antibodies, including monocbnal antibodies, that bind specifically to a polypeptide of the Invention. See, for instance, 

so Wilson et al., Cell 37:767-778 (1 984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 15 to about 30 amino acids contained within 
the amino acid sequence of a polypeptide of the invention. Non-limiting examples of antigenic polypeptides or peptides 
that can be used to generate S. aureus specific antibodies Include: a polypeptide comprising peptides shovim in Table 

ss 4 below. These polypeptide fragments have been determined to bear antigenic epitopes of indicated S. aureus proteins 
by the analysis of the Jameson-Wolf antigenk; index, a representative sample of which is shown in Figure 3. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means. 
See, e.g., Houghten, R. A. (1985) General method for the rapid solid-phase synthesis of large numbers of peptides: 
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specificity of antigen-antibody interaction at the level of individual amino acids. Proc. Natl. Acad. Sci. USA 82: 
5131-5135; this "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 
4,631,211 to Houghten et al. (1986). Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known In the art. See. for Instance, Sutcliffe et al.. supra; Wilson et al., supra; 

5 Chow. M. et al.. Proc. Natl. Acad. Sol. USA 82:910-914; and Bittle. F. J. et al., J. Gen. Virol. 66:2347-2354 (1985). 

Immunogenic epttope-bearing peptides of the invention, i.e.. those parts of a protein that elicit an antibody response 
when the whole protein is the immunogen, are identified according to methods known in the art. See, tor Instance, 
Geysen etal.. supra. Further still, U.S. Patent No. 5. 194.392 to Geysen (1990) describes a general method of detecting 
or determining the sequence of monomers (amino acids or other compounds) which Is a topological equivalent of the 

10 epitope (I.e., a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of 
interest. More generally, U.S. Patent No. 4,433,092 to Geysen (1989) describes a method of detecting or determining 
a sequence of monomers which is a topographical equivalent of a llgand which is complementary to the ligand binding 
site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) on Per- 
alkylated Oligopeptide Mixtures discloses linear C1-C7-alkyl peralkylated oligopeptides and sets and libraries of such 

IS peptides, as well as methods for using such oligopeptide sets and libraries for determining the sequence of a per- 
alkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the Invention also can be made routinely by these methods. 

Table 4 lists immunologically useful polypeptides identified by an algorithm which locates novel Staphylococcus 
aureus outermembrane proteins, as is described above. Also listed are epitopes or "antigenic regions" of each of the 

20 identified polypeptides. The antigenic regions, or epitopes, are delineated by two numbers x-y, where x Is the number 
of the first amino acid in the open reading frame included within the epitope and y is the number of the last amino acid 
in the open reading frame Included within the epitope. For example, the first epitope in ORF 168-6 is comprised of 
amino acids 36 to 45 of SEQ ID NO:5.192, as is described in Table 4. The inventors have identified several epitopes 
for each of the antigenic polypeptides identified In Table 4. Accordingly, forming part of the present invention are 

25 polypeptides comprising an amino acid sequence of one or more antigenic regions identified in Table 4. The Invention 
further provides polynucleotides encoding such polypeptides. 

The present invention further includes isolated polypeptides, proteins and nucleic acid molecules which are sub- 
stantially equivalent to those herein described. As used herein, substantially equivalent can refer both to nucleic acid 
and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more 

30 substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity be- 
tween reference and subject sequences. For purposes of the present Invention, sequences having equivalent biological 
activity, and equivalent expression characteristics are considered substantially equivalent. For purposes of determining 
equivalence, truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other strains of Staphylococcus aureus, of the 

35 fragments of the Staphylococcus aureus genome of the present invention and homologs of the proteins encoded by 
the ORFs of the present invention. As used herein, a sequence or protein of Staphylococcus aureus is defined as a 
homotog of a fragment of the Staphylococcus aureus fragments or contigs or a protein encoded by one of the ORFs 
of the present invention, if it shares significant homology to one of the fragments of the Staphylococcus aureus genome 
of the present invention or a protein encoded by one of the ORFs of the present invention. Specifically, by using the 

40 sequence disclosed herein as a probe or as primers, and techniques such as PGR cloning and colony/|plaque hybrid- 
ization, one skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to 'share significant homology" if the two contain 
regions whteh prossess greater than 85% sequence (amino acid or nucleic acid) homology. Preferred homologs in this 
regard are those with more than 90% homology. Especially preferred are those with 93% or more homology. Among 

45 especially preferred homologs those with 95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those are homologs with 99% or more 
homology. The most preferred homologs among these are those with 99.9% homology or more. It will be understood 
that, among measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence provided in SEQ ID NOS:1 -5, 191 or from 

so a nucleotide sequence at least 95%. particularly at least 99%. especially at least 99.5% identical to a sequence of SEQ 
ID N0S:1 -5,1 91 can be used to prime DNA synthesis and PGR amplification, as well as to identify colonies containing 
cloned DNA encoding a homolog. Methods suitable to this aspect of the present invention are well known and have 
been described in great detail in many publications such as, for example, Innis etaL, PGR PROTOGOLS. Academic 
Press, San Diego, GA (1990)). 

ss When using primers derived from SEQ ID NOS: 1 -5, 1 91 or from a nucleotkje sequence having an aforementioned 

identity to a sequence of SEQ ID NOS: 1-5, 191, one skilled in the art will recognize that by employing high stringency 
conditions (e.g., annealing at 50-60°G in 6X SSPC and 50% formamide, and washing at 50- 65°C in 0.5X SSPC) only 
sequences which are greater than 75% homologous to the primer will be amplified. By employing lower stringency 
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conditions {e.g., hybridizing at 35-37**C in 5X SSPC and 40-45% fornnamide, and washing at 42*C in 0.5X SSPC), 
sequences which are greater than 40-50% homologous to the primer will also be amplified. 

When using DNA probes derived from SEQ ID NOS:1 -5, 1 91 , or from a nucleotide sequence having an aforemen- 
tioned identity to a sequence of SEQ ID NOS:1 -5, 1 91 , for colony/plaque hybridization, one skilled in the art will recog- 

s nize that by employing high stringency conditions (e.g., hybridizing at 50- 65*C in 5X SSPC and 50% formamide, and 
washing at 50- 65*C in 0.5X SSPC). sequences having regions which are greater than 90% homologous to the probe 
can be obtained, and that by employing lower stringency conditions [e.g., hybridizing at 35-37** C in 5X SSPC and 
40-45% formamide, and washing at 42'C In 0.5X SSPC), sequences having regions which are greater than 35-45% 
homologous to the probe will be obtained. 

10 Any organism can be used as the source for homologs of the present invention so long as the organism naturally 

expresses such a protein or contains genes encoding the same. The most preferred organism for isolating homologs 
are bacterids which are closely related to Staphylococcus aureus. 

ILLUSTRATIVE USES OF COMPOSITIONS OF THE INVENTION 

IS 

Each ORF provided in Tables 1 and 2 is identified with a function by homology to a known gene or polypeptide. 
As a result, one skilled In the art can use the polypeptides of the present invention for commercial, therapeutic and 
Industrial purposes consistent with the type of putative identification of the polypeptide. Such identifications permit one 
skilled in the art to use the Staphylococcus aureusOHfs in a manner similar to the known type of sequences for which 

20 the identification is made; for example, to ferment a particular sugar source or to produce a particular metabolite. A 
variety of reviews illustrative of this aspect of the invention are available, including the folbwing reviews on the industrial 
use of enzymes, for example, BIOCHEMICAL ENGINEERING AND BIOTECHNOLOGY HANDBOOK, 2nd Ed., Mac- 
millan Publications, Ltd. NY (1991) and BIOCATALYSTS IN ORGANIC SYNTHESES, Tramper et ai, Eds., Elsevier 
Science Publishers, Amsterdam, The Netherlands (1985). A variety of exemplary uses that illustrate this and similar 

25 aspects of the present invention are discussed below. 

1. Blosynthetic Enzymes 

Open reading frames encoding proteins Involved in mediating the catalytic reactions involved in intermediary and 
30 macromolecular metabolism, the biosynthesis of small molecules, cellular processes and other functions includes en- 
zymes involved in the degradation of the intermediary products of metabolism, enzymes involved In central intermediary 
metabolism, enzymes involved in respiration, both aerobic and anaerobic, enzymes involved in fermentation, enzymes 
involved in ATP proton motor force conversion, enzymes involved in broad regulatory function, enzymes involved in 
amino acid synthesis, enzymes involved in nucleotide synthesis, enzymes involved in cofactor and vitamin synthesis, 
3S can be used for industrial biosynthesis. 

The various metabolic pathways present in Staphylococcus aureus can be identified based on absolute nutritional 
requirements as well as by examining the various enzymes identified in Table 1-3 and SEQ ID NOS:1-5,191 . 

Of particular interest are polypeptides involved in the degradation of intermediary metabolites as well as non- 
macromolecular metabolism. Such enzymes include amylases, glucose oxidases, and catalase. 
40 Proteolytic enzymes are another class of commercially important enzymes. Proteolytic enzymes find use in a 

number of industrial processes including the processing of flax and other vegetable fibers, in the extraction, clarification 
and depectinization of fruit juices. In the extraction of vegetables' oil and In the maceration of fruits and vegetables to 
give unicellular fruits. A detailed review of the proteolytic enzymes used in the food industry Is provided in Rombouts 
©fa/, Symbiosis2V. 79 (1986) and Voragen etal. in BIOCATALYSTS IN AGRICULTURAL BIOTECHNOLOGY Whitak- 
45 er et al, Eds., American Chemical Society Symposium Series 389: 93 (1 989) . 

The metabolism of sugars is an important aspect of the primary metabolism of Staphylococcus aureus. Enzymes 
Involved in the degradation of sugars, such as, particularly glucose, galactose, fructose and xylose, can be used in 
industrial fermentation. Some of the important sugar transforming enzymes, from a commercial viewpoint, include 
sugar isomerases such as glucose isomerase. Other metabolic enzymes have found commercial use such as glucose 
so oxidases which produces ketogulonic acid (KG A). KG A is an intermediate in the commercial production of ascorbic 
acid using the Reichstein's procedure, as described In Krueger etal., Biotechr)oloQ y S{A) , Rhine etal, Eds., Verlag 
Press, Weinheim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in purified form as well as in an immobilized 
form for the deoxygenation of beer See. for instance, Hartmeir et al, Biotechnology Letters t: 21 (1979). The most 
ss important application of GOD is the Industrial scale fermentation of gluconic acid. Market for gluconic acids which are 
used in the detergent, textile, leather, photographic, pharmaceutical, food, feed and concrete Industry, as described, 
for example, in Bigells et al., beginning on page 357 In GENE MANIPULATIONS AND FUNGI; Benett et al., Eds., 
Academic Press, New York (1985). In addition to industrial applications, GOD has found applications in medicine for 



16 



EP0 786 519 A2 



quantitative determination of glucose in body fluids recently in biotechnology for analyzing syrups from starch and 
cellulose hydrosylates. This application Is described in Owusu etal., Biochem. et Biophysics. Ada. 872: 83 (1 986), for 
instance. 

The main sweetener used in the world today is sugar which comes from sugar beets and sugar cane. In the field 

5 of industrial enzymes, the glucose isomerase process shows the largest expansion in the market today. Initially, soluble 
enzymes were used and later immobilized enzymes were developed (Krueger etai, Biotechnology, The Textbook of 
Industrial Microbiology, Sinauer Associated Incorporated, Sundertand. Massachusetts (1990)). Today, the use of glu- 
cose- produced high fructose syrups is by far the largest industrial business using immobilized enzymes. A review of 
the industrial use of these enzymes is provided by Jorgensen, Starch 40:307 (1988). 

to Proteinases, such as alkaline serine proteinases, are used as detergent additives and thus represent one of the 
largest volumes of mk:robial enzymes used in the industrial sector. Because of their industrial importance, there is a 
large body of published and unpublished information regarding the use of these enzymes in industrial processes. (See 
Faultman etai, Acid Proteases Structure Function and Biology Tang, J., ed., Plenum Press, New York (1977) and 
Godfrey etal, Industrial Enzymes. MacMillan Publishers, Surrey UK (1983) and Hepner etai, Report Industrial En- 

t5 zymes by 1 990, Hel Hepner & Associates, London (1 986)). 

Another class of commercially usable proteins of the present Invention are the microbial lipases, described by, for 
instance, Macrae etal, Philosophical Transactions of the Chiral Society of London 310:227 (1985) and Poserke, Jour- 
nal of the American Oil Chemist Society 61:1758 (1984). A major use of lipases is In the fat and oil industry for the 
production of neutral glycerides using lipase catalyzed inter-esterificatlon of readily available triglycerides. Application 

20 of lipases include the use as a detergent additive to facilitate the removal of fats from fabrics in the course of the 
washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for key steps in the synthesis of complex 
organic molecules is gaining popularity at a great rate. One area of great interest is the preparation of chiral Interme- 
diates. Preparation of chiral intermediates is of interest to a wide range of synthetic chemists particularly those scientists 

2S involved with the preparation of new pharmaceuticals, agrochemicals, fragrances and flavors. (See Davies etal, Re- 
cent Advances In the Generation of Chiral Intermediates Using Enzymes, CRC Press, Boca Raton, Florida (1990)). 
The following reactions catalyzed by enzymes are of interest to organic chemists: hydrolysis of carboxylic acid esters, 
phosphate esters, amides and nitriles, esterification reactions, trans-esterification reactions, synthesis of amides, re- 
duction of alkanones and oxoalkanates, oxidation of alcohols to carbonyl compounds, oxkJation of sulfides to sulfoxides, 

30 and carbon bond forming reactions such as the aldol reaction. 

When considering the use of an enzyme encoded by one of the ORFs of the present invention for biotransformatbn 
and organic synthesis it is sometimes necessary to conskier the respective advantages and disadvantages of using a 
mrcroorganism as opposed to an isolated enzyme. Pros and cons of using a whole cell system on the one hand or an 
isolated partially purified enzyme on the other hand, has been described in detail by Bud et al, Chemistry in Britain 

35 (1987), p. 127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism of amino acids, are useful in the catalytic 
production of amino acids. The advantages of using microbial based enzyme systems is that the amino transferase 
enzymes catalyze the stereo- selective synthesis of only Lnamino acids and generally possess uniformly high catalytic 
rates. A description of the use of amino transferases for amino acid production is provided by Roselle-David, Methods 
40 of Enzymology126A79 (1 987). 

Another category of useful proteins encoded by the ORFs of the present invention include enzymes involved in 
nucleic acid synthesis, repair, and recombination. A variety of commercially important enzymes have previously been 
isolated from members of Staphylococcus aureus. These include SauSA and Sau96l. 

4S 2. Generation of Antibodies 

As described here, the proteins of the present invention, as well as homologs thereof, can be used in a variety 
procedures and methods known in the art which are currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the protein. Such antibodies can be 
so either monocbnal or polyclonal antibodies, as well fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of the proteins of the present invention and 
hybridomas which produce these antibodies. A hybridoma is an immortalized cell line which is capable of secreting a 
specific monoclonal antibody. 

In general, techniques for preparing polyclonal and monoclonal antibodies as well as hybridomas capable of pro- 
55 ducing the desired antibody are well known in the art (Campbell. A. M.. MONOCLONAL ANTIBODY TECHNOLOGY: 
LABORATORY TECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, Elsevier Science Publishers. Am- 
sterdam, The Netherlands (1984); St. Groth etal, J. Immunol Methods^: 1-21 (1980), Kohler and Milstein, Nature 
256 : 495-497 (1975)), the trioma technique, the human B- celt hybridoma technique (Kozbor etal, Immunology Today 
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4: 72 (1983), pgs. 77-96 of Cole etaL, in MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. LIss. Inc. 
(1985)). 

Any animal (mouse, rabbit, etc.) which is known to produce antibodies can be immunized with the pseudogene 
polypeptide. Methods for immunization are well known in the art. Such methods include subcutaneous or interperitonea! 
s Injection of the polypeptide. One skilled in the art will recognize that the amount of the protein encoded by the ORF of 
the present Invention used for immunization will vary based on the animal which is immunized, the antigenicity of the 
peptide and the site of injection. 

The protein which is used as an immunogen may be modified or administered in an adjuvant in order to increase 
the protein's antigenicity. Methods of increasing the antigenicity of a protein are well known in the art and include, but 
10 are not limited to coupling the antigen with a heterologous protein (such as globulin or galactosldase) or through the 
inclusion of an adjuvant during Immunization. 

For monoclonal antibodies, spleen cells from the Immunized animals are removed, fused with myeloma cells, such 
as SP2/0-Ag14 myeloma cells, and allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to identify the hybridoma cell which produces 
IS an antibody with the desired characteristics. These include screening the hybridomas with an ELISA assay, western 
blot analysis, or radioimmunoassay (Lutz etal., Exp. Cell Res. 175: 109-124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and the class and subclass is determined using procedures 
known in the art (Campbell, A. M., Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Mo- 
lecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984)). 
20 Techniques described for the production of single chain antibodies (U. S. Patent 4,946,778) can be adapted to 

produce single chain antibodies to proteins of the present invention. 

For polyclonal antibodies, antibody containing antisera is isolated from the immunized animal and is screened for 
the presence of antibodies with the desired specificity using one of the above-described procedures. 

The present Invention further provides the above- described antibodies In detectably labelled form. Antibodies can 
2S be detectably labelled through the use of radioisotopes, affinity labels (such as blotin, avidin, etc.), enzymatic labels 
(such as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as FITC or rhodamine, etc.), 
paramagnetic atoms, etc. Procedures for accomplishing such labelling are well-known in the art, for example see 
Sternberger etal., J. Histochem. Cytochem. 18:315 (1970); Bayer, E. A. etai, Meth. Enzym. 62:308 (1979); Engval, 
E. etal., Immunol. 109:129 (1972); Goding. J. W. J. Immunol. Meth. 13:215 (1976)). 
30 The labeled antibodies of the present Invention can be used for in vitro, in vivo, and in situ assays to identify cells 

or tissues in which a fragment of the Staphylococcus aureus genome is expressed. 

The present inventbn further provides the above-described antibodies immobilized on a solid support. Examples 
of such solid supports Include plastics such as polycarbonate, complex carbohydrates such as agarose and sepharose, 
acrylic resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such solid supports 
35 are well known In the art (Weir, D. M. et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell Scientific 
Publications. Oxford, England, Chapter 1 0 (1 986); Jacoby, W. D. etal., Meth. Enzym. 34 Academic Press, N. Y. (1 974)). 
The immobilized antibodies of the present invention can be used for in vitro, in vivo, and in situ assays as well as for 
immunoaffinity purification of the proteins of the present Invention. 

40 3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the expression of one of the ORFs of the present in- 
vention, or homolog thereof, in a test sample, using one of the DFs.antigens or antibodies of the present invention. 
In detail, such methods comprise incubating a test sample with one or more of the antibodies, or one or more of 

4S the DFs, or one or more antigens of the present invention and assaying for binding of the DFs, antigens or antibodies 
to components within the test sample. 

Conditions for incubating a DF. antigen or antibody with a test sample vary. Incubation conditions depend on the 
format employed in the assay, the detection methods employed, and the type and nature of the DF or antibody used 
in the assay. One skilled in the art will recognize that any one of the commonly available hybridization, amplification 

so or immunological assay formats can readily be adapted to employ the Dfs. antigens or antibodies of the present in- 
vention. Examples of such assays can be found in Chard, T, An Introduction to Radioimmunoassay and Related 
Techniques. Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock. G. R. etal., Techniques in 
Immunocytochemistry, Academic Press. Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P.. Practice 
and Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry; PCT publication W095/32291, and 

ss Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1 985), all of which are hereby incorpo- 
rated herein by reference. 

The test samples of the present invention include cells, protein or membrane extracts of cells, or biological fluids 
such as sputum, blood, serum, plasma, or urine. The test sample used in the above-described method will vary based 
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on the assay format, nature of the detection nnethod and the tissues, cells or extracts used as the sample to be assayed. 
Methods for preparing protein extracts or membrane extracts of cells are well known in the art and can be readily be 
adapted in order to obtain a sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry 

5 out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
which comprises:(a) a first container comprising one of the Dfs, antigens or antibodies of the present invention; and 
(b) one or more other containers comprising one or more of the following:wash reagents, reagents capable of detecting 
presence of a bound DF, antigen or antibody. 

70 In detail, a compartmentalized kit includes any kit in which reagents are contained in separate containers. Such 
containers include small glass containers, plastic containers or strips of plastic or paper Such containers allows one 
to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are 
not cross -contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept the test sample, a container which 

IS contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline. 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound antibody, antigen or DF. 

Types of detectbn reagents include labelled nucleic acid probes, labelled secondary antibodies, or in the alterna- 
tive, if the primary antibody is labelled, the enzymatic, or antibody binding reagents which are capable of reacting with 
the labelled antibody. One skilled in the art will readily recognize that the disclosed Dfs, antigens and antibodies of the 

20 present inventk>n can be readily incorporated into one of the established kit formats which are well known in the art. 

4. Screening Assay for Binding Agents 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
2S and identifying agents which bind to a protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Staphylococcus aureus fragment and contigs herein described. 
In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the ORFs of the present invention, or an isolated 
30 fragment of the Staphylococcus aureus genome; and 

(b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, peptides, carbohydrates, vitamin derivatives, 
or other pharmaceutical agents. The agents can be selected and screened at random or rationally selected or designed 
35 using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected 
at random and are assayed for their ability to bind to the protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used herein, an agent is said to be 'rationally 
selected or designed' when the agent is chosen based on the configuration of the particular protein. For example, one 
40 skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like capable of binding to a specific peptide sequence in order to generate rationally designed antipeptide peptides, 
for example see Hurby et ai, Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's 
Guide, W. H. Freeman, NY (1 992), pp. 289-307, and Kaspczak etal.. Biochemistry 28:9230-8 (1 989), or pharmaceutical 
agents, or the like. 

4S In addition to the foregoing, one class of agents of the present invention, as broadly described, can be used to 

control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, 
such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled 
artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. 

so One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix by 
binding to DNA or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or can 
be a variety of sutfhydryl or polymeric derivatives which have base attachment capacity 

Agents suitable for use in these methods usually contain 20 to 40 bases and are designed to be complementary 
to a region of the gene involved In transcription (triple helix - see Lee etaL, Nucl. Acids Res. 6:3073 (1979); Cooney 

ss et sL, Science 241 :456 (1 988); and Dervan et a!.. Science 251 : 1 360 (1 991 )) or to the mRNA itself (antisense - Okano. 
J. Neurochem. 56:560 (1991); Oligodeoxynucleotkies as Antisense Inhibitors of Gene Expression, CRC Press. Boca 
Raton, FL (1988)). Triple helix -formation optimally results in a shut-off of RNA transcription from DNA, while antisense 
RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated 
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to be effective in model systems. Information contained in the sequences of the present invention can be used to design 
antisense and triple helix-forming oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

5 

The present invention further provides pharmaceutical agents which can be used to modulate the growth or path- 
ogenicity of Staphylococcus aureus, or another related organism, in vivo or in vitro. As used herein, a "pharmaceutical 
agenf is defined as a composition of matter which can be formulated using known techniques to provide a pharma- 
ceutical compositions. As used herein, the 'pharmaceutical agents of the present Invention' refers the pharmaceutical 

10 agents which are derived from the proteins encoded by the ORFs of the present invention or are agents which are 
identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to 'modulate the growth or pathogenicity of Staphylococcus aureus 
or a related organism, in wVoor in wYro/' when the agent reduces the rate of growth, rate of division, or viability of the 
organism in question. The pharmaceutical agents of the present Invention can modulate the growth or pathogenicity 

IS of an organism in many fashions, although an understanding of the underlying mechanism of action is not needed to 
practice the use of the pharmaceutical agents of the present Invention. Some agents will modulate the growth or path- 
ogenicity by binding to an important protein thus blocking the biological activity of the protein, while other agents may 
bind to a component of the outer surface of the organism blocking attachment or rendering the organism more prone 
to act the bodies nature immune system. Alternatively the agent may comprise a protein encoded by one of the ORFs 

20 of the present invention and serve as a vaccine. The development and use of vaccines derived from membrane asso- 
ciated polypeptides are well known In the art. The Inventors have identified partlculariy preferred Immunogenic Sta- 
phylococcus aureus polypeptides for use as vaccines. Such Immunogenic polypeptides are described above and sum- 
marized in Table 4, below. 

As used herein, a "related organism" is a broad term which refers to any organism whose growth or pathogenicity 
2S can be modulated by one of the pharmaceutical agents of the present invention. In general, such an organism will 
contain a homobg of the protein which is the target of the pharmaceutical agent or the protein used as a vaccine. As 
such, related organisms do not need to be bacterial but may be fungal or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may be administered in a convenient man- 
ner, such as by the oral, topical, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or Intradermal 
30 routes. The pharmaceutical compositions are administered in an amount which Is effective for treating and/or proph- 
ylaxis of the specific indication. In general, they are administered In an amount of at least about 1 mg/kg body weight 
and in most cases they will be administered in an amount not in excess of about 1 g/kg body weight per day. In most 
cases, the dosage is from about 0.1 mg/kg to about 10 g/kg body weight daily, taking into account the routes of ad- 
ministration, symptoms, etc. 

3S The agents of the present Invention can be used in native form or can be modified to form a chemical derivative. 

As used herein, a molecule is said to be a "chemical derivative" of another molecule when it contains additional chemical 
moieties not normally a part of the molecule. Such moieties may improve the molecule's solubility, absorptbn, biological 
half life. etc. The moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate any undesirable 
side effect of the molecule, etc. Moieties capable of mediating such effects are disclosed in, among other sources, 

40 REMINGTON'S PHARMACEUTICAL SCIENCES (1980) cited elsewhere herein. 

For example, such moieties may change an immunological character of the functional derivative, such as affinity 
for a given antikxxJy. Such changes in immunomodulation activity are measured by the appropriate assay, such as a 
competitive type immunoassay. Modifications of such protein properties as redox or thermal stability biological half- 
life, hydrophobicity, susceptibility to proteolytic degradation or the tendency to aggregate with carriers or into multimers 

45 also may be effected in this way and can be assayed by methods well known to the skilled artisan. 

The therapeutic effects of the agents of the present invention may be obtained by providing the agent to a patient 
by any suitable means (e.g., inhalation, intravenously, intramuscularly, subcutaneously, enterally, or parenterally). It is 
preferred to administer the agent of the present invention so as to achieve an effective concentration within the blood 
or tissue in which the growth of the organism is to be controlled. To achieve an effective blood concentration, the 

so preferred method is to administer the agent by Injection. The administration may be by continuous infusion, or by single 
or multiple injections. 

In providing a patient with one of the agents of the present invention, the dosage of the administered agent will 
vary depending upon such factors as the patient's age, weight, height, sex, general medical condition, previous medical 
history, etc. In general, it is desirable to provide the recipient with a dosage of agent which Is in the range of from about 
ss 1 pg/kg to 1 0 mg/kg (body weight of patient), although a lower or higher dosage may be administered. The therapeu- 
tically effective dose can be towered by using combinations of the agents of the present Invention or another agent. 

As used herein, two or more compounds or agents are said to be administered "In combination" with each other 
when either (1) the physiological effects of each compound, or (2) the serum concentrations of each compound can 
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be measured at the same time. The composition of the present Invention can be administered concurrently with, prior 
to, or following the administration of the other agent. 

The agents of the present invention are intended to be provided to recipient subjects In an amount sufficient to 
decrease the rate of growth (as defined above) of the target organism. 

s The administration of the agent(s) of the Invention may be for either a "prophylactic" or "therapeutic" purpose. 

When provided prophylactlcally, the agent(s) are provided in advance of any symptoms Indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, attenuate, or decrease the rate of onset of 
any subsequent infection. When provided therapeutically, the agent(s) are provided at (or shortly after) the onset of an 
Indication of Infection. The therapeutic administration of the compound($) sen/es to attenuate the pathological symp- 

10 toms of the infection and to Increase the rate of recovery. 

The agents of the present invention are administered to a subject, such as a mammal, or a patient, in a pharma- 
ceutlcally acceptable form and In a therapeutically effective concentration. A composition is said to be "pharnnacolog- 
ically acceptable" if its administration can be tolerated by a recipient patient. Such an agent is said to be administered 
in a 'therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiolog- 

is ically significant If Its presence results In a detectable change In the physiology of a recipient patient. 

The agents of the present Invention can be formulated according to known methods to prepare pharmaceutically 
useful compositions, whereby these materials, or their functional derivatives, are combined In admixture with a phar- 
maceutically acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of other human proteins, a 
g., human serum albumin, are described, for example, in REMINGTON'S PHARMACEUTICAL SCIENCES. 16* Ed.. 

20 Osol, A., Ed., Mack Publishing, Easton PA (1 980). In order to form a pharmaceutically acceptable composition suitable 
for effective administration, such compositions will contain an effective amount of one or more of the agents of the 
present Inventbn, together with a suitable amount of carrier vehicle. 

Additional pharmaceutical methods may be employed to control the duration of action. Control release preparations 
may be achieved through the use of polymers to complex or absorb one or more of the agents of the present invention. 

2S The controlled delivery may be effectuated by a variety of well known techniques, including formulation with macro- 
molecules such as, for example, polyesters, polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcel- 
lulose, carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the macromolecules and the agent 
in the formulation, and by appropriate use of methods of incorporation, which can be manipulated to effectuate a desired 
time course of release. Another possible method to control the duration of action by controlled release preparations is 

30 to incorporate agents of the present invention into particles of a polymeric material such as polyesters, polyamino 
acids, hydrogels, poty(lactic acid) or ethylene vinylacetate copolymers. Alternatively instead of incorporating these 
agents Into polymeric partk^les, It Is possible to entrap these materials in mrcrocapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization with, for example, hydroxymethyteellulose or gelatine-micro- 
capsules and poly(methylmethacylate) microcapsules, respectively, or in colloidal drug delivery systems, for example, 

35 liposomes, albumin microspheres, microemulsions, nanoparticles, and nanocapsulesor Inmacroemulsions. Such tech- 
niques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES (1980). 

The invention further provides a pharmaceutical pack or kit comprising one or more containers filled with one or 
more of the ingredients of the pharmaceutical compositions of the invention. Associated with such container(s) can be 
a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals 

40 or biological products, which notice reflects approval by the agency of manufacture, use or sale for human adminis- 
tration. 

In addition, the agents of the present invention may be employed In conjunction with other therapeutic compounds. 
6. Shot-Gun Approach to Megabase DNA Sequencing 

45 

The present invention further demonstrates that a large sequence can be sequenced using a random shotgun 
approach. This procedure, described in detail in the examples that follow, has eliminated the up front cost of isolating 
and ordering overlapping or contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail in the examples that folbw. The examples 
so are provided by way of illustration. Other aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present disclosure. 
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ILLUSTRATIVE EXAMPLES 

LIBRARIES AND SEQUENCING 

s 1. Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole genome sequencing follows from the Lander and V\faterman 
(Landerman and Waterman, Genomics 2: 231 (1988)) application of the equation for the Poisson distribution. According 
to this treatment, the probability, Pq, that any given base in a sequence of size L, in nucleotides, is not sequenced after 

10 a certain amount, n, in nucleotides, of random sequence has been determined can be calculated by the equation Pq 
= e-"', where m is L/n, the fold coverage." For instance, for a genome of 2.8 Mb, m=1 when 2.8 Mb of sequence has 
been randomly generated (1 X coverage). At that point, Pq = e-"" = 0.37. The probability that any given base has not 
been sequenced is the same as the probability that any region of the whole sequence L has not been determined and, 
therefore, is equivilent to the fraction of the whole sequence that has yet to be determined. Thus, at one-fold coverage, 

IS approxinnately 37% of a polynucleotide of size L. in nucleotides has not been sequenced. When 14 Mb of sequence 
has been generated, coverage Is 5X tor a .2.8 Mb and the unsequenced fraction drops to .0067 or 0.67%. 5X coverage 
of a 2.8 Mb sequence can be attained by sequencing approximately 1 7,000 random clones from both insert ends with 
an average sequence read length of 410 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le''", and the average gap size, g, follows the 

20 equation, g = L/n. Thus, 5X coverage leaves about 240 gaps averaging about 82 bp In size in a sequence of a poly- 
nucleotide 2.8 Mb long. 

The treatment above is essentially that of Lander and Waterman, Genomics g. 231 (1 988). 

2. Random Library Construction 

25 

In order to approximate the random model described above during actual sequencing, a nearly ideal library of 
cloned genomic fragments is required. The following library construction procedure was developed to achieve this end. 

Staphylococcus aureus DNA was prepared by phenol extraction. A mixture containing 600 ug DNA in 3.3 ml of 
300 mM sodium acetate. 10 mM Tris-HCI, 1 mM Na-EDTA, 30% glycerol was sonicated for 1 min. at 0*»C in a Branson 

30 Model 450 Sonicator at the lowest energy setting using a 3 mm probe. The sonicated DNA was ethanol precipitated 
and redissotved in 500 ul TE buffer. 

To create blunt-ends, a 100 ul aliquot of the resuspended DNA was digested with 5 units of BAL31 nuclease (New 
England BioLabs) for 10 min at 30**C in 200 ul BAL31 buffer . The digested DNA was phenol-extracted, ethanol-pre- 
cipitated, redissolved in 100 ul TE buffer, and then size-fractionated by electrophoresis through a 1.0% low melting 

3S temperature agarose gel. The section containing DNA fragments 1 .6-2.0 kb in size was excised from the gel, and the 
LGT agarose was melted and the resulting solution was extracted with phenol to separate the agarose from the DNA. 
DNA was ethanol precipitated and redissolved in 20 ul of TE buffer for ligation to vector. 

A two-step ligation procedure was used to produce a plasmid library with 97% inserts, of which >99% were single 
inserts. The first ligation mixture (50 ul) contained 2 ug of DNA fragments, 2 ug pUCl8 DNA (Pharmacia) cut with Smal 

40 and dephosphorylated with bacterial alkaline phosphatase, and 1 0 units of T4 ligase (GIBGO/BRL) and was incubated 
at 14°G for 4 hr. The ligation mixture then was phenol extracted and ethanol precipitated, and the precipitated DNA 
was dissolved in 20 ul TE buffer and electrophoresed on a 1 .0% low melting agarose gel. Discrete bands in a ladder 
were visualized by ethidium bromide-staining and UV illumination and identified by size as insert (i), vector (v). v+i, 
v+2i, v+3i, etc. The portion of the gel containing v+i DNA was excised and the v+i DNA was recovered and resuspended 

45 Into 20 ul TE. The v+i DNA then was blunt-ended by T4 polymerase treatment for 5 min. at 37* C in a reaction mixture 
(50 ul) containing the v+i linears, 500 uM each of the 4 dNTPs, and 9 units of T4 polymerase (New England BioLabs), 
under recommended buffer conditions. After phenol extraction and ethanol precipitation the repaired v+i linears were 
dissolved in 20 ul TE. The final ligation to produce circles was carried out in a 50 ul reaction containing 5 ul of v+i 
linears and 5 units of T4 ligase at 14*0 overnight. After 10 min. at 70**C the following day, the reaction mixture was 

so stored at -20°C. 

This two-stage procedure resulted in a molecularly random collection of single-insert plasmid recombinants with 
minimal contamination from double-insert chimeras (<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in the host, E.co// host cells deficient in all 
recombination and restriction functions (A. Greener, Strategies 3 (1):5 (1990)) were used to prevent rearrangements, 
ss deletions, and loss of clones by restriction. Furthermore, transformed cells were plated directly on antibiotic diffusion 
plates to avoid the usual broth recovery phase which allows multiplication and selection of the most rapidly growing cells. 

Plating was carried out as follows. A 100 ul aliquot of Epicurian Coli SURE II Supercompetent Cells (Stratagene 
200152) was thawed on ice and transferred to a chilled Falcon 2059 tube on ice. A 1.7 ul aliquot of 1.42 M beta- 
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mercaptoethanol was added to the aliquot of cells to a final concentration of 25 mM. Cells were incubated on ice for 
10 mtn. A 1 ul aliquot of the final ligation was added to the cells and incubated on ice for 30 min. The cells were heat 
pulsed for 30 sec. at 42** C and placed back on ice for 2 min. The outgrowth period in liquid culture was eliminated 
from this protocol in order to minimize the preferential growth of any given transformed cell, instead the transformation 

s mixture was plated directly on a nutrient rich SOB plate containing a 5 ml bottom layer of SOB agar (5% SOB agar: 
20 g tryptone, 5 g yeast extract, 0.5 g NaCI. 1 .5% Difco Agar per liter of media). The 5 ml bottom layer is supplemented 
with 0.4 ml of 50 mg/ml ampicillin per 100 ml SOB agar. The 15 ml top layer of SOB agar Is supplemented with 1 ml 
X-Gal (2%), 1 ml MgClg (1 M), and 1 ml MgSO4/100 ml SOB agar. The 15 ml top layer was poured just prior to plating. 
Our titer was approximately 100 colonies/10 ul aliquot of transformation. 

10 All colonies were picked for template preparation regardless of size. Thus, only clones lost due to "poison" DNA 
or deleterious gene products would be deleted from the library, resulting In a slight increase in gap number over that 
expected. 

3. Random DNA Sequencing 

IS 

High quality double stranded DNA plasmid templates were prepared using an alkaline lysis method developed in 
collaboration with 5Prime 3Prime Inc. (Boulder, CO). Plasmid preparation was performed in a 96-well format for all 
stages of DNA preparation from bacterial growth through final DNA purification. Average template concentratton was 
determined by running 25% of the samples on an agarose gel. DNA concentrations were not adjusted. 

20 Templates were also prepared from a Staphylococcus aureus lambda genomic library. An unamplified library was 

constructed in Lambda DASH II vector (Stratagene). Staphylococcus aureus DNA (> 100 kb) was partially digested in 
a reaction mixture (200 ul) containing 50 ug DNA, IX Sau3AI buffer, 20 units Sau3AI for 6 min. at 23 C. The digested 
DNA was phenol -extracted and centrifuges over a 10- 40% sucroce gradient. Fractions containing genomic DNA of 
15-25 kb were recovered by precipitation . One ul of fragments was used with 1 ul of DASH 1 1 vector (Stratagene) in 

25 the recommended ligation reaction. One ul of the ligation mixture was used per packaging reaction following the rec- 
ommended protocol with the Gigapack II XL Packaging Extract Phage were plated directly without amplifk:ation from 
the packaging mixture (after dilution with 500 ul of recommended SM buffer and chloroform treatment). YieW was about 
2.5x109 ptu/ul. 

An amplified library was prepared from the primary packaging mixture according to the manufactureer's protocol. 

30 The amplified library is stored frozen in 7% dimethylsulfoxide. The phage titer is approximately 1x10^ pfu/ml. 

Mini-liquid tysates (0.1 ul) are prepared from randomly selected plaques and template is prepared by long range 
PCR. Samples are PCR amplified using modified T3 and T7 primers, and Elongase Supenmix (LTI). 

Sequencing reactions are carried out on plasmid templates using a combination of two workstatbns (BIOMEK 
1000 and Hamilton Microlab 2200) and the Perkin-Elmer 9600 thermocycler with Applied Biosystems PRISM Ready 

35 Reaction Dye Primer Cycle Sequencing Kits for the M1 3 forward (Ml 3-21) and the Ml 3 reverse (M13RP1) primers. 
Dye terminator sequencing reactions are carried out on the lambda templates on a Perkin-Elmer 9600 Thermocycler 
using the Applied Biosystems Ready Reaction Dye Terminator Cycle Sequencing kits. Modified T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. Sequencing reactions are on a combination 
of AB 373 DNA Sequencers and ABI 377 DNA sequencers. All of the dye terminator sequencing reactions are analyzed 

40 using the 2X 9 hour module on the AB 377. Dye primer reactions are analyzed on a combination ot ABI 373 and ABI 
377 DNA sequencers. The overall sequencing success rate very approximately is about 85% for Ml 3-21 and M13RP1 
sequences and 65% for dye-terminator reactions. The average usable read length is 485 bp for Ml 3-21 sequences. 
445bp for Ml 3RP1 sequences, and 375 bp for dye-terminator reactions. 

4S 4. Protocol for Automated Cycle Sequencing 

The sequencing was carried out using Hamilton Microstation 2200, Perkin Elmer 9600 thermocyclers, ABI 373 
and ABI 377 Automated DNA Sequencers. The Hamilton combines pre-aliquoted templates and reaction mixes con- 
sisting of deoxy- and dideoxynucleotides, the thermostable Taq DNA polymerase, fluorescentty-labelled sequencing 
so primers, and reaction buffer. Reaction mixes and templates were combined in the wells of a 96-well thermocycling 
plate and transferred to the Perkin Elmer 9600 thermocycler. Thirty consecutive cycles of linear amplification (i.e.., one 
primer synthesis) steps were performed including denaturation, annealing of primer and template, and extension; i.e., 
DNA synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents evaporation without the need for 
an oil overlay. 

ss Two sequencing protocols were used: one for dye-labelled primers and a second for dye-labelled dideoxy chain 

terminators. The shotgun sequencing involves use of four dye-labelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer was labelled with a different fluorescent dye, permitting the four individual 
reactions to be combined into one lane of the 373 or 377 DNA Sequencer for electrophoresis, detection, and base- 
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calling. ABI currently supplies premixed reaction mixes in bulk packages containing all the necessary non-template 
reagents for sequencing. Sequencing can be done with both plasmid and PCR-generated templates with both dye- 
primers and dye- terminators with approximately equal fidelity, although plasmid templates generally give longer usable 
sequences. 

s Thirty-two reactions were loaded per ABI 373 Sequencer each day and 96 samples can be loaded on an ABI 377 

per day. Electrophoresis was run overnight (ABI 373) or for 2 1 /2 hours (ABI 377) following the manufacturer's protocols. 
Following electrophoresis and fluorescence detection, the ABI 373 or ABI 377 performs automatic lane tracking and 
base-calling. The lane-tracking was confirmed visually. Each sequence electropherogram (or fluorescence lane trace) 
was inspected visually and assessed for quality. Trailing sequences of low quality were removed and the sequence 

10 itself was loaded via software to a Sybase database (archived dally to 8mm tape). Leading vector polytlnker sequence 
was removed automatically by a software program. Average edited lengths of sequences from the standard ABI 373 
or ABI 377 were around 400 bp and depend mostly on the quality of the template used for the sequencing reaction. 

INFORMATICS 

IS 

1. Data Management 

A number of information management systems for a large-scale sequencing lab have been developed. (For review 
see, for instance, Kerlavage ef a/.. Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System 

20 Sciences, IEEE Computer Society Press, Washington D. C, 585 (1993)) The system used to collect and assemble 
the sequence data was developed using the Sybase relational database management system and was designed to 
automate data flow whereever possible and to reduce user error. The database stores and correlates all Infomnation 
collected during the entire operation from template preparation to final analysis of the genome. Because the raw output 
of the ABI 373 Sequencers was based on a Macintosh platform and the data management system chosen was based 

2S on a Unix platform, it was necessary to design and implement a variety of multi- user, client-server applications which 
allow the raw data as well as analysis results to flow seamlessly into the database with a minimum of user effort. 

2. Assembly 

30 An assembly engine (TIGR Assembler) developed for the rapid and accurate assembly of thousands of sequence 

fragments was enployed to generate contigs. The TIGR assembler simultaneously clusters and assembles fragments 
of the genome. In order to obtain the speed necessary to assemble more than 10^ fragments, the algorithm builds a 
hash table of 12 bp oligonucleotide subsequences to generate a list of potential sequence fragment overlaps. The 
number of potential overlaps for each fragment determines which fragments are likely to fall into repetitive elements. 

3S Beginning with a single seed sequence fragment, TIGR Assembler extends the current contig by attempting to add 
the best matching fragment based on oligonucleotide content. The contig and candidate fragment are aligned using a 
modified versbn of the Smith-Waterman algorithm which provides for optimal gapped alignments (Waterman, M. S., 
Methods in Enzymobgy 264: 765 (1988)). The contig is extended by the fragment only If strict criteria for the quality 
of the match are met. The match criteria include the minimum length of overlap, the maximum length of an unmatched 

40 end, and the minimum percentage match. These criteria are automatically lowered by the algorithm in regions of minimal 
coverage and raised in regions with a possible repetitive element. The number of potential overlaps for each fragment 
determines which fragments are likely to fall into repetitive elements. Fragments representing the boundaries of repet- 
itive elements and potentially chimeric fragments are often rejected based on partial mismatches at the ends of align- 
ments and excluded from the current contig. TIGR Assembler is designed to take advantage of clone size information 

4S coupled with sequencing from both ends of each template. It enforces the constraint that sequence fragments from 
two ends of the same template point toward one another in the contig and are located within a certain ranged of base 
pairs (definable for each clone based on the known clone size range for a given library). 

3. Identifying Genes 

so 

The predicted coding regions of the Staphylococcus aureus genome were initially defined with the program zorf, 
which finds ORFs of a minimum length. The predicted coding region sequences were used in searches against a 
database of all Staphylococcus aureus nucleotide sequences from GenBank (release 92.0), using the BLASTN search 
method to identify overlaps of 50 or more nucleotides with at least a 95% identity Those ORFs with nucleotide sequence 
55 matches are shown in Table 1 . The ORFs without such matches were translated to protein sequences and and com- 
pared to a non-redundant database of known proteins generated by combining the Swiss-prot, FIR and GenPept 
databases. ORFs of at least 80 amino acids that matched a database protein with BLASTP probability less than or 
equal to 0.01 are shown in Table 2. The table also lists assigned functions based on the closest match in the databases. 
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ORFs of at least 120 amino acids that did not match protein or nucleotide sequences in the databases at these levels 
are shown in Table 3. 

ILLUSTRATIVE APPLICATIONS 

5 

1. Production of an Antibody to a Staphylococcus aureus Protein 

Substantially pure protein or polypeptide is isolated from the transfected or transformed cells using any one of the 
methods known in the art. The protein can also be produced in a recombinant prokaryotic expression system, such as 
10 E. cot'u or can by chemically synthesized. Concentration of protein in the final preparation is adjusted, for example, by 
concentratbn on an Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

IS 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from 
murine hybridomas according to the classical method of Kohler, Q. and Mllstein, C, Nature 256:495 (1975) or modifi- 
cations of the methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein 
over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the spleen isolated. 

20 The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells 
destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused 
cells are diluted and aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is continued. 
Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay 
procedures, such as ELISA, as originally described by Engvall, E., Meth. Enzymol. 70:419 (1980), and modified meth- 

2S ods thereof. Selected positive clones can be expanded and their monoclonal antibody product han/ested for use. 
Detailed procedures for monoclonal antibody production are described in Davis, L etai Basic Methods in Molecular 
Biology Elsevier, New York. Section 21-2 (1989). 

3. Polyclonal Antibody Production by Immunization 

30 

Polycbnal antiserum containing antibodies to heterogenous epitopes of a single protein can be prepared by im- 
munizing suitable animals with the expressed protein described above, which can be unmodified or modified to enhance 
immunogenicity. Effective polyclonal antibody production Is affected by many factors related both to the antigen and 
the host species. For example, small molecules tend to be less immunogenic than other and may require the use of 

3S carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, with both inadequate or 
excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigenadministered at multiple 
intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in Vaitukaitis, 
J. etai, J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as de- 

40 termined seml-quantitatively, for example, by double Immunodiffusion in agar against known concentrations of the 
antigen, begins to fall. See, for example, Ouchterlony O. etai, Chap. 19 in:Handbook of Experimental Immunology, 
Wier, D., ed, Blackwell (1 973). Plateau concentration of antibody is usually in the range of 0. 1 to 0. 2 mg/hni of serum 
(about 1 2M). Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, 
for example, by Fisher, D., Chap. 42 in:Manual of Clinical Immunology, second edition, Rose and Friedman, eds., Amer. 

45 Soc. For Microbiology. Washington, D. C. (1980) 

Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which de- 
termine concentrations of antigen-bearing substances In biological samples; they are also used semi- quantitatively 
or qualitatively to identify the presence of antigen in a biological sample. In addition, they are useful in various animal 
models of Staphylococcal disease known to those of skill in the art as a means of evaluating the protein used to make 

so the antibody as a potential vaccine target or as a means of evaluating the antibody as a potential immunothereapeutic 
reagent. 

3. Preparation of PGR Primers and Amplification of DNA 

ss Various fragments of the Staphylococcus aureus genome, such as those of Tables 1 -3 and SEQ ID NOS: 1 -5. 1 91 

can be used, in accordance with the present invention, to prepare PGR primers for a variety of uses. The PGR primers 
are preferably at least 15 bases, and more preferably at least 18 bases in length. When selecting a primer sequence, 
it is preferred that the primer pairs have approximately the same G/C ratio, so that melting temperatures are approxi- 
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mately the same. The PGR primers and amplified DNA of this Example find use in the Examples that follow. 
4. Gene expression from DNA Sequences Corresponding to ORFs 

5 A fragment of the Staphylococcus aureus genome provided in Tables 1 -3 is introduced into an expression vector 

using conventional technology. Techniques to transfer cloned sequences Into expression vectors that direct protein 
translation In mammalian, yeast, Insect or bacterial expression systems are well known in the art. Commercially avail- 
able vectors and expression systems are available from a variety of suppliers including Stratagene (La Jolla, California), 
Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to enhance expression and facilitate 

10 proper protein folding, the codon context and codon pairing of the sequence may be optimized for the particular ex- 
pression organism, as explained by IHatfietd etal., U. S. Patent No. 5,082,767, incorporated herein by this reference. 

The following is provided as one exemplary method to generate polypeptlde(s) from cloned ORFs of the Staphy- 
lococcus aureus genome fragment. Bacterial ORFs generally tack a poly A addition signal. The addition signal sequence 
can be added to the construct by for example, splicing out the poly A addition sequence from pSG5 (Stratagene) using 

IS Bgll and Sail restriction endonuclease enzymes and incorporating it into the mammalian expression vector pXT1 (Strat- 
agene) for use in eukaryotic expression systems. pXTI contains the LTRs and a portion of the gag gene of Moloney 
Murine Leukemia Virus. The positions of theLTRs in the construct allow efficient stable transf ectfon. The vector includes 
the Herpes Simplex thymidine kinase promoter and the selectable neomycin gene. The Staphylococcus aureus DNA 
is obtained by PCR from the bacterial vector using oligonucleotide primers complementary to the Staphylococcus 

20 aureus DNA and containing restriction endonuclease sequences for PstI incorporated into the 5' primer and Bglll at 
the 5' end of the corresponding Staphylococcus aureus DNA 3' primer, taking care to ensure that the Staphylococcus 
aureus DNA is positioned such that Its followed with the poly A addition sequence. The purified fragment obtained from 
the resulting PCR reaction is digested with PstI, blunt ended with an exonuclease, digested with Bglll, purified and 
llgated to pXTI , now containing a poly A addition sequence and digested Bglll. 

2S The llgated product is transfected into mouse NIH 3T3 cells using Lipofectin (Life Technologies, Inc., Grand Island, 

New York) under conditions outlined in the product specification. Positive transfectants are selected after growing the 
transfected cells in 600 ug/ml Q41 8 (Sigma, St. Louis, Missouri). The protein is preferably released Into the supernatant. 
However if the protein has membrane binding domains, the protein may additionally be retained within the cell or 
expression may be restricted to the cell surface. Since it may be necessary to purify and locate the transfected product, 

30 synthetic 15-mer peptides synthesized from the predicted Staphylococcus aureus DNA sequence are injected into 
mice to generate antibody to the polypeptide encoded by the Staphylococcus aureus DNA. 

Attemativly and if antibody productbn is not possible, the Staphylococcus aureus DNA sequence is additionally 
incorporated into eukaryotic expression vectors and expressed as, for example, a globin fusion. Antibody to the gbbin 
moiety then is used to purify the chimeric protein. Corresponding protease cleavage sites are engineered between the 

35 globin moiety and the polypeptide encoded by the Staphylococcus aureus DNA so that the latter may be freed from 
the formed by simple protease digestion. One useful expression vector for generating globin chimerics Is pSG5 (Strat> 
agene). This vector encodes a rabbit globin. Intron It of the rabbit gbbin gene facilitates splicing of the expressed 
transcript, and the polyadenytation signal incorporated into the construct increases the level of expression. These 
techniques are well known to those skilled in the art of molecular biology Standard methods are published in methods 

40 texts such as Davis etai, cited elsewhere herein, and many of the methods are available from the technical assistance 
representatives from Stratagene, Life Technologies, Inc., or Promega. Polypeptides of the invention also may be pro- 
duced using in vitro translation systems such as in vitro ExpressTM Translation Kit (Stratagene). 

While the present invention has been described in some detail for purposes of clarity and understanding, one 
skilled in the art will appreciate that various changes In form and detail can be made without departing from the true 

45 scope of the invention. 

All patents, patent applications and publications referred to above are hereby Incorporated by reference. 
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60-73 


90-99 


171.11 


5224 


EF 


20-31 


91-110 


j 


53.4 


5225 


• peniciinn binding protein 2b 






95-104 ! 


353_2 


5226 






! 62-71 




743C1 


5227 


29 kDa protein in fimA regi« 




68-79 


94-103 


: 175-184 


342.4 


5228 


Twitchina motility 


1 rv.1 Q 


"TO ov 


83-92 


' 111-121 


59_3 


i 5229 


arahinoaatactan orotetn 


97^106 


132-141 


158-157 


180-189 


70_6 


; 5230 


nodulin 


36-45 


48-57 


» 137-160 


■ 179-188 


129_2 


i 5231 


' alvcerol diester phosphodie 


8-17 


41-50 


55-74 


97-106 


58_5 


5232 


PBP (S. aureus) 


26-35 


70-79 


. 117-126 


; 152-161 


188.3 


5233 


MHC class II analog (S. aure 


72-81 


94-103 


115-124 


i 135-145 


236.6 


5234 


histidine kinase domain (Die 


24-33 


52-67 


81-94 


• 106-121 


310.8 


5235 


clumping factor (S. aureus) 


59-71 


77-86 


93-102 


! 118-127 


601.1 


5236 


novel anti9en/0RF2 (S. aui 


45-54 


91-104 


108-117 


: 185-195 


544.3 


5237 


ORF YJR151c(S. cerevisae: 


76-90 


101-111 


131-140 


154-164 


662.1 


5238 


MHC class 11 analog (S. aure 


22-32 


71-80 


89-98 


114-122 


87.7 


5239 


5* nucleotidase precursor (' 


29-45 


62-71 


i 105-114 


125-137 


120.1 


5240 


B65G qene product CB. sub 


102-111 
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Table 4 



ORF Antig enic 1 Regions (cont) 



Region J Regi on 6 ] Region 7 ' Region 8 ! Re gio n 9 Reg ion 1 0 



168_6 1 244-272 303-315 | 


! i '' 


238_1 i 260-269 291-301 j 


308-317 ! ; 


51^2 140-152 188-208 ! 


211-220 j 256-266 i 273-283 ^ 


278_3 1 198-209 I 


i i 


276_2 i 255-268 1 


1 


! i 


45_4 i 177-199 : 221-230 1 


234-243 


258-279 ! 284-293 ! 304-313 


316.8 , j 


1 




154.15 \ 148-157 177-187 


202-211 


! 


1 


228.3 j 10M19 . 139-154 


166-181 






228_6 1 






i 




50_1 








i 




112_7 


136-149 197-211 


218-229 


253-273 


i 




442.1 


199-210 : 247-257 


264-277 


287-309 






66.2 












304.2 


178-187 250-259 










44.1 


• 










161.4 


■ 










46.5 


131-141 ' 162-176 


206-215 


243-252 i 264-273 


285-294 


942_1 












5.4 


189-205 230-239 i 246-264 


301-318 


340-354 


378-387 


20.4 


202-212 217-234 


260-275 


314-336 


366-373 


380-391 


328_2 












520.2 












771_1 i 145-154 ! 








999.1 i ' 






j 


853.1 1 




! 1 


287.1 1 154-164 




! 1 


288.2 1 




i 




596.2 121-130 








217.5 244-253 259-268 


288-297 


302-311 i 




217.6 i 144-158 174-183 


188-197 


207-216 


! 226-242 




528_3 










171.11 






i 1 


63.4 i 








353.2 1 








74J.1 ' 197-207 






i 


342.4 ; 




j 


69.3 195-211 




1 


70.6 206-215 263-272 i 291-301 


i 331-340 358-371 390-414 


129 2 117-127 141-157 = 168-183 • 202-211 222-231 261-270 


58.5 184-203 260-269 275-299 : 330-344 372-381 424-433 


188_3 1 i ; 


236_6 138-147 163-172 : 187-198 i 244-261 268-278 308-317 


310.8 131-140 144-153 \ 177-186 ! 190-199 


i 204-213 216-227 


601 _1 208-218 ! 




544 3 170-179 184-193 ! 224-235 i 274-287 327-336 352-361 


662 1 ; 


87.7 i 1 


120 1 i 
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Table 4 



ORF ! Antigenic 


Regions |(cont) ! 


Kegion 1 1 Region 1 Z 


Region 1 3 1 Region 1 4 . Region 1 5 1 Region 1 6 


168^6 1 1 


1 i 


238.1 1 




51-2 i 1 


i 


278_3 1 1 


: 1 


276.2 i 


i i 


45.4 1 


i i 1 




315.8 


! : ' 


1 54.1 5 1 i 




I 




228.3 1 i 








228.6 






1 

! 




50.1 


1 




1 




112_7 










442.1 






1 




66.2 


j 




1 




304.2 










44.1 






1 




161.4 








46.5 


306-315 ; 








942.1 


; 




1 

! 




5.4 


393-407 ! 416-426 


456-465 


1 




20.4 


396-405 : 410-419 


461-481 






328_2 


i 




1 




520.2 






! 

.. i ^ 




771.1 


! 








999.1 










853.1 










287.1 






: i 


288.2 


1 

I 




i i 

: i 


596.2 


* 


\ • ! 


217.5 


1 




1 


217.6 1 








528.3 i 








i7i_n ; 








63.4 1 








353.2 1 








743-.1 ! 






342.4 1 








69.3 ! 


i 




70.6 ! 453-471 506-515 i ! 


129.2 1 296-315 i 1 


58.5 ^ i : 


188.3 ' ! i i 


236.6 ' 358-377 410-423 i 428-439 


1442-457 467-476 '480-493 


310.8 : 238-251 256-275 i 281-290 


1296-310 314-333 3^8-347 


601.1 ! ; 


! 1 


544.3 1 






662.1 ' ! 






87.7 ! i 






120.1 ' 
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Table 4 



ORF 




Antigenic] Regions 


icont) 








Region 1 7 


Region 1 8 i 


Region 19 


Region 20 1 


Region 21 


Region 22 


1 68^6 i 




j 




1 






238^1 • ' i 1 


5U2 ^ 1 ! i . 


278_3 i 1 i i ' 


276.2 , 


t : 


45^4 1 


, ^ , 


316_8 i 




154_.15 1 


! 1 




228J 


1 






228_6 ! 




! 


50.1 










112.7 












44Z_1 










66^2 


i 










304_2 




» 






44_1 




1 






161.4 








46_5 


^ - 


; 


1 


942.1 




1 




5_4 




■ 


i 


20.4 i 




! 


328.2 j 


1 


j 


520.2 1 


^ , 






771^1 ! 








999.1 " 








B53.1 1 






287_1 i 






288.2 " 






596.2 i 






217_5 






217.6 


• 




528.3 1 




171.11 • 


. 1 


63_4 1 


I'll 


353.2 




III! 


743-1 




: : i i 


342.4 1 ; • 1 i 


69^3 




J 

! _ • 




I 




70.6 














129 2 . ! ! 


58 5 i i ' 


188 3 i 


236_6 ; * 


310.8 


357-366 


370-379 


; 429-438 


443-452 


.478-487 


551-560 


601.1 














544.3 














662 1 


87 7 


120 1 
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Table 4 



so 



55 



ORF i Antigenic 


Regions ! 


(cont) 






i Region 23 Region 24 ! 


Region 25 ! 


Region 26 


Region 27 ! 


Region 28 


168^6 : 


1 




! 




238.1 1 


! 




i 




5i_2 ; \ 


1 




1 




278_3 i 1 






1 




276.2 ! ! 










45_4 1 . i i : i 


3 1 6^8 






154_15 i : 




: i 


228_3 




1 




228_6 I 








. 50.1 1 




1 
1 




1 1 2.7 1 




i 




442.1 1 




I 




66.2 : 1 




1 




304.2 ! 




1 




44.1 i 








161.4 




1 




46.5 ! 








942.1 i 












1 




20_4 ! i 










328.2 ! 










520.2 j 




1 




771.1 i 








999.1 ! 








853_1 ! t 






287.1 ! 








288.2 


1 




596.2 : 




217.5 1 










217.6 1 






528.3 ! 






171.11 : ! 






63.4 i 1 


i 




3S3_2 ! 1 


1 




743_1 : 


1 






342_4 








69_3 








70.6 






129.2 ! 


58.5 


188.3 










236.6 








i 


310-8 622-632 670-685 


1708-7 18 


823-836 


858-867 


;877-886 


601.1 ' i ! 


544 3 ' i • 1 


662_1 


87 7 1 I . 1 


120.1 
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Table 4 



ORF Antigenic; Regions (cont) 


Region 29 : 


Region 30 ; 




168_6 i i 


238.1 1 1 


51.2 1 i 


278.3 i i 


276.2 i i j 


45.4 i 1 ! 


316.8 


154.15 • ! 1 




228.3 ; 1 ! 




228_5 ! 




50.1 1 


■ 


112.7 






442.1 ; 






66.2 i 






304^2 






44.1 






161.4 






46.5 : 






942J 






5_4 ! 




20.4 1 






328.2 






520.2 . 






771.1 1 






999.1 






853.1 






287^1 






288.2 






596.2 { 




217.5 1 


i 


217.6 t ! 




528.3 1 


1 


171.11 i ! 




63.4 1 1 


i 


353.2 . ! 1 




743-.1 ! 




342.4 i ! 




6913 




70.6 i 


129.2 ! 


58.5 


1 88.3 


235.6 i 


310.8 


601.1 


544.3 






662.1 


87.7 


1 20.1 
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Table 4 



ORF 




BLAST 


Antigenic! Regions 


< 








HOMOLOG 


Region 1 


Region 2 


Region 3 


fAcgion «t 


46.1 


;5241 


aldehyde dehydrogenase 


8-17 


36-52 _i 


l3^6^_J^ 


1 1 7-1 71 


63_4 


S242 


glycerol ester hydrolase (P. 


9:26 ; 


57-73 


93-Vd7 


1 <L J- 1 O 0 


174.6 


5243 ketopantoate hydroxymeth 


71-80 


' 203-21 2 ' 


242-254 "T 




206.15.5244 


ornithine acetyl transferase 


1-10 


34-43 . .: 5J-63.._;. 


1 Gld.71 C\ 


267.1 


15245 


NaH-antiporter protein (E. ^ 


120-129 


332-347 


398-408 ; 




322_1 


i5246 


acriflavin resistance protein 


58-75 


153-164 


203-231 1 


264-284 


415.2 


5247 


transport ATP-binding prot< 


108-126 


218-227 J 


298-308 ; 


315-334 


214_3 


15248 


2-nitropropane dioxygenasc 


123-136 


216-233 


^ 283-292 i 


297-306 


587.3 


15249 


clumpinq factor 


5-14 


43-54 


59-68 X_ 


76-95 


685.1 


;5250 


signal peptidase 


59-68 


72-81 


, 86-95 ! 


99-108 


S4_3 


iS251 


fibronectin binding protein 1 


23-32 


37-46 


50-59 i 


89-98 


54_4 


"'5252 


fibrcnectin binding protein 1 


43-52 


66-75"" 


95-104 i 


_ 147-156 


54_S 


15253 


fibronectin binding protein 1 


49-60 


81-90 


• 




54.6 


15254 


fibronectin binding protein 1 


55-71 


82-97 


139-158_j 
96-105 i 


lA75:f8£ . 


328_1 


•5255 


lipoprotein (H. flu) 


11-20 


61-70 





Table 4 

2S 



ORF 




Antigenic; Regions 


(cont) 1 1 




Region 5 


Reqion 6 ! 


Region 7 


Region 8 I 


Region 9 


Region 10 


46_1 


215-242 i 


333-352 ! 


376-385 


416-432 1 


471-487 ! 




63^4 


145-154 ; 


191-202 1 


212-223 


245-265 : 


274-283 ■ 


291-300 


174 6 : i • 




1 




206.16 


239-259 1 


275-284 ! 




1 






267.1 




— . ._. ( 
! 










322.1 


298-319 


350-359 ; 










41S_2 


344-353 


3 71-380 


395-404 


456-465 


486-495 : 


Vs 18:5277.1 


214.3 


318:337 __ 


"365-375 










'537.3 i 


106-115 


142-151 


156-166 


173-182 


186-198 


204-213 


685_1 J 


113-122 


130-145 










54.3 


128-138 


185-194 


217-226 


251-260 


268-277 


295-305 


54.4 ^ 


175-188 


191-200 


203-212 


220-229 






54.5 














54_6 


2Z0-230 


287-304 


317-326 


344-353 


364-373 


378-387 


328.1 
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Table 4 



ORF 




1 Antigenic! Regions !(conO ' 








Region 1 1 


Region 1 2 ' 


Region 13 


Region 14 i 


Region 1 5 


Region 17 


46.1 i i 




63«4 


306-315 


: 319-328 


366-376 


395-420 


453-462 


_467-476 


174_6 ! i 








206_.16 : 1 


1 i * 


267_1 j 1 ! 


1 ! 


322.1 i i 




415_2 


539-555 






1 i 


214^3 1 ' 




■ ■ 1 1 

i 1 


587^3 


217-226 


278-287 


318-327 


332-342 


351-360 


_j 377-386 


685.1 1 








1 


S4.3 


. 316-325 


j 329-345 


355-372 " 


387^316 


4J^25 


..,138:^18 


54.4 ' ' 








54_5 1 1 










54.6 


1 396-407 


i 427-436 


514-531 


541-550 


569-578 


'lei 2-622 


328_1 ' • ' 


1 1 



Table 4 



ORF 1 


Antigenic! Regions j(cont> ' 




Region 18 Region 19 . Region 20 ! Region 21 : Region 22 ^ Region_ 23 


46.1 ' 






63_4 


485-500 513-525 


1 




174_6 






206_16 




t 




267.1 








322,1 

415.2 














214.3 










587.3 


396-405 1426-442 


459-470 


485-494 


505-514 .531-562 


685.1 


1 _ 








54.3 


455-462 :472-491 1517-536 




i 


54.4 


i 




1 


54.5 


1 1 




1 


54.6 


1639-548 673-681 1703-715 


723-732 


7^-760 [772:733. ... 

i 


328.1 


! ^ ! 
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Table 4 



ORF Antigenic Regions (cont) 


1 


Region 24 ! Region Z5 ! Region 26 Region 27 • 


Region 28 ! Region 29 


46_1 ^ . i 1 


1 


63_4 • i : i 






174.6 1 t 






206_16 : i 




i 

1 


267^1 i i 




322_1 




1 








21 4 J i 


1 _ _ 


587.3 567-578 •584-601 ;607-840 


844-854 


858-870 ; 877-886 


685.1 \ \ 




( 


54^3 1 i 




1 
1 


54^4 ! ; 




1 


54.5 1 ! 






54_6 793-802 |8n-826 834-848 


866-876 


893-903 j 907-9 18 


328.1 1 ! 







Table 4 

2S 



ORF 


! Antigenic 


Regions 


(cont) 




' Region 30 


Region 31 




46.1 








63.4 








174.6 


206.16 


1 

! 






267.1 1 ; j 


322.1 1 ^ 1 


415.2 




214^3 








587.3 


"^89-911 


927-936 




685.1 1 ! 




_54^1-_ 


! 






54.4 


i 






54_5 


i 






54.6 


925-944 


951-997 




328.1 1 1 1 
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10 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: US 

(F) POSTAL CODE: 20850 

(ii) TITLE OF INVENTION: Staphylococcus aureus Poly- 
nucleotides and Sequences 

20 

(iii) NUMBER OF SEQUENCES: 5255 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4 Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 

30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

35 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/009,861 

(B) FILING DATE: 05 -JAN- 1996 



40 



(2) INFORMATION FOR SEQ ID N0:1: 



4$ 



50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCCATTATGA AGTCACAAGT ACTATAAGCT GCGATGTTAC CAATGTTTTT TAAAATCCCA 60 

GTAATAAAAT CAAAAAATAA GTTAAATAAT GTATTCATTT TAA6TCCTCC TTAATAAAGa 120 

f5 aaataGGTAA TAATGTAATA GCTTCTATTA TGATGCCTAA TTGAATGAAT TGGGCAAATG 180 

GCTCTTTGAT GATAAGTGTG ATAATGAAAA GGGTTAAACT AACAATAATC GCATAATATT 240 

TTTTTCGTTT AATAAGTCGC ACAGGAATGG GCTTCTTTTT AGTTGCTGCA GGAGCATATA 300 

CTGAGATTAC ACCTAAAGAA ATAACTGTTA AAATAATCAT AATTAAAAAG TTAATATGAA 360 

AATTTACTAT TACTAAAGGT AAAAGTATAA ATAGTATAAT ACTTTCTACA TAACACCAAA 420 

AAGAAGAAGG TGCATGTGCa CCATGTGCAT GtCTTCTTAT TAAATAAAAT GTTAAATTCG 480 

TAATTAACGT AAACAGAAAA ATGTTTAAAA TATAGGCAAT AGTATACATA ACAATTAATT 54 0 

TACCTATATT TTTAGCTAAG ACCTGCATCC CTAATCGTAC TTGCAAAAAT TGAATATGAT 600 

CTAAGTTATT TCTCTTTTGA AGATACGTGG CAAACTGGTC AATTTTATTA TCAAAATAAT 660 

TCAATTTTAC ACCACTCTCC TCACTGTCAT TATACGATTT AGTACAATCT TTTATCATTA 720 

TATTGCCTAA CTGTAGGAAA TAAATACTTA ACTGTTAAAT GTAATTTGTA TTTAATATTT 780 

35 TAACATAAAA AAATTTACAG TTAAGAATAA AAAACGACTA GTTAAGAAAA ATTGGAAAAT 840 

AAATGCTTTT AGCATGTTTT AATATAACTA GATCACAGAG ATGTGATGGA AAATAGTTGA 900 

TGAGTTGTTT AATTTTAAGA ATTTTTATCT TAATTAAGGA AGGAGTGATT TCAATGGCAC 960 

AAGATATCAT TTCAACAATC GGTGACTTAG TAAAATGGAT TATCGACACA GTGAACAAAT 1020 

TCACTAAAAA ATAAGATGAA TAATTAATTA CTTTCATTGT AAATTTGTTA TCTTCGTATA 1080 

GTACTAAAAG TATGAGTTAT TAAGCCATCC CAACTTAATA ACCATGTAAA ATTAGCAAGT 1140 

GAGTAACATT TGCTAGTAGA GTTAGTTTCC TTGGACTCAG TGCTATGTAT T TT TCTTAAT 1200 

TATCATTACA GATAATTATT TCTAGCATGT AAGCTATCGT AAACAACATC GATTTATCAT 1260 

SO TATTTGATAA ATAAAATTTT TTTCATAATT AATAACATCC CCAAAAATAG ATTGAAAAAA 1320 

TAACTGTAAA ACATTCCCTT AATAATAAGT ATGGTCGTGA GCCCCTCCCA AGCTCGCGGC 1380 

CTTTTTTGTA ATGAAGAAGG GATGAGTTAA TCATCATTAT GAGACCCGCC GTTAAAATAT 1440 

55 
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TCATTTGCAA AGGGCGAAAT GGGTTCTTAC TGAGTTATCT ATTATAAAAA AATAAACATA 1560 

GACTTATGAA AAATCTCTCA TAAATCTATG TTTAGTCATG aCATGTGTTA AATATTATTT 1620 

CGGGCGCTTC TTATTTATAC AAATCTAATT TAATACTTTT AAATACAGGT ATATTTTCgC 1680 

GTTGCTGTTC TACTTCATTT AAGTTTAAAT CTACAGTCAA AATATCTGCG GATTCATTTA 1740 

ATTCTCCAAC TAAATCTCCA TTTGGGTTTA TAACTATCGA ATGACCAGCA TATTCTGTGT 1800 

TACCATCGAA TCCAGTGCTA TTAGTTCCAA TGACAAACAT ATTATTTTCA ATTGCACGTG 1860 

CCTTTAGTAA TGAATGCCAA TGTTGAAGAC GTGACATAGG CCATTGCGCC ACATAAAATG 1920 

IS CAATTTTAGC ACCACTACGA GCAGGATATC TTAATAATTC TQGAAAACGT AAATCATAAC 1980 

AGATAAGTTG GGTCACATAA GTACCGTCAG ACAATTGAAA GGGTTCAGCT ACGTATTCGC 2040 

CAGCGGTTAA AAATTCATGC TCTCTTAACA TAGGAACTAA ATGAACTTTG TCGTATTCaT 2100 

TAATCAGCTG GCCACTTTTA TTCACACTAA AAGCTGTATT AAATATTTGA TTGTTTCTAA 2160 

TGITAGAAAC TGACCCAGCT ACGATATCGA CTTTATATTT TTCAGCTAAA TOTTTAATAA 2220 

ATGAAAAACT TTGTCCTAGA TTATTATCTG CTTTTTCATT TAAATGCTCT AAATCATAGC 2280 

CATTATTCCA CATTTCAGGT AAAACGACTA CATCTACTTC AGCATTCATA TTTTTTTCGA 234 0 

ACCATTGCGT TATTTGAGTT TCATTTTTAG AACTATCTCC AAAAACAATC GGTAATTGAT 2400 

AAATTTGGAC TTTCATAACA TCACATCCTT GATAGATCTT ATATATAACT TACTAAAAGT 2460 

TATGTTGAAA CGCAAAAAAC GAGCACAAGA CATAAAATCA AAGTCCTAGG CTCTACAAAG 2520 

TTATATTGAC AGTAGTTGAT GGGGCCCCAA CATAGAGAAA TTGGAACACC AATTTCTACA 2580 

^ GACAATGCAA GTTGGGGTGG GCTCTAACAT AAAGAAATAC TTTTTCTTTA GAAATTAGTA 2640 

TTTCTTATAC ATGAGTTTTA CTCATGTATT CCTATTCTTA AGTGCACATT AGCAGCGGCT 2700 

AATGTGTAAG AACTACTACA TAATGAATAA CTAATGATTC TTTATCATTT CTGTCCCATT 2760 

CCTAACAATA TATTGATTAT TTTTTTATTA CGAAACGATC TTCCACTGGA TTAAATGTTT 2820 

TTTCGCCAGC AGCTTCACGA ATATCACCAA ATGGCATTTG AGCAATAAGT TTCCAACTTT 2880 

TAGGAATATT AAATTCATTT GAAGTCATCT CATCAACAAG TGGATTATAG TGTTGTAATG 2940 

AAGCACCTAT GCCTTTAGTA GCTAATGCAG TCCAAATTGC AAATTGATGC ATGGCATTTG 3000 

TTTGAGTTGA CCATATTGCA AAATTATCAT AGTAGTTTGG CATTTGTTCT TGTAAACCAC 3060 

so TTACAACATC TTGATCTTCA TAAAACAAAA TTGTACCGTA TGAATGTTTG AAGTTATCAA 3120 

TTTTTTGTTC AGTTGGCTCG AAATCACGAT TCTCTCCCAT GACTTCTTTT AAAATTGCTT 3180 

TTGTGTTATC CCAAAATTTA TTATTGTTGT CATTTAACAA GAGAACAATT CTAGTTGATT 3240 

SS 



20 



25 



30 



40 



45 
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CATCGCTAAT TGATATCGAA TCTTTCAAAT 
TGTCAAAAGT CATTGCTTTT TTATCTTTTT 
^ TAGTAAAGAA TACTTAATAG ACTAAGTATA 
TTACGAAAAT TTCAAGAATA TTAATATTCA 
AACGCATATT TATTATACTT AGATTAATAC 

JO 

TGTCATATCA TTGGTTTAAG AAAATGTTAC 
GTAGTTTAGG GCTTGCAACXS CACACAGTTG 
IS CAACTACTAA TTTGAATCAT AATATAACTT 
ATGAGACTGG GACACCTCAC GAATCAAATC 
GTCGTGATGC TAATCCTGAT TCGAATAATG 
GTACAGATTC AAAACCAGAC CCAAATAACC 
CAGATAACCC GAAACCAAAA CCX3GATCCAA 
CGGATCCAAA ACCAGATCCA GATAACCCGA 

25 

ATAAACCAAA GCCAAATCCG GATCCAAAAC 
ATCCAAAACC AGACCCTAAT AAGCCAAATC 
GGGATTCCAA TCATTCTGGT GGCTCGAAAA 
ATGGATCTAA TCAAGGTCAA TGGCAACCAA 
CTGGTAATGA TTTTGTATCC CAACGATTTT 
35 ATCCGTATAT TTTAAATCAA ATTAATAAGT 
AAGACATTTA TAATATTATT CGAAAACAAa 
TACMCAGCA ATCGAATTAC TTTAGATTCC 

40 

ACTATC6TAA TTTAGATGAA CAAGTACTCG 
CAGATTTGAA AAAGCCCGAA GATAAGCCGG 
AAAAAGACGA TTTTACAGTA GTTAAAAAAC 

45 

CATATAGTAA AAGTTGGCTA GCAATTGTAT 
TATTCTTATT TGTAAAGCGA AATAAAAAGA 
SO CCGTGTGTGA TTCGTTTTTT TTATTATGGA 
TCCGTGGCTT TTTTCAAAGC CTCAGGATTA 
TGTAACATAT GGATAATAAT TGGAACAGCA 

as 



TATATATTGA ACGTCTTTCT TCCATTGCAT 3360 

TAAATAAGCC CATAATTATT GCTCCTTCTT 3420 

AAATTTATAC TCGTACTTGT AAAGCAATAT 3480 

TTTTCAAATT CCAAATATAA ATGCATTTTC 3540 

TTACATGAAA AAGGGAGGTG TCTCGTGAAA 3600 

TTTCAACAAG TATTTTAATT TTAAGTAGTA 3660 

AAGCAAAGGA TAACTTAAAT GGAGAAAAAC 3720 

CACCATCAGT AAATAGTGAA ATGAATAATA 3780 

AAACGGGTAA TGAAGGAACA GGTTCGAATA 3840 

TGAAGCCAGA CTCAAACAAC CAAAACCCAA 3900 

AAAACTCAAG TCCGAATCCT AAACCAGATC 3960 

AACCAGACCC AGATAAACCA AAGCCAAATC 4020 

AACCAAATCC AGATCCAAAA CCAGACCCAG 4080 

CAGATCCAGA TAAACCAAAG CCAAATCCGA 4140 

CTAACCCGTC ACCAGATCCC GATCAACCTG 4200 

ATGGGGGGAC ATGGAACCCA AATGCTTCAG 4260 

ATGGGAATCA AGGAAACTCA CAAAATCCTA 4320 

TAGCCTTGGC AAATGGGGCT TACAAGTATA 4380 

TGGGCAAAGA TTATGGAGAA GTTACTGATG 4440 

ATTTCAGCGG AAATGCATAT TTAAATGGAT 4500 

aATATTTCAA TCCATTGAAA TCAGAAAGGT 4560 

CATTAATTAC TGGTGAAATT G6ATCAATGC 4620 

ATTCAAAACA ACGCTCATTT GAACCGCATG 4680 

AAGAAGATAA TAAGAAAAGT GCGTCAACTG 4740 

GTTCTATGAT GGTGGTATTT TCAATCATGC 4800 

AAAATAAAAA CGAATCACAG CGACGATAAT 4860 

ATAAAAATGT GATATATAAA ATTCGCTTGT 4920 

AGTAATTGGA ATATAACGAC AAATCCGTTT 4980 

AGCCGTTTTG TCCAAACATA TGCTAATGAA 5040 
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AATATTAATG AACTTACTGT TGTAGCAATA ATAAATGCCA CGATACGATT ACCTTTAATC 5160 

GCATTAAATA ATTCTCCAAA GATTACTTTT CTGAATACAT ATTCTTCTAA TAAAGGACCA 5220 

5 

ATAATAGATA CAAAGAAGAT AAATATAGGT ATTTTTCGAG CAATAATAAT TAGCTTTTCT 5280 

GTATTAGGAC TTACTTGTTG TCCACCATAA ATTTGCGTTA ATACAATGCT CACTACCATT 5340 

TGATAAATCA TTACCAATGC AAATCCAAGC AATGCCCATG GAATGATATA TTTTTTAGGT 5400 

10 

TCTTTAACTT CTAATTCTAA TTTTGTTGGA TTTTTAATTT TTAAATTAAT TAAAATAATC 5460 

GTCGTGGCGG CGATTAAAAA TAGAACAAGT TGTATGTAAA TGACTGCTTT AGTCAGTTCT 5520 

IS ATGCCACTAT ATTGTACAAA TGGTAATTTT TTTACAATGA GAAGCGGTAA AAATTGAGAC 5580 

AA7ATATAAA TAATAACAGT TAGCAATGAT GCCCATAATC tTGTCATAAT TTTCCTCCAA 5640 

ATATTTGTTT ATAATTTATT TTATCGTAAA TAACTTGAAG TTACAAAACT TAATTAAAAG 5700 

20 

GTTATGACTT GAAATTTTGA CCAAATTTGA TTATTATAAA TGTATGTTAG CACTCTTTAA 5760 

TGTTAAGTGC TAAACTTTAG GTTTTTTAAG GAGGAACAAT CATGCTAAAA CCAATTGGAA 5820 

ATCGTGTGAT TATTGAGAAA AAAGAACAAG AACAAACAAC TAAAAGTGGn ATTGTTTAAC 5880 

2S 

TGATAGTGCT AAAGA ' 5895 
(2) INFORMATION FOR SEQ ZD NO: 2: 

30 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TTT»AAAAA CAAGGTACGA TTGGTTTAAT AACATATATG AGAACCGATT CTACACGTAT 60 

40 

TTCaGATACT GCCAAAGTTG AAGCAAAACA GTATATAACT GATAAATACG GTGAATCTTA 120 

CACTTCTAAA CGTAAAGCAT CAGGGAAACA AGGTGACCaA GATGCCCATG AGGCTATTAG 180 

ACCTTCAAGT ACTATGCGTA CGCCAGATGA TATGAAGTCA TTTTTGACGA AAGACCAATA 240 

45 

CCGATTATAC AAATTAATTT GGGAACGATT TGTTGCTAGT CAAATGGCTC CAGCAATACT 300 

TGATACAGTC TCATTAGACA TAACACAAGG TGACATTAAA TTTAGAGCGA ATGGTCAAAC 360 

AATCAAGTTT AAAGGATTTA TGACACTTTA TGTAGAAACT AAAGATGATA GTGATAGCGA 420 

AAAGGAAAAT AAACTGCCTA AATTAGAGCA AGGTGATAAA GTCACAGCAA CTCAAATTGA 480 

ACCAGCTCAA CACTATACAC AACCACCTCC AAGATATACT GAGGCQAGAT TAGTAAAAAC 540 

55 
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AAAGCGTAAC TATGTCAAAT TAGAAAGTAA 
AGTTCATGAA CAAGTGAAAG AATACTTCCC 

5 

TATGGAAACG TTACTTGATA AGATTGCAGA 
OGGTTTCTTT AGTAGCTTTA AACAAGATGT 
TGAAATCAAA GATGAGCCAG CCGGTGAAGA 
AAAAATGGGA CGCTATGGTA AGTTCATGGC 
AAAAGCGATA GTTAAGTCTA TTGGTGTTAA 
AGAAAGAAAA TCTAAAAAGA ATCXSTGTCTT 
CTTTATCTCT TGGGATAAGC CGATTGGAAG 
TGAAAATAAA AAAGGCAAGA CAACACAAGT 

20 

AGCGCAGAAA TAATATTTTT ATTTCCTAGA 
AGTGAATCTT ATTTTAAAGA TAGTAAAGGA 
ATAACAACAT AATTAAmAGA CATAAATGAC 

25 

GTAAATGTAA TAGGTGCTGG TCTTGCCGGT 
GGAATTAAAG TTAATCTAAT AGAGATGAGA 

30 GATAAATTTG CGGAACTTGT ATGTTCCAAT 
GTGGGTGTTT TAAAAGAAGA AATGAGAAGA 
AAGGCACGAG TTCCAGCTGG TGGTGCATTA 

^ ATTACTGAAA CACTTAAAAA TCATGAAAAT 
ATTCCAGATG GATACACAAT TATCGCAACA 
GAAATAGTGG ACATTACTGG TAAAGATCAA 

40 

ATTGAAAAAG AATCTATTGA TATGGATAAA 
GAAGCTGCAT ATTTAAACTG TCCTATGACT 
45 GTATTAGAAG CTGAAGTTGC GCCTGTAAAT 
TGTATGCCTT TTGAAGTAAT GGCAGAACGC 
AAACCAGTAG GATTAGAAGA TCCAAAGACT 
AGACAAGATG ACGCTGCTGG TACACTCTAC 
TGGGGAGCTC AAAAAGAAGT CATTAAATTA 
AGATATGGTG TGATGCATAG AAATACCTTC 

55 



GCGTTTTGTT CCTACTGAGT TGGGAGAAAT 660 

AGAGATTATT GATGTGGAAT TCACAGTGAA 720 

AGGCGACATT ACATGGAGGA AAGTAATCGA 780 

TGAACGTGCT GAAGAAGAGA TGGAAAAGAT 840 

CTGTGAAATT TGTGGTTCTC CTATGGTTAT 900 

TTGCTCAAAC TTCCCGGATT GTCGTAATAC 960 

ATOTCCAAAA TGTAATGaTG GTGACGTCGT 1020 

TTATGGATGT TCGAAATATC CTGAATGCGA 1080 

AGATTGTCCA AAATGTAACC AATATCTTGT 1140 

AATATGTTCA AATTGCGATT ATAAAGAGGC 1200 

TACATTTTAA GATTGTTAAA TAGAATCATT 1260 

TTAATCTAAA TAAGTGCGGA TAATATAAAC 1320 

aATAAAAGGA GTATAGAAAT GACTCAAACT 1380 

TCAGAAGCGG CATATCAATT AGCTGAAAGA 1440 

CCTGTTAAAC AAACACCAGC GCACCATACT 1500 

TCATTACGCG GAAATGCTTT AACTAATGGT 1560 

TTGAATTCTA TAATTATTGA AGCGGCTGAT 1620 

GCAGTTGATA GACACGATTT TTCAGGTTAT 1680 

ATCACAGTTA TTAATGAAGA AATTAATGCC 1740 

GGACCACTTA CTACAGAAAC CCTTGCGCAA 1800 

CTTTATTTCT ATGATGCGGC TGCTCCAATT 1860 

GTTTACTTAA AGTCCCGTTA TGATAAAGGT 1920 

GAGGATGAAT TTAATCGCTT TTATGATGCA 1980 

TCATTTGAAA AAGAAAAATA TTTCGAGGGT 2040 

GGACGCAAGA CATTACTATT TGGACCAATG 2100 

GGGAAACGTC CTTATGCGGT GGTTCAATTA 2160 

AATATTGTTG GCTTCCAAAC GCATTTAAAA 2220 

ATTCCAGGCT TAGAAAATGT TGATATTGTT 2280 

ATTAATTCAC CGGACGTATT AAACGAGAAA 2340 
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TATGTAGAAA GCGCAgcTAG CGGCTTAGTT GCAGGTATCA ATCTTGCGCA TAAAATATTA 2460 

GGCAAGGGTG AGGTAGTATT TCCGAGAGAA ACAATGATTG GAAGTATGGC TTACTATATT 2520 

5 

TCTCATGCTA AAAACAATAA GAATTTCCAA CCTATGAATG CTAACTTCGG GTTATTACCA 2580 

TCTTTAGAAA CTAGAATTAA AGATAAAAAA GAACGCTATG AAGCACAAGC TAATAGAGCT 2640 

TTGGATTACT TAGAAAATTT CAAAAAAACT TTATAAAATA GTTAGAAAGA CTAGATATGC 2700 

TATTCATTCT TAAGTCATCA ACGAGTAAGT AATGACTTTC TAAATGGAAA ATACTTATCC 2760 

TAGTCTTTTT AATTTTGGAA TTGTTACGTA TTTCTGACAA TTTAGAATTC GCATTCAAAA 2820 

AATATCTAAA TAAATAACAC GCAATAAGTT GATTGATGTA ACATGTAAGA GAATGTnTA 2880 

AATAAACTTT ATTTAAAAGG CAATGAAATA ATAAATGGCA AGGCTATTAA TAAAGACTTT 2940 

TAGTAATTAA TTTAAAAAAG AGGTATTCTA ATTAACAGGT TTTCCGAITA GTTACAATTA 3000 

20 

TTTAATTCTC AAAAGATTTA GAATTGATTA TCAAATTACT GTAAGCCCTT TGCTGTATAT 3060 

GCTACAATTC TTATTGATGG AGGGTAAATG TATTGAATCA TATTCAAGAT GCGTTTTTAA 3120 

ATACATTGAA AGTTGAACGG AATTTTTCGG AACACACATT GAAATCATAT CAAGATGACT 3180 

25 

TAATTCAGTT TAATCAATTT TTAGAACAAG AACATTTAGA GTTGAATACT TTTGAATACA 3240 

GAGATGCTAG AAATTATTTG AGCTATTTAT ATTCAAATCA TTTG7UJ\AGA ACATCTGTTT 3300 

30 CTCGTAAAAT CTCAACGTTA AGAACTTTCT ATGAATATTG GATGACGCTT GATGAGAACA 3360 

TTATTAATCC ATTTGTTCAA TTAGTACATC CGAAAAAAGA AAAATATCTT CCGCAATTCT 3420 

TTTACGAAGA AGAAATGGAA GCGTTATTCA AAACTGTAGA AGAGGACACT TCAAAAAATT 3480 

TACGGGATCG AGTTATTCTT GAATTGTTGT ATGCTACAGG CATCCGTGTT TCGGAATTAG 3540 

TAAATATTAA AAAACAAGAT ATAGATTTTT ACGCGAATGG TGTTACCGTA TTAGGAAAAG 3600 

GGAqtAAAGA GCGCTTTGTA CCGTTTGGTG CTTATTGTAG ACAAAGCATC GAAAATTATT 3660 

40 

TAGAACATTT CAAACCAATT CAGTCATGCA ATCATGATTT TCTTATTGTA AATATGAAGG 3720 

GTGAAGCAAT CACTGAACGC GGTGTACGAT ATGTTTTAAA TGATATTGTT AAACGAACAG 3780 

4S CAGGCGTAAG TGaGATTCAT CCCCACAAGC TCAGACATAC ATTTGCAACG CATTTATTGA 3840 

ATCAAGGTGC AGACCTAAGA ACAGTACAAT CGTTATTAGG TCATGTTAAT TTGTCAACAA 3900 

CTGGTAAATA TACACACGTA TCTAACCAAC AATTAAGAAA AGTGTATCTA AATGCACATC 3960 

^ CTCGAGCGAA AAAGGAGAAT GAAACATGAG TAATACAACA TTACATGCAA CAACAATTTA 4020 

TGCTGTAAGA CATAATGGGA AAGCAGCTAT GGCTGGAGAT GGGCAAGTAA CGCTTGGTCA 4080 

ACAAGTCATC ATGAAACAAA CGGCAAGAAA AGTGCX5ACGT TTATATGAAG GTAAAGTGTT 4140 

55 
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AT7ACAACAG TTTAGTGGTA ACTTAGAAAG AGCTGCTGTT GAATTGGCAC AAGAATGGCG 4260 

AGGCGATAAA CAATTACGTC AATTAGAAGC TATGCTAATT GTAATGGATA AAGATGCTAT 4320 

5 

TTTAGTTGTC AGTGGAACTG GCGAAGTTAT TGCTCCAGAT GATGACCTTA TCGCTATTGG 4380 

ATCAGGAGGC AACTACGCAT TAAGCGCAGG ACGTGCATTG AAAOGCCATG CATCGCATTT 4440 

GTCTGCTGAA GAAATGGCAT ATGAGAGCTT GAAAGTAGCG GCTGATATTT GTGTCTTTAC 4500 

CAACGATAAT ATTGTTGTC6 AAACACTATA ATAATCAGAG CACGATAAAT AATTACGAGC 4560 

AATTAATTTT AGTTAAAAGA CGGAGGAATG AAATTAATGG ATACAGCTGG AATAAGATTA 4620 

^5 ACTCCAAAAG AAATCGTATC TAAATTAAAT GAATACATCG TTGGACAAAA TGATGCTAAA 4680 

CGTAAAGTGG CAATTGCCCT ACGTAATCGA TACAGAAGAA GTTTATTAGA TGAGGAATCA 4740 

AAGCAAGAAA TTTCACCTAA AAATATTTTG ATGATTGGAC CAACCGGCGT TGGTAAAACT 4800 

20 

GAAATTGCAA GAAGAATGGC CAAAGTTGTC GGCGCGCCAT TTATAAAAGT AGAAGCTACT 4860 

AAATTTACTG AGGTAGGTTA TGTAGGACGA GATGTTGAAA GTATGGTTAG AGATCTTGTT 4920 

GATGTTTCAG TAAGATTAGT CAAGGCGCAG AAAAAATCAT TGGTACAAGA TGAAGCAACA 4980 

25 

GCTAAGGCCA ATGAAAAACT TGTTAAGTTA TTAGTTCCAA GTATGAAAAA GAAAGCGTCT 5040 

CAAACGAATA ATCCTTTAGA GTCACTTTTC GGAGGTGCAA TTCCAAATTT CGGACAAAAT 5100 

30 AACGAAGATG AAGAAGAACC ACCTACTGAG GAAATTAAAA CAAAACGTTC TGAAATTAAG 5160 

AGACAGCTAG AAGAAGGCAA ACTTGAAAAA GAAAAGGT7VA GAATTAAAGT CGAACAAGAT 5220 

CCTGGTGCTT TAGGTATGCT AGGTACAAAT CAAAATCAGC AAATGCAAGA GAT6ATGAAT 5280 

^ CAATTAATGC CTAAAAAGAA AGTTGAGCGA GAAGTTGCTG TTGAGACGGC AAGGAAAATC 5340 

TTAGCTGATA GTTATGCGGA TGAACTAATT GATCAAGAAA GCGCTAACCA AGAAGCGCTT 5400 

GAAtTAGCAG AACAAATGGG TATCATCTTT ATAGATGAAA TCGACAAAGT TGCGACGAAT 5460 

40 

AATCATAATA GTGGTCAAGA TGTCTCAAGA CAAGGTGTTC AAAGAGATAT TTTACCTATA 5520 

CTTGAAGGTA GCGTTATTCA AACCAAATAT GGTACTGTGA ATACTGAACA TATGCTGTTT 55 80 

4S ATAGGTGCTG GAGCTTTCCA TGTATCTAAG CCGAGTGACT TGATACCAGA ATTGCAAGGT 5640 

CXjTTTTCCGA TTAGAGTTGA ACTTGATAGT TTATCGGTAO AAGATTTTGT AAGAATTTTG 5700 

ACAGAACCAA AATTGTCATT AATTAAACAA TATGAAGCAT TGCTTCAAAC AGAAGAAGTT 5760 

ACTGTAAACT TTACCGATGA AGCAATTACT CGCTTAGCTG AGATTGCTTA TCAAGTAAAT 5820 

CAAGATACAG ACAACATTGG TGCACGTCGA CTTCATACAA TTTTAGAAAA GATGCTAGAA 5880 

GATTTATCAT TCGAAGCACC AAGTATGCCG AATGCAGTTG TAGATATTAC CCCACAATAT 5940 

55 
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AAATATACAA AAGGAGAAAA ATTCATGAGC TTATTATCTA AAACGAGAGA GTTAAACACG 6060 

TTACTTCAAA AACACAAAGG TATTGCGGTT GATTTTAAAG ATGTAGCACA AACGATTAGT 6120 

5 

AGCGTAACTG TAACAAATGT ATTTATTGTA TCGCGTCGAG GTAAAATTTT AGGATCGAGT 6180 

CTAAATGAAT TATTAAAAAG TCAAAGAATT ATTCAAATGT TGGAAGAAAG ACATATTCCA 6240 

AGTGAATATA CAGAACGATT AATGGAAGTT AAACAAACAG AATCAAATAT TGATATCX3AC 6300 

10 

AATGTATTAA CAGTATTCCC ACCTGAAAAC AGAGAATTAT TCATAGATAG TCX3TACAACT 6360 

ATCTTCCCAA TTTTAGGTGG AGGGGAAAGA TTAGGTACAT TAGTACTTGG TCnAGTACAT 6420 

IS GATGATTTTA ATGaAAATGA TTTGGTACTA GGTGAATATG CTGCTACAGT TATTGGTATG 6480 

GAAaTCTTAC GTGAGAAGCA TAGTGAAGTA GAAAnAGAAG CGCGCGATAA AGCTGCTATT 6540 

ACAATGGCAA TTAATTCATT ATCTTATTCT GAAAAAGAAG CX5ATTGAACA TATCTTTGAA 6600 

20 

GAACTTGGCG GTACGGAAGG CCTATTAATC GCATCAAAAG TTGCAGATAG AGTTGGTATT 6660 

ACTAGATCTG TAATTGTAAA TGCACTACGT AAATTAGAAA GTGCTGGTGT AATTGAATCA 6720 

CGTTCTTTAG GAATGAAAGG TACTTTCATT AAAGTTAAAA AAGAAAAATT CTTAGATGAA 6780 

25 

TTAGAAAAAA GTAAAT 6796 
(2) INFORMATION FOR SEQ ID NO: 3: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ^ 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 3: 
ATCCTAAAAT TnAAAATTAT CACGCCTTTT GaACAGCTTT GTAACCaTCt GGACGATCAT 60 

40 

kAAATTCCaA TGTAAATCCT GGTTTAAaGT TGATCTTTAA CCTTATTTAA AyCACCAATT 120 
GTACGTATAT TATGTTGTTT AGCAAAATCA CGTTTTACAG CTAAAGCATA CGTATTGTTA 180 

^ TACTTCATTG GTTTTAACAT AGTCATTTGA TATTTCTTTT CAAGACTTTG CTTAGCTTGT 240 
TCATAAACTT TTTTCTCTTC TTTTGACTTC AATGGTTCTT TTGTTAATTC ACCTAAAACT 300 
GTTCCAGTAA ATTCTAAATA CCCATCTATA TCGTCAGATT TTAAAGCATT AAATAAAAAT 360 

so GCTGTTTTGC CCATACCATC TTTCACTTCT ACAGTATTTT TGGTCTCTTC TTCTATTAAA 420 
ATTTTATACA TATTTGTAAT AATCGATGGC TCGGAGCCAA GCTTTCCAGC TAACGTAATT 480 
TTATCACCTT TTTGTGCAAA CATAGGAATA GCGATAGCCA GTATAATAAT CATCACTATA 540 

55 
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TCAAATATAA 


TTGCCAATAA 


GGCTGCTGGA 


ATTGCACCTA ATAATATCAA 


CGATGCATTG 


660 




TTACGGTCTA 


TACCTAATAA 


AATTAAATCT 


CCTAGTCCX5C CTGCACCAAT 


TAATGCTGCT 


720 


5 


AGTGTTGCTG 


TACCTATAAT 


TAATACCATA 


GCCGTTCTTA CACCAGCCAT 


TATAACAGGC 


780 




ATTGCTATCG 


GAAGTTCGAC 


TTTAGTTAAA 


CGTCTAAATG GTTTCATACC 


TATACCTTTA 


640 


10 


GCCGCTTCAA 


TGAGTGATGG 


ATCAACTTCT 


TTAATTCCAG TATACGTATT 


CCTTAAAATT 


900 


GGTAACAACG 


CATACACTAC 


AAGTGCAATA 


ATTGCTGGCA CACGACCGAT 


ACCAAATAAA 


960 




GGAATCATTA 


AACCTAATAA 


TGCCAACGAT 


GGTATGGTTT GAAGAATTGC 


CGCAATATTC 


1020 


15 


ATTACGATTT 


CAGATATCGT 


TTTAGTCTTC 


GTTAATAAAA TACCTAATGG 


TACCGCAATA 


1080 




GCAGTTGCAA 


TCAATAATGC 


GATAAATGAT 


ATTTGAATAT GTTCTATCAT 


TGTCGAAAAG 


1140 




AGTTGCCCCT 


TACGTTCACT 


CAATATGTCg 


AAAAAGTTAG TCATGTTGAG 


CTACCTCCTT 


1200 


20 


TTTCTGGGAC 


AAATATTTGA 


AGATATCTTT 


CCTATCAATA ACATATTGAC 


CTACGCTATC 


1260 




TTCTTGCATG 


ACAATGACAC 


GCTCGCTCTC 


TGATAAAAGT TGATACAATA 


CTTCAATTGG 


1320 


25 


TTGATTGTCA 


TAAACAATTG 


GATAAGCGCT 


CATAGATGTA ACCTCATCGA 


TTGGTTTCAT 


1380 


AATATCCAAG 


TCACGGATAA 


TTGCGTTCTC 


TTCAACACAT GGCX3CATCAT 


CTTCTAAATG 


1440 




ACTACCCATA 


AATTGTTTAA 


CAAATTCACT 


TTGAGGATTA TTTTTAAATC 


CTTCTGGTGT 


1500 


30 


GTCAATTTGT 


TCAATATGCC 


CTTCATTCAA 


AAGACAAATC TTATCACCAA 


GTTTCATCGC 


1560 




CTCTTGAATA 


TCATGTGTAA 


CAAATATGAT 


TGTCTTCTTA ATTTTAGTTT 


GTAATTCAAT 


1620 




TAAATCATCT 


TGAAGTTTTT 


CTCGGCTGAT 


TGGGTCTAAT GCACTAAACG 


GTTCATCCAT 


1680 


35 


TAAAATAACT 


GGTGGATCAG 


CTGCTAACGC 


ACGTATAACT CCTACACGTT 


GTCGTTGCCC 


1740 




CCCTGACAAT 


TCATCAGGTT 


TTCTGTTTTT 


ATATTTTTCA GGTTCTAATC 


CAACCATTTC 


1800 




AAGT&ATTCA 


TCTACTCTTT 


TATCTATATC 


TTTTTCTTTC CACTTTTTCA 


TTTGTGGCAC 


1860 


40 


TTGTGCAAtA 


TTTTCTTTGa 


wTGTCaTATG 


TGGGAATAAT GCAATCTGCT 


GcAATACGTA 


1920 




TCCAATATCC 


CAACkCATTT 


CGTATACTGG 


ATAATCACTT ATTGGTTTAT 


CTTTAAAATA 


1980 


45 


AATATAACCT 


TCACTTAAGT 


GAATGAGTCG 


ATTAATCATT TTTAATGTCG 


TAGTTTTTCC 


2040 


ACAACCTGAA 


GGTCCAATTA 


GCACAAAAAA 


TTC 




2073 



(2) INFORMATION FOR SEQ ID NO: 4: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACTATTCTAG CTTCATCAGT TATCATATAT TCTTTGAAAC ACTTGTAAGA AAATATAATG €0 

^ AGTATTTACT ACATAATGAT ATTTCAAATT AGAAAAAAGG AAGTTATGAT TTAATGGCCT 120 

TGAGCCTATC ATAACTTCCT TTTATCATTT TATTGTTGTG TTGATGTTTC GATAACGTGG 180 

TACATCTTAT CAAACATCAA TTCGAAACCA TGCACCATGG CATCATGATA TTCTTTTTTC 240 

10 

TnTGCTTGT ATTCTAAATT AGTAAATCGT CTTTCTTTTT CAACTAATGA ACGATAATAA 300 

AATAGCATTT GGGTGCCACC TGTTTCACGT TCAAAAAATT CTACCTCAAT GACATCTTGC 360 

^5 GTTTCACTTA GTCCAGGCAT ACCGATAGTC ATCTTAACGT ATTCATCCAT AACTAAAGAT 420 

TCATAAATGC CTTCAATCAC ATTTACTTTG CCATTACGTT GTTGATCTAC AATACGATAT 480 

TTACCGCCTT CTTTAACGTC CGCTTCAATC TCTTTATTCG TTCTGGCTGA TGTCATAAAC 540 

20 CATTGTTTCA ACAAATCTTT CTTTGTCCAA GCTTCGTATA CTAACTCTGG AGAAAATTTA 600 

TAAAGCTTTT CAATTTCAAC TTCGACATGT TCATTCTCTA CATTAAATTT TGCCACTGTT 660 

GTCCACCCAC TTTCGCTCTT ACTTTTATTT TAACGTATTT TTGCTCAGTT CCAAACATAG 720 

25 

ATGAXCATCA TTTTTAAAAG ATTAGCGTTA TACGGTGAGT ACAACATGAT CTGTTAATAT 780 

AACAAGCCAC CTTACTTGGC TACATCGATA TATTGTTAAG CATTAATGTT TCATTTCTTG 840 

ACTAGTGTTC TTTTTTAGCT TTGGAAAATT AAATAAAATC GCAATAAGTC CGCATACACC 900 

30 

TAATAATATA GGATAAATGC TGTATGGGAA TAACATTAAC GGTGAAATAC CAGCTACACC 960 

AGCCGCTGaA ATGACTTGCG GGCTATATGG TAATAAACCT TGGAAGCAGC CTCCAAATAT 1020 

35 ATCAAGAATA CTTGCTGATT TCCTTGAATC TACATCATAT TCATCTGCAA TATTTTTAGC 1080 

TAAAGGACCT GACATAATAA TAGAGATGGT GTTGTTTGCC GTGGCAATAT CTGCGACACT 1140 

TACC2AAACTA GCAATTCCTA ATTCTGCGCC ACGCTTTGAT TTCACTTTAG AGCGAACAAA 1200 

40 

TTGCAACAAC CATTCAATAC CACCATTGTG TTGAATAATA CCQACTAAAC CACCAATTAG 1260 

CAACGCAATC ATAGCAATAT CITCCATGCT TATAATACCT TTGGACACTG CATCTAGTAG 1320 

CCCCATCCAA CCGAATGAAC CATCTATGAG ACCAATGATT CCGGCTAATA ATGTTCCGCC 1380 

45 

AATCAATACG ATAATGACAT TTACACCTAA TAATGCTAAT ACCAATACTA AGATATACGG 1440 

TACAACTTTA ATTAGATTAT AATCATAGTt TTTAGCATGA TTTAAAGAAA TGCCATTCGT 1500 

so TAAGAAATAC AGAATAATAA TCGTTAAAAT AGCACCTGGC AATACAATTT TAAAGTTTAC 1560 

TCTGAATTTA TCTTTCATTT TCGTATGTTG TGTTCTAACC GCAGCAATTG TTGTATCTGA 1620 

AATCATTGAT AGATTATCGC CGAACATTGC ACCTCCAACA ACTGTAGCCa tTGCtAGCQC 1680 
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70 



20 



25 



TCCTACAGAC GTCCCCATAG ATATAGAAAC AAACATACAA ATCACAAACA ATCCTACAAT 1800 

AATTAAATTT TCTGGGATTA ATGATAGTCC TAAATTAACT GTCGACTTTA CGCCACCCAT 1860 

TTTTTCAGCT GTATTTGAAA ATGCACCTGC TAAAATAAAA ATCAACATCA TTAAAACAAT 1920 

GTTTGAATGG CCTGCACCTT TCGTGAAGAC CTCAACTTTT TTAGCAAATG ATTCTTTTCG 1980 

ATTCATTAAT AACGCCACAA TTACCGTTAT CGTAATTGCA ACATTTAATG GCATTGAAGT 204 0 

AAAATCACCT GTGATAATAC CTACGCCTAA AAACAACGCC ACAAATAATA ACAAGGGGAA 2100 

TAATGCCCAA GCATTGCTCT TTTTATGTAC TTCCATCCTT TTTACCTGCT TTCCAATTAA 2160 

^5 AAATACCTCT TTCTCACAAA CGATGAAGAA AGAGGTTTTC ATGT6CTTTA CCTGCTTATC 2220 

TTCAAACCAT TACGGTTACT GGAATTGGCA CATTCGAGAT GTTGCCGAGG CTTCATAGGG 2280 

CCAGTCCCTC CACCTCTCTA GATAAGTGAT GCTTATTTAC GTTTACGTTA CAAGATAATC 2340 

CTTAGTACGT CAATCATAAA TTAATCAGGA GTCGTATAAT ATTTTTCATA AACAATCATT 2400 

GCTACTGTAA TAATAATCAA AACAATAATG CTAATAACAA GTAAAAGCCA CCATTTAAGC 2460 

ATTAATGCAA TAAAAATGAA CACGATAGAC ACACTTACTA ATATTAATGA TATGACTTTA 2520 

AATTGCTGAA CACGTTGCTT GGAGATGACT TTCAACTGTT TGTTTGATAG ACGCGTATTT 2580 

TTTATACTGA TTCCCAGTAT ATTTTCTAAT ATTTGAACCA ATACGATACT TATTGCAAAT 2640 

30 ATAATAATTG GTAAAACATC ATAGCTCCCT ATAGTTAATG TATAAATTAC AAATCCAATG 2700 

TAAAGTAACC CTGAGACAAA GGATAAAAAG TATGCGACGT ATTTGTTAAA CTTAATGATA 2760 

TGCTTTTTAA CGTTTTGATG TGTAAACCAT ACATTCXSAAA CGATCGCAAC TGCTACAAAT 2820 

AATGTGAATA CTATATATAA TGGTAATTTT TGTTCAGGAA AAACAGTCGC TATTCCAAAA 2880 

GCTAATGCTA AAATCAAAAA TAATATAGCT CTAGATACTA TTAATGCCAT AATAACAACC 2940 

CCTTTGTTTA ATATCGAGTT TGCAAATTTA CGTTTATCAG CGTTTCTATG ATCAGTACTT 3000 

CTACGGGTAG CGTTTCTATG TAATTTACAT CATCTTAACA TATAAATACT TCGCTATTTA 3060 

ATTGAAAACA TATCCTATTA TTCTTTGTCC GTTCTGACGT TTAATATCTA GCCTTAGGCA 312 0 

4S TTTCACTTGT TAATGAATTT AACTTTCTTC CACTAACCGT CCCTAAACCC AATCCCGCAA 3180 

CAGTTTTTAA CTTTTTCGTT GTTGTCCTGA CATCCTCATT AAGAAAGTTT ATTCTGCTTA 3240 

AAACTTATAA TCCACACCCT GAGCAAACGC TCCTTATGAC AGAGTATTAA AATAAGCCGA 3300 

TAAAGATACA CACCTTTACC GACTATTTAA AATACACTTC ACCAATTCAT TTTAATTTAA 3360 

TGGATTGAAG TAACTAAATT AATATTATGT TGTTCAATTA AAAGCTTCAT ACAAACCTAA 3420 

TCTATTTGCA CTCCACCGCT AACACOGAAC ACTTGTCCGG TTGTATAACT TGATTCTTCT 3480 
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CAGCTATACA 


ACTCAGACCT 


TGTCCCACTT 


CAGCAACAAG 


CCCAGTCGCA AGTGGTAAGT 


5400 




CTAATATTCT 


AATATGATGT 


TCTGTTAAAA 


TATCTTTAAT 


TTTTTGTGTA 


GTCTTAAATT 


5460 


5 


CTTTATCGGA 


TAGTTCTGGA 


AATTGATGAA AATACCTTCT 


CCAGGTAACA 


GCTTGATCTT 


5520 




TTAATCCCAT 


CGGTCATTCC 


CCTTCCTTAA 


GTCAATGATA 


TGTTGTCTAC 


CCTACGATGA 


5580 


10 


TCATCTTTGA 


CTATTAAACG 


ATGATTTCAC 


AACAATGTAC 


TCTTGTTAAT 


TGCTTTCGTT 


5640 




AATGATA6AC 


AGTTGTTTAA 


TAATATCGTA ACACTGTTGT 


CAAACTATTC 


TAACTTTTAT 


5700 




AATTGAGACT 


CTATACAAAA 


ACGTGTTCTC 


GAATATACTT GTTTTTACAA ACCACAAAAA 


5760 


IS 


GCTCTAAACA TTAGTTTAAA CCAATGCTTA GAGCTTTCTA ATTATTTTAT 


GCTTTAAAAG 


5820 




ATACTGTGTT ATCTACGATG 


ACCTTACCGT 


CTTTAATAAC 


TTTTTCTGCG 


TGATTGATAC 


5880 




CAAAATGATA TGGAATATAT 


TCATGATTTG 


GTGCATCCCA AATTACTAAA TTAGCCTTAT 


5940 


20 


CACCTGTGTT 


AATTGTACCC 


GCGTTAATGT 


CTATTGCTTT 


AGCAGCATTG 


ACCGTAACAG 


6000 




CATTCCAAAC 


TTCATTAGGT 


GATAGCTTTA 


ATTTCAAGGC 


TGCAATCGCC 


ATAACAAGTT 


6060 


25 


GTAAGTTGTT 


TGTGACACTA 


CTACCAGGGT 


TATAATCAGT 


TGCTAATGCA ATCGCACCGT 


6120 




TATTGTCAAG 


CATGCCTCTT 


GCATCTGCAT 


AATCTTCTTT 


ACCTAAATAG 


AACX5TCGTTG 


6180 




CAGGTAAGAG 


GACAGCTACA 


GTATCACTAT 


TTCGCAACTT 


TTCTTTTCCT 


TTATCACTAG 


6240 


30 


AAGCTACTAA 


GTGGTCTGCT 


GATATTGCTT 


GTTCATCAAT 


TGCTAATTCC 


AGTCCGCCTA 


6300 




ACGGATCAAT 


TTCATCCGCA 


TGTATTTTCA 


CTTTAAAACC 


TGCTTCTTTG 


GCTTTTTGCA 


6360 




TATAATGTTG 


CGATTGTTCT 


ATTGTAAATA 


CACCTGTTTC 


ACAGAAAATA 


TCCGCAAAGT 


6420 


35 


CTGCATATTG 


TTTTACTTCC 


GGAAGTAACG 


CAATCATTTC 


TTCTAAAAAT 


GCCTCATTTG 


6480 




AACTTGCCTC 


TTTAGGTACA 


GCATGAGGCC 


CTAGGAAAGT 


ATGTTTCATG 


TCTAAATCAT 


6540 


40 


ATTTCTCAGC TAAACGATTA GACACTTTCA ATTGCTTCAG TTCATTTTCT 


CTATCTAATC 


6600 


CATAACCACT 


CTTACTTTCA 


ACTGCAAGCA 


CGCCGTGTTT 


AATCATAGTA AGCAAATCAT 


6660 




GCTCTGCTTT 


TTTAAACAAG 


TCATCTTCGG 


ATGTTTCTCT 


AGTAGCATTA 


ACGGTAGATA 


6720 


45 


ATATGCCACC 


ACCCATTTCT 


AATATTTCAA 


GGTAAGACTT 


ACCTTGACGT 


TTTAATGACA 


6780 




TCTCATGTTC 


TCGAGATCCA 


CCAAATGTTA AATGGGTATG 


TGCATCTACT 


AATGCTGGGG 


6840 




ACACTACCTT 


CCCACTAGCA TCAATCGTCT 


CAGTCGCATC 


GTAGTCATCT 


GTATGTGTTC 


6900 


SO 


CAGCATATAC 


AATTTTGCCA 


TCTTTAATGA 


CAACTGTACC 


ATTTTTCACA 


ACATTTAATT 


6960 




CATCTAATTC 


CTTACCCTTC 


AAAGGTTTAT 


CT6TTGATCT 


CGGTAAAATT 


AATTCTGCTA 


7020 




TATGATTAAT 


TATTAAATCA 


TTCATTACTT 


ATCACCTGCT 


TTATCAATCA 


TTGGAATATG 


7080 
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AACACCCATA 


CCTGGGTCAG 


TCX3TCAATAC 


ACGTTCCAAT CTTCTTTCAG 


CACGCTCTGA 


7200 




TCCATCTGCT 


ACAACAACCA 


TACCCGCATG 


AAGTGAATAT CCCATGCCAA 


CACCGCCACC 


7260 


5 


GTGATGGAAT 


GAAATCCATG 


AACCACCTGC 


AGCTGTGTTA ATGAGTGCAT 


TCAATACAGC 


7320 




CCAATCACCA 


ACCGCGTCAC 


TACCATCTTT 


CATACTTTCT GTTTCACGGT 


TAGGACTAGC 


7380 


10 


AACTGAACCA 


GCATCTAAAT 


GGTCTCX3TCC 


AATAACAATT GGTGCTGAAA TTTCACCGTC 


7440 


ACGTACAAGA 


CGATTTAAAG 


CTAAGCCCAT 


TTTCGCTCTT TCTCCATAGC 


CTAACCAAGC 


7500 




AATACGTGAT 


GGTAGTCCTT 


GATATGAAAT 


TTTTTCTTCA GCTAAATCAA GCCATCTTAA 


7560 


IS 


TAACTTTTCA TTTTCTGGGA AAAGTTTGCG CATTTCTTCA TCCGCACGCT CGATATCTTT 


7620 




TGGATCACCA 


CTCAACGCAG 


CAAAGCGGAA 


TGGCCCTTTA CCTTCACAGA ATAATGGTCT 


7680 




AATGTAAGCT 


GGTACAAAGC 


CTGGGAAGTC 


AAAAGCATTT TTCACTCCGT 


TATTGAAGGC 


7740 


20 


TACTTGACGA 


ATATTGTTAC 


CATAATCAAA 


TGCTACAGCG CCACGTTTTT 


GGAATTCAAG 


7800 




CATTAATTCA 


ACATGCTTTG 


CCATTGAAGC 


TTGTGACAGT TCAACATATT 


TTTTCGGATC 


7360 


25 


TTTTTCACGC 


AATACTTTCG 


CTTCTTCTAC 


AGAGTATCCT TGTGGCACAT 


ATCCATTTAG 


7920 


CGGATCATGT 


GCACTTGTTT 


GGTCAGTAAT 


AATGTCAATT TTAAATCCTT 


TTTCTAGAAT 


7980 




CGCTTGATGG 


ATGTCTACAG 


CATTTCCAAC 


TAACCCGATT GATAATCCTT 


CTCCACGTTC 


8040 


30 


TTTCGCCTCT 


TCTGCTAATT 


TTAATGCTTC 


ATCTAAATCA GCTGTTTTAA 


CATCACAGTA 


8100 




TTTCX3TATCA 


ATTCGCTTAT 


CAACACGTGT 


TTCATCAACA TCCACGCAAA 


TTGCTACCCC 


8160 




ATGATTCATA GTAATTGCTA ACGGTTGCGC ACCACCCATA CCACCTAAAC 


CTGCTGTCAG 


8220 


35 


TGTAACAGTG 


CCTGCTAAAT 


CTCCATTAAA 


GTGTTGATTA CCTAGCTCGG 


CAAATGTCTC 


8280 




ATAAGTACCT 


TGCACAATAC 


CTTGAGAACC 


AATATATATC CAACTACCGG 


CTGTCATCTG 


8340 


40 


TCCMACATG 


ATTAAACCTT 


TTTTATCTAA 


TTCATTAAAA TGATCCCAGT 


TTGCCCATTC 


8400 


AGGCACTAAT 


ACTGAATTTG 


AAATTAATAC 


ACGTGGCGCT TCTTCATGTG 


TTTTAAATAC 


8460 




AGCAACTGGC 


TTTCCTGATT 


GTACTAACAT 


TGTCTCATCT GATTCTAATT 


CTCGTAACGT 


8520 


45 


TTTCTCTATT 


GCTTCAAAAG 


CTTCCCAATT 


ACGTGCTGCT TTTCCAATAC 


CACCATAAAC 


8580 




AACTAAATCT 


TCTGGTCTTT 


CAGCAACTTC 


TGGGTCTAAA TTGTTGTATA 


ACATTCTAAG 


8640 




TACTGCTTCT 


TGTTCCCAAC 


CTTTACACTC 


AATACTCAAA CCTTTTTTTG 


CTTGAATTTT 


8700 


SO 


TCTCATAAAA 


TTCGCTCCTG 


TTCTTTTAAG 


AAGTTAATTC CACTAAATTT 


AAAACGCTTA 


8760 




CATTATTATC 


TTCAATATTC 


ATTATAGTAT 


GTTAAAATAT AGCCAACAAA 


TATAAATAAA 


8820 




CTAATTATCC ATAGCTTGAA TCTATAAATA AAAGGAGGAA AACACATGAA AATTATTCAG 


6880 
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CATATTAGCC AGCCATCTTT AACTGCTACG ATTAAAAAAA TGGAAGCAGA TTTAGGTTAT 9000 

GACTTATTTA CACGTTCAAC AAAAGACATC AAGATTACCG AAAAAGGAAT ACAGTTTTAT 9060 

5 

CX3TTATGCGA GCGAATTAGT TCAACAATAT CGATCCACGA TGGAAAAAAT GTATGATTTA 9120 

AGCGTTACA7 CAGAACCAAG GATAAAAATT GGGACTCTTG AATCTACGAA TCAATGGATT 9180 

,0 GCGAATTTAA TTCGAAAGCA CCATTCCGAC TACCCTGAAC AGCAATATCG TTTATATGAA 9240 

ATACATGATA AACATCAATC TATAGAGCAA TTACTGAATT TTAATATTCA TTTAGCTATA 9300 

ACAAATGAAA AAATAACCCA CGAAGATATA AGATCCATTC CTTTATATGA GGAATCTTAC 9360 

ATTTTATTAG CACCCAAGGA AACATTTAAA AATCAAAATT GGGTAGATGT TGAAAATTTG 9420 

CCACTCATAT TACCAAACAA AAATTCTCAA GTGCGCAAAC ACTTAOATGA CTATTTTAAT 9480 

AGAAGAAATA TTCGTCCAAA TGTCGTTGTA GAAACAGATC GATTCGAATC AGCAGTTGGA 954 0 

20 

TTTGTTCATC TCGGCTTAGG TTACGCTATC ATTCCGAGAT TTTATTACCA ATCATTTCAC 9600 

ACGTCTAATT TAGAATATAA AAAAATTCGT CCAAACTTAG GCCGAAAAAT TTATATCAAT 9660 

TACCATAAAA AACGCAAACA CTCCGAACAA GTACATACAT TCGTACAACA ATGCCAAGAT 9720 

25 

TATTTMkTG GACTTTTAGA GGCTCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9780 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9840 

30 CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9900 

CTCAGTCAAC TGTATACCTT TTTCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9960 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGTGCCTCT TATGTAGTTG 10020 

^ CGTAGTCAaC TGTaTACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 10080 

CGCAGATCAT CGTATAAAAA TTAATGACGT CATTTCAAAA ATCGATACAA AAATAATTTA 10140 

TTATAAAAAT TCTAAGAAAG AAGTGAAGCA GATGTTAAAA TCTATTAATC ATATATGCTT 10200 

40 

TTCAGTCAGA AATTTAAACG ATTCAATACA TTTTTATAGA GATATTTTAC TTGGGAAATT 10260 

GCTATTGACT GGTAAAAAAA CTGCTTATTT TGAGCTTGCA GGCCTATGGA TTGCTTTAAA 10320 

45 TGAAGAAAAA GATATACCAC GTAATGAAAT TCACTTTTCA TATACACATA TAGCTTTCAC 10380 

TATAGATGAC AGCGAATTTA AATATTGGCA TCAGAGGTTA AAAGATAATA ACGTGAATAT 10440 

TTTAGAAGGA AGAGTTAGAG ATATTAGAGA TAGACAATCA ATTTACTTTA CCGACCCTGA 1050 0 

^ TGGTCATAAG CTAGAATTAC ATACTGGCAC ACTTGAGAAC AGATTAAATT ATTATAAAGA 10560 

GGCTAAACCA CATATGACAT TTTACAAATA AGGTGTCATT ATAAAAAGGC CTCTTGAACT 10620 

CCGTTAAAAT TTTAATTAAT TATTATATAA TAAGAGAACT TTTCAAACAA TACAGTTGTT 10680 

SS 
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TTACTGCAAT TATTTTTCAA 


ATATATCAAC GTTAATATAA 


CTTCTATTAA 


GAAATACTCA 


10800 




CATTCTGCCC TGCAATGCAA 


ATCTCGTCAC ATATAAATAT 


TTTTAATTAT 


TTTAAAAAAT 


10860 


5 


GATGCACTAA ATTAGCAACG 


AGCTTAGCAG TTCTATTGTC 


AGCGTCATAT 


GTTGGATTCA 


10920 




TCTCAGCAAT ACTAACTGAA 


GACACCTTAT CACTTGGAAT 


AATACGTTTT 


GCTAATTCAA 


10980 


10 


GAACAGTATG TGGATACAAA 


CCTAACACTG CCGGCGCACT 


TACCCCAGGC 


GCAAACGCAC 


11040 




TATCAATGAC ATCCATACAA ATCGTAAACA TAATGACATC ATGTTCATGT ACAAAACGTT 


11100 




CAATCATATC TTTAATTGTT 


GGTGATACGT GACTCAATAA 


TTCATCTGCA 


AAGACATAAT 


11160 


IS 


CAATCTTTTT CTCTTTAGCA 


TAATCAAATA AACTTTGCGT 


ATTACCACCT 


TGAGCAATAC 


11220 




CAAGCACTAA ATAATCTGTG 


TTTTCATCTT CTTCTAAAAT 


TTGTCTAAAG 


CTCGTTCCAG 


11280 




ATGTAGATTG TTGTTCAGCA 


CGTGTATCAA AATGCGCATC 


AATATTTATC 


ACACCAATAG 


11340 


20 


ATTGTGTTGG ATAGACTTTA 


CGTGTTGCTA AATATTGAGC 


ATACGCAATA 


TCATGTCCAC 


11400 




CACCTAATAA AAATGTTTGT 


CTATGATTAG CAATTGACTT 


CGCTGCAAGC 


ATAGCAAATT 


11460 


2S 


CTTTTTGAGT ATCAATTAAT 


TCCTCATGAT CATGATAAAC 


ATTTCCGTAA 


TCGACTAAAG 


11520 


TTcACATTGA TTCAAATCCX5 


GCAAACCTGC AAATGCTTGT 


TTAATCGCAT 


CTGGTCCTTC 


11580 




TTTTGCACCA ATGCGCCCCT 


TGTTTAAAGC AACACCTTTG 


TCAACAGCAT 


AGCCTAATAT 


11640 


30 


ACCGACCCCT GATGGCATAC 


TACTCTTTTC CAGCTTAGAC 


AAATCTTCAA 


ATGTTACTGT 


11700 




TTGAAAATGT CTAAATTTTT 


TCGGGTCTGT TTCACTATCT 


AACCTTCCAG 


TCCATAAATT 


11760 




TGGTTCACCT TGCTTGTACA 


CAGCATTTCC CCCTCTTAT7 


TATGTGGCTT 


ATTAACAATT 


11820 


3S 


AAAGTATAAC GTATAGGAAA 


TTTTGAATTC AATTCATAGT 


TAAATCCGTA 


TCTTAAAAAT 


11880 




ACTTATCTAC ATTACTTTTA 


CCCCTATTTT CTATGTAATA 


ACXyUlTACTT 


AGCTGATTTA 


11940 


40 


TGTTAATAAA ATACGTCAAG ACTATTACAT TTTCATTAAT ATTGACATAG ACAATTTATC 


12000 


TCTCGGCTTG TAATATGTAT 


AATTGTTACT AAAAGATATT 


TTGCTTGTTA 


CCTAATGGAG 


12060 




GTTACATATA ATGAAGAACA ATAAAATTTC TGGTTTTCAA 


TGGGCAATGA 


CGATTTTCGT 


12120 


4S 


CTTCTTTGTC ATTACAATGG 


CGTTATCCAT TATGCTCAGA 


GATTTCCAGT 


CTATAATTGG 


12180 




TGTCAAACAC TTTATATTTG 


AAGTTACAGA TCTAGCACCA 


TTAATTGCTG 


CAATCATTTG 


12240 




TATACTCGTT TTCAAATATA AAAAGGTCCA ACTTGCAGGT 


TTAAAATTCT 


CAATCAGCCT 


12300 


SO 


GAAAGTAATT GAACGTCTAT 


TGCTAGCTTT AATTTTACCT 


TTAATTATTC 


TAATTATTGG 


12360 




TATGTACAGC TTTAATACAT 


TTGCAGATAG CTTTATTTTA 


TTACAATCAA 


CAGGCTTATC 


12420 




AGTACCTATT ACACACATTC 


TGATTGGACA TATTCTGATG 


GCGTTCGTAG 


TAGAATTCGG 


12480 



SS 
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TGTTGTTGGT TTGATGTATT CAGTTTTCTC AGCAAATACA ACTTATGGTA CAGAATTTGC 12600 

TGCTTATAAC TTCCTTTATA CATTCTCATT CTCTATGATT CTTGGTGAAT TAATTAGAGC 12660 

5 

GACTAAAGGA CGTACAATTT ATATTGCAAC GACATTCCAT GCTTCAATGA CATTCGGACT 12720 

TATTTTCnTG TTTAGCGAAG AAATOGGCGA TCTATTTTCA ATCAAAGTCA TCGCCATTTC 12780 

AACAGCAATC GTTGCAGTAG GATACATTGG TTTAAGCTTA ATTATCCGAG GTATTGCATA 12840 

10 

TTTAACAACA AGACGAAACC TTGAAGAACT TGAGCCTAAT AATTATTTAG ACCATGTCAA 12900 

TGACGATGAA GAAACTAATC ATACTGAGGC TGAAAAATCT TCTTCAAATA TTAAA6ATGC 12960 

IS TGAAAAAACA GGTGTAGCTA CTGCATCAAC GGTTGGTGTT GCTAAAAATG ATACTGAAAA 13020 

TACAGTGGCT GACGAACCAA GCATTCATGA AGGTACTGAA AAAACAGAAC CTCAACATCA 13080 

CATAGGTAAT CAAACTGAAT CTAATCATGA TGAAGATCAt GACATCACTT CGGAGTCAGT 13140 

20 

AGAATCAGCm GaATCAGTTA AACAAGCACC ACmAAGTGAC gATTTaACAA ACGATTCAAA 13200 

TGAAGATGAA ATAGAGCAAT CATTAnAAGA ACCTGCGACT TATAAAGAAG ACAGACGTnC 13260 

ATCAGTTGTA ATTGATGCAG AAAAACATAT CGAAAAAGCT GAAGAnCAAT CTTCAGATAA 13320 

2S 

A 13321 
(2) INFORMATION FOR SEQ ID NO: 5: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGTGTTGTA AACTTTTATG TTGAAAAAGC TACTTATCTC AATGAAAACA AGTAGCATTT 60 

40 

AATAAATTAA TTAGTATACA GCTAGTTTTT CTAATTGTTC TTTAACTTGA ATTAAGTTTG 120 
ACCGTATTAG AGAGGCAGAT TGATCCATCG TTTGAATTGC TTGTCCTTCA TTTTCGTTCA 180 
45 AGCCATTACA AACAACTTCA AACTGTTGTG CCATTTGATC AAGACGCGCA TGAGCTTGTG 240 
TGTTTAAAAT AAACATATCG TCATAATGTG ATGGCGAATA GATAATTCGT CGTTGTATAC 300 
AAAOGTATAA AAACCTTGTC ATATCAACGG TTTTGGCATT TTTAAACCTC TGTGTTTTCC 360 
ACGCATGTTT GCCCTTATTT AAATAATTTG CCCTTTTTTC GCCCCGAAAA AAAAACACAA 420 
AAAAATAACC ACACTCCTAA ATTAATAGGT GGTGTGGTTT TGTTGATTGT AGGGGTATAA 480 
AAATAACCGC ATTATTAAAG ATACGGTTAC TCTGTTATCT GTAAATATAA TAGTAGTTTA 540 
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10 



20 



25 



AAACAGGACT CCACATAAAA ATCAACTCCT TTATATACCA TAATGATACT ATATTTTCTA 660 

GTTTATTTCA ATTTTTCAGT TTTTAAAAAT GAGTTTCTGT TTTTATTTAT ACGCTTTTCT 720 

GTTTTCTTTT TAAATTTTAT CTTTTTGTTA TTCCATTCAT TGTAAAATTC TATTAAATTA 7 BO 

ACATAAAATT TTTCATGCCC TATTTTATTT GTTGATGAGA TATCAATGTA AAGACTCAAT 840 

ATTGTTTTTA AATAGATTTG ATGCAACGAC TGATAAACCG TATTACTATC TGCTATGTTA 900 

TTGGTAAAAT GCATAGAAAA ATATTCTAAT TTATTCATGC AATATATATG GGTTTCATTA 960 

TACTTCTTAA TGAGTCTATT TATACCTTGC AATACGTCAT TACTTTTAAT AACAATTTCT 1020 

15 TTTTCACCTG TCGAAAAAGT CCACTGTTTA TCTCCTATAT TTTCTTTAAT TGTTTTCTTG 1080 

TTGTCAAATT CTAAAATTAT AGCCCGTAAA CACTCTTCTT TATAATTCTC GTTCTTGAAA 1140 

GTACGAAGCA AAATTTTTAT AAATTCGGTA TTGGTGACTT TTTTATAAGT GTGATATTTT 1200 

GCAATCTCTT TATCAGTAAA GACTGTTCTT AGTTCGTGAT TATCAAAACT TAAATTCATC 1260 

TTATTCTCTA ATTCATTAAT TTTATCTTGC AAACCAACAT TTTCTAAAAT TTTCTTGTTT 1320 

ATCTCCCCTA TATCAAAACT CCTTTTCGAA ATTAATTTTG AAAACTCGTC TGCCATTTCA 1380 

ACAGCCTTTT CTTTCCTTTT ATACCTTTTG TTAAATTTAT GAACCACCGT TGCAGCATAA 1440 

TACGATATCC CACCAGATAA AATAGATGaT ATTATCGGTA TGTATATATC ACCTTTCATA 1500 

30 TTTCCACCTC TTTTAACACA ATTAAGTATT ATGATACACA ACTTGOGCAA AAAGATGTAG 1560 

ACAGAACATA ATGGCGAACA AAAACAACCA CCCAGTAACT AGTATGGGTG GCGTAgACTA 1620 

TAACAACTCT ATGTTATCAA GATATATGTA TCGAGTGATG GCAAGGAAGA AGTCTCCTGC 1680 

GGGACCAACA GTCAGATATA TGGCCTCTGC CGGGCTATAT AGTTCACTCC TACTATATAA 1740 

AAGTAAGTAT AACATAAAAA GCACCCCGTA AACTGTTATA CGGGAATGCT AAAGTCATAT 1800 

ATACTACGGG GAGTAGTATG AAAACTATGC TCTCTATCGT AAGAAAAAAC ACCCAGTGAC 1860 

ATGCTTGGGT GAACAAGGAT AGATGTAAAT AGTTGATGCA TGTGTAcACA TCATAACAAA 1920 

AAACTAGCCC GAAGcTAGCT ATAACATAAA AAAATAGGCA AGTACCGAAG TACCTGCCAG 1980 

45 TTACGCACAT TTAAATCTTG AGAGTAATGT TAAAAAGTGT ATAGGAATAT TAACATCCAT 2040 

CCAAATAGTT ATTTAATAAC TGTAAGATTC CCTATAATTA ATGTAGCaAA ATTTTTATTC 2100 

TAAGTAAATA CTAAATCGTG CTAAACTTAC CAAAACTACT TATTCTATTA CCTGCCTTGT 2160 

^ CTACCTCTCC TGTCGCTATA TAACGACGTT GTCCACTATT AGCAATATAA GTAATCCATC 2220 

TATAGCCATT QATGCAATAT GCGCCGTCAT ATTTAATTGT TGCGTTATTA GGTAATACTIC 2280 

CTGTAATTCT TGAATTAGTT GAATAGCCGT CCCTTACGTT ATTACCTTTA ACATTGGCAA 2340 
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CTGGCACTGG TGGATTTTTT TGGTTTTTAG CTGATGTTTT AACATTACCA GCTACCAAAC 24 60 

CACCTATAGG CTTACCATCA ATCGCACCGG CTATTAATTT AGAATACAAG TCATAGTTTT 2520 

TCTTAATCCA ATCCATATCA TTTTTATTAG TAATAAAACC TAATTCAGAT AAACGATAGT 2580 

TTATATTTAT TTCTGCTGAT ACATTAACGT TTAGTAAATC ATTACGAGGT GTTACACCTC 2640 

TTATTTGTCC TAAGTTATTT TTAATAACAT CTTGTATACT TTTATCAATA GTATCTGCAT 2700 

TGAATTGACT TGAAATAATA ACATGCCCAC CACTTGCACT TTCTCCTGCT GCGTCTAAAT 2760 

GAATCTCTAG AACAATGTCA TACCCATGTG ATTTAACCCA ATATAAGCCA TAATCTTTAT 2820 

fS TATTTCCTAC ATTAACACCG TAAGCAGTAT CTTGATACAT ATCTTGTGAT TGACTTGAGC 2880 

CACCATATAA TGCAACTTCG TGACCTGCAT GTCTTAAATA CTTAGCGATA TTTGGTGTTA 2940 

TATATTTACG GATAAAATCA CGTTCATTTG TTCCGTTTCC GACTGCTCCA GGATCGTTAT 3000 

AACCATGACC G6CTACAAGC ATAATTTTTT TAGGTTTAAT TACTGCTTGC TTTTTGGCAG 3060 

TTGCTTGCTT AATAACGCTT TTAGCTTTAT CTCCAACACT TACTTTATCT GGGAAATTTA 3120 

ATCTAATAAA ATACATTGGG TCATCGTAAT AATGAACATG TCTTGTAACG GTTTCGGGAC 3180 

CCCAACCAGG TTGCGCAACG CCATTTGTCC AACCTTTACC ATTCCAATTT TGGCCAAACX3 3240 

ATGTGAAAGT GTTTAGATTA GCGCTCTCAA CAATTTCAAC ATGTCCaGct CCGCCACCAT 3300 

ACTTTGACGG GAAAACGACA ATGTCCAACT TTTGCGGTAA AAAGCTATCA TAGTTTTTAA 3360 

TTATTTGCCC GTATTTTTCA ATCCTTGCTT TATTATCAAA TGGAATATTA TAAGCGTATA 3420 

AACCTTGTAA CcTTTCGCCT GTTGCTATCA TAAAAAACAT ATTTGCGTAA TC6TAACACT 3480 

^ GAAATCCATA AAACAAATCA GGATTGAACT GCTTCCCTAA TGAATTATCA AACCATTTTT 3540 

CTGCTTGGTT TTTTGTTATC AACATTGGTC AACACCTACC CTAAATCATT TCTGTCGTTC 3600 

ATATOrGTAG GTGTCATTAC TTCTTTAATT GGCGCTTGCC CTGTTGCTTT TCTATACTTG 3660 

TTTTCAGCTT TATATTTCTT TAGCTTTTGA TTTGCCCATT TACCTTCTTG AGATGTTGGA 3720 

TTATCTTTAT ATGTAGTATA TAAAGCAACA ACTGTTAAGA TAATCGATGA AACACTTTCT 3780 

TCATCTACTG GTATCGGACT TATACCTTTA TTCGCTAAAA ACTGATTGAC TAATGCTAAG 3 840 

ATCAATACGA TGTATCTTGT TATTACTTTT GCATCCATTT GTTTGCTCCT TTTATCCAAA 3900 

ATAAAAAGCC AGTGCCGAAG CACTGACTCT TAACTATTAC TTACACTTAC TAAACCAGAA 3960 

50 ACACGACCAA AAGCTATATC CTAAAATTCC CTTAAGCATG GTAATCACCT CCTTTAAATG 4020 

CCAAAAATAG TTTTTAACAA GGCTATAACA AATGTACTTA GAATCGTCCC TATTAATCCT 4080 

AGAATCCACA TCTTGATGTC TCTAATATTT TTAGCATTTT TCTCTTTATT TTTTTCATCT 4140 
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TGCGTTCTCA GACTGTCTTC TATTCTGTCG AATTTTTCAA ACATAGTCTT ATCATTTTCT 4260 

TCTAATCGCG TTAAACGCCA ATCTTGTTCG TGTCGTTTGG TAAATCCAAA CATTACACCA 4320 

5 

CCCACTTTAT TCAAATTAAA AAGCCATAAG ATTATAACCT ATGACTCTAG ATTTTCTGGA 4380 

TACTTTTCTC CTGTAATAAT TGCATATTCC TCTTTATCTA TAACTTCCAT ATCTACATAC 4440 

CACGCTATAT CTTCTTTACT ATATTCTTTC AATTGATACC ATGTTTTAAT ATCTTCGAAT 4500 

10 

GTTGGTGAAA TTAATTTAAG CATTTTCAGT CTCTCCTTTA ACCTCTTCTA ATTTTTTATT 4560 

AAGTGTCACA AGTTGTTTTG CCATTAGTGC ATTTTGCTTA TTAACTTGCA TCGATAACTT 4620 

IS TGTACTTTGA ACAACTTGTT TCTGCATACT AGCAACCATT TTTCGTAAGA TGTCATCAGA 4680 

AGCGACTGTG TTTTGTTCTT CACTGTCAAT CTGTTGATGC AAGTCATCTT TTTCTTCTGA 4740 

ATAATCTTC6 TTAAAAACTA TTTCCCCATT TGAATATTTA AAGGCTTTAG GTCTAAAAAC 4800 

20 

TTGAGAGAAA TTTTCTGGTA AATTTTCAAT ATCAATACCT TCTTCAAAGC CACCAATGAT 4860 

AGCGTATGAA ATTATCTCAT TACGCTTGTT AACTAATATT TGCATTATTT TCTCACTCCT 4 920 

ATAATTTTGT TAATTGTCCC TCTATTTGCG TTCGCACCAG AGCCTCTTTG ACTTCCTAAG 4 980 

25 

TCGAAATAGA CATCGTTTGA TATAGTTAAA GATGTACGAC TAGATTTAGT TAATCCAAAC 5040 

TCATAAACAC CTCCACCATT TCCATCACCA TCTGGAAGAT TTGAGGGATT CAATGAAATC 5100 

30 TTTCCTCCTC CAAAAGGACT GCCAAACTCT GTAAAGTCAC CACCTGGAAA AGTCCCATAA 5160 

AAAATTAATA AAATAAATTG GTCTAAACTC TCATTTAAGT ACAATGTAGA GCCCACACCA 5220 

TTTGCTGTTC CATCAAAAAT AACCGAATAC CTTTTATTAA ACTTGTCATC TGCGTATAAT 5280 

^ TTAGCGTTAC TTTCGGCCAT ATTAGCTTTT GATTGGGCAC TTTGAACAGT TTCAAAAGGT 5340 

GTATTGTAAT CATTAATAGC TAATTCTGAC CACTCAGACC ATGAACCCGC TTCTTTTCTT 5400 

TTA«aVAATA CTTTATTTGT ACCGTTCGGT CGATAAGTCA TACGCTTGTA ATCTGAAGTT 54 SO 

40 

ACTACTAAAT ATTCGACAGT ACCGTTAGTA CTAACACCTC TTGGATAATT TATAGCTTGC 5520 

GAAACATAAA TAAATTGGGT TGAATCACCT ATTCTTTGTT CTGGATTATT AAAATCAAAT 5580 

CCAGTAATCT GCATTATCTT ACCATCATCT TTAGTAATCT TAGCTTTTTG CCAATTTGAA 564 0 

45 

GTAGAACCAC TTGTGACTAA ACCACCACTA TTCACTGACT GCTTGAAGGC TTCATGTTTC 5700 

TCATCCATAT ATCGCTTTTG CTCATCGAAT GTTCTTGAAT ATGCTTGCGC TTTATTTTCC 5760 

^ AAATCAGATA TATGGCTATT AGCAAGTTGC TTTAATTCAT CTATACTTGA AGATTTTGCT 5820 

ATTTGAATAT CTGATAGACC TTTTTCTTTA GCTTTTTCAA TCAGACTCGC ATAATCTTCA 5880 

CCATTTTTTA TAGCCTCGTC CATTGCTTTC GCACGATCCA TAATAGTTTT TTCTAATTCC 5940 
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TCAACGTTAA ATGTGATAGT TCTCTCGACA ACTACCACGT CTGAATTACC TAATTCTGCA 6060 

ACCGAAACTT GAGCTTGATA ACTTCCATCT CGTTTAATTA CATCATTAGG TAATTGAAAT 6120 

5 

TTTAAAATAC CTTTAAATGG ATCTAATATT TCTAGTGGAG CAACTACCAT GACTCCTTTA 6180 

CCTCGAATCG CTATTCGTGC kTTGATATTT tCTTCACTCA ATAATAACGG TTGATTATTT 6240 

TTAGTGATAT TAAAAAGAAG AACAGAAGAA TCACTCTCTC CTGTTCTAAA AGTTATATCT 6300 

10 

AGATTTGAAA TATTTCCATA ATGCGCTGTG TTTTCTAAAT TTATAGCTAC AGATTTCTCT 6360 

AAATTACTCA TTAACTTATA ATTCTCCCTT CGTGTAAAGT CCATGGCCCT GAACTTGTTT 6420 

IS TACTATCATA ATTTTTCAAT AGTATCTCAG CAGATGCTGT AACACTATTA CGAACTAGCC 64 BO 

TATGAACAAA GCCACCTGTG TTTGAAGCTT CTACATATAA GTTCCAACCA GCTACCCCTT 6540 

TACGTTCAGT TGGAAAATCT GTAAAACGTT TTGTATCATC CGTAGTTAAA TAAAACGACA 6600 

20 

TGCCTACTAT GTTAATATCT GACATTTTTG TGATGAATGA AGGTACTCTC TCCCATTTAC 6660 

CACTATTTTT AGGCACATAA TTCCAGTCCG AAATGTCTCC AGTTCTTCCA GAAAGCACCC 6720 

TTTCAAAAGT CATCATATTC CTTGCATAAC TATTACGCGT CAATATCTGA ATTACATCAC 6780 

2S 

CGCCAGTTTG TGGTGGCTTA ACTTCCAAGA ACCAACCTGC ATCACGCCAT TCTCTTGGTA 6840 

ATGGGAAATC ATCGATTTGA ACTGTATGAT CAGTGTATAA ATAGTAAAGA CCTGGCTCTG 6900 

30 TTAACATCCC AAGATTCTTA AGTTTATCAG GCCTCATTGG TAAAGGTTTA ACTCTACCAC 6960 

CTGTGTCACT CaTGATAAAA GGAACGCCTC TTGAGTGAAG TATTTCTAAA ATACCTCTTT 7020 

GCCCAATCAT GAAAATACGA TQTGTTCTAT TTCCaTCACC ACCGACAGTA ACACCTAGCA 7080 

^ TCAAAGCTTT TTTACCACTA TCTTTGTCAT AGTATATTTG CAAACCTTtC TgCTTCCGCA 7140 

AATTCGCCAG GAAATGAATC tAgTGTTCCA CCATAGTCAG CATTAACCTG ATACGCTTCT 7200 

TCTCCTGTTT CTAAATCGAA AGCCGTTAAA TAGTTTCTAT TATTTGQATT ACTGTCTCCT 7260 

40 

GTATACCAAT ACAAGTATTT TTCATCAAAA GTCACACCCT GCATTGGTTG GGTTTCGTTT 7320 

GTTAGTCTCA TAGGGATACT GATTTTATGC AAAACTTTAT CAATATTTTT ATCAACATCG 7380 

TCTAAACTTC TTATCTCTAT ATAAnTCATT GAGTTTTCAA GTTCCCACTG ACTTCTAGGT 7440 

45 

CTCTCaATTC TGTATAGAAT TTTATTTTCT TTTTCATTTA TGACAGGGGT GATGTAGGGT 7500 

TTTTCTGGGT GTCCTGTAAA TACATCTTGC ATACCATACT TGCCATAGCT AATTTCCACA 7560 

50 TTAGGCGTAT ACTTGAAACG AACTAATGTA TTCTCATTAT TACCATTTAA GATAAAACTA 7620 

TAAATCCATA ACTCATcATC AATATATCTA TAACCGTTAT GTGTACCATG ACCCCCACCT 7680 

ACAATCAATG AGCTGTCTAT AAATTGACCA TTAGGTCTTA GACGACTTAG CATATAGCCA 7740 
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ATTACTGCAT TTGTAAgAGG TGCAAGTTCT GTCACAAATA AAAATTCTTG CTTATCAGGT 7860 

TCAAAACGAT ACTCGATATC AAGAATTTCT TGTTTGGTCT TATTTAATTC TCTTATAGTT 7920 

5 

TCCTCTTTAT TAATTTGAGT TTTGGTTTCC CAATCGTCTA AATGTTCTTT TAATGTGTCA 7980 

AAGGTTTCGC CGTTTACATT AACTCGAGCT TGAACAATCT CATTAGCACT GTTATTACGT 8040 

GGTGCCACAA CAAGTGCGTT AATTTGACTT TGTAAAGATT TOTTTACTGC TGCTTGCGAT BlOO 

10 

CTACCATTAT AATAAATTTG CTCAGCGAAG TGTTGAATTG TTTTAGCTyT CTGATGCAAC 8160 

TTAAACTCTG TTGTCAAGCC AAGCGCAAAT TGCTCTATTC TTTGTAAGTT TTGTATTTCC 8220 

IS TTAGCTCTAT AATCTCGACC TGCTAAAGCT CCCAAATCCT TTATTAAATA CAAATTTTCC 8280 

ATAATGCACC TTCCTTTCTA ATAAAATAGC ACTGTACCAA GTTTCCCACT ATCGTCAACT 8340 

GTTATTTTCC ACAATTTACC GTTTGGGGAT TTCTGTACAA TGCTATTTTG AATAATTgcC 8400 

20 

TGctTCGCCT ATTTTTAAAT TATCTAATTT ATTTkTATCA TTTACCGAAA TGATACCGTC 8460 

TTGAGGCAAT CCATCAATAn CACTACTGCC TGCATAAGGT ATCCCATTTA TAGCTTTCCA 8520 

ATGTGTAGCT GGAAAGTACT GTTTATCGT 8549 

2S 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3601 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AGGCGTGTAG TGACTTACGG nTAGGAAACT ATGTATCCGA ATGATTTATT GAGACCAAAA 60 

AGGCATTAAA GTCCATTGAA ATATCnGGTA GCGmGTTGGT ACgTGGACGT GGGGGCCCTA 120 

40 

GATGTATGAG TCAACCATTA TTCAGAGAGG ACATTTAACG TAATAAATTA TAGAmACGAG 180 

GGTGAAAATA ATGACAGAAA TTCAAAAACC GTATGATTTA AAAGGCAGAT CATTATTAAA 240 

45 AGAAAGTGAT TTTACCAAAG CAGAATTCGA AGGACTTATT GATTTTGCAA TTACATTAAA 300 

AGAGTATAAG AAAAACGGTA TTAAGCATCA CTACTTATCT GGAAAAAATA TTGCACTACT 360 

ATTCGAAAAG AATTCGACGA GAACGCGTGC TGCGTTTACA GTTGCGTCTA TTGATTTAGG 420 

^ TGCGCATCCA GAATTTTTAG GAAAAAATGA TATTCAATTA GGCAAAAAAG AATCTGTAGA 480 

GGATACTGCG AAAGTATTAG GTAGAATGTT CGATGGTATT GAATTCCGTG GTTTTTCACA 540 

ACAAGCTGTT GAAGATTTAG CGAAGTTCTC TGGTGTACCG GTOTGGAATG GATTAACAGA 600 
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TCTAGAAGGA ATAAACTTAA 


CTTACGTTGG 


AGATGGACGT 


AATAATATTG 


CGCATTCATT 


720 




AATGGTAGCA 


GGTGCTATGT 


TAGGTGTTAA 


TGTAAGAATT 


TGTACACCTA 


AATCATTAAA 


780 


5 


TCCAAAAGAG GCATATGTTG ATATTGcAAA rGAAAAaGCG AGTCAaTATG GTGGTyCAGT 


840 




CATGATTACG 


GATAATATTG 


CAGArcCAGT 


TGAAAaTwCm 


GATGCTATAT 


ATmCAGATGT 


900 


10 


TTGGGTATCG ATGGGTGAAG 


AAAGTGAATT 


TGAACAcGTA 


TTAATTTATT 


AAAAGACTAT 


960 


CAAGTGAATC 


AACAGATGTT 


TGATTTAACA 


GGTAAA6ATT 


CAACGATATT 


CTTACATTGT 


1020 




TTACCAGCAT TCCATGATAC 


AAATACACTT 


TATGGACAAG 


AAATTTATGA 


AAAATATGGA 


lOBO 


15 


TTAGCTGAAA TGGAAGTTAC 


AGACCAAATC 


TTTAGAAGTG 


AACATTCAAA 


AGTGTTTGAT 


1140 




CAAGCTGAAA ATAGAATGCA TACAATTAAG 


GCAGTAATGG 


CAGCAACATT 


GGGGAGTTAA 


1200 




TCACTAAATG 


GAACGATATG 


AATATGATGT 


GTCTGATGAT 


ATAAGTGTCA 


TGTACAGACA 


1260 


20 


CCTCATATTG 


GTATTAAAGG 


AGAAATGAAT 


ATGAACX^T 


CAGGAGATAA 


CAAACTCAGT 


1320 




AAATCTTCTT 


TAATTGGACT 


AGTTATAGGA 


TCCATGATTG 


GTGGCGGTGC 


GTTCAATATA 


1380 


25 


ATGTCTGATA 


TGGGCGGTAA 


AGCCGGTGGA 


TTAGCCATTA 


TTATTGGTTG 


GATTATTACA 


1440 


GCTATAGGAA 


TGATTTCATT 


AGCGTTCGTA 


TTTCAAAATT 


TAACCAATGA 


ACGGCCGGAG 


1500 




CTAGACGGTG 


GTATTTATAG 


TTATGmTCAA 


GCAGGATTTG 


GCGATTTTGT 


AGGATTTATC 


1560 


30 


AGTGmTTGGG 


GATATTGGTT 


CTGAGCGTTT 


TTAGGCAATG 


TTGCCTATGC 


AACACTATTG 


1620 




ATGTCAGCAG 


TAGGTAACTT 


TTTCCCGATT 


TTTAAAGGAG 


GCAACACATT 


ACCAAGTGTT 


1680 




ATTGTCGCCT 


CGTTACTACT 


CTGGGGTGTC 


CATTTCTTGA 


TTTTAAAAGG 


CGTTGAAACA 


1740 


35 


GCAGCATTTA TCAATAGTAT 


TGTTACTGTT 


GGAAAGTTAA 


TACCGATTTT 


ACTTGTAATC 


1800 




ATATGCATGA 


TAATTGCATT 


CAATTTTGAC 


ACTTTTAAAA 


CAGGCTTTTT 


CAGTATGACG 


1860 


40 


TCAGAGGGTG 


TATTGCCATT 


TAGTTGGGCG 


AGCACAATGA 


GCCaaGTtAA 


AAGTACGrTG 


1920 


CTAGTGACAG 


TTTGGGTGTT 


TATCGGTATC 


GAAGGTGCAG 


TAATTTTTTC 


TAGTAGAGCT 


1980 




nAAAATGAGA AAGATGTAGG 


TAGTGCCACG 


GTTATAGGAC 


TTATATCAGT 


TTTAATTATC 


2040 


45 


TATyTCTTAT 


TAACTGTATT 


AGCTCAAGGC 


GTGATTTTGC 


AAAATCATAT 


TTCGCAATTA 


2100 




GATTCGCCAA GTATGGCACA GGTGCTTGCA ACTATTGTAG GTGGTTGGGG ATCTACACTT 


2160 




GTAAATATTG 


GTTTAATTAT 


TTCGGTACTA GGTGCATGGT 


TAGGATGGAC 


ACTGCTTGCT 


2220 


SO 


GGTGAATTAC 


CTTTCATTGT 


TGCAAAAGAT GGATTATTTC 


CAAAATGGTT 


TGCTAAAGAA 


2280 




AATAAAAATG 


GAGCACCTGT 


AAATGCACTG 


CTTATTACCA 


ATATATTAGT 


ACAATTATTT 


2340 




TTAATAAGTA TGCTATTTAC ACAGAGTGCG TATCAATTTG CATTTTCACT AGCATCAAGT 


2400 
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10 



IS 



25 



3S 



CGACAGCAAG 


CPJ\CTACTPA 


ACAATGGACG ATTGGTATCA TAGCCTCAAT 


TTATGCTATA 


2520 


TGGCTTATAT 


ATGCAGCAGG 


TATCAATTAC TTATTATTGA CGATGTTACT 


TTATATTCCA 


2580 


GCTCTTCTTG 


TTTATACaAT 


CGkTCmAAAG rATwATCAGa CACGTTTGAT 


TAAATCAGrC 


2640 


TATATTCtTT 


TTATGATTAT 


tATCGTACTT GCAGTTATCG GGTTAATTAA GTTATTGATG 


2700 


GGAACGATAA 


ATGTTTTTTA 


AAAGGAGCGA CAAAAATATG AAAGAGAAAA 


TTGTCATTGC 


2760 


ATTAGGCGGT 


AATGCGATAC 


AGACAACAGA AGCAACAGCT GAAGCACAAC AAACAGCTAT 


2820 


TAGATGTGCG 


ATGCAAAACC 


TTAAACCTTT ATTTGATTCA CCAGCXSCGTA 


TTGTCATTTC 


2880 


ACATGGTAAT 


GGTCCACAAA 


TTGGAAGTTT ATTAATCCAA CAAGCTAAAT 


CXSAACAGTGA 


2940 


CACAACGCCG 


GCAATGCCAT 


TGGATACTTG TGGTGCAATG TCACAGGGTA 


TGATAGGCTA 


3000 


TTGGTTGGAA 


ACTGAAATCA 


ATCGCATTTT AACTGAAATG AATAGTGATA 


GAACTGTAGG 


3060 


CACAATCGTT 


ACACGTGTGG 


AAGTAGATAA AGATGATCCA CXSATTTGATa 


ACCCAACTAA 


3120 


AcCAaTTGGT 


CCimTATA 


CGAAAGAAGA AGTTGAAGAA TTACAAAAAG 


AACAGCCAGA 


3180 


CTCAGTCTTT 


aAAGAAGATG 


CAGGACX^TGG TTATAGAAAA GTAGTTGcGT 


CACCACTACC 


3240 


TC5ATCTATA 


CTAGAACACC 


AGTTAATTCG AACTTTAGCA GACGGTAAAA 


ATATTGTCAT 


3300 


TGCATGCGGT 


GGTGGCGGTA 


TTCCAGTTAT AAAAAAAGAA AATACCTATG AAGGTGTTGA 


3360 


AGCGGTTATA 


GATAAAGATT 


TTGCTAGTGA GAAATTAGCA ACGCTQATTG 


AAGCAGATAC 


J4 ^ w 


CTTAATGATT 


CTTACGAATG 


TAGAAAATGT ATTTATTAAC TTTAATGAAC 


CTAATCAACA 


3480 


ACAAATCGAT 


GATATTGATG 


TAGCAACACT GAAAAAAtAC GCGGCACAAG 


GTAAGTTTGT 


3540 


GGAAGGATCG 


tGTTGCCAAA 


AATAGAAGCT GCGtACgtTT GTTGAaAGtG 


GGGaAACCAA 


3600 


A 








3601 



(2) .INFORMATION FOR SEQ ID NO: 7: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACACTATT AAATGAATTA GAGCACAATC TAACAAATCA AATTCATTTT TCAAAAGATG 60 

AACGACTCAC ACATATCGCT TTAAAGTTAT TCGAAACAAC CGATCCTGTT TCAACAAAGC 120 

AACTTGCGCA AGATGTTAAT GTTTCGCGTC GGACAATTGC AGATGATATT AAAATGATTC 180 
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AGATCTCTTG CTATTTCTAA TTCAGTATCT GAAATATAAT GCTTTGTTAA ATTCTGAAGT 


1140 




TGTACACCTC TACCTGCCCA 


TCTTCCAGTA CCGGCACCGT AAAATTGAAA 


CAGACCTCTT 


1200 


5 


ACCCGTTCAT CACTGCACAT 


C 




1221 




(2) INFORMATION FOR SEQ ID NO: 9; 






10 
IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 






(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 








TTTTGTTTGG TATGAGGTAG 


CAATGACGAC GTGTCATTGG TGGAGATTGT AAAAATACAT 


60 


20 


AATAAAAAGA AGCGGCAATG 


TATACCGCTC CTTmTATA CTACATACCG 


ATTTTCAACC 


120 




ATCTCTTTCT ACTTAGTAAT 


AAGACAATAG TATTAACTAT AAATAGAAGA 


ACGAAGAATG 


IBO 


25 


ATACTATATT TATAATTTCA 


GTAGGACACA TAAATGTTGA CTCGTTATTC 


AATArrrrrr 


240 


CTACGGCACG ATACATCGTA 


TTGCTCGCCT CAAATGGAGC AACGATACCA 


AATATATTTT 


300 




TATTAATGGC AACTAAGATG 


ACTGAACCAA TCCAATATAC AATGCTGATA 


CCTAAGCTGA 


360 


30 


TTAAAATGTT AGGTGAAACC 


ATACTAATCG TTCCAACAAC TAAGATATAT 


TGTAAGATAA 


420 




CGAGTGAAAA TAAGATTATT 


AATAGTAAGT AATGTGAGAA ATCCGAATAT ATAATTGAAA 


480 




TAATAGTGAT ACTTAGAATT 


ATGAACACTA AACATTCAAA AAATAACACT 


GCTACCTTTT 


540 


35 


TATAGAAGAA GGTAAAGATA 


TTATCGCCAA TCAATTTATA AAACAGGATA 


TTTTTATTCG 


600 




AATACTCTTT ATTAATAAAA 


TATGCAATAA CAAATGAAAA TAGTAAGAAC 


CCTAATTGCG 


660 


40 


TTGCAACAGT ATATGAACTG 


AAGAAAAACT GGCTATAGCT TAAACTTTTA 


ACTTTGTCTA 


720 


TACCTATTGG TAAAAAATAC 


CCAAGTAAGA AAAGGAATGT GAATAGCACA ACAAGCGTGT 


780 




AAATAATTTT ATTGGAAATA 


CTTTTTTTAA ATTCTAATTT CAAAGTGGAC 


ACCTCAATTA 


840 


AC 


TAAATTAATG TAATCATTTA 


TGACTTCTTC TTTTGATTGG TACTCTTCTA 


TTTGAAGGTC 


900 




TTTAAAAATA AAGTATTTAC 


CCGGCAAAGC ACTTAAATCG GATAAATTaT 


GTGTAATATT 


960 




GATAATAGTT TTAGTTTGAT 


GGCTTTGAAT AAAATCATTT AAAAATTCAT 


AAATTTCATT 


1020 


50 


AACTGTTTTC TTGTCTAAAG 
CAATAAGAAA 


CGTTTGTAAC TTCATCTAAT ATGATTAAAT 


CATGATCTTC 


1080 
1090 



(2) INFORMATION FOR SEQ ID NO: 10: 

55 
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IS 



20 



(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
10 TTAGGACTAT TTTATCATAT TCATTTAAAT TACGGCTAAA AATTTTAAAA ACGGGGATTA 60 
ATATATGGAA TTAAGCTATG AAAGTTAATT GATACTTGCA TTTTACGCTG ATTTATATAA 120 
GAATAACTAT TGTATAGTTT TAAAAACGAA CGTACGTTTG CAGGAGGCGA AATCATTGGC 180 
AATGAATAAA CAAAATAATT ATTCAGATGA TTCAATACAG GTTTTAGAGG GGTTAGAAGC 240 
AGTTCGTAAA AGACCTGGTA TGTATATTGG ATCAACTGAT AAACGGGGAT TACATCATCT 300 
AGTATATGAA ATTGTCGATA ACTCCGTCGA TGAAGTATTG AATGGTTACG GTAACGAAAT 360 
AGATGTAACA ATTAATAAAG ATGGTAGTAT TTCTATAGAA GATAATGGAC GTGGTATGCC 420 
AACAGGTATA CATAAATCAG GTAAACCGAC AGTCGAAGTT ATCTTTACTG TTTTACATGC 480 
25 AGGAGGTAAA TTTGGACAAG GCGGCTATAA AACTTCAGGT GGTCTTCACG GTGTTGGTGC 540 
TTCAGTTGTA AATGCATTGA GTGAATGGCT TGAAGTTGAA ATCCATCGAG ATGGTAATAT 600 
ATATCATCAA AGTTTTAAAA ACGGTQGTTC GCCATCTTCT GGTTTAGTGA AAAAAGGTAA 660 
AACTAAGAAA ACAGGTACCA AAGTAACATT TAAACCTGAT GACACAATTT TTAAAGCATC 720 
TACATCATTT AATTTTGATG TTTTAAGTGA ACGACTACAA GAGTCTGCGT TCTTATTGAA 780 
AAATTTAAAA ATAACGCTTA ATGATTTACG CnwGGgTAAA GAGCGTCAAG AGCATTACCA 840 

35 

TTATGAAGAA GGGAtCaAAG rGTTgTTAGT atGTCCAaTG ArGGAAAAGA AGTTTTGCCT 900 
GACG 904 
(2) INFORMATION FOR SEQ ID NO: 11: 

40 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 11271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATTTCTAAA TCAAGATCTG TTTTACGATA ACCATTCAAA CCTTGACGTT CATCTTCTTC 60 

AGGTTGATTT TGTTGCTGTG TGTCTTTGTT GTCAGAAGTC GCTACTGTTT TTTTATTATC 120 

TGTTTCTTTA GTCATAACAA ACGCCTCCGT TATAAAACGC TATATTTAAT GATATGTGAT 180 
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10 



TTAATAAGAC GATTCAGCAA GTTTTAAAGT ATTATTTGAC TATGTTGGAT TAGGCATCTA 300 

GTCCTATAAT ATCACTGACA TTGTCAAAAT GATGATCTTT TAAGTAACGT GCGATGCCTT 360 

TGTTCATTTT CTTAGTTAAA CCTGGGCCTT CAATAACAAG TGATGAATAA ATTTGAATAA 420 

GTGACGCACC GTGACGCATC ATTTTGATTG CATCTTCAGT ACTGAATACG CCGCCTGTAC 480 

CTATAATTAA AAATTCACCA TTTGTTTGCT GATAAgCATa CTTAATCAAT TTTAAATTAC 540 

GTTCAAATAA TGGACX3ACCA CTCAAACCGC CTTCTTCGAC TTTATTAGCA GAAGTTAAAC 600 

CATCTCGTTG TCGCX3TTGTG TTTGCTAAGA TGATACCGTC AAATGTCTCA GTAATCGCTG 660 

75 GTAATAGTGC TTTTAAGCCA TCGAAATCCA TATCAGACGT TAGTTTTAAA TAAATTGGCA 720 

CTGTTACATC ATGTTGTTTT TTAAATGCTG TTAAAGCTTG GCATAACATT GAAAATTCAT 780 

CTTTATCATG GAAGTTTTGA AGATTTTCAG TATTTGGAGA ACTGATGTTG ACTGTGAAAA 840 

ATGAAACX5TC GTGTTTAAAC GTATCAATAA CCTTTATATA ATCTTGATAA CGCGCTTCAT 900 

AAGGTGTCAT TTTATTCACA CCAACATTGA TACCAACAGG TACTTGATAA GCATTTTTAC 960 

GCAAATGACT TAGTGCTTTG TTCATACCAA TATTATTGAA GCCCATTCGA TTTATCAAGG 1020 

CGTCATCTTC TAATAATCTA AACATGCGTG GTTGAGGGTT ACCCGGTTGA GGTTTAGGTG 1080 

TGATACCACC TAATTCTAAA GCACCGAATC CAAGGTGTTC CAATGCTTTT GGTACTTCGC 114 0 

AAGATTTGTC GAAACCAGCT GCTAAgCCAA TTGGATTGTC GTACGTATTA CCTTGTATCG 1200 

TTTGTGATAA CGTTGGATTC TTATAAGTAA ATAGTTTATC GACGACTGGG AATAAAACCG 1260 

GaAACTTTTG TaACXSTTTTT AATGCATCGA TAGTTAGTCC GTGTGCTTTT TCGGGTTCGA 1320 

35 TTTTGAATAA GAAAGGTTTA ATTAATTTGT ACATGAGTAT GCTCCTATTT CATTATATTT 1380 

GAGGCTTACT ATCCTCAACT TAATATATGT GAAATATATT CTTTTAATAG ACTAGCATTT 1440 

CCAT&CATAA TTTCCTAGTT AAAACTAAAA AGTTTTGAAA ATTGACGCAA gTTTGAATAA 1500 

40 CGTTTTTAAG ATTAAATCAT CCTAATTAGG CAATATTATA GTATAAAGTA AGTAGATTGG 1560 

AAGGTGTTTG TATGAATGAA CAATGGTTAG AGCATTTACC TTTAAAAGAT ATTAAAGAGA 1620 

TTTCACCAGT GAGTGGTGGT GATGTAAACG AAGCATATCG AGTCGAAACA GATACGGATA 1680 

CATTTTTCTT ACTTGTCCAA CGTGGACGTA AAGAATCATT TTATGCTGCA GAAATTGCAG 1740 

GTTTAAATGA ATTTGAACGT GCAGGTATCA CGGCACCTAG AGTAATTGCA AGTGGCGAGG 1900 

TTAACGGTGA TGCGTATTTA GTGATGACGT ATTTAGAAGA AGGGGCTTCA GGGAGTCAAC 136 0 

GCCAATTAGG GCAACTCGTA GCTCAATTAC ACAGTCAGCA ACAAGAAGAA GGCAAATTTG 1920 

GCTTCTCATT ACCTTATGAA GGTGGCGATA TTTCTTTTGA TAATCATTGG CAAGACGATT 1980 

SS 



25 



30 



45 



SO 
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ATGTCATTCA TAATCATTTG AACTAAACGT 
ACCAACTTCC GAAATGTAGA TGAATTCTCT 
^ AAATTTCTCA AGGATAGGTC TATACTTTAT 

AACAATAACT TGAAATAGAT CATTGAGGGA 
TTATcGGTGC CGGTGCTGCA GGTATAGGTA 

10 

CAGATGTCAT TATTTTAGAA AAAGGAACAG 
CGACCCGTAC GATCACGCCA TCATTTACGT 
,5 CAATTTCCAT GGATACTTCA CCAGCATTTA 

CATATGCTGA ATATTTACAA GTGGTTGCCA 
CAGTTGTCAC AAATATATCT GTAGATGATG 
TATATCACGC GGATTATATC TTTGTCGCAA 
TTAAATATGG TATTCATTAT AGTGAAATTG 

ATGTGGTTAT CGGAGGTAAT GAAAGTGGCT 

25 

GCTCTGACAT CGCACTTTAT ACTAGCACAA 

GTGTTAGATT GTCACCTTAT ACACGTCAGC 

GCATCGAAAT GAATGTACAT TATACAGTTA 

30 

ATATCAGTTT TGATAGCGGA CAAAGTGTGC 
GCTTTGATGC AACAAAAAAT CCAATCGTTC 
35 TTAAATTAAC AACACATGAT GAATCGACAC 
CAGTTGAAAA TGATAATGCC AAATTATGCT 
TACftGCACA TCTTTTAACA CAGCGGGAAG 
ATTATCAAAA AAATCAAATG TATTTAGATG 
GTTAGAAGTG AAATATGATA TGAGAACTGG 
TTATTTGGTT ATTAGTCATG CGGATAAACT 

45 

ATTAATCATA AAACAGAAAT TAGATATTTC 
AGCGAGTGAA CATGTGATAG AACAATTGAC 
ACCTAAAATA AGTGCGACAT TTTTAGCCTG 
AATCGGTATC GCTTATCAAT TTTCAGATTG 
AGAATATTTA ACTCAAACAA CATTGCTCAA 

55 



AGCAGCCTTA AATTTTAAAA AAAGACACAT 3900 , 

ACAATAACGG AAGTTTTTCT TTTAATATTG 3960 

AAATCGTAAT TATTACGATT TATAATCAAA 4 020 

GTGTTAATAT GCAACATCAT AAAGTGGCTA 4080 

TGGCCATTAC CTTAAAAGAT TTCGGTATAA 4140 

TAGGACATTC ATTTAAACAT TGGCCGAAAT 4200 

CTAATGGATT TGGCATGCCT GATATGAATG 4260 

CATTTAATGA AGAACATATT TCCGGAGAAA 4320 

ACCATTACGA GCTGAATATC TTTGAAAATA 4380 

CATATTATAC GATTGCAACG ACAACAGAGA 4440 

CAGGTGATTA TAATTTCCCT AAAAAgCCAT 4500 

AAGACTTTGA TAACTTTAAT AAGGGGCaAT 4560 

TTGATGCTGC ATATCAACTT GCAAAAAATG 4620 

CCGGTTTAAA TGATCCGGAT GCTGATCCTA 4680 

GACTAGGTAA TGTCATTAAG CAAGGTGCTC 4740 

AAGATATTGA TTTTAACAAT GGACAGTATC 4800 

TTACACCTCA TGAACCAATA CTAGCAACTG 4860 

AACAATTATT TGTGACAACA AATCAAGATA 4920 

GTTATCCGAA TATTTTTATG ATTGGTGCAA 4980 

ATATCTATAA ATTTAGAGCG CGATTTGCAG 5040 

GcTTACCAGC TAAACAAGAT GTCATTGAAA 5100 

ATTATTCATG TTGTGAAGTG TCATGCACAT 5160 

GCATTATACG CCCATACCTA ATGAACCTCA 5220 

TACCGCAACA GAAAAAGCGA AATTAAGATT 5280 

ATTGGCAGAA AGTGTAGTTT CTTcGCCTAT 5340 

ACTATTTCAA CATGAGCGAC GACATTTAAG 5400 

GTTGTTGATA TTTTTAATGT TTGCATTGCC 5460 

GTTTCAAAAT CAGTATGTGT CAGCATGGAT 5520 

TCACQATATA TTACAGCATA TATTATTTGG 5580 
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ATTGATTAGT TTATCAACTG CTATAATTGA 
GGCAATTGAA CCGTCAATGT TATGGATAGG 

5 

AGAAGGGTTT GGATGTAATG CAGCAGCTAT 
CACGAAGACA CAGTGTATGA GTTTAATAAG 
TGCGACATTA TCTATTTTTA GTGTAGCTGG 

10 

ATTAGTACTT TTAGGTGGCA TCTTACATAA 
CTTAGCGTTC OGCTACCTTA TGATAGGCAA 

IS CTACAAATGT GGCAAAATAT ACAAATGTTT 
ATCTGTCTTA TTGTTAGTAT TTTATCACTA 
TTTACACCTA TATTATCGTT ATTAGGCATC 

^ TCAATGATTC GAAAAGACGG CATGCTCTTG 
GGAATGACAG CAACACAGTT ACTACTACTT 
TCGGTCACAA TGACGATGCT TTTGAAACAT 

2S 

GGAAAGCAAA TGGTGACATC ATTGTCTTTA 
GTAATGCTGA TTATTTAAAA AAAATGAACT 
AGGAGATCTA TCTTGGAATA TGCTATTCAT 

30 

AATAGTTATT ATAGGTGGGT TTTTAGGTGG 
CGCTGAATCA TTAAAGGAAT CGCTGAAACC 
3S GAGTGTTGAT GGTGCCTTAG TATCTGAAGA 
TATCTGTTGT GCAATGAAAG CAGATGTATC 

GCAACCAGAC ATTGTATTTA TTGAATGTAG 

40 

TGCTTGTTTA ACGCCTATTT TAGCTCCGTT 

AGACGCAAGC ATGTATAAAC ACATTAAATC 

TGAGCAATTA GCATATTGTT CTGTCTTATT 

45 

AACAACGAGC AAACTATTGA AAGATTTAGA 
CGGTATGCAT GGCAGCGTCA CTTTGCCAAT 
so CAATAAACAT AAGTCTTTAC ATCAAATGAT 
ATGTACTAAA GCAGAGTTTA TAAAACGTTT 
GAAAGGGTTT ATGACATTTG AAGACACCGC 

55 



TCAAACAGGA CTCAAATCAT GGATGATATG 5700 

ATTACAAGGT AATGATATCG TGCCACTATT 5760 

TTCACAAGCA GCACACCAAT GCCATACCTG 5820 

CTTTGGTAGT TCTTGTAGTT ATCAAATAGG 5880 

AAAGTCATGG CTATTTATGC CGTACTTAAT 5940 

AGGATATGGT TGAAAAAGAA TGATCAACAA 6000 

TTACATATGC CAAATATACG TCAAATGTTG 6060 

ATCGTTCAAG CGCTACCTAT TTTTATCACA 6120 

ACGCCAATTT TGAATGTTTT ATCACAAATA 6180 

TCGTCAGAAT TGTCACCAGG GATTTTATTT 6240 

TTTAATTTGC ATCAGGGCGC CTTATTACAA 6300 

GTGTTTTTTA GTTCAACATT TACAGCGTGC 6360 

TTAGGTGGTC AGTCAGCACT AAAATTAATT 6420 

GTTATTGGTG TAGGCATCAT TGTTAAAATA 64 80 

ATAACTGAAT ATAGAGTCAT GTCAGTCAAT 654 0 

ATGAAGTATA AGAGGAGAGT CGCAGATGAA 6600 

CGGTAAAACG ACTGTCTTAA ATCATTTGCT 6660 

AGCAGTCATC ATGAATGAAT TTGGGAAAAT 6720 

CATACCTTTA AGTGAACTGA CAGAGGGGTG 6780 

AGAACAGTTA CATCAATTAT ATTTAAAAGA 6840 

TGGQATTGCA GAACCGGTCT CTGTCTTAGA 6900 

TACAACAATT ACACATATGA TTGGTGTAAT 6960 

ATTCCCTAAA GACATCCAAG GCTTATTTTA 7020 

TGTTAATAAA ATAGATTCAG CAGATGTTGA 7080 

AGTTATTAAC CCAGAGGCCG ATATACAAGT 7140 

ATCAGTTAGA CAAATGACAG CAACTTCTGA 7200 

TAATCATCAA TTTGTGCAAT CACCAGTCAA 7260 

AGCATGCCTT CCGTCTCATA TTTATAGGTT 7320 

ACATACGTAT CTCATTCAAT TTACACAAGG 7380 
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CGGAAAGGGT ATTTCAAAAG AAGACTATCA 
GAGAATGGTT AACATGCCTT CATGTATAAT 
^ TAAAAATAAG CTTGGTCAGC CATCAAATAT 
GAGAACAATC AATTAACCCC ACATATTTAA 
AATATAACCT AAGTGACCGC CTGGAATATC 

10 

ATAAAAGTTA ACATCTTGTG GGAAGGAGCC 
TTTATCGCTG TATTTTGTGA AATCATCCAA 

,5 TTCAAATTCT GACCAGAACA TCGTACGTTT 
AGCAGGTTGA GACATCATTT TTGCATCAAT 
TTTCATGCCT TTTTCTAAGC CTTCTGTTAA 

20 TTTCCAATAA GTACTGTCTG GTAAAAATGT 
TTTAACGACT TCAGGGTAAT CTTTTAACAC 
ACCTAATATA TAGACAGGTT CATCACTTAA 

25 

GTCGCGTTTG ACACGATAAT CACTGTCAGG 
TAACTCGCTT TCTCCATAAT CACGACGATC 
CTGTTCTGCA AGAGGCAGAA AAATGTCTCC 

30 

CACGGGTCCT TGTCCX5ACTT GGTGGTATCG 
CATATTCAAT GACCTCCATT TGTTAATTGT 

3S TTGTATAACT TATTTTCTCT TTTTCTTCAT 
CTAATTTTTC AGGCTCAATA TATGGATAAT 
CTTCTTTCTT GACTAAATCA AACTGTGGCT 

40 AATTATTTTT AAAGTAATAG CTTACAGGGT 
CCATACGTTC TAAGAAGAAT GGGATAAACT 
TTGGATAACG ATCAAAAATA CCAGATAATA 

45 

TGTGCCAACC ATAACCAAAA CAAGCAAATG 
TATAGTATGA TTGATAAATG TCACTGTTAA 
^ CTAAATTTTC AGCTGTTTTG AAAATAATGT 
GTGCACGTCC CATAATGAGC GCACCTTTGA 
CTCGCGCTGC GGCTTCAGGC TCATTGATAG 

55 



ATGTTTGGAA CAGTAGTGTT TTCAGTGGAA 7500 

AACGAGTTGA TTTGAACGTT TAAGCGTAAA 7560 

AATTTGAAAA CTGTCCAAGC TGTTTTATTA 7620 

TAATACATCA GCAAAGCCTT CAGGTTTTTG 768 0 

TACAATAGGT ATGCCAGTTT CTTTATTTAT 7740 

TCTAGAATCT GTCCCATTTA GTAGGGTGAT 7800 

AGTAATATCT GAATGCGTAT ATTGTCTAAT 7860 

GTACTGTTCT ATACGTCCTT CTTCAGTATC 7920 

TGGTGCGATA TTTAATGTTT CGCCAAATGT 7980 

AATTTGATGC ACAATGTCAT CATTTTTATC 8040 

ATTAATTGGT GGTTCGTGAA ATGCAATCTT 8100 

ATGCATCGCA ACGATTGAAC CTGAACTTGA 8160 

TGACTTTGCA AGTTCGGCAA TGTCCTGTGC 8220 

GTTTGAAGCG GAATCAGGGA GTGGTTCAGT 8280 

AACGGCTACA ACAGTAAAAT GGTCTTTTAA 8340 

GGTACCGTTT GCACCAGGAA TAAAGATGAG 84 00 

TAATTTAGCG CCTTGTAATT CTAAAGTTTC 8460 

TAGGTGATAA ACCTAATAAT TTAGCACCAT 8520 

CTGTTAAACC CAGTTCATCT AAAAATACAC 8580 

CAGCAGCATA AAGAATTCTA TCAATACCTA 8640 

TCGTTAACAT GCCACTCGGT GTGATATAAA 8700 

GGTTCAAATG TTCAGCGAAT AAAGCTTCAT 8760 

CACCCCAATG TCCAATAATC ATATTTAACT 8820 

CTAGATGTAT TGTATGAATG CCGACATCAA 8880 

TTGCCGCAGT TACTTCAGGA TAATTTCCTT 8940 

CTGGCGCGGG ATGTAGATAA ATCGGTACGT 9000 

CATATTTGTC TTGATCAAGA AAACCATCTT 9060 

ATCCTAAATC ATTGATGCAA CGTTCGAATT 9120 

GTAAAGTTGC AAAGCCTACA AAGCGATTGG 9180 
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TCTGACCAAC CAAATTTGAA GGAGAACCAT TTCCATAAGA TAAGACTTGA ATTTGAACGT 93 00 

CTTGATTATT CATAAATTGG ATACGTTCAT CATGATGTGA TAATTCGTCG GCATTTGTAA 9360 

AACCTGTCTT TTTTTcAAGG CCTTCTAACA TTACTTTCAT CGGTACACCT TTAGGATCTG 9420 

CTGATATCGC ATTCATCGTT TCTTTTTGAA TATCTTCAAT GACATAATGT TCTTCAAACG 9480 

TAATACTTTT CATTTACTTC GCCTCCATAT TGTATTGCAT GTTTATTGCA TCTATTGCAG 9540 

AAGCATTTTT TATATACCTC TAATTTCAAT GTTTGTAACA TAAAATTGAT CTACCAAGGC 9600 

ATCTCTCCAT CGCCATTAAT AAATGTACCT GTTGGGCCAT CTGCACCAAT CGTTGCTAAT 9660 

TGAATGATTC GCTTGATTCC TTCAGAAACG TGTTTGGAAT TATTACTAAA ATCACCAACT 9720 

AAATCAGTAT TTGTAGCGCC TGGATCAGCA GCATTGATTT GCATGTTAGG TAATCCTTTA 9780 

GCGTATTGTA GCGTTAGCAT TGTTACTGCC GATTTAGACG AACAATAAGC TAATGAATTC 9840 

ACTTTAGATT CAGCTGTTTC GGGGTTTGTA ACCATTCCAA ATGAACCTAA ACCACTTGAT 9900 

ACGTTGACGA CAACAGGTTG TTCAGATTTT TCTAAGAGAG GGACGAATGT ATTCATCATT 9960 

CGTACGATAC CGAATACATT CGTTTGATAT ACTTCTTCAA CGTCACGAGG TGTCAATTTG 10020 

GAAGGTGCTG AAAATTGACC AGATATACCT GCATTGTTAA TGAGGATATC AAGACGGCCT 10080 

TCTTTTTCAG CAATCATGTT ATAAGCATTT TTGACTGAGT AGTCACTTGT AACATCTAAT 1014 0 

TGTACATAAT GAACACCTAA TTTTTGTGAT GCTTGTTGTC CTCTTACATC ATTCCGAGAA 10200 

CCTATATAAA CTTTGTAACC CAATGCTTTA AGTGCCTCTG CACTTGCATA GCCTAACCCT 10260 

TTATTGCCTC CTGTGATTAA CACAATTTTA GTCATTACGT CCCACCTCAT CTAAATAAAT 10320 

GTTTAATAAA TAATTTCTGT ACGCTTCAAT TGAAATATGG CGATGCTCTA TTTGGAAGGC 10380 

AAATACACTA GTTGATAATG ATTGCAACAG CATATCTGTT TTGAAtTCGT GTAAGTGTCG 10440 

TCAT€GCTTT TAAATAAGTC ATAATAAAAA TCAAATAATT CTTGATAAAA TGCGCTTTGG 10500 

TAAAAACGTA ATTTATTGTT GCCTGCTTCA ATACATTGCA GTAGTGCCTT ATTATCGATT 10560 

TTAAATTGTA AAAGATAATC TAAC6ACACT TGCATAACCT CATAATTAGA ATGATAGTCA 10620 

TCTTTAATTT GCTTAAAATG AGTGATAAAA ATATCAAGGT CTCTTTGTAT GACGTAGTAG 10680 

CATAAATCGC TTTTATCTTT GAAATGTCGA TACAATGTCC CCATACCGAT ACCTAGTTCT 10740 

TTAGCAATAC GATTCATACT AATGTTTTCA ACGCCTTCTT CATCAAAAAG TTTGTGCGCT 10800 

ATTTCTTCAA TTCGTTGCCT ATTCTCTTTT GCATCTTTTC GCATGATTAC ACCTACTTAA 10860 

AATTCTCTAA AATTGACAAA CGGATAACTC TCCGTTTATT ATAAAACGTG TTAAGAAAGT 10920 

TAGCAATGAA TTTGCAATAA CTATTAAATA TCATAAAAGA AAAGAGTGTT GATAATGTCT 10980 
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ACCTTATCGG TTCAAATGAT TGCTGAAAAA CTGAATGTCA CTACAGAAGA TGTGGAAAAA IHOO 

GTATTAGCTA TGACAGCGCC ACTAGGCATT TTTAGTCATC AATTACAACG ATTTATTCAT 11160 

TTAGTATGGG ATGTCAGAGA TGTAATAAAC GACAATATTA AAGGAAATGG ACAAACACCA 11220 

GAACCATATA CGTATTTAAA AGGTGAAAAA GAGGACTATT GGTTTTTAAG A 11271 



(2) INFORMATION FOR SEQ ID NO: 12: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
IS (0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

^ CAACCCGTTC AGAACAAAAT AAAAACCGTA CAATTTTATC ATCTTAATGA TTATTGTACG 60 

GAAAAACTTT TTTACATCAT ATCTGCATGT GCATAATCGA TATCGGTAAA TTTATTATAT 120 

TGTTTCATAA AATGTAACTT AACTGTGCCT GTTGGACCGT TACGTTGCTT AGCAATGATA 180 

25 

ATTTCAATTT CACCGTTTTC ATCATTCGTT TGTGGCTCGA AACCACCATC ATCGTCATCA 24 0 

TCTTCATCGC CGCCACGGTT ATAGTAATCA TCACGGTATA AGAATGCAAC GATATCGGCA 300 

TCTTGCTCAA TCGAACCAGA TTCACGAATA TCACTCATCA TTGGACGTTT ATCTTGTCGT 360 

30 

TGTTCAACAC CACGAGATAA CTGACTTAAT GCGATAACTG GACATTTTAA TTCACGGGCT 420 

AAT6CTTTTA ATGTAC6AGA GATTTCAGAA ACTTCCTGTT GTCTGTTATC GGACGCACGT 430 

35 GAACCACTAC CTTGAATCAA CTGTAAGTAG TCAATCACAA TCATGTCTAA GCCATGTTCT 540 

TGCTTTAATC GACGACATTT AGAACGTAAA TCATTAATTC GAATACCCGG TGTATCATCA 600 

ATAA&AATCT TCGTACGTGA TAATTTACCT ACCGCTATAG TAAAACGACT CCAATCTTCC 660 

TCAGTCATAG TACCCGTTCT TAAGCGGTTT GAGTCAACAT TTCCAGAACT ACAAATCATA 720 

CGTGTGGCTA ACTGATCAGC ACCCATCTCT AGCGAGAAAA TACCAACTGT ATACATATCT 780 

TCATGCGTTG CAACTTTTTG TGCAATATTA AGTGCGAACG CAGTCTTACC TACAGATGGA 840 

45 

CGCGCTGCAA GGATAATTAA ATCATTTCGG TTGAACCCTG CTGTCATTTG GTCTAAATCT 900 

CGATATCCTG TAGGTATACC TGGTGTTTGA CCACTATTTT GATCAAGCTC TTCAGCTGTT 960 

^ TCATACACTT GTCCTAAGAC GTCTCGAATG TCTTTAAAGC CATCGCTTTC ACGAGAAGAT 1020 

GATAGCTCTA AAATTCGACG TTCTGCATCA CTTAAAATCG CATCTAGTTC AAGTTCATCA 1080 

TTATATCCAT CATTGGCAAT ACTATCTGCA GTTTGAATCA ATCTACGTTT TAATGCATGC 1140 

55 
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TCTGCAAGAT ATTGCGGGCC ACCCGCTTcA TTCAACGTAC CTTCCGTCGA TAATTGATCC 1260 

ATCAATGTTA CAACATCAAT TTCTTTATTA TCTTCATTTA AGTGCATCAT TGCACGGAAA 1320 

5 

ATATGTTGAT GGGCACCCCT ATAAAACGAC TCAGGAAGCA AAACTTCCTG AGTAGTATTA 1330 

ATCAATTCTG GATCTATAAT AATTGAACCT AAGACAGACT GTTCAGCTTC ATTGTTATGC 1440 

GGCATTTGAT TTTGCTCATA CATTCTATCC ATGAATGGTT ACACCTCTTA TTTCAATCCA 1500 

10 

ACTTTATTGT TCAACTGTGT GTACGCGAAT TGTACCTTCA ACTTCTTTAT CTAATTTAAC 1560 

AGGTACATTC GTATATCCTA GGGAATGAAT TCCATTTGGT AAATCCATTT TACGTTTATC 1620 

j5 AATTTTAATA TCATGTTGTG CTTTTAGTGC TTCGGCAATT TGTTTTGTAC TTACTQACCC 1680 

AAACAATTTA CCACCTTCAC CAGTTTTTGC TGaTACTTCA ACTTCAATGT TTGATAACGT 1740 

TTCTTTTAAT GCTTTAgCAT CTTCAATTTC TTGTTGGCGT TCTTGTTTTG CACGTTTTTT 1800 

CTGTAACTCT AATTGTTTAA GGTTACCTGG TGTTGCTTCT ACAGCATAAT TCTTTTTCAA 1860 

TAAGAAGTTA TTTGCATAAC CTACTGGTAC TTCTTTAACT TCACCTTTTT TACCTTTACC 1920 

TTTACCTTTA ACATCTTGTG TAAAAATTAC TTTCATGCAT CTTCACTCCT ACTTAATTGT 1980 

2S 

TCTGTAATTG CTTGTTGTAA TTGTGCTATC GCCTCTTCGA CTGTCACACC TTTAAGTTGT 204 0 

GTTGCCGCAT TGGTTAAATG TCCACCGCCA CCAAGTGCTT CCATTGTTAA CTGGACATTT 2100 

ACTGAACCGA GTGAACGCGC AGATATACCA ATCAGATTAT CTTCACGTCT CGCAACAACA 2160 

30 

TATGATGCTT CAATACCTTC TAAACTTAAC AGTTCATCTG CTGCTTGTGC AACTGTTACT 2220 

GGATGATAAA TTTTATCGTC TGAACCATGC GcAATGGCTA TGCCATTATC TTCAACTTTT 2280 

35 ACAGTTCGAA TTAATTCAGA TCGATTAATG TAAGTATCCA CATCATCTTT TAAGAAATGT 2340 

TGCGTTAAAA TCGTATCTGC ACCATGTGCA CGTAAATAAC TCGCTGCATC GAATGTTCTT 2400 

GATGCTGTTC GTAATGTAAA GTTTCTTGTA TCTACAATAA TACCTGCATA CATCACTGTT 2460 

''^ GATTCAAGAC GTGTTAAACG TTGTTCTGTT GGTTGATATT CCAGTAACTC TGTTACCAAT 2520 

TCAGCTGTCG AACTTGCGTA TGGTTCCATA TATATCAACA ATGGATTAGA GATGAAGCTT 2580 

TCACCACGTC TATGATGATC GATAACAACT TTACGGTTTG CTTTATTTAA GACATTTTCA 2640 

45 

TCTAAAACCA GTTCCGGTTT ATGCGTATCA ACAATCACTA CGGTTGTCTT AGATGTCATC 2700 

ATATCCCAAG CATCATCTGA TGTAATAAAT CGCTCTCTTA ACTCTGGCTT TTTATCTATT 2760 

TCGTTCATCA CGCGTCGTAA TGTTGGATCA ATGTCAGTCT CATTTAATAC GATGTATGCT 2820 

TCTAAATTAT TCATCATTGC AAATCTAGAC ACACCGATTG CTGCACCAAT TGCATCTAAG 2880 

TCAGGACGTT TATGTCCCAT GATAATGACT TTGTCACCCT CTGCT^GGAT ATCTTTTAAC 2940 

55 
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10 



IS 



CCATAGAAAC GCACATTACC ATTAATACTT TTAATTGCAA CTTGGTCGCC ACCGCGTCCT 3060 

AATGCTAAGT CTAGGCCTGA TTGTGATAAT TCACCTAAGT CGATTAAATT TTCAGTACCT 3120 

TCACCAACAC CGATACTTAA TGTTAATTGG GCACGATAAC CAACACTTTT TTCACGTAAT 3180 

TGACTCAAGA TATCAAATTT AGATTCTTCT AAGTCAGCTA ATATTTTTTG ATTTAAATAG 3240 

GCTACGAATT GATCGGAACT GTATCTTTTG AAAAATATAT TATACTCAGT TGCCCATCGA 3300 

CTAATGACAC GCGTTACCAT TGAGTTGATT TCCGAACGCT GCGTATCATT CATATTTTGC 3360 

GTAATCTCAT CGTAGTTATC TAAAAATAAT GTCGCAATGA TTGGTTTAGA ATTTTCATAT 3420 

AGTTCATTTG TTTGTACTTG TTCAGTTATA TCAAAGAAAT AGAGGCAGTG ATCATTCTCA 3480 

GAATAACGTA CTTGGAAATG ATACTGATTA TATTCTATTT cAACGGATTT CACTCTATCT 3540 

AATTGCTTTA AAATGTTTGG AAATACTTCA TTTACAGATT CAGAAATGAC ATTCGCTTCC 3600 

20 ATATGATCTG TCATAAATTG GTTAACCCAT TCGATGTGAT CATTTTCATC TAAAACAATG 3660 

ATACCAATTG GTAAATGTTT GATTGCTTTA TTATTTGTTG TTGAAATTTG AGCACTCAAA 3720 

CCATCTACAT AACTATCCAT TTTCATTAAA GCTTGTCTGA ATAAAATGAT GCTAACAATA 3780 

ATCATCACGA CAAGAACGAT AGATGCAATT AGTGCTATAA GACTATTAAA GATAAACCAT 3840 

ACACCCATTA AAACAATTGC TGTGATGATC ATGATGACAA ATGGTATTAG TAAAGCTTTC 3 900 

TTAGTGGACT GCCGATTCAT TATTCCACCT CTATTCACTT TTTAGAATTA TTTTTCATGA 3960 

TTCGCTTCAA ATTCAAACTT AAATCGATAA CACCAAGTAG TCCTACAATA TGTGTCGTAG 4020 

GTGTCAGTAT TGTACCGATA ACCAATAGTA AAATCGTTAC TGCATTCGGC AAACCTTTCG 4080 

CTTTACCAAA GAAATGAATA ACACTTAAAC CTTGAATATA CATTACTAAT GATAACACAA 4140 

GTTGGAAGTT TAAAAGAATG CTCTGGAACA CACTCGGTTG ACCTGTAAAT AATAAACATA 4200 

TGA-BAACAAT AATGTATATC CATAATAAAA TACCOCTCAT TTGCCACGCG AAAAGTGGCT 4260 

40 TAAATACAGG TGTAGCGATT TTAAATTTTC GTAAAATCGG AAATGTAACG ATTAAGTTAA 4320 

TTAAGACGAT TAAAAATGTA ATGATAATGA TGAAACCTGG TAATTGAACG GTCGCTTGTC 4380 

TAAACCCTTC TTCTAATATT TGGGTCATAT TCGCATCGGC ACCGCTCATC GTAATCGCTT 4440 

CATGTT^TGT TTGCTTGAAA GGTTTTACTA TGCTCGCTGA TGGTGGAATC CTTCCGAATG 4500 

TTTGTAGTAA CATAAAAGCG ATTAATGAAA TTtiArCTCAT CGCTACTGTT GTTACGTATA 4560 

ATATTCTTTC TTTAGACGTT CTTTCTTTGA GCAATTGACC AATAATTAAA CTTGCAATTA 4620 

AGACTAATAT GATGGCACTT AAAACGAAAG TATTACCTAA AACAGTTGTT ATAATTACTG 46 80 

TAATAAGTGC ACTAATCCCG AAAGATTGTA TTGATTTATT CCATAAAACG ATACCTGGTA 4740 
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CAAATACCAA CGCAATCGTT GCAATTATTG TTGCTTTAGG TTGTATTTTT GAAAACACAT 4860 

AAGCCACTCC CATATTTTTA ACTATAGCTA TTATTTTAAC CTCTTTAATG AAAATTAACA 4920 

ATTTATAGAT TGTATGCTTC TATTTCATTT AATTGAATAA TAACTTTCAT GTTTTATAAG 4 980 

TAATTAACAT ACTCATTTGA ATCGCTTTTG TGTGCTTTCA TTTTCAACAT GATTATTTAA 504 0 

TCCCACTACA TAGCAATCAA GCTTGATTTA GATTTACAAT ACATTTCCAC TCTCATGTAC 5X00 

TCTAGATGTT TTTGAATATG ATAACTGTGA TTTAGTGGCT TCATTCTTTG AAAATATATA 5160 

TTATTACTTA CGCTTAAAAT GCTTTAAATT TAAGAAATGA TATAAGTTAG GTGCCCAGGT 5220 

ACTAAAGTTT AGTAGGaATC CATCATGCCC AACATTATCA GGCACGAAGA AATGACGATG 5280 

ATATTTAAAA CGTTCACCTA ATGCACGAAC TTGATCATCC GGATATAGCA AATCATCTAT 5340 

GAACCCCATC GTTAACACTT TTGTTTCTAA ATTTTTAAAA ACATGCGTTA CGTCTGTGCG 5400 

20 ACCTCGGTCA ATGTTGTGAC TATCCAATAC ATCTAGCAGT GTCAGATAAC AATTCAAATC 5460 

AAAATGTTCT TTAAATTTAT TACCTTGATG TTGTTGGTAT GCGACTACTT CATCCGGCGT 5520 

AAAACGTTCA TCATAACTTT TTGATGATCG ATATGTCAAA AAACCTAATT GGCGTGCAAT 5580 

ACTTAGACCT TCCTTACCAC CAAGATGAAT GGCTTGCCTT GCAATTTCAT TGAAAGCTCT 5640 

ACTATAAGAT GATGTTCGAC TTGTTGCAGC AAGGATAATG GCTTTATCTA CTTCAAACTG 5700 

TTGATTGTAG AGTAGTTCCA TTGCTTGCAT ACCTCCAAGA CTTCCCCCTA TTAAAATATT 5760 

AATCTTATCA TAACCAAGGG CTTGTATACC TCGTTCATTC GCTCTGACTA TATCTCTTAA 5820 

TGTTAATTTT TTAGGAAAAT GAGGGTCGTT TAAAGGTGAA CTTGAACCGA AAGGACTACC 5880 

AATAACATCA AATGTTAAAA ATTGATAATC GTGAATGGGT ATATATCCCC CATCAATAAT 5940 

TTCTCGCCAC CAACCCGGAT AATCATCTGT TCCATATGTT AAATGATTGC CAGTTAATGC 6000 

ATGAGAAACT ACAACTAATG GTTGTCCATG ATAACCGACA TGCTCATATC TCAAACGCAA 6060 

40 GTnATCTATG ACTTCCCCAG ATTCTGTAAT AAATTCCCCT AAATTTAAAG TATCTACTGT 6120 

GTAATTTGTC ATrGTTCTTT CCTCCTTAAA CAAAAAAACT TCTCACCCTA TTGAAAAGTA 6180 

AGAAGTCTTT ATACTTATCA TTCGAGTAAC TCGTTGGTTT TAGCACCGTG CTATAAAGTC 6240 

GGTTGCTGAA GTATCACAGG G 6261 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 1222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS : double 

(D) TOPOLOGY: linear 
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TTCTTTTGGC ACGACATAAT TGTCTTTATC TTGAACTAAA TATCCGCCAG ATACTGAAAC 180 

AAACTCTTCT TCGTTACTGT CTATAGTCAT ATCAATTTCT AATAATCTTA CATTCTTCTT 240 

TTGTTTTAAA ATATCTAATG CTTCATCTGT AAATTTTGGT GCAATAATGA CTTCCAAAAA 3 00 

GATACTATGC AATTGCTCTG CTAACTCAGG TGTTACAGCT CGGTTTAATG CAACAATTCC 360 

ACCAAATATT GATTGACTAT CCGCTTCATA CGCATGTTGA AATGCTTGTT CTATCGTGTC 420 

ACCGATACCA ACACCACATG GATTCATGTG TTTAACCGCA ACTGTAGCAG GTGTATCAAA 480 

CTTTTTAACT AAAGCTAGT6 TAGCATCTGC ATCTTTAATA TTGTTATAOC TTAATTGTTT 540 

CCCATGTAAT TGTTTAGCGC CTGCAATCGT GTGCTTAGCA TTCGAAGTTC TCACAAAATA 600 

CGCTGATTGT TGTGGATTTT CTCCATATCT TAAAGTTTCT TTATCCCCTT TAAAGAAACX5 660 

TACAATCGCT TCATCATATT CTGCAGTATG CTCAAAAACT TTAATCATTA ATGATTGTCT 720 

20 ATATGACTCA TCTAACGAAT CGTTTCTTAA TCXSCGTCAAT ACTTCTTGAT AATCTGCCGG 780 

ATGTACAATT GTTGTTACAT GTTTATAGTT TTTAGCTGCA GCACGTAACA TTGTTGGACC 840 

ACCAATATCA ATATTTTCAA TTGCTTCGTC CATCGTCACA TCAGGGTTTG CAACAGTTTG 900 

TTGGAATGGA TATAAATTAA CTACTACCAT ATCAATTAAA TCTATATGTT GTTCTGATAA 960 

TTCATTTAAA TGCTGCGGTT TATTTCGATC AGCTAAAATG CCACCATGAA CAGCCGGATG 1020 

T 1021 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN2SS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATTCACTC CTAAATTGTT ATTACACTAT TACACaTAGC TAATCATCAA TGTGAAATCA 60 

CCTTCAAAGA CACTATCCAA ATCTTCAGAA GTCAAAATAA AGTTTGTACC AGTAGTCAGT 120 

TTGAAAATTT CACCATCGAC AATCATTTGC CCTTCGCCTT CCAACACTGT AACTAAACAG 180 

AACTCTCTAG GCTTCATATA ATTTAACGTG CCAGAAATTT CCCATTTAAC CAATGTAAAG 240 

AAATCATTCG ATACAATGTG TGTACACTTA TGGTTTTCAA TAATTTCGCT TTCAGGCAAA 300 

ATATTAGGTA ATGGTGCATT GTACTGAATA ACGTCTAAAG CTTTTTCAAT ATTTAACGGT 360 

CTATCATTAT ATTGATTATC TTGACGATTG AAATCATAAA GTCTATATGT AATGTCTGAC 420 
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ATAAAAtAGa ATTCyCCAGG kTTTACtTTA AtatATCyAA gTAtCGaCtC tATCGTTCCG 


540 




TGTTGAACAT GATTCGCAAC 


TTCTTCTCTA 


GACTCTGCTA 


ATGTCCCtAT AACTATTTCT 


600 


5 


GCATCTTCTT CTGCATCTAT AATATACCAA CATTCAGATT TGCCATATTG CCCgTTTTCA 


660 




TGCTCATAAG CATAAGAATT 


ATCAGGGTGC 


ACATGAATAG 


AAAGTGATTC TCTTGCATCC 


720 


10 


ACTATTTTAG TTAGAAGCGG 


AAAATCTTTG 


CTTGGGAAAT 


CACCAAACAA TTCACGATGT 


780 


TCTGACCAAA TACGGTCTAA 


TGTTTGACCT 


TGATATGGTC 


CATTAATAAT CTCGCTCGTA 


840 




CCATTTGGAT GTGCTGACAC 


ACACCAACAT 


TCCCCCAGTT 


GTATCATTGT CTAATTGATA 


900 


IS 


TCCAAACTCA CTTAGACGTT 


GACCQCCCCA 


TAATTTTGTT 


TTTAAAATTG GTTGTAAAAA 


960 




TAATGGCATT GTTGCACCTC 


CATTGTGATT 


AAGTAAGCAA 


TAGAACTCTG ATGTTGTTGT 


1020 




TCCATTATAT TTTGATTTTG 


TTCTCATTTA 


CATCX5TATTA 


TTAACTTCCA CATTTCAAAT 


1080 


20 


TAACTATTAG TGATTGTACC 


ATATTTACTA 


ACATTGCAGT 


ACTGCCAATT AAAAGnGCTT 


1140 




CACTTAAATT TACAGTACTT 


TAACATTTTC 


AAAAATTTAT 


AGCATAGAGA TTATATCTCT 


1200 




CTTACATTTG TACATATTTC 


CCTTTAAATT 


TACTCGCCCA 


TTATACCAAT TAATAaACAA 


1260 


2S 


CTTTAATAGT TGTGCCATAC ATTGTTCAAA TTCTTTGTAA AACGCATAGA CAATACGTAC 


1320 




TTATTCATAC TTATAATTCA 


TCATTTTCAA 


AAAATAACGA 


GTTACGAAAA AGTAACCCGC 


1380 


30 


TTCAAATCAT ATTTACTATC 


CTTATTAATC 


CGTTTCATTT 


TCAAATTGAG TTAAAGCATC 


1440 


TTTAATGTCC TGATCACCAC 


TAATAATTTG 


AAACTCTTGG 


TGATTAAAAT GATTGGATGT 


1500 




GACAATTTCT TTTAATACTG 


TCGCAACATC 


TTCTCTAGGA 


ATTTCACCTT TACCATCAAA 


1560 




ATATTGTGCA GCTTCTATCT 


TTCCAGATCC 


TGCTGCATTT 


GTAAGTGCCC CTGGATGTAA 


1620 




AATTGTATAA TTCAAACCTG 


nAACGTCTTA 


AATAGTCATC 


AGCGTAATGT TTAGCTATTG 


1680 




TATATGGCTT TAAATCACCG 


CTATCATCAA AAGCCTGACG 


TCTCGAATCA TATGTTGAAA 


1740 


40 


CCATGACATA GTGTTTAATA 


TTGGCCTCTT 


TACTCGCAAT 


CATTGATTTA ACAGCACCAT 


1800 




CTAAATCGAC AATAATTGTT 


TTATCTGCAC 


CCGTGTTCCC 


TCCAGAACCT ACTGAAAAGA 


1B60 




TAACTTTATC GAATGGTTTA AACGTCTCAG 


TTAAAGTCTC 


TATTGAATCA TTTTCAACAT 


1920 


4S 


CAACAAGAAT TGCTTTCATA 


CCTTGTGATT 


TTAAOGCATT 


AAGTTGATCT GATTGCCTAA 


1980 




CACCAGCAGT AAATGGTACA 


TTTTCTTTTG 


CTAATTGTTG 


CACTAGTAAC GAACCTACAC 


2040 


SO 


CGCCATTAGC ACCTATAACC 


AAAATATTCA 


TTTACAACAC 


TCTCCTATkT ATTATTCTCT 


2100 


ATGCCATACC ACTTTATGAG 


ATATGTAAAA 


CTTGTTACAA 


CTATAAAAAT CAATTGACAT 


2160 




ACTACTGGGA ACGTATTAAA 


TTAATATATG 


AACAAATATT 


CATATGAAAG GATTGTCATA 


2220 
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tCaAGGCATT AGcGATTACA ATCGAATACG TATCaTGGAA TTGTTATCaG TCAGCGAAgC 2340 

AAGTGTTGGT CACATTtCAC ATCAATTGAA TTTATCTCAA TCAAATGTCT CGCACCAATT 2400 

5 

AAAATTACTT AAAAGTGTGC ATCTTGTGAA AGCAAAACGA CAAGGCCAAT CAATGATTTA 2460 

TTCATTAGAT GACATCCACG TAGCAACTAT GTTAAAGCAA GCCATACATC ACGCGAATCA 2520 

TCCTAAAGAA AGTGGGTTAT AATATGTCTC ATTCACATCA TCATCATGAC CATATGCATA 2580 

10 

GTCATGTAAC TACAAATAAT AAGAAAGTAT TGTTTATATC GTTTTTAATA ATCGGTCTAT 2640 

ATATGTTTAT CGAAATCATC GGCGGTCTCC TTGCTAACAG CTTGGCATTA CTATCTGACG 2700 

15 GTATCCATAT GTTTAGCGAC ACATTCTCAT TAGGTGTTGC ACTTGTCGCA TTTATTTATG 2760 

CTGAAAAGAA TGCCACAACT ACAAAAACAT TTGGTTATAA ACGTTTCGAA GTACTCGCAG 2820 

CGTTATTTAA CGGTGTAACG CnTTTGTAA TT^GTATTTT GATTGTTTTT GAAGCGATTA 2880 

20 AACX5TTTCTT TGTTCCTTCT OAAGTTCAAT CAAAAGAAAT GTTAATCATT AGTATTATCG 2940 

GTTTAATTGT CAATATCGTT GTTGCATTCT TTATGTTTAA AGGCGGCGAC ACTTCACACA 3000 

ATTTAAATAT GCGTGGTGCT TTTCTACATG TTATCGGAGA CTTATTAGGT TCAGTTGGCXS 3060 

^5 CCATTACTGC AGCTAkTTTA ATTTGGGCAT TTGGATGGAC AATCGCCGAT CCTATCGCAA 3120 

GTATTTTAGT TTCCGTTATT ATTTTAAAAA GTGCTTGGGG TATCACAAAA TCTTCAATTA 3180 

ACATTTTAAT GGaAGGCACA CCAAGTGATG TTGATATAGA TGAAGTTATA ACTACTATTA 3240 

30 

AAAAGGATTC ACGAATACAA AGTGTGCATG ATTGCCATGT TTGGACAATT TCAAATGATA 3 3 00 

TGAATGCATT GAGTTGTCAT GTTGTTGTAG ACCATACATT GACAATGAAA GAATGTGAAT 3360 

TATTATTAGA AAaCATTGAG CATGATTTAT TACATTTAAA TATTCACCAT ATGACTATTC 3420 

3S 

AATTAGAAAC GCCTAATCAC AAACATGATG AATCGATTAT ATGTTCAGGA ACACATAGTC 34 80 

ATTCACATAA CCATCATGCT CATCATCACG CGCATGTACA TTAATAATTT TAACCTACTG 3540 

40 CCATTGCATC GATTAAACTT TTCAATGGCA GTAGGTTTTT TATGTCTTTA TGGCGACTTG 3600 

TTTGGTCTTT GATGATGCAA TGTTTATTAA CAAATTTTCA ACTATTATTT CTTACATTAG 3660 

TCATATTTTT GACAATTTAC TATTATAATT CTCTAACTTT AGTCACTTTA ATTAATTTTT 3720 

^ ATTAGATATT AATATGAAAA TAACGTGTTT TTTGTTATT 3759 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 13086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: i 
TAATTATCGC GCATAACAAA ACATTA6CAG 
^ TTCCTGAAAA CAGGGTGGAA TACTTTGTAA 

ACGTACCGTC TACTGACACT TTTATTGAAA 
AACTACGACA TTCTGCTACA AGTGCATTAT 

10 

GTGTAAGTTG TATATATGGT TTAGGTAATC 
TTCGAGTTGG TATGGAAATG GATAGAAGTG 
TATACACGAA ATGACATCgA TTTcCAACGA 

IS 

GAAATATTCC CAGCCTCTAA AGAAGAACTT 
ATTGACCGTA TCCGAGAAGT TAACTACCTA 

2^ TTTGCGATAT TCCCAGCTTC TCACTTCGTA 
GAACGTATTG AAAAAGAATT GGAAGAACGA 
CTAGAAGCGC AAAGGTTAGA ACAGCGTACC 

2S GGATTCTGTT CAGGAATTGA AAACTATTCC 
ACACCATATA CTTTATTGGA TTACTITGGC 
CATGTGACAT TACCGCAAGT TCGAGGCATG 

^ TTGGTGGATC ATGGGTTTAG ATTACCGAGT 
GAATTTGAAG mAAAGACAAA ACAACTTGTG 
ATTGAACATA CGGATAAGAT GGTTGAACAA 

3S 

AAGATTGAGG TTAGACCTAC TGAAAATCAA 
AGAGTgAGCG TAATGAACGC GTACTTGTTA 
aACCACATAC ATGAAAGAaG CGGGTATTAA 

40 

ATTAGAACGA ATTGAAATAA TTAGAGACTT 
TATTAATTTA TTAAGAGAGG GTATTGATAT 

^ TGCAGATAAA GAAGGGTTTT TACGTTCTAA 
TGCGCGTAAC GATAAaGGTG AAGTCATTAT 
GTATGCAATT GATGAGACAC AACGTCGTCG 

SO TGGTATTACA CCTAAAACAA TTAATAAAAA 
AAATGACGAA AATAATGACA AAGCACAAAC 
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;EQ id NO: 16: 

GACAATTATA TAGTGAGTTT AAAGAATTTT 60 

GTtACTATGA TTATTATCAn CCAOAGGCAT 120 

nAGATGCCTC AATCAnTGAT GAAATTGATC 180 

TTGAACGCGA TGATGTAATT ATTATTGCTA 24 0 

CTGAAGAATA TAAAGATTTA GTAGTAAGTG 3 00 

AATTACTTAG AAAACTTGTc AGATGTGCAA 360 

GGAACGTTTC GAGTGC6TGG TGAT6TAGTG 420 

TGTATAAGGG TTGAGTTTTT CGGCGATGAG 480 

ACAGGTGAAG TGTTGAAAGA AAGAGAACAT 540 

ACACGTGAAG AAAAGTTGAA AGTTGCGATT 600 

TTGAAAGAAT TACGAGATGA GAATAAATTA 660 

AACTATGATT TAGAAATGAT GCGAGAGATG 720 

GTACATTTAA CTTTGCGACC ACTGGGTTCG 780 

GATGATTGGT TAGTAATGAT TGATGAATCA 840 

TATAACGGAG ACAGAGCGCG TAAACAAGTT 900 

GCATTAGATA ACCGTCCACT TAAATTTGAA 960 

TATGTATCTG CAACGCCTGG ACCATACGAA 1020 

ATTATTCGTC CTACTGGTTT ACTGGATCCT 1080 

ATTGAOGATT TATTAAGTGA AATTCAAACA 1140 

CAACGCrCAC TAAAAAGATG AGTGAAGATT 1200 

aGTtAATTAT CTGCATTCAG AAATCAAGAC 1260 

ACGAATGGGT ACATATGATG TTATCGTAGG 1320 

ACCAGAAGTT TCTCTAGTTG TCATATTAGA 13 80 

CCGCTCATTA ATTCAAaCAA TAGGTAGAgC 1440 

GTATGCCGAT AAAATGACTG ATTCGATGAA 1500 

AGAAATACAG AT6AAACATA ATGAAAAACA 1560 

AATACATGAT TTAATTAGTG CTACTGTTGA 1620 

TGTGATACCT AAGAAGATGA CGAAAAAAGA 1680 
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TTTCGAGAAA GCTACAGAAT TAAGAGATAT 
AGTAAATGAA AGAACCATCC ATAGTAGTAA 
^ TTGATATCGA ACTACCTAAA AaTAAATTAA 

AATCGTCATT AGCATTCGAT ACTATATATG 
TAAGTGCCTA TGCGCGTCAA TTTTTAGGCC 

10 

AAGGATTATC GCCAGCAATT TCAATAGATC 
CTGTAGCAAC AGTAACAGAA ATATATGATT 
AACCTTACTG TCCAAATCAC AATATAGAAA 

IS 

ACCGCATTAT GGAATTAGAG GCACGTACAA 
ATCGTAAAGG TAGTCATGAA AAGC7AATCG 

20 TAAGAATCGA TGGCGAAATT GTTGATGTAA 
ATCATACAAT AGAAGTTGTT GTAGACCGAT 
TAGCTGACTC TATAGAAACT GCCTTAGAGC 

25 TTGACGGGGA AGACCTTAAG TTTTCAGAAA 
TCGGAGAGTT AGAACCAAGA ATGTTTAGCT 
GTQATGGCTT AGGCCAAAAG TTAACAGTCG 
AGACGCTAAA CGAAGGTGCA ATAGAACCTT 
CATTGTTAAA ACGTGTTTGT GAAGTTTATA 
TAACAGAACG TCAACGTGAT ATTTTATTGT 

3S 

CATTTACACA ACGTCAAGGT GGTACTAGAA 
CTAMATAAG TAGACGATTC CATGAATCTC 
AATATATGAC TGAACTACCT TGCGAAACTT 

40 

TATCTGTTTA TGTAGGTGGT TTAAATATTG 
CGCTGAACTA TTATAAAAAC ATTGATTTGT 

^ TATTGAAAGA AATTATTTCC CGACTCACTT 
CGTTAAACAG AGCTTCAGGT ACACTTTCAG 
CGCAAATTGG GTCGCGTTTG ACTGGTGTCT 

SO TGCATCAAAG AGATAATGAT CGATTAATTA 
ATACTTTAAT TGTAGTTGAA CACGATGATG 



GTTATTTGAA TTAAAAGCAG AAGGGTGACA 1800 

AAGGTGCTCG TGCGCATAAC TTGAAAGATA I860 

TTGTTATGAC AGGTTTATCT GGGTCAGGTA 1920 

CTGAAGGACA ACGACGTTAT GTTGAATCAT 1980 

AAATGGACAA ACCAGATGTT GATACAATTG 2040 

AAAAAACAAC AAGTAAAAAT CCAAGATCAA 2100 

ATATACGTTT GTTATATGCA CGTGTTGGTA 2160 

TTGAATCGCA AACAGTACAA CAAATGGTT6 2220 

AGATTCAATT ATTAGCACCT GTCATCGCTC 2280 

AAGATATTGG TAAAAAAGGT TATGTACGTT 2340 

ATGATGTACC TACTTTAGAT AAGAACAAGA 2400 

TAGTTGTTAA AGATGGAATT GAAACACGAC 2460 

TTTCACAAGG ACAATTAACA GTCGATGTCA 2520 

GCCATGCTTG TCCTATATGT GGATTTTCAA 2580 

TTAACAGTCG TTTTGGTGCT TGTCCGACAT 2640 

ATGTAGACTT GGTTGTTCCC GACAAAGATA 2700 

GGATACCGAC GAGTTCTGAT TTTTATCCAA 2760 

AAATCAATAT GGATAAACCT TTTAAAAAGT 2820 

ATGGTTCTGG TGACAAAGAA ATTGAATTTA 2880 

AACGAACAAT GGTTTTCGAG GGTGTAGTTC 2940 

CTTCAGAATA TACACGTGAA ATGATGAGTA 3000 

GTCATGGAAA GCGATTGAGT CGTGAAGCkT 3060 

GTGAAGTAGT CGAATATTCA ATCAGTCAAG 3120 

CAGAACAAGA TCAAGCGATT GCAAATCAAA 3180 

TTTTAAATAA TGTGGGACTT GAATATTTAA 3240 

GTGGTGAAGC ACAACGTATT CGATTAGCAA 3300 

TATATGTATT AGATGAGCCA TCAATTGGAC 3360 

ATACACTTAA AGAAATGAGA GATTTAGGAA 3420 

ATACAATGCG TGCGGCTGAT TACTTAGTGG 3480 
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AGGTAATGAA AGATAAAAAA TCATTAACAG GACAATACTT GAGTGGTAAG AAACGTATTG 3600 

AAGTACCTGA ATATCGCAGA CCGGCTTCAG ATCGTAAAAT TTCTATACGT GGAGCTAGAA 3660 

GCAACAATCT TAAAGGGGTT GATGTGGACA TACCACTATC AATCATGACG GTTGTTACAG 3720 

GTGTATCAGG TTCTGGTAAA AGCTCATTAG TAAATGAAGT ATTATACAAA TCATTAGCTC 37 80 

AAAAAATTAA TAAATCTAAA GTAAAGCCAG GATTGTACGA TAAGATTGAA GGTATTGATC 3840 

AACTTGATAA AATTATTGAT ATTGATCAAT CACCAATAGG TAGAACGCCA CGCTCTAATC 3900 

CAGCAACATA TACTGGTGTG TTTGATGATA TACGTGATGT GTTTGCGCAA ACAAATGAAG 3960 

CTAAAATTCG AGGATATCAA AAAGGGCGTT TTAGTTTTAA TGTAAAAGGT GGACGCTGTG 4020 

AAgcTTGTAA AGGTGACGGT ATTATTAAAA TTQAAATGCA TTTTTTACCT GATGTTTATG 4080 

TTCCTTGTGA AGTGTGTGAT GGTAAACGAT ATAATCGTGA GACACTAGAG GTTACTTACA 4140 

AAGGTAAAAA TATTGCTGAC ATTTTAGAAA TGACTGTTGA AGAAGCAACA CAATTTTTTG 4200 

AAAATATTCC TAAGATTAAG CGCAAGTTAC AAACACTAGT TGATGTTGGT CTTGGATACG 4260 

TCACATTAGG TCAACAAGCT ACAACGTTAT CAGGTGGTGA GGCTCAACGT GTGAaACTTG 4320 

25 CATCTGAACT TCATAAACGT TCAACTGGTA AATCTATTTA TATCCTAGAT GAACCGACAA 4380 

CAGGGTTACA TGTTGACGAT ATTAGTAGAT TATTAAAA6T ATTAAACCGA TTAGTTGAAA 4440 

AtGGTGATAC TGTTGTAATT ATTGAACATA ACCTAGATGT TATCAAAACA GCAGACTATA 4500 

30 TTATAGACTT AGGTCCTGAA GGTGGTAGTG GCGGTGGTAC TATTGTTGCG ACTGGCACAC 4560 

CCGAAGATAT TGCTCAGACA AAGTCATCAT ATACAGGAAA GTATTTAAAA GAAGTACTTG 4620 

AACGAGATAA ACAAAATACT GAAGATAAAT AAGATTAAAA GAAGTGAAGG ATGTTATAAA 4680 

TTTATCCTTC GCTTCTTTTT ATTAATTTAG TAATGAATAG TAGAAAGAAA AGATGC6TAA 4740 

AAAGAATTAT GTTAAGATAG GGTCAATCTA GAGTAGTTAA ACATAAATCG AACTGGGA6T 4800 

GGGACAGAAA TGATAAAGAA TCACTAATGA TTTATTATGT AGTGGTTCTT TGTCATTAGC 486 0 

CACAGCTATT GTGTACTTAA AAATAGGaat GCaTgAGTGC AACTCATGCA TAAGaAATAC 4 920 

TAATTTCTAA AGAAAAAGTA TTTCTTTATG TTGGGGCCCC GCCAACTTGC ATTGTTTGTA 4 980 

GAATTTCTTT TCGAAATTCT TTATGTTGGG GCCCCGCCAA CTTGCATTGT TTGTAGAATT 504 0 

TCTTTTCGAA ATTCTTTATG TTGGGGCCCC GCCAACTAAT TCCAATATAT CATTGTAGAG 5100 

CTTAGGTCAT TGATTTTTGG CTCGGACITT TATGGCGATA TX3AACCATGT AAATTAAGCA 5160 

50 AGCAATAAAT TAATGATTGA TATTGACTTG TAAAATAATA ACAATAATGA ACAATTAATA 5220 

TTTATTTTAG CTTTTCAATG TAGATTGGTG TTATATTTTT GATATGATAA GAAGAGATGT 5280 
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ACATTAAAGT TAGATTTAAT CGCTGGTGAA GAAGGACTAT CGAAGCCAAT TAAAAATGCT 5400 

GATATATCAA GACCGGGCTT AGAGATGGCA GGTTATTTTT CACATTATGC GTCAGATAGA 5460 

ATACAACTAT TAGGAACAAC GGAACTATCG TTTTACAATT TATTACCAGA TAAGGATCGC 5520 

GCAGGTCGTA TGCGTAAACT ATGCAGACCA GAAACGCCTG CAATTATTGT GACACGTGGA 5580 

TTGCAGCCAC CAGAAGAATT AGTTGAAGCT GCAAAAGAAT TAAATACCCC ACTTATAGTT 564 0 

GCTAAAGATG CGACTACAAG TTTAATGAGT CGCTTAACAA CGTTnTAGA GCATGCACTT 5700 

GCAAAGACGA CATCTTTACA TGGTGTTTTA GTAGATGTTT ACGGTGTTGG TGTACTAATT 5760 

ACCGGTGATT CAGGAATAGG TAAAAGTGAG ACTGCGTTGG AATTAGTTAA ACGTGGGCAT 5820 

AGATTAGTAG CAGATGATAA TGTAGAAATA CGTCAAATTA ATAAAGATGA ACTAATAGGG 5880 

AAACCACCAA AGTTAATAGA ACATCTATTA GAAATACGTG GACTAGGTAT TATCAATGTT 5940 

ATGACTTTAT TTGGCGCGGG TTCAATATTA ACTGAAAAAC GAATTAGATT AAATATTAAT 6000 

TTGGAAAACT GGAACAAGCA AAAGTTATAT GACCGCX^TAG GTCTTAATGA AGAGACGCTA 6060 

AGTATTTTAG ATACTGAAAT CACTAAAAAA ACAATACCTG TAAGACCTGG TAGAAATGTT 6120 

25 GCGGTAATTA TTGAGGTCXSC TGCAATGAAC TATCGATTAA ATATCATGGG CATTAACACX3 6180 

GCCGAAGAAT TTAGTGAAAG ATTAAATGAA GAAATTATCA AGAACAGTCA TAAGAGTGAG 6240 

GAGTAGGTTG AATGGGTATT GTATTTAACT ATATAGATCC TGTGGCATTT AACTTAGGAC 6300 

30 CACTGAGTGT ACGATGGTAT GGAATTATCA TTGCTGTCGG AATATTACTT GGTTACTTTG 6360 

TTgCACAACG TGCACTAGTT AAAGCAGGAT TACATAAAGA TACTTTAGTA GATATTATTT 6420 

TTTATAGTGC ACTATTTGGA TTTATCGCGG CACGAATCTA TTTTGTGATT TTCCAATGGC 6480 

CATATTACGC GGAAAATCCA AGTGAAATTA TTAAAATATG GCATGGTGGA ATAGCAATAC 6540 

ATGGTGGTTT AATAGGTGGC TTTATTGCTG GTGTTATTGT ATGTAAAGTG AAAAATTTAA 6600 

ACCCATTTCA AATTGGTGAT ATCGTTGCGC CAAGTATAAT TTTAGCGCAA GGAATTGGAC 6660 

GCTGGGGTAA CTTTATGAAT CACGAGGCAC ATGGTGGATC GGTGTCACGC GCTTTTTTAG 6720 

AACAATTACA TTTGCCTAAT TTTATAATAG AAAATATGTA TATTAACGGC CAATATTATC 6780 

ATCCAACATT CTTATATGAA TCCATTTGGG ATGTCGCTGG ATTTATTATC TTAGTTAATA 6840 

TTCGTAAACA TTTAAAATTA GGAGAAACAT TCTTTTTATA TTTAACTTGG TATTCAATTG 6900 

GTCGATTCTT TATAGAAGGA TTACGTACAG ATAGCTTAAT GCTCACAAGT AATATTAGAG 6960 

SO TTGCACAAIT AGTATCAATT CTTTTAATTT TAATAAGTAT AAGTTTAATT GTATATAGAA 7020 

GGATTAAGTA TAATCCACCG TTGTATAGCA AAGTTGGGGC GCTTCCATGG CCAACAAAAA 7080 
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TTATGGCGTG TATACCGTCT TGTTAAATTT TCGAAAGTTT TTAAGAATGT AATTATCATT 7200 

GAATTTTCGA AATTTATTCC AAGTATGGTA CTGAAAAGAC ATATATATAA ACAACTTTTA 7260 

AATATTAATA TCGGTAATCA ATCGTCGATA GCTTATAAAG TAATGTTAGA TATTTTTTAC 7320 

CCAGAACTGA TTACGATTGG TAGTAACAGT GTTATTGGTT ACAATGTAAC AATTTTGACG 7380 

CATGAAGCAT TAGTTGATGA ATTTCGTTAT GGACCAGTGA CGATAGGATC TAACACTTTG 744 0 

ATTGGTGCAA ATGCTACCAT TTTACCCGGT ATAACGATTG GTGACAATGT AAAAGTTGCA 7500 

GCTGGTACGG TTGTTTCAAA AGATATACCG GATAATGGAT TTGCATATGG CAACCCTATG 7560 

TATATAAAAA TGATTAGGAG GTGACAATTT TATGGCGCAA AAGAATAATA ATGTAATTCC 7620 

AATGACTTTT GATGATGCAT TTTATCGTAA AATGGCTAAA CAGAAGTTTA AACAAAGAGA 7680 

ATATAAACGA GCTGCTGAAT ACTTT6AAAA AGTOTTAGAA TTGTCACCTG ATGATCTGGA 7740 

AATTCAAATT GATTATGCAC AATGTCTAGT GCAACTTGGT ATTGCTAAAA AAGCAGAACA 7800 

TTTATTTTAT GACAATATTA TTTATAATAG GCATCTAGAA GATAGCTTTT ATGAATTGAG 7860 

TCAGCTCAAC ATTGAAGTTA ACGAACCAAA CAAGGCATTC TTGTTTGGTA TTAATTATGT 7920 

2S TATTGTTAGC GACGACCAAG ATTATAGAGA TGAATTAGAT CAAATGTTTG ATGTGAAATA 7980 

TCAAAGTGAA GAACAAATTG AACTTGAAGC TCAATTGTTT GTAGTTCAAA TACTATTCCA 8040 

ATATCTTTTT TCTCAAGGTC GATTAAAAGA TGCAAAGAAT TATGTCTTAC ATCAACCACA 8100 

AGAAGTTCAA GATCATCGTG TAGTACGTAA TTTATTGGCA ATGTGTTATT TATATCTCGG 8160 

TGAATATGAT ACgGCTAAAG CATTGTACGA aGCACtATTA CAAGAGGATA GTACaGATAT 8220 

ATATGCATTA TGCCATTATA CTTTGCTACT TTATAACACT AAGGAAAATG AACAATATCA 8280 

AAAATATTTA AAAATATTAA ACAAAGTTGT ACCTATGAAT GACGATGAAA GTTTTAAATT 8340 

AGC^ATTGTA TTAAGTTATT TAAAGCAGTA TCGTGCATCA CAACAATTGT TGTACCCTTT 8400 

ATATAAAAAA GGGAAATTTT TATCAATTCA AATGTACAAT GCTTTAGCAT ATAATTATTA 8460 

TTATTTAGGT GAAGAAGACG AAAGTCATTA CTACTGGGAT AAATTGAAGC AAATTTCTAA 8520 

AGTGGAAATT GGACATGCGC CTTGGGTAAT TGAAAATAGC AAAGAAGTTT TTGACCAACA 8580 

TATTTTGCCA TTACTTCAAA GTGATGACAG TCATTATCGT TTATATGGTA TTTTTTTATT 8640 

GGATCAATTA AATGGTAAAG AAATTGTGAT GACGGAAAGT ATTTGGCAGG TTTTGGAAAA 8700 

TCTAAATAAT TATGAGAAAT TGTATTTAAC GTATTTAGTT CAAGGTTTAA CGCTCAATAA 8760 

SO ATTAGACTTC ATTCATCGCG GCTTATTAAC GCTTTAGCAT AATGAATTAT TTGTAAGTGA 8820 

AAATGATGTA ATGGTTGCAT GGATTAATCA AGGTGAACTC ATAATTGCTG AAAAAGTAGA 8880 
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ACTCGCATTG 
TTGAAAAAAG 
TAGACCCTAG 
ATTTAATCTA 
ACATTTAAAT 
AAAAGAAAAA 
TTTGGTTATT 
GTTATTGCCT 
AATTGCAATT 
AGTCAAAAGT 
ATTAATTTCA 
ATCGTTAATC 

taattttgtt 
atactatgaa 
acatgggatt 
ttatgtagta 
gaaatggaaa 
acccgggtat 
acaacatcga' 
atataatgtt 
caaataaaag 
agagaattcc 

GGGAAAATCa 
GCTTTAAGTG 
CAAATTAGCG 
GATTTCGGAC 
CCATCTACAA 
GGAGAAACAA 
GATGTGCAAC 



CCTGTAGAAT 
CTTGTTGCAA 
GACATTGATT 
GACATAGTTG 
TAACATCATG 
AGTAAAAGTG 
CAATGTTTAG 
AAATTTGTAG 
GATTTAAGAG 
GAAAGTGACG 
AGATATAAGG 
AATGCAATTA 
ATAGATACTA 
GATGAAGAGT 
CAGATGGATG 
GATTTAAGAG 
GAGACGGAGA 
AAAAAAGAAG 
TCTGTAGCAT 
TATGTGCATC 
TTGTACTTAT 
CAATTGATAT 
GAGATGAAAT 
ATTCTGAGTC 
GTCACTCATT 
ATGCCATTAA 
ATACAAGTGT 
ATATTCCTAA 
CAATGGAAGA 



TTCTTTTCGA 
GCGCATTTTC 
TATGTCCCAA 
GAAATAAATA 
AAATTCGTTT 
AACTATTAGT 
AAGACATGGG 
AGTTGATGGA 
GTAAGGAACT 
TCATCATTGA 
AAACGCGTCG 
ATGATGAGCG 
CAAAGTTATC 
TTGAAACTTT 
CAGATTTAGT 
CTTTAACAGG 
TTTTCTTTGA 
GGAAATCTCA 
TAGCAQAACX3 
ATAGGGACGC 
CGGTGGTGGC 
TACGGCGATT 
GGATATACCA 
AGTTTTAAGC 
AGGTAATTTA 
AGCATTAAGT 
GCAAITAAAT 
AAAACATAAA 
AGCAATCGAT 



AATTCTCTGT 
ATTCAGTCAA 
GCTCCTTTTA 
TAAAACATCG 
TGGCGGTGAA 
TGTAACAGGT 
AT A TTTTT G T 
ACAAGGAAAT 
ATTTAATTCA 
TGTTATGTTT 
TGCACATCCT 
AGAGCATTTG 
ACCTAAAGAA 
TACAATTAAT 
ATTTGATGTA 
ATTAGATAAA 
AAAATTAACT 
ATTAGTAATT 
ACTAGGTAAT 
ACATATTGAA 
ACTGGCTTAT 
GTAACAGTTG 
GCACCAGGAG 
CAACTTTTTC 
TTAATCGCAG 
AAAATTTTAA 
GCTGTTATGG 
AAAATTGATC 
GCTTTAAGGG 



GTTGGGGCCC 
CTACTAGCAA 
AATGATGTAT 
TTGCTTAATT 
AAAATAATGG 
TTATCTGGCG 
GTAGATAATC 
CCATCCTTAA 
TTAGTTGCA6 
TTAGAAGCAA 
TTGATGGAAC 
TCTCAAATTA 
TTAAAAGAAC 
GTCACAAGTT 
OGATTTTTAC 
GACGTTTATA 
GATTTGTTAG 
GCCATCGGTT 
TATCTAAATG 
AGTGGCX3AGA 
CAGTTATGGC 
CTGATAATGG 
ACATCAGAAA 
AGTATCGCTT 
GTATGACTAA 
ATATTAAAGG 
AAGATGGAGA 
GTGTGTTTTT 
AAGCAGATTT 



CTGACTAGAG 
TATAATATTA 
ATTTTTAGAA 
TTGTCATAGA 
ATAATAATGA 
CAGGTAAATC 
TACCACCAGT 
GAAAAGTGGC 
TAGTGGATAA 
GTACTGAAAA 
AAGGTAAAA6 
GAAGTATAGC 
GCATTCOTCG 
TCGGTTTTAA 
CAAATCCATA 
ATTATGTTAT 
ATTTTATGAT 
GTACGGGTGG 
AAGTATTTGA 
AAAAATGAGA 
TAGGGGATTA 
TGGGAGTACA 
TGTGATTGCA 
TGAAGAAAAT 
TATTACGAAT 
TAGAGTCATT 
AATTGTTTTT 
AGAACCTAAC 
AATCGTTCTT 



10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
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GCGTTAATTC ATTCTGATGC GCCTAAGCTA TATGTTTCTA ATGTGATGAC GCAACCTGGG 12600 

GAAACAGATG GTTATAGCGT GAAAGATyAT ATCX3ATGCGA TTCATAGACA AGCTGGACAA 12660 

^ CCGTTTATTG ATTATGTCAT TTGTAGTACA CAAACTTTCA ATGCTCAAGT TTTGAAAAAA 12720 

TATGAAGAAA AACATTCTAA ACCAGTTGAA GTTAATAAGG CTGAACTTGA AAAAGAAAGC 12780 

ATAAATGTAA AAACATCTTC AAATTTAGTT GAAATTTCTG AAAATCATTT AGTAAGACAT 1284 0 

10 

AATACTAAAG TGTTATCGAC AATGATTTAT GACATAGCTT TAGAATTAAT TAGTACTATT 12900 

CCTTTCGTAC CAAGTGATAA ACGTnAATAA TATAGAACGT AATCATATTA TGATATGATA 12960 

ATAGAGCTGT GAAAAAAATG AAnATAGACA GTGGTTCTAA GGTGAATCAT GTTTTAAATA 13020 

IS 

AGAAAGGAAT GACTGTACGA TGAGCTTTGC ATCAGAAATG AAAAATGAAT TAACTAGAAT 13080 

AGACGT 13086 
20 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotlble 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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CATTAGTCAT 


GAAAATAGCC 


GACAACTTCA 


TCT6TGAAAT 


CACCGGCCTT 


TTATTTTAGC 


60 


TAACTTTATT 


TCTGATTTTA 


CGATTTTAAT 


TGATCATACA 


GAGAAAGTGA 


TCTTTTTACA 


120 


ATTTCTAAAA 


ACTCATGATC 


TATATTGGAC 


ATTTGATGAA 


AATAAGACAA AATGTTTTCT 


180 


GTTAGCTTCT 


CTTGTTTTGG 


GAATGAATCA 


TCTTCTTTAA 


TCCAAATCGC 


TAATTCGCCT 


240 


AATGGTGTTT 


TATCATCTTT 


AAATGTTTGT 


ATATATTCGT 


AAAAGCTCAT 


AGTATTCCTT 


300 


CTCTCAATTT 


ACTTATATAA 


ATCCTACCAC 


GAAAGCTTTC 


AAGAAAACAC 


AATTAAATGT 


360 


CTATTTAGTG 


AACTTTTTAA 


GGTTGTGCAC 


TCTTTTAATG 


TCTGCCAATT 


AGGTCAATTA 


420 


ATCATCACAA 


TGTACAATTA 


ACTCTATTTT 


CAGTTCATAT 


ACTCACACAC 


CGTTTTTGAA 


480 


CAACACATTA 


ACTTCTCATT 


TAGATAAAAC 


GCAAAAAAGC 


CTGGCACCAA 


TACAATAGAT 


540 


GCCAGACTAA 


GAGTCTACTA 


TATAAATTTA 


TTTAGCGTAT 


GGTTTTACTT 


CGATTGCACC 


600 


TTCATTTTCA 


TCATGAACAC 


CATGCTTATA 


ATAATCAATA 


TATTGTGGCT 


CTAAAGGCTT 


660 


TCTGCCACGT 


ATAATGTCTG 


CTGCTTTTTC 


AGCTAACATT 


AAAACAGGTG 


CGTGTATATT 


720 


GCCATTTGTC 


GTACGTGGCA 


TAGCTGATGC 


ATCAACTACA 


G6TAAATTTT 


CCATACCGTG 


780 
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ACTACAAGAT GGGTGTAATG CTGTTTCACC ATCTCTACGA ACCCAATCAA GAATTTCTTC 900 

GTCTGTTTGC ACTTCTGGTC CTGGTGAAAT TTCTCCACCA TTGAATGGAT CCATTGCTTT 960 

5 

TTGAGATAAG ATATTTCTTG CTACACGAAT TGCTTCTACC CATTCTTTTT TATCTTCTTC 1020 

TGTTGATAAA TAATTAAAGC GGATACTTGG TTTTTCGAAT GGATCTTTAG ATTTGATTTT 1080 

CAAGCTACCA CGAGAGTTTG AATACATTGG TCCTACGTGA ACTTGATAAC CATGTGCGAC 1140 

10 

CGCTGCCTTT TGACCATCAT ATCTTACAGC TATTGGTAAG AAATGGAACA TTAAGTTAGG 1200 

ATAAtCAACT TCGTTATTTG AACGTACAAA TCCGCCACCT TCAAAATGGT TAGATGCTGC 1260 

,5 TGCACCTGTA CGTGTGAAAA TCCATTGTAA ACCAATAAAT GGcATGCGCT TGAtATCTAA 1320 

GCTTGGCtGt AATGATACAG GTTCCTTACA 1350 
(2] INFORMATION FOR SEQ ID NO: 18: 

20 ii) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAATGCTATT GGCAACACCA TATATGAAAn CTCCAAACGA TCCTAAACCG ACTATAGATT 60 

30 

CACCAAATTT nACAATCCAT GAATAAAGTA GTGGCCATAA GAATAACAAT ATGACAACTA 120 
AAAATGTACA GTAAAATGCA GTCATAATTG GAACTAGACG TTTACCACTA AAAAATGATA 180 
ATGCTAATGG TAATTCTGTT TCACTAAACT TATTGTATGC ATAAGCTGCT ATTAAACCTA 240 

35 

TTACAATACC AACAAAGACA TTGCCATTAT TCATCTTTTC AAAAGCTGAA TTTATTTCCG 300 
ArGCTTTCAT TCCTAATAAA GGCGCTAATT TCATTGGTGA TAATACAACT GTAACTAAAA 360 
AATATCCTAA CGTrGCTGCA rGCGsGACTG CACCATCATT TTTCTTTGCC ATTCCTATAG 420 

40 

CTACACCAAT TGCAAATAAA ATACCTAATT GCTCTAAAAT CGTAGTACCT ACCGTAGTAA 4 80 

AGAACATTGC GATTTTCGGC GTCGCATGAA GTGCATTTAA CGTATTACCA ATTCCGGCAA 540 
45 TAATTGCTGC AGCOGGTAAA ATGGCAACTG GTAACATTAA CGAACGCCCT AAATTTTGGA 600 
AAAATTTATA CATTGAATGT CATCCTTCTT AAAATAATGT AGAAATATAA AGATTACTAA 660 
TGTAACTAGA ATAACTACTT CGATACTCCG TTATAGTCAC CTAGGCTTAC TAACCAGCTA 720 
TATTTCTACC TCAAGTTATT TTATAAACTT TTTACAATTT CATGCAATTC TTGTTGTAAC 780 
TTTGCTGTTC GTGTTTCAAT CTCTTTTGTA ATATAATCGA TACGCTCGTT TCGTTTTAAA 840 
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AAAGACCGTG AATCTTAGTA GGACCAACAT AAGCAACAGG TAATATTGGT GACTTACTTA 960 

ACATTGCAAT TGTTGAAGCA CCaCXSTTTCA AAGGTGCACC TTCTTGCGAT GTGCGAGAAC 1020 

CTGTTGGGAA C5ATACCAACT GTCTTATTAT CTTTCAACAA ATTGATTGGG CGTTTTAAAG 1080 

TACTAGGTCC TGGATTTTCA CGATCTACAG GAAATGCATT TAAAGACGTT AAAAATTTAC 1140 

CAATCCATTT ATTTTTGAAT AATTCTTTTT TAGCCATATA ATGAATTTGA TTAGGATATA 1200 

ATGCCATACC TAGCATAATG ACTTCGTTAT AACTTTCATG CGTACAAGTT ACGACATATT 1260 

TACTATCCTT AGGAATATTA TCTTTACCGA TTACGTATAA TGATTTTGAC ATTTTAACTA 1320 

AAATGAAATT CAAAATCTTA CTAATCACTG AATACATTGT GCCACCTACT TAACTT 1376 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TTGTCATACC AATATTTTGT AAAATATGGA ACACAAGTAA AGTGACGAAA CCAACGATAA 60 

AGATTTTGTT AAATTGATCT TCAATTTTCG CAGCTAATCT TATTAGATGG AAGATTAAAA 120 

30 ATAAAAATAT TAAGATCAAT ATGACAGAAC CGATAAAGCC AAGTTCCTCT CCAATCACTG 180 

AAAAGATAAA GTCAGTATGA TTTTCAGGTA TATAAACTTC ACCGTGATTG TATCCTTTAC 240 

CTAGTAACTG TCCAGAACCG ATAGCTTTAA GTGATTCAGT TAAATGaTAG CCATCACCAC 300 

TACTATATGT ATAGGGGTCA AGCCATGAAT TGATTGGTCC CATTTGATAC AGTTGGaCAC 360 

CTAATAAATT TTCAATTAAT GCGGGTGCAT ATAGaATACC TAAAATGACT GTCATTGCAC 420 

CAACaATACC TGTAATAAAG ATAGGTGCTA AGATACGCCA TGTTATACCA CTTACTAACA 480 

TCACACCTGC AATAATAGCA GCTAATACTA ATGTAGTTCC TAGGTCATTT TGCAGTAATA 540 

TTAAAATACT TGGTACTAAC GAGACACCAA TAATTTTGAA AAATAATAAC AAATCACTTT 600 

GGAATGATTT ATTGAATGTG AATTGATTAT GTCTAGAAAC GACACGCGCT AATGCTAAAA 660 

TTAAAATAAT TTTCATGAAT TCAGATGGCT GAATACTGAT AGGGCCAAAC GTGTACCAAC 720 

TTTTGGCACC ATTGATAATA GGTGTAATAG GTGACTCAGG AATAACGAGC AAGCCTATTA 780 

so ATAATAGACA GATTAAGAAA TACAATAAAT ATGTATAATG TTTAATCTTT TTAGGTGAAA 840 

TAAACATGAT GATACCTGCA AAAATTGCAC CTAAAATGTA ATAAAAAATT TGTCTGATAC 900 
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TTGCTAAAAC AGCTATAGTG GCTACTAATA CCCAGTCTAC TTTGCGAAnC aATGCTTATC 1020 

CGGCTGTTGA CGAGATGAAT AATTCATTGC AAACTCCTTT TATACTCACT AATGTTTATA 1080 

TCAATTTTAC ATGACTTTTT AAAAATTAGC TAGAATATCA CAGTGATATC AGCTATAGAT 1140 

TTCAATTTGA ATTAGGAATA AAATAGAAGG GAATATTGTT CTGATTATAA ATGAATCAAC 1200 

ATAGATACAG ACACATAAGT CCTCGTTTTT AAAATGCAAA ATAGCATTAA AATGTGATAC 1260 

TATTAAGATT CAAAGATGCG AATAAATCAA TTAACAATAG GACyAAATCA ATATTAATTT 1320 

ATATTAAGGT AGCAAACCCT GATATATCAT TGGAGGAAAA CGAAATGACA AAAGAAAATA 1380 

TTTGTATCGT mTGGAGGG AAAAGTGCAG AACACGAAGT ATCGATTCTG ACAGCACAAA 1440 

A7GTATTAAA TGCAATAGAT AAAGACAAAT ATCATGTTGA TATCATTTAT ATTACCAATG 1500 

ATGGTGATT6 GAGAAAGCAA AATAATATTA CAQCTGAAAT TAAATCTACT GATGA6CTTC 1560 

20 ATTTAGAAAA TGGAGAGGCG CTTGAGATTT CACAGCTATT GAAAGAAAGT AGTTCAGGAC 1620 

AACCATACGA TGCAGTATTC CCATTATTAC ATQGTCCTAA TGGTGAAGAT GGCACTGATTC 1680 

AAGGGCTTTT TGAA6TTTTG GATGTACCAT ATGTAGGAAA TGGTGTATTG TCAGCTGCAA 1740 

25 GTTCTATGGA CAAACTTGTA ATGAAACAAT TATTTGAACA TCGAGGGTTA CCACAGTTAC 1800 

CTTATATTAG TTTCTTACGT TCTGAATATG AAAAATATGA ACATAACATT TTAAAATTAG 1B60 

TAAATGATAA ATTAAATTAC CCAGTCrTTG TTAAACCTGC TAACTTAGGG TCAAGTGTAG 1920 

GTATCAGTAA ATGTAATAAT GAAGCGGAAC TTAAAGAAGG TATTAAAGAA GCATTCCAAT 1980 

TTGACOGTAA GCTTGTTATA GAACAAGGCG TTAACGCACG TGAAATTGAA GTAGCAGTTT 2040 

TAGGAAATGA CTATCCTGAA GCGACATGGC CAGGTGAAGT CGTAAAAGAT GTCGCGTTTT 2100 

ACGATTACAA ATCAAAATAT AAAGATGGTA AGGTTCAATT ACAAATTCCA GCTGACTTAG 2160 

ACGAAGATGT TCAATTAACG CTTAGAAATA TGGCATTAGA GGCATTCAAA GCX3ACAGATT 2220 

GTTCTGGTTT AGTCCGTGCT GATTTCTTTG T7UVCAGAAGA CAACCAAATA TATATTAATG 2280 

AAACAAATGC AATGCCTGGA TTTACGGCTT TCAGTATGTA TCCAAAGTTA TGGGAAAATA 2340 

TGGGCTTATC TTATCCAGAA TTGATTACAA AACTTATCGA GCTTGCTAAA GAACGTCACC 2400 

AGGATAAACA GAAAAATAAA TACAAAATTG ACTAACTGAG GTTGTTATTA TGATTAATGT 2460 

TACATTAAAG CAAATTCAAT CATGGATTCC TTGTGAAATT GAAGATCAAT TTTTAAATCA 2520 

AGAGATAAAT GGAGTCACAA TTGATTCACG AGCAATTTCT AAAAATATGT TATTTATACC 2580 

50 ATTTAAAGGT GAAAATGTTG ACGGTCATCG CTTTGTCTCT AAAGCATTAC AAGATGGTGC 2640 

TGGQGCTGCT TTTTATCAAA GAGGGACACC TATAGATGAA AATGTAAGCG GGCCTATTAT 2700 
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AAACCCTAAA GTAATTGCCG TCACAGGGTC TAATGGTAAA ACAACGACTA AAGATATGAT 2820 

TGAAAGTGTA TTGCATACCG AATTTAAAGT TAAGAAAACX3 CAAGGTAATT ACAATAATGA 2880 

^ AATTGGTTTA CCTTTAACTA TTTTGGAATT AGATAATGAT ACTGAAATAT CAATATTGGA 2940 

GATGGGGATG TCAGGTTTCC ATGAAATTGA ATTTCTGTCA AACCTCGCTC AACCAGATAT 3000 

TGCAGTTATA ACTAATATTG GTGAGTCACA TATGCAAGAT TTAGGTTCGC GCGAGGGGAT 3060 

10 

TGCTAAAGCT AAATCTGAAA TTACAATAGG TCTAAAAGAT AATGGTACGT TTATATATGA 3120 

TGGCGATGAA CCATTATTGA AACCACATGT TAAAGAAGTT GAAAATGCAA AATGTATTAG 3180 

TATTGGTGTT GCTACTGATA ATGCATTAGT TrGTTCTGTT GATGATAGAG ATACTACAGG 3240 

IS 

TATTTCATTT ACGATTAATA ATAAAGAACA TTACGATCTG CCAATATTAG GAAAGCATAA 3300 

TATGAAAAAT GCX3ACGATT6 CCATTGC6GT TGGTCATGAA TTAGGTTTGA CATATAACAC 3360 

20 AATCTATCAA AATTTAAAAA ATGTCAGCTT AACTGGTATG CGTATGGAAC AACATACATT 3420 

AGAAAATGAT ATTACTGTGA TAAATGATGC CTATAATGCA AGTCCTACAA GTATGAGAGC 3480 

AGCTATTGAT ACACTGAGTA CTTTGACAGG GCGTCGCATT CTAATTTTAG GAGATGTTTT 3540 

2S AGAATTAGGT GAAAATAGCA AAGAAATGCA TATCGGTGTA GGTAATTATT TAGAAGAAAA 3600 

GCJ^TATAGAT GTGTTGTATA CGTTTGGTAA TGAAGCGAAG TATATTTATG ATTCGGGCCA 3660 

GCAACATGTC GAAAAAGCAC AACACTTCAA TTCTAAAGAC GATATGATAG AAGTTTTAAT 3720 

AAACGATTTA AAAGCGCATG ACCGTGTATT AGTTAAAGGA TCACGTGGTA TGAAATTAGA 3780 

AGAAGTGGTA AATGCTTTAA TTTCATAGAG ATTAGTCGAG GGACCTTTTA CTTATAAAAA 3840 

TGATTTGAAT TAATACTAAA AGATTACAAA GAAGAGGTGG TTTTGTGTGT AAATACAAAA 3900 

35 

TTGCCnTTT CTTTTTATGT TAAATCTATA AATTTGAAAC TAAATCAAGG TTAATTCTAT 3960 

GTAGACACTT TATATAGGAA GTAGTTTGAA TGTTTATATA ATGTTTTACA AAAAGATGTA 4020 

GTATTATAAT GTCTAATTTC ACATGTGTTT CAGTAAAATT TGTTGTGGAA TGTTAACX3AT 4080 

40 

ATACGTATTT TATAAAAaAT TTTTTATAAT GATTATTCGA ATGATGCGTA ACGCTTACAT 4140 

CTTATCTAAT GCTAGCTTTT TGACAAAAAT ATGACAATCA ATTAATGTGA TTCTAATAAA 4200 

^ TATTCGCAAA TTGCTTTATT GCGATTAAAT TTTTTTGGTG GTACTATATA GAAGTTGATG 4260 

AAATATTAAT GAACTTATAT GCAAAAGTAT ATTGAGAAAT AAACAGGTAA AAAGGAGAAT 4320 

TATTTTGCAA AATTTTAAAG AACTAGGGAT TTCGGATAAT ACGGTTCAGT CACTTGAATC 4380 

50 AATGGGATTT AAAGAGCCGA CACCTATCCA AAAAQACAGT ATCCCTTATG CGTTACAAGG 4440 

AATTGATATC CTTGGGCAAG CTCAAACCGG TACAGGTAAA ACAGGAGCAT TCGGTATTCC 4500 
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AGAATTGGCA ATGCAGGTAG CTGAACAATT AAGAGAATTT AGCCGTGGAC AAGGTGTCCA 4620 

AGTTGTTACT GTATTCGGTG GTATGCCTAT CGAACGCCAA ATTAAAGCCT TGAAAAAAG6 4680 

CCCACAAATC GTAGTCGGAA CACCTGGGCG TGTTATCGAC CATTTAAATC GTCGCACATT 474 0 

AAAAACGGAC GGAATTCATA CTTTGATTTT AGATGAAGCT GATGAAATGA TGAATATGGG 4800 

ATTCATCGAT GATATGAGAT TTATTATGGA TAAAATTCCA GCAGTACAAC GTCAAACAAT 4 860 

GTTGTTCTCA GCTACAATGC CTAAAGCAAT CCAAGCTTTA GTACAACAAT TTATGAAATC 4920 

ACCAAAAATC ATTAAGACAA TGAATAATGA AATGTCTGAT CCACAAATCG AAGAATTCTA 4980 

TACAATTGTT AAAGAATTAG AGAAATTTGA TACATTTACA AATTTCCTAG ATGTTCATCA 5040 

ACCTGAATTA GCAATC6TAT TCGGACGTAC AAAACGTCGT GTTGATGAAT TAACAAGTGC 5100 

TTTGATTTCT AAAGGATATA AAGCTGAAGG TTTACATGGT GATATTACAC AAGCGAAACg 5160 

20 TTtAGAAGTA TTanAGAAAT TTAAAAATGA CCAAATTAAT ATTTTAGTCG CTACTGATGT 5220 

AGCAGCaAGA GGACTAGATA TTTCTGGTGT GAGTCATGTT TATAACTTTG ATATACCTCA 5280 

AGATACTGAA AGCTATACAC ACCGTATTGG TCGTACGGGT CGTGCTGGTA AAGAAGGTAT 5340 

2S CGCTGTAACG TTTGTTAATC CAATCGAAAT GGATTATATC AQACAAATTG AAGATGCAAA 5400 

CGGTAGAAAA ATGAGTGCAy TcGTCCACCA CATCGTAAAG AAGTACTTCA AGCACGTGAA 5460 

GATGACATCA AAGAAAAAGT TGAAAACTGG ATGTCTAAA6 AGTCAGAATC ACGCTTGAAA 5520 

CGCATTTCTA CAGAGTTGTT AAATGAATAT AACGATGTTG ATTTAGTTGC TGCACTTTTA 5580 

CAAGAGTTAG TAGAAGCAAA CGATGAAGTT GAAGTTCAAT TAACTTTTGA AAAACCATTA 5640 

TCTCGCAAAG GCCGTAACGG TAAACCAAGT GGTTCTCGTA ACAGAAATAG TAAGCGTGGT 5700 

AATCCTAAAT TTGACAGTAA GAGTAAACGT TCAAAAGGAT ACTCAAGTAA GAAGAAAAGT 5760 

ACAAAAAAAT TCGACCGTAA AGAGAAGAGC AGCGGTGGAA GCAGACCTAT GAAAGGTCGC 5820 

ACATTTGCTG ACCATCAAAA ATAATTTATA GATTAAGAGC TTAAAGATGT AATGTCTTGA 5880 

GCTCTTTTTT GTTTTCAATA ATTGATTCTC TGTAGATATC ciAAGTaCTAA CGTTTTAAAG 5940 

GTTAAATATT TAATTGGATT GAGATCTGTA TGCGGTTATA TCaTTCTGTG TAAATATGGT 6000 

TCTCCACCAA ATGTGGTGAG TATATAATTT AAAGAACTAT TTTTAAATTA AGAATAATCG 6060 

AACATAAATA AACTTTATGA AATTTCAGTA TCATGTTCTT ATAAAAAACA ATAGGGCTTT 6120 

TTGCtGACGC TAGTGCGCGA TAAATAATAA GTTGAATATA AAAAAGATCA CTGCCAATCA 6180 

SO TTCGTTTAAT GGCAGCGATC TTTTTTATTT AATTATTTCT CTTTCCACTG CAACATTTGA 624 0 

TAACCAATGC GTGGATGTGT TTTAATAATA TCTTTTGCX;T CCTCATGACA TTGTGAAAGT 6300 
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CCATATATTC GTTTTAATAT CATCTCATAA GTGAGTACTT TTCCTTTATG ATTTGACAAT €420 

AGTTCTAACA AGCTAAATTC ATTTGGCGTC AAATGTACCT CCTGATTATT AATAACAACA 64 80 

5 

GATTTGGAGC CAAAGTCGAT GCTTAGCAAA CCGTTAGTAA ATACAATGTT AGTTTCTTGA 6540 

TGTGACTTAG CGATTCTCTC GATGACTCGT ATTCGTGCCC GAAGCTCATC AACATTAAAA 6600 

GGTTTAGTCA TATAGTCATT CGCACCGTTA TCTAAAGCTT GAATAATTGT TTGTTCTTCT 6660 

10 

TGTCTTGCAC TTATTACAAT GATAGGAATG TCAGTATGTT GCCTGATTTC TGAAATCAAA 6720 

CATAATCCAT CTTTATCTGG TAAACCTAAA TCTAATAAAA TGACATCTGG TTTATCAATT 6780 

T6AATTTTAA AGTGTGCTTG TGTGGCATTG TCGGCTGTAG TTACATTGTA ATAATCTAAA 6840 

GTTAATGCAA CATCAAGTAA ATGTGTQATT GCGTGATCAT CTTCAATTAT CAATATTTTA 6900 

GATTGCATTA TACGTCTCCT TOGTTAAAGT CTGTATATAT ATTGAAATAG AATATACTGC 6960 

20 CGTGTGGTTG GTTCGGTTTA TATTGTAAGT TTGATTGATQ TTTGTGTAGG ATAGTCTGTA 7020 

CTAAATATAA GCCTAGTCCC ATGCTTTCTT TTTGGTTATC TTTAAAATAT TTATTTGATC 7080 

CTGTGTAAAA AGGCTCGAAT ATCTTTTGTt GTTCTTCTAA ACTAATTCCA GGTCCTTCGT 7140 

CTATAACGGC AAATTCGATT TGTTCATAGC TAGCATAACG AATAGATAAA TTGATTTTGG 7200 

TGTCAGTAGA AGTGTGTTTA ACTGCATTTT CAATCAAATT GAAtAAAgCT TGTAAAATCA 7260 

ACTTACTGTC AATGTGTATA AACtGTAAAT TTACTGAGGA TGATACAGTT ATACGCTTTT 7320 

TTAAATGGCG ACGTTCTAAA ATACATATCG ATTTCTTATA CTA 7363 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 10470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
- (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTAACAATCG ATAACCACAA TACTTCTATT GTAATTGTTT AACGATTTCn CGATTAAAAT 60 

45 CATCTAAATC GTCTGGTACT CGACTTGTTA CAATATTGTT GTCTACAcTa CTGACTCATC 120 

AACTACATGT GCGCCTGCAT TTGATAAATC TTTGCGTACA TTTAATACTG CTGTTAACGT 180 

ACGACCTTTT AAATCGTCTG TATCTATTAG TATTTGTGGC CCATGACAAA TGGCAAATGT 240 

SO TGGTACATCA TTTTTAGTAA AGTATTTAGC AAATGTGCCA TATCGACCTT CTGTATCTCC 300 

ACGTAAATGA TCTGGTGAAA ATCCTCCAGG AATTAATAAT GCATCATAAT CTTCTGGTTT 360 
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ATTTQCAGTA TCTCCAATCA CTACAGTATT AAAGCCTGCA TTCTCTAATG CCTCTTTAGG 480 

GCTTGAATAT TCTATATCTT CAAATTCGTT TGCTAGAATA ATTGCTACTT TTTTAGTCAT 540 

5 

TGAAAATCAC CTTTCTATAT ATCATTGATA TAATTACTAT AGACAAGTAA ATCAGTGATT 600 

AAACATACAA GATATAAAAA ATATTAAGCG ACTGTCGCGA TATCTAACCC TAACACATCT 660 

TATGTGGCAT TTACTTAGAT ACTAATTTAA CCTTTTCTTC AAGCTGATCT AACAATCCAA 720 

10 

TCCATTCATC TATATCTTCA ACACX5TACTT CATCAGGATT TACATGATCG ATATCCTCAA 780 

TAAACTTATT TAAACGCGCT TTTATCTGTT CGATTGTTTG CTGTTCATTC ATAAAAAGTT 840 

AACTCCTTTT ATTTTGTTTr CTTTTTCATT ATTATCCTAA CAGAAATTGC GTTAAAGOGA 900 

IS 

TATAATCTTA GCTATATTTA TGACATTCAA ATTATTTTGA CTTTTAAAAA TCCCCTTTTC 960 

AATTAACTAA AATTAAGAGA TAATTTGTTA CGAGTGATAA TACGAaGkGG TaTCATACCG 1020 

20 ATATGAACCA AATAGAAAGA AGGAAGTTTA AGACGATGAA TAGCGTCAAA TTGAAGCAAC 1080 

CTGTTAGCAT TTACAATQAT CCATGGGAAG TGAAATTTAT ATACATTTAA ATTTCATGAG 1140 

ACAATAAACG TTGATTTAAT GCGTTTTTTT GCCTTTTTTA TTTTCCTTAT TTTTTCTGTT 1200 

25 TTACAACAAA ATGGTATCAA AAATGGTATC ATTTGTAGTT ATTTTAGCTT CACATATTAA 1260 

AACAACCACA CTCCTAAATT AATAGGTGGT GTGGTTTTGT TGGTTGTGTG GGGATAAAAA 1320 

TAACCGCATC AGTTAAGATG CGGTTATCTA GCAAGGGCCA CGTATTTATA AATACGTTTA 1380 

^ GAATCTCTTC GGCAACTTTG CTATAQACAG TCTATGCTGT TACTAAATTA TACCACCACA 1440 

CAAACCTACT CCCATTCAGG AACACAGAGC TTTGTCGCTC GTCAGCAACG TCATATGAAT 1500 

TCTCAGTTCA TGTTGTGGTG ACACTTTAAA CGGTCTGTGC CAGTAGCGAC CGAGTCATTT 1560 

35 

CAAGAATGAC CATTTCACAT TTATATTATA ACACTTGTCG TGCGTAACTG TATAGTTTTT 1620 

CAGftGTATT TAAAGTTAAG TTATCTACTT CGCGCTTTCC TTGCCTTAAT TGTGAAATTA 1680 

CATATTGCGC TACX3CCAGTT TGTTTGTGAA TTTGGTAACC TGTTATATCA CTTTTGATCA 1740 

40 

ATTCAATTAT TTTTAATTTA TAATCACTCA TATTATCTAC GTCCATTCTT TTTATCTAAA 1800 

CAATAAAAAT GTGTCTTTCT CCCGATAAAT AATAACAATG GTAGGCTTAA TAAAAACAAT 1860 

^ ATTAAATACA TTTGTTCTGT CATAATTGAA AACCTCCAAA TAATATTATA TTATATAAGT 1920 

GTAAGGAGGA GCCATCAGGC TCCAAGCATA ATGTTAATCT TTGTTGTTTG GCTTTCGGTC 1980 

TAGGTAGCCG AGATGCCaTT CTCTAAGTTG TTTTAACACT TCTGGAATTA TCAGTACTGC 2040 

SO CAATACTTGA TGTTCTAGAA GTGTTTTTAT TATGTCTAGC ATGAGGCTTT TCACCTCCTT 2100 

ACACATAATT TGTAAGTCAT CAACTAACCT ACAAATATAA TTATACTAAA CAAATGTTTA 2160 

55 



272 



EP0 786 519 A2 



GTTATCTACA TTTAAATCTT GAGAGAAATG TTAAAAAGTT CTAGTAAAAT AATAGCACAT 2280 

TTTATCTTTA AATGTAAATA GAAAGCAGGT ATGTAACGCA CCTGCTTAAA TAGaCATGAC 2340 

5 

TATGTCATTC TAACTGATTT CTCCCCATAA GTCACCTAAT ATCTGATTAG GTGGGGCAGA 24 00 

ACCATTCCAT GTTCTAATAG GCAAGTAATA ACGTTGCCCC TCCCATGTAT ATCCTACCCA 2460 

AACATGACCA TCTTGTAACA TCACTTCTGT ATAATCACAA TACCCACCAG GTTGGAACTG 2520 

10 

ATAACCCACT GGACAAGATA AGAATGGCCC CACTTTTCTT ACTGTGATTG GTTGATTGCC 2580 

GTTTGTGAAT CTAGCACTTT CTTCCATGTA GTAAGTACCA TATTTATTAC GTTTCCATGC 2640 

ACTTGCAACT GGTTTAACTG TATTACTTGA AGCGCTTGAC TCATTAGAGA CAGTGGCAAC 2700 

15 

CGGTATTTTA CCATCCATGT ACX3CCCTAAT CTGCTTGATA AAGTAGTCTT TAAGTTGCAA 2760 

CCGCTTGTCT TCTGGCAATA GACOGCGAGT TACTGGGTCA AAACCAGTGT GTAAAACCGA 2820 

20 ACTTCTATGA GGGCATGATG TTGAAGTAAA TTCATTGTGC AATCTGATTG TATTTCTGTT 2 880 

TGCTGGTAAT CCCCATTTTT TCAACAATCT AGCX5CATTCT TGGAAAGTTG CCTGTTCATT 2940 

TTTTAAGAAT GTCGCGTTAT CTGCGCCCAT TGATTGACAT ACTTCAATAC CGTAATAATA 3000 

25 TTTATTACCT ATTTGATTAG CGGTATGCCA ACCTACTTGT GATTCATCTA AGGCTTGCCA 3060 

AACTGTGTTG CCTGATACGT AACTATGCGC AATGCCCX3CT TCTAATCTTG ATAAAGGTGC 3120 

ATTTACTAAT CCGTTACGAT ATGCTTCAGC AGTCGCCCCT TTGCTCCCTG CGTCGTTGTG 3180 

30 

TATAACTATA CCTTTAGGGT TACTACCACG CTTAGGTAGG TCATAACCTT TAACCACATC 3240 

TTTGATGATT TTAAGTTCTA CTGCTTTAGG TTGTGGCTTA GCTGTTTCTT TTTTAGGTGC 3300 

TTGTGTAGGA GATTGAACTG ATCGTGGCGC TGTCTCACTT TTATUU^TTCG GACGGATAAA 3360 

35 

CCACATAGGG AAATCATAAG CATGTTGTCG TCTTGTAACT TTTTCCCAAC CCCAGCCGGG 3420 

TT G TTCGATT CCGTCAGTCC AGCCACCGCC TAGCCAATTC TGCTCATATA CAATGATGTA 3480 

ATCTAAAGTT GCTTCAATTA CCCATGCAAC GTGACCATAT CCAGCACCGT AGTTGCTACC 3540 

40 

GAATACCACC ATGTCGCCAG GTTGTGCTAA GAAGTCCGGT GTATTTTGGT ATACAGTAGC 3600 

TAATCCGTCG AAGTTGTTAG CGAACGGAAT ATCTTTTGCA CCTAAACCTT TTAGAAGTAA 3660 

45 TCCAAACAAA ACTTTCCAAC CAGCATTGGC ATAATCAAAG CATTGAAATC CATACCATAA 3720 

GTCCACATTG AATTGmTC CCTCAGAAGT TTTCAACCAC TCTATAAACT CATTTTTAGT 37 80 

TAATTTTGCT TGCATTGTCG CCACCTCCAT GATGATACTC ATTCACATCA AAGCCAACAT 3840 

SO CGTTAGAGGC GTCTGTGAAA GGTTGTGATG TATCATATTC TTTTGGTGcT TTCGCGCTTA 3900 

ATTCCGGCGT TAAACTACTG TCTTGTGATG ATTTCCACGT AACTTGTTGT TCTTCTTTTT 3960 
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TTGGGTCAGT AATAACGCCA ATACCTGTAA GTAACGTGAG GATAGCGCCT ATAATTGCGC 4080 

TAGCTTGATT TAATTGAGTA GATAAATCTA ATCCGAATAA ATCCGTGACT TGCTTGATAA 4140 

ATAGCAACAA TGCTCCAACT AAACCAGTTA GTACTGCTTT GTTTTTGAAT CTCAATTTCC 4200 

AGTTAATATC CATTTGTTTG CTCCTTTTAT CCAAAATAAA AAAACGACTA AAAATTAGTC 4260 

GTTTAAAATT ATTCAATGGT CAATGTCGGA GATCCTGAAT AAACATCACT TATAGTGACG 4320 

TACAACATCC CTGAAGGATT ACTAAAGTTG ATATTTTTAC TTGCAACTCC GCTATTGACT 4380 

CCTGATATTC CTAAATCACT TGACCCTAAA TTAGTTTGCG AAATCCTCAT TATACCGCTA 4440 

CGTACATTTT CTATTGTCAC CTGATAACTT TTATTGGGTT CAACTCCATT TATTGTCCAT 4500 

TTTGCTGTTG ATTCTTCTAT GCTATCCGGA TATTTATTTT TAGGTAAGGG TTTTATTACA 4560 

AAAGATGAAG GCTTTTTCCA TACTTGGATA TTTCCAGCAT ATACTTTTGT ATATTCTTCA 4620 

CCTTCGTAAA TAAACTTCTT TACATTTTTA AAATTACCTT CCATAAAAAT CACCCTTTAA 4680 

TTAAATATAA CGTATTCGGG TCTTTTTGAT ATATATAGTT ATATTCATTT TCTGTTCCTG 4740 

TCCAAATTTT AACCGTCGGT TGAGATGCGC TTTTTAGTTG ATATAAATTA TCCGCTTGTT 4800 

25 GTTTAGTAAA AGCTTGAGAT GACAAAACAT ACCGCTCGTC ATGATTATGA TTTTTTGGAG 4860 

CATATAAATC ATTTAGTGTT TGTTTGAATT CCTCAAAATC TTCTGTATTA ACTTTTGAGC 4920 

CAATCTGTTG CAATACACTT TCTGAAATAG AGTTGTTTTG TATTGCTTCT GCTAATTCTC 4980 

30 TTAATGTGTT CATAGATTCA GGCGCGCTAT CAACTAGTTC AGCAATTTTT GTATCCGTAT 5040 

ACGTTTTAGA GTCGTTGAQA GTTGTATCTT TGATTTTTTC AACTTCTTGC AATTTATTTT 5100 

CTAACCCTTC AACATTTGCX3 ATATTGATTT TGTCCAATAA CTCAGGTTCT GCTTTGATAT 5160 

CTGTATCTTT ACCATCAATT TGCCACATTT TAGTGTCAGG ATTGATTGAT ACTACAGTAC 5220 

CGTTJTTACC GGGTGCGCCT TGTTCTCCTT TTTTACCTGC TTCACCTTTT GCTCCAGGTT 5280 

GTCCCGGTTC ACCTTTATCA CCTTTCGCAC CTTTAAATCT ACTTTCATTC TTTTCGATGT 5340 

AAGAAATGAC ATCTTTATCT ATTTTCTCTT TAAAGTCTTT GCTCAATAAA TCTGTCGCGT 5400 

TATCTTTTAA AATTCTCGTA ATAGCATCAT CTACCAATTT AACATCGATT TCTTTTGCTA 5460 

CAGCAGATTC AATACCACTA TCAACGATAT TGAAAGAAAA GTTTGCGACA TGTATTTTTT 5520 

CTTCTTCTTT CTCTAAAAAC AGCTTACAGC GAACATAACC AGCGTGTTTG ATAACCTTTT 5580 

TAGGTATCTT GTAGGTAAGG AAACCTTTTA CAACATCGTC GATAATAAGG GGCTCATTTT 5640 

SO TGAATATAGA GCCATCTTCC ATAAACAAAT GTAATCTAGG TGTTAAGCCA TGTGCTTTTA 5700 

GATCGATACG ACCTTGTTTG TCATTGATAC CTATTCTTAT AGATGCTGTA TTTTCATCTT 5760 
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CAACATCTTT TATTTTGTAC ATTTACACAC CTCTTTATTT ATATTTATCC CTTGTGAAGT 5880 

AGATACCTTT TAAGCCGATT TGTTTATATA ACTTAGCGAT TGTACTTGCT TGATGTTGGC 5940 

5 

ACCACTCTAT AGCAGTAGCG TATTGGTGGG TAGCTGGATT CTTAGGATTC CATCTAATTC 6000 

GGTACAATGT GTTTTGACCT TTATTGATGT AATCCTTTCT TACGAAGCTA GCACCGCCCA 6060 

TGATTGCTTT TGCTGGAGAT GTCCAACCTT TATTCCTTGC AAACGTCATT GCGTAGTTAG 6120 

10 

GATTGTTGTC GTAAGCGCCA ATGCCGAAGT AGTTGTATAC TCCATCTTTT CCGTTAGCGA 6180 

AGTTACTTGT TCCATATCCA CTTTCTAAGA AAGCATGCGC GATTAAATAA ATTTCATTAA 6240 

TGTTGTGCTT TTTACAAGCT TCTGCGAACG CTTTACCTTG ATTATTCAAT GTTCCCTTAC 6300 

CTTTAAGTAT CTTATTAAGT GCGCTAACTG AAACACCTTG ATACTTGCCT AAATTAAGCA 6360 

TTTGGTAGCA TTGTGTGTTA CTTTCCCATA TACGCTTTAC ATTCATTGCT GAACTCGTTT 6420 

20 GTGCTCGTGT AGCGTTAscC AACCCCAAGC ATTAGATTTT TTCGGGTTAC CTCTTGCCAT 6480 

TTGTTTATCC AGTGCTTGTT TGAATGTATA AGGACTCGTT TCTGTTATGA TCTGCGGTTG 6540 

TTTAGATGCC GAACCATTGT TGGCTGTTGG TGACGAGTCT CTTACATTAG CTATATCAGC 6600 

25 GTTTTTATTA TCTACCATAA CTTTTATTCT AGATTTTGTT ACTGTTGGCT TAGTTATAGA 6660 

ATTTAATAAT TTTTCTCTGT TTTTAAATAT ATTAAGTAAT GCCTTTTCTA AT6CTTCGTA 6720 

TTTATCTTTA GGAGGAACAC CGTTGTCAAT CATATTCCAA TTAACATGTT CCAACATTGA 6780 

ACGCCAAATG CTGTCGTCTA CTTTTAAATT TTCAATACTT AGAGGTATCT CATATTTGGC 6840 

CATCATATCT ACAGCTACAA CCATTGCGTG AATCTCATTA AAAATAAATT CATTTTTACT 6900 

CGCACTATAA TCTTCACATA CGTCTATAAC TATATAATCA GGTTCATTAG GAACTTCAAA 6960 

35 

TACAGCTCTT CTAGGTGCCC AAATATTATG TCTATCAACA TAAAAGTGGG GATATTCTAC 7020 

ATCCTGTTTG TATTTCTTCC TACTGTTATA TAAACTTTCT ACCGAGCTCA TCGTTTGTGC 7080 

GTTTCTAATC ATTATTCCTT TAGGTTTTTC GAGTCGTCGA TTACCTTCTA CTATAAAGTG 7140 

40 

ATAAATATAT TCTGGATAAT TAACCTCTTG GCTAGAAATA GTGTACTTTA TAGTTGTTAC 72 00 

ATCTTTCCAA ATTGGAACTT TTTTATTATT TTTTTCGTTA TCATCACTAT CATCTTCTGG 7260 

TTTAGGTGCC GGTGTAGTTT TGTCTGGATG ATATGGTGGT CTAACAAAAT ATTTAACCCC 7320 

45 

TCCACCTGGT CCATCATGAT AAGAGTGTTT AATTTTATAA GGTGGACTTC CTGTTGCGTT 7380 

ATTTGTATAC CAGTTTTGAT CTACGCCATA CCAATAGTCT TTTGTGCATG GTCCCACTAC 7440 

SO AATGTTTACA TGTCCTGCCC AACCACCAGT CCAAACACCC CAGTCGCCTG GTTGTGGTAC 7500 

AAAATCTTTT GTATTTCTAA TTATCTTGAA ATCTCTACCT CTATAATTGG ATTTTTGAGC 7560 
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TAAATCCCAG CATTGTGCTC CCATTCCAGA ACCAGGTACA TCAATAGCTA TTTTGTTTTT 7680 

AGCGATATAT AACGCCCATT CAACCACTTC ACTAGCTGTG GGCTTTCTAT TTTTCGGATT 7740 

AGGTAATCCC ATGTATGCAC CTCATTTCAA TCAAAATAAA AAGCCAGTGC CGAAGCACTG 7800 

ACTCTTAACT GTTATTTACA TTTACCAAAC CAGAAGCACG CCCAGAAGCT ATATCCTAAA 7860 

ATCCCTTTAA GCATGGTAAT CACCTCCTTT AAATACCAAA AACAGTTCTT AGTAAAGCTA 7920 

TGACAATCGT ACTGAAGATA GTCCCTATCA AACCTAGAAT CCACATTTTT ATGTCTCTAA 7980 

TATTCTTGGC ATTCTTTTCT TTATTCTTTT CATCTTCTAC CTTGTCGCGC TTTAATTCTT 8040 

CAAAATTTCT ATCTAATTTG TCATAAATCT TTTCTTGCGC TCTAAGACTA TCTTCTATTC 8100 

TGTCGAATTT TTCAAACATA GTCTTATCAT TTTCTTCTAA TCGCGTTAAA CGCCAATCTT 8160 

GTTCATGTCG TTTGGTAAAT CCAAACATTA TGCCACCCAC TTTATTCAAA TTAAAAAGCC 8220 

ACAAGCATTA CACCTGTGAC TTTTCATCTT TTGTTTCTGG ATATTTTTCT CCAGTGATTA 8280 

AAGCGTATTC TTCTTTATCG ATTAAACCCT TGTCTACGTA CCACTTAATT TGCTCGTTTT 8340 

TATAGTAACC CCAAACATAA AAAGTTTTAA TGTCTTTAAA AGTTGGATAA ATCATCTTCA 8400 

25 TTATTTAAAC GTCCCCCTCA GTACTTGTTT TGTTAGTTTT CAGTTCAGTC AACTGTTGTG 8460 

TTAACATAGC GTTTTGTTGA GCTAATTCCA TTGTTAATAC GTTTACTTGT GCCACCTGCA 8520 

TTTGCATACT CGCAACCATT CCGCGAAGTT CCTCATCACT TAAATCTGAC GCACTTTGTT 8580 

30 GGTTTGATGC ATTCGGTACG TCTTCTTTTT CGAAATTGCT ATTGTATTTA ATTTCGCCGT 8640 

TAGTGAAAAC AAACTTTCTA GGTTCGAACT CTTCTTTAAA TTTAATAGGC ACATTGTTAT 8700 

CATCTACATC TAAACTATTG CGTAAACCGC CAGTATTAAC GAATCCGATA ACTTCGTTTT 8760 

^ TATCGTTTAC TGTGATTTTC ATTATTTCCA CCCCATAATT TTAGTTATAG TAACTTTGTT 8820 

GGCA3TCGCT CCAGAACCTG ATGTTTTACC TAAATCAAAG TACACATCGT TATCTATTCT 8880 

TAAAGTAGTG GTACTTGTTT TGGATAGTAA GCACTCATAA ATACCGCCAC CGTTGCCGTC 8940 

TGAGTCAACT ACATTCGCTT TACTCAATTG AATCGCGTTA GGTAATGCGG TTAGTCCGAA 9000 

TCCCTCAATA ACGCCACCTG GATAAGTTCC ACTTACCAAC AAAATAGAAT AGTTTGTGTA 9060 

CGGTTCAGTT AGATTGATTG TTGTACCTAG ACCATTTGCG CCACCGTCGA ACAATACCX5T 9120 

TGATTTATGT TCATTAGGAA CTGTCCACTG TTGCTCAAGT CTGCCGTTTG TGATTGATCG 9180 

TGTGTAAATC TTTTTAGAGT TATAAGGTGT GAAGTTAAAT AGCTTGTTTG TATCATCTTT 9240 

AACGAATACC GATAAATAAC CCTCATAACT TTCAACGCTA CCTGGTAAAT CCGGCACTCT 9300 

TGTTGCATAG TAATTACCAG CAGTTAAATA TCCCAAATCG CCTTGCGCAT TAITTAAGTT 9360 
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GAATTTATCA TCTACATACT GCTTAGCTTG ATTTAAAGCG TTGTTAGACG TTTCTTCAAC 9480 

AAATTGCTTA GTTAAGTTTC CATCATTCTT TTTATAAAAC GGGTACCATG TGCCGTAGAT 9540 

TTTGTATTTT GTGTACTCAT OGTTTGAATC GTCTGGGTAC CATGTTGCAC GAGCAGTATT 9600 

ATTATCAACA ACATAAACAA CTAACACACC AGATTTGCTT GATGTATAAG TTGATTCATC 9660 

GAACGAAGAA CCGTCATCAA CACCATCTTG TCCAGGCTTC TCTAACGTGC CTATATCCGT 9720 

CTTTTCTGGC GCATCTGTTG CATTAGTAAT ATGAATAATC CTAGATGTGT TAACTGCGCT 9780 

TAAAACGCTA TCTATGGACT GCTCATACGA TTCAATTGCT TTACCGTAAT CATCTGTAAG 9840 

TTTAGACnT TGCCAATTCG TTGTTGAATT ACCTTTAACA AGGTCAGCGC CATTGATTTG 9900 

TTGTTCAACT TCGTTAACAC GTTCAAAAAT CGCTTGCTCT TTTTCAACTA TTTTATCGAA 9960 

TTCAGCTGTA ACAGCTTGTG TTGCACTAGT TTGOGTCGCA GTAATAGCTT GTATAGCTTC 10020 

GTTTTGCTTG ATTTCGATTT GTTGAATGCC TTTTGTCX3CA CTATCATTCA CTTTTGCTAT 10080 

TAAOGTTTGT GTATCAGCCA TATTTTGCTT TAATTGGTrA AAATCTTTAC CGACAGCTTC 10140 

GATAGTATCT TGAATAGATT TGATATAAAC AAGCTTTGTT ATACCATCAA ACCCACTAAC 10200 

TAAATCATTT TCAATATTGA AGCTAAATTG ACGTTCAACA ACAACATTAT TACTCCCGTT 10260 

TTG7GTAAAG AATGCCTGAG CATGCACCTT GCCTGAATGT TTTAAAAATT CATTCGGTAT 10320 

CACATACTGC AAACGCCCAT TAATT6CGTC TACTATCGTT AATTCGTCTG AAATATAAGC 10380 

GCCTCTATCT ACGTTATAAT CATCGGTTTT TAAnACGATA GATGTTTTAA CATGTTCAGA 10440 

ACTTATAGAT AAGGGTCTGT TATnCTTAGT 10470 

(2) ItiTFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3647 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATCAGATCTT GAGAATCGAG TTATTAAGTC TATCGAAGAC TTAACTAAAA TCCAACCATT 60 

CATGCCTACA CAAGATTTTG ATTTTAAAAC TAAAGAAATT CAATCAAACA TTTCTGAAGA 120 

AAGATTTATC GAAATGATTC AGTATTTCAA AGAGAAAATA ACAGAAGGGG ATATGTTCCA 180 

AGTTGTGCCA TCAAGAATTT ACAAATATGC GcATCATGCT AGTCAGCATT TAAATCAACT 240 

TTCGTTTCAA CTGTATCAAA ATTTAAAACG ACAAAACCCA AGTCCATATA TGTATTATCT 300 
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TCAAATTGTA ACAACTAATC CTATTGCAGG TACGATTCAA CGTGGTGAGA CGACACAAAT 420 

AGATAATGAG AATATGAAAC AACTACTTAA TGATCCAAAA GAATGCAGCG AACATCGTAT 480 

^ GCTAGTTGAT TTAGGACGTA ATGATATTCA TAGAGTAAGT AAAATCGGTA CCTCAAAAAT 540 

TACTAAATTA ATGGTTATTG AAAAATATGA ACATGTTATG CATATCGTAA GTGAAGTCAC 6 00 

AGGTAAAATA AATCAAAATT TATCGCCAAT GACAGTTATT GCGAATTTAT TACCAACAGG 660 

10 

TACCGTTTCA GGTGCACCAA AATTACGTGC AATTGAAAGA ATATATGAAC AATATCCACA 720 

TAAACGGGGC GTTTATAGTG GTGGTGTTGG ATACATAAAT TGTAATCATA ACTTAGATTT 780 

TGCATTAGCA ATTCGAACGA TGATGATAGA TGAGCAGTAT ATCAACGTAG AAGCTGGTTG 840 

IS 

TGGCGTTGTA TATGATTCTA TTCCTGAAAA AGAACTGAAT GAAACGAAAT TGAAAGCTAA 900 

AAGCTTATTG GAGGTGAGCC CATGATCTTA GTTGTAGATA ATTATGATTC CTTTACATAT 960 

AACCTAGTGG ATATTGTTGC TCAACATACT 6ACGTCATTG TTCAATACCC TGATGATGAT 1020 

AATGTGCTGA ATCAATCGGT GGACGCTGTT ATTATATCTC CTGGTCCAGG GCATCCATTA 1080 

GACGATCAAC AGTTAATGAA AATCATATCA ACCTATCAAC ACAAACCCAT TTTAGGTATT 114 0 

2S TGTTTAGGGG CTCAGGCACT GACTTGTTAC TACGGTGGAG AAGTCATTAA AGGCGACAAG 1200 

GTTATGCACG GCAAAGTTGA TACACTAAAG GTTATATCGC ATCATCAACA TCTGTTATAT 1260 

CAAGATATAC CAGAACAGTT TTCAATTATG AGATATCATT CATTAATAAG TAACCCTGAC 1320 

30 AATTTTCCAG AAGAATTGAA AATTACTGGA CGTACCAAAG ATTGTATACA GTCATTCGAG 1380 

CATAAAGAAA GACCGCATTA TGGTATTCAG TACCATCCTG AATCATTTGC TACAGACTAT 1440 

GGTGTCAAAA TAATTACAAA TTTCATTAAT CTAGTGAAGG AAGGATGAAA ACCATGACAT 1500 

TACTAACAAG AATAAAAACT GAAACTATAT TACTTGAAAG CGACATTAAA GAGCTAATCG 1560 

ATATfiCTTAT TTCTCCTAGT ATTGGAACTG ATATTAAATA TGAATTACTT AGTTCCTATT 1620 

OGGAGCGAGA AATCCAACAA CAAGAATTAA CATATATTGT ACX^TAGCTTA ATTAATACAA 1680 

40 

TGTATCCACA TCAACCATGT TATGAAGGGG CTATGTGTGT GTGCGGCACA GGTGGTGACA 1740 

AGTCAAATAG TTTCAACATT TCAACGACTG TTGCTTTTGT TGTAGCAAGT GCTGGcGTAA 180 0 

AAGTTATAAA ACATGGtAAT AAAAGTATTA CCTCaAATTC aGGTAGTACG GATTTGtTAA 1860 

45 

ATCAAATGAA CATACAAaCA ACAACTGTTG ATGATACACC TAACCAATTA AATGAnAAAG 1920 

ACCTTGTATT CATTGGTGCA aCTGAATCAT ATCCAATCAT GAAGTATATG CAACCAGTTA 1980 

^ GAAAAATGAT TGGAAAGCCT ACAATATTAA ACCTTGTGGG TCCATTAATT AATCCATATC 204 0 

ACTTAACGTA TCAAATGGTA GGCGTCTTTG ATCCTACAAA GTTAAAGTTA GTTGCTAAAA 2100 
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AAGCAACACT ATCTGGTCAT AATTTGATAT ATGAATTGAC TGAAGATGGA GAAATCAAAA 2220 

ATTACACATT AAATGCGACT GATTATGGTT TGAAACATGC GCCGAATAGT GATTTTAAAG 2280 

^ GCGGTTCACC TGAAGAAAAT TTAGCAATCT CCCTTAATAT CTTGAATGGT AAAGATCAGT 2340 

CAAGTCXIACG TGATGTTGTC TTACTAAATG CGGGTTTAAG CCTTTATGTT GCAGAGAAAr 2400 

TGGATACCAT CGCAGAAGGC ATAGAACTTG CAACTACATT GATTGATAAT GGTGAAGCAT 2460 

10 

TGGAAAAATA CCATCAAATG AGAGGTGAAT AATATGACGA TTTTATCAGA AATTGTTAAA 2520 

TATAAACAGT CACTTTTACA AAATGGCTAT TATCAAGACA AACTTAATAC CTTGAAAAGT 2580 

GTGAAGATTC AGAATAAAAA ATCTTTTATA AACGCAATTG AGAAAGAACC AAAGCTAGCA 2640 

IS 

ATTATTGCAG AAATTAAATC GAAGAGTCCT ACAGTTAATG ACTTACCTGA ACGAGATTTA 2700 

TCGCAACAAA TCTCAGATTA TGACCAATAT GGTGCAAATG CCGTGTCCAT TTTAACTGAT 2760 

GAAAAGTACT TTGGTGGTAG TTTTGAAAGA TTACAAGCAT TGACGACAAA AACAACATTA 2820 

20 

CCCGTATTAT GCAAAGACTT TATTATAGAC CCGCTTCAAA TTGATGITGC TAAACAAGCT 2880 

GGTGCATCTA TGATTTTATT GATCGTTAAC ATCTTATCTG ATAAACAATT GAAAGATTTA 2940 

25 TATAACTACG CTATATCGCA AAATCTAGAA GTGTTAGTTG AAGTACATGA TCGCCATGAA 3000 

TTAGAACGTG CCTATAAGGT TAATGCTAAA TTGATTGGTG TAAATAACAG GGACTTAAAA 3060 

CGATTTGTTA CAAATGTGGA ACATACAAAT ACTATTTTAG AAAATAAAAA AACAAATCAT 3120 

30 TATTATATTT CTGAAAGTGG TATTCACGAT GCATCTGATG TAAGAAAAAT CTTGCATAGT 3180 

GGTATCGATG GCTTACTAAT AGGTGAGGC6 CTTATGCGTT GTGACAATCT ATCTGAATTT 3240 

TTACCACAAC TGAAAATGCA AAAGGTGAAG TCATGATGAA ATTGAAATTT TGTGGCTTTA 3300 

^ CATCAATAAA GGATGTTACA GCGGCCAGTC AATTACCTAT TGATGCGATA GGTTTCATCC 3360 

ATTATGAAAA AAGTAAAAGG CATCAAACAA TTACCCAAAT AAAAAAGTTA GCGTCTGCTG 3420 

TTCCAAATCA TATCGATAAA GTATGTGTCA TGGTAAATCC TGATTTAACA ACAATTGAAC 34 80 

40 

ACGTATTAAG CAATACGTCA ATTAACACAA TACAGTTACA CgGCACAGAA TCTATTGATT 3540 

TTATACAGGA AATTAAAAAG AAATATTCAA GCATTAAAAT CACTAAAGCT TTAGCTGCaG 3600 

ATGgAAAACm TwATCCCAAA caTtAAtnAA tnTTAgGGGG TCCGTGG 3647 

45 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: S 
CcAcCTTGAC CACCTTTACG TGGAATCTTT 
^ GAAAgTCAAC AAGTTCTGGA CTAAATGTTG 

AAATTAGTTC TATATCGTCA TGACGTTCTA 
ATGCATCTTC ACGAATCGCA GTATGAGGAC 

10 

TTTCAGGTAG AACTCCTGAA TTTACTAATA 
TTGTAATAAC GCCGATACTC ATTTCTTTTG 

,5 GTGTTTTACC TGCACCTACA GGACCACCAA 
TAACCTCCTA TGATATGAAA tTCTAACATT 
TAAACCAGGC GCTGTCATGC CAAAATCTGC 

20 TCCTTCTATA TAAGGAATCA TGTGAGTAAC 
AATAGCACGA ACAGCATTTT GAGTTAAACT 

AATCGTTTCA ATATCTACAC CTAAATGATG 

25 

TnACTTTGCT TTCTTATCTT GCATTTGTTG 

AAGTTCTAAA GCCAATTTAA CCATGCGAGT 

TTTAGGTAAG TTTTGrACAr ACATCAGTTT 

30 

ATTTTCCAAT GCATCATAAA CTAaACGCAT 
TTGTAAAAAC ATTTTTAACC AAGCAATAAA 
35 AATATATGTT TCAAGACCAA ATGAATGACT 
GAACTGAAAT AATCTTAAGT GTGTATGATC 
CCnATTAAC TTTACGGTCT TCTCGAACAT 
CAACTAAATA ATCATATTGT ACTAGCATTT 
GATTTCCTAA TTGATGGGCT ATATCTCCCA 
AAAGATCTTC TGAATTAACA TCCACAATAA 

45 

CTCCATATTG TAAGTCAATA GGTTGTTTTA 
TAACGACTCT TTGAATACGT TTAACAAGAT 
GCTTTTGTTT TTCTGAATTT GACAAATTGG 
TTCTATGTTC CTCCTAGAAT AAGAAGTATC 
TACTTGTAAT TTTTTCTCCA TCTACATATA 

55 



lEQ ID NO: 22: 

TCmCCTkGAG CAACaTCGaT AATaTATATT 60 

CTGCTAAGTT ATCGCCACCA GATTCTATGA 120 

ATAATTCGTC TATTGCTGCA AAGTTCATAG 180 

ATCCACCAGT TTCAACACCA ATGATACGAC 240 

TCTTTTCGTC TTCTTTTGTA TATATATCAT 300 

AAAGACGTTT TACAA C TTTT TCAATTAATT 360 

TACCAATTTT AATCGGATTT GCCACAATTA 420 

GaCGTTCTCA TGCGCCATTT GATTTAGTTC 480 

TTCTTTTAAT TCGAAAATCT GCTTTCTTGT 540 

TATCTTTTGA CCAGCAGTTT GTCCAAGTGG 600 

TGAAACATTT TGATATAAAT AGTAATCAAT 660 

GCCTAGCATA GTAAAACAAA TAGCTGGATT 720 

ATGATACCAA GCAATCCATG GGCTATtATA 780 

CCCCATTTGT kTTGCACCAA CACGTGTTTC 840 

ATCTATGTGT AATACTTTTT GTGTATGATC 900 

GGCTAAACCA TCAGAATAGG TAAGTTGCTC 960 

AGTATGATCG TCATGAATTA TATTTCGTTG 1020 

GAAAGCACCT GTTGGAAACT GTGAATCACA 1080 

AATCATGAGA ATGCCCTATA TGTCTGAAAG 1140 

ATGGGATGCC TAAACTTTTT AATAAATCTT 1200 

CAGTCTCTGT AAATTGTGCT GGCAAATGAC 1260 

TTTCTTGCAA TGTTCTTGGT TGAATCACTA 1320 

TCATATTATG GTCATCTGCG TATAAAATAT 1380 

AACGAATGCC TATTTCAGTG CCATGGTCTG 1440 

CTGAATTTTC AAGGTATACT TTTTCGACGT 1500 

CAATATTGCC TTGGATTTCT TCAACAATCA 1560 

TTTGAGTTAA TGGTAACTCA GTTGCTGCAT 1620 

CTTCATATGT TTGTGGATCA ACGTCTAATT 1680 
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GACGCACCAT GCGTTTTAAA TTTAATGCAC GATTGATACC ATTTTCATAA GCAGTTTTAG 1800 

ACACGAATGT CATTGACGTA CTTGTAAGGT TTCCGCCGTA TTGACCATAC ATTTTACGGT 1860 

5 

ACTTCATCGG TTCAGATGTA GGTATAGAAC CATTTGCATC GCCATTTACG GCAGAGTTAA 1920 

TTAATCCGCC CTTTACAACT AATTCAGGTT TAACCCCAAA GAAAATTGGG TCCCATAAGA 1980 

CAATGTCAGC TAGTTTGCCC GGCTCX3ATAG ATCCTACATA TTCAGAAATA CCATGTGTAA 2040 

10 

TTGCTGGGTT AATTGTATAT TTAGCGATAT AACGTTTGAT GCGATTATTA TCATTATGTT 2100 

CAAAATCACC ATCTAAAGGA CCACGTTGTT CTTTCATGCG ATGTGCTACT TGCCATGTTC 2160 

IS GTGTAA1TAC TTCACCTACA CGGCCCATTG CTTGTGAATC GGAACTAATC ATACTGAATA 2220 

CACCCATATC TTGCAGAACA TCTTCTGCTG CAATCGTTTC TTTACGAATA CGTGAATCTG 2280 

CGAATGCGAT ATCTTCAGGA ATAGCCGCAT TTAAATGGTG AGTAATCATT ACCATATCTA 2340 

^ AATGTTCATC TACAGTATTA TGTGTATAAG GCAAAGTTGG ATTTGTAGAT GAAGGTAAAA 2400 

TATTTGAAAA TGCAGCGGAT TTAATTAAAT CAGGCGCATG ACCGCCACCA GCACCTTCAG 2460 

TATGGTACAT ATGAAGTACA CGGTCTTTAA CAGCAGCCAT TGTGTCTTCC ATAAATCCTG 2520 

25 

CTTCATTTAA AGTATCTGCA TGTAATGCAA TTTGAACATC AAATTCATCA GCAACATCTA 2580 

ATGCATGACT CAAAGCAGAT GGTGTTGCAC CCCAGTCTTC ATGTACTTTT AATCCAATTG 2640 

CTCCGGCATT GATTTGTTCA ATGAGTGCAG TTGGATTTGT TGCTTGTCCT TTACCTGTAA 2700 

AACCGACATT AATCGGTAAA CcTTCGGCAG CTTCTAACAT TCTATGAATA TGCCATGGAC 2760 

CTGGAGTTAC AGTTGTTGCT TTAGAACCTT CTGAAGCACC AGTACCACCA CCAATATGAG 2820 

3S TCGTAATACC ACTTTCTAAT GCGACCTCTG CTTGTTCAGG ATTAATAAAA TGAACATGAG 2880 

TATCAATACC ACCAGCAGTG ACGATTTTAC CTTCAGCGGC AATGATATCT GTTGTTGAAC 2940 

CTATAATAAT GTCGACATTA TCCATTATAT CTGGGTTGCC QGCATTACCT ATGGCGAAAA 3000 

40 

TATAACCATT TTTAATGCCT ATATCAGCTT TAACCACTTT ATCGTAATCG ATAATAACGG 3060 

CATTAGAAAT GACAAGGTCT GCAACGTTCA CGTCATCACG TGTTACACGA GGATTTTGCG 3120 

CCATACCGTC TCTAATAGAT TTACCACCAC CAAAAGTAGC TTCTTCACCA TAAACCGCAT 3180 

45 

AGTCTTTTTC TATTTGAGCA AATAGATTCG TATCACCTAA ACGAATGGAA TCTCCAACAG 324 0 

TTGGACCGTA TAAGCTCGTA TATTGATTTT GCGTCATTTT AAAGCTCATG ATCTTTTTCC 33 00 

SO TCCTTTTTTA TTCACGTTTT CAGCACCGTT ATCTCCGAAT ACACCTGCAT ATTCATCATT 3360 

TTCATCAGTT GGGCGATAGA CACGTGACTC ATCGATAGGA CCATTGACCA TACCACGAAA 3420 

ACCAAAAATT TTACGTTTGC CAGCATATTC AACTAATTGA ACTTCTTTTT TATCCCCAGG 3480 
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TTCGAAATCT AATGCTGCAT TTGCTTCATA AAAATGAAAA TGTGAGCCCA CTTGAATTGG 3 600 

TCGATCTCCT GTATTTTCAA CTTCGATAAC TGTTTCAGGA TGATGGTTAT TAATTTCAAC 3660 

CTCTGTACTT TTTGTAATAA TTTCTCCTGG TATCATTTGA CTGCCTCCTT TAAACAATAG 3720 

GGTGATGTAC TGTGATTAAC TTAGTACCAT CGGGGAACGT AGCCTCGATT TCGATATCTG 3780 

TAATCATGTG TTCGACACCA TCCATGACAT CTTCT T TGTT TAGAATTTGT CTACCATAAC 3840 

TCATTAACTC TGCAACGGTC TTACCATCGC GTGCACCTTC TAATAATTCA TCGCTGATTA 3900 

AAGCTAATGC CTCAGGATGA TTTAGTTTCA AACCACGTGC TTTACGACGA CGTGCAACTT 3960 

75 CCGCCGCCAC TACAATCATT AATTTGTCTT GCTCTCGTTG TGTAAAATGC AAATTAAAAC 4020 

CCCCAATTTC ATATTAGATA CaATTTACAA AATTTATATT AATCCTAATT GTTGTGATAA 4080 

ACAAGTAATA TACAAAGTTC AATGTGTAAT TAGAAAATTA TATTTTTAGC ATATCCGATA 4140 

TTGAAGCAAA CAATCTAATC GAAAACAAAT AGTGGAATAT ATTTATGTAA AAACCAAAAT 4200 

AGTTTTTAAT ATAACTTTTC ATAGAATAGT AGTATATTAA TGAGTAATGA TTCAAAGGAA 4260 

AGGTGAAAGA TTTGAAGATA ATAGATGTGC TTTTGAAAAA TATATCTCAG GTTGTGTTAA 4 320 

TTAGTAATAA ATGGACAGGA TTATTTATCT TAATAGGATT ATTTGTAGCC GATTGGACAA 4380 

TTGGATTAGC GGCTATTGTA GGTAGCATCA TCGCCTATAC TTTTGCGCGT TTTATAAATT 4440 

30 ATAGTGAGGC AGAGATTAAT GATGGGTTAG CTGGATTTAA TCCAGTGCTA ACTGCCATTG 4500 

CX3TTAACAAT CTTTTTAGAT AAGTCAGGAT TAGATATTGT TATAACAATG ATAGCAACTT 4560 

ATTAACGTT ACCAGTTGCT GCTGCAGTGA GAGAAGTTTT AAGACCATAT AAAGTTCCGA 4620 

.XjCTGACGAT GCCTTTTGTC ATTGTGACTT GGTTTACAAT TTTACTTTCA GGACAGGTTA 4680 

AATTTGTAGA TACATCGTTA AAGTTAATGC CTCAAAACAT TGAAACGGTT AATTTTAGCA 4740 

-CAATGATAG AATaCATTTC ATTCAGTCAT TATTTGAAGG ATTCAGTCAA GTATTTATCG 4800 

AAGCGAGTGT AATTGGTGGC GTATGTATTT TAATCGGCAT ATTGATAGCA TCAAGAAAAG 4860 

CAACACTCTT AGCTGTTATA GCTAGTTTGT TAAGCTTTAT CATTGTAGCT CTATTAGGTG 4 920 

GTAATTATGA TGATATTAAT CAGGGATTAT TCGGTTATAA CTTTGTATTA ATGGCAATCG 4 980 

CACTAGGATA TACATTTAAA ACAGCGATTA ACCCTTATAT TTCGACTTTT TTAGGTGTGT 5040 

TATTAACAGT AGTGGTGCAA CTAGGTACAA CAACATTGCT TGAACCGTTT GGCTTACCTG 5100 

50 CATTAACATT GCCATTTATT ATCGTGACAT GGATTTTATT ATTTGCTGGT ATTAAACATG 5160 

ACAAAGTAGA TGCTTGATAG TTAAATCAAA CCTAATATTG TTTGAATATC ACCTTAAACT 5220 

ATACAGCGAA TTGTATAGTT TAAGGTGTAT TTTTATGGAT AAAATTAAGT GCATACTTAA 5280 
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TTATTAAAAT AAACGTATCG TATTGTGATA ATAAATGACT CGCATTAATG ACATTGCCCA 840 

AAAATGTGAC ATCATTTTCT AACCCAGCTT GTACAACTTG TTGCTGACAA TCATTTAATG 900 

5 

TAGGTCCATC GCCTATAAAT GTA?^TGCG CATGATTACT GTTATGTAAT TTCAATATCT 960 

CTATTGCCGC GATTAGATTT TGTGGCAATT TTGGATAAGC AAATCTTGCA ATCATAACAA 1020 

ATTGATGCTT TGTCGGGGCA TTAATCTGTA AATCTTGTTT ATTAGGCAAC ATTCCAACTA 1080 

10 

CTTCGCCAAT ATTGTTATGT GATTGGCTTT TTAGOGTTTG CTTAACAGCG GGAACATCTG 1140 

CAATACCATT ATGTATTGTG GTTAATTTCA ATCGATTAAA TCGATATTTT AACGCTAACT 1200 

IS GTTTATCGAA ATCTGAAACA CAAATAATGC TATCTGTAAT AAGTGACATT AATTTTTCGA 1260 

TAACTAAATA TAGAAATTTT TTAGCTGGTT TAACACCCTC TGTAAAAGCC CATCCATGTG 1320 

CAGTAAAAAC TATACXSTGTG TCTTTCGATT TCGAAATGAa CTtCGCAATT CGTCcGACCG 1380 

TtCCAGCTTT GGAAGAATGT AAATGGATAA CATCAGGTTT AATTTTCGAG AATAACTGTG 1440 

CTAACACTTT GACAGCTAAA ATATCTTGTT TAAAGTCAAT TGGACCTACT AAATGTTCGA 1500 

TAATAATTAC ATTAACTCTT GCATCTAGTT GTTCAATCAT TGGTCCATGA TTGCCTACAA 1560 

25 

TGACATAAAC ATCATTGTGT ACGCAAAAAT GGTTGGCGAG TTGAATGAGA TGTGTTTGTG 1620 

CACCACCATT GTCTGCTTTA GTAATACAAT ATATAATTTT CAACTGTTAC AAACCCCTTT 1680 

^ AATGCTATAC TTTCAATTTC TTAACATGGC TATCTCATCA GATGAATAGT ATTTATAGCC 1740 

ATGCAAATCA ATGATGGCAC ATATTTCTTA ATGCCATTT6 ATACTGTCTC AAGGGATTCC 1800 

TCGTTATACT GTAACAATTG GTCACAATCT TTAAAATATA ACTTTTATTT GAACTTATTA 1860 

35 AGTAAATTAA GACTACCTTG AGCCTTCCCC TGTAATAACA ACCATCAATG TTCTAATTQA 1920 

TATATATAGT TCCATCATTA AACTACCTTT ATGTATATAT TTCATGTCAT ATTTCAGTTT 1980 

TTGTTGCOGT GTTAAGTCAT ATCCACCTTG AATTTGCGCA AGTCCTGTTA ACCCTGGTGT 2040 

40 

AACAAGACAT CTTTGCTCGA AACCTATCAC TTCTGAACTA AATAATTCTA CAAATTCCGG 2100 

ACX3TTCCGGG CGTGGTCCAA TAAAACTCAT TTCCCCTTTA ACAACATTAA TTAGTTGTGG 2160 

TAATTCATCA ATGCGTGTTT TACGAATAAA CTTCCCGACA TTTGTTATAC GATCATCATC 2220 

45 

TTTATCAGCC CATTGCGCAC CGTTTTTCTC TGCGTTTTTG CACATCGAAC GTAATTTGTA 2280 

TATTTTAATT AATTTACCCA TCTTCCCAAC TCTAACCTGA CTATAAATAG GGTTTCCTGG 2340 

SO CGAATCTATG ACGATAGCAA TGGCGAATAT AACCATAATC GGTAAAGTTA AAAATAATAA 2400 

AACAATGCTT AAAATTAAGT CAATCGCACG TTTAATTGGG TAATAGCTTT TTCTCACTTC 2460 

TTCTAGTTTG TCTAATTTTC TTTGATAGGC ATAACCCTTA TTATTATGGA CAGCTTCAAT 2520 
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AATTAAAGTA ATCCTTTAAA 
TTAAGTTAGA AATATCTGCA 

5 

CATGCTCX3AC TGATTTTCCA 
TAAAAGTGCC TGTACCAATG 
AGCGTACAGA TTGAACAACA 

10 

GAAAAAATGT AAATGGCTTG 
AATATTGTGA CTTAGGATCC 
IS TTGGTATGTT ATATAACGAA 
TTGCATATGG TGATAATGGT 

CACCATAAAC AGCTGCTGAC 

20 

TAATGATTTC TAACAATCTT 

TCTCAACCGA CTCAGCAACA 

ATGTCTTCAT GATTTGTTCA 

2S 

CGTCAGCCAA ACTTTTAATA 
CATCATAATC TTGTTGTAAA 
TTATCAAAAC TCTTTCCAAA 
CATAAAGTAT TGTAAGCTTT 
TTTGTGAATT TTTATTGAGA 
35 TTGGGTGTGT TAATACTAAT 
CACTTAAGAA CTTCGCTTAT 
AATCCTTACT GCTAAGTGAT 

40 

TTATTAAOGG AATCTCATAC 
GTCTTTACCA GCTAAGAAGC 
CAAAGCAATC TGACTTGTAA 

45 

GGCTGTTGCC ACTGCGATAC 
ATTCACAGTA ATACCTTCAG 
50 TGCGAATTTT GTACTGCAAT 
TGTTGCATTG ATAATTTTAC 
AATACCCCAT AGCACACCTG 



CCTGTTTCTA CTGTATATTT 
TAAGAATGCT TAATATCTCC 
TATAATTCAC CAATAATACG 
TTATAACCGT GTCCAATTGC 
TCATATACAT ATACAAAATC 
TTATGCTCAA ATGAATCGAA 
TGTCTTGGCC CAAATACATT 
CAATAATTTA ATGTCGTCCG 
AAGATTAATG ATTGATCACT 
GAAGCAAAGA TAAAACGTTT 
AATGTTGCTA CGACGTTTAT 
CTAACTAATG CTGCTAAATG 
ACTGCATCAT ATTCACGAAT 
TTTTCTCGTT TACCTGTTCT 
TCATCTACTA AATGCGACCC 
TCTTCCACCT CATTTATACA 
TTATCGATAT TTTTTATTTA 
TAAATTAGAT AGTGGTGTTT 
AATGCTTTCA ACTGATGCAT 
TAATTTTCTA CCAATACACT 
TAAACTTAAC AATAAGGATT 
CACCATCTAC AATAATTGTT 
TCACTACATT TGAAACATCT 
ATTGTTCCCA ACCCCATGCT 
TTTCCATCAT TGGTGTTTGA 
ACGCTAAATC TTGTGCGGCT 
ATAAAGACAA GCCTGGGTTA 
CGCCATGATT GAATTTTTTA 
CAACGTTCAC GCCATATACT 



AGGAACAAAT CCTAATGCCT 2640 

TTTTCGTGCT TCTTTAAATT 2700 

ATAAACCTCT AATAAATTAG 2760 

ATCTTTGTGT TCCATAATTA 2820 

TCTAGTTTGC AGTCCGTCAC 2880 

CATCTTTGAA ATCACACCTG 2940 

AAAAAATTTA ACAACCGCTG 3000 

TTCGCCGTAA TATTTATCTA 3060 

TTTAGGCAAA TCAGGAAGAT 3120 

TATATGATTA TTATATTTTT 3180 

TTCTTGAGAT AAGATAGGTT 3240 

AATAACATAA TCAAATTGAt 3300 

ATCTAATTCA AACACATGAT 3360 

ATAGTTATCT AGAACATAAA 3420 

AATAAAACCA GCCCCACCAG 3480 

TTAAAAATAT ATCATAAAAA 3540 

TAAAAATAAA ATGAGATAAC 3600 

TTGTGATGTT TTATAATATC 3660 

TAGACTGTGA CATCATAACT 3720 

CCCTTCTAAG TGCACTAAAA 3780 

TATTTATCAT TAGTGGATGA 3840 

TGTCCAGTAA T6TAATCAGA 3900 

TCTGGTTGAG AAACTCTGCC 3 960 

TCAGGTTTAC CTGCTTCTTC 4020 

ACGATACCAG GTGCGAATGC 4 080 

ACTTGTGTTA AACCTCX5CAC 4140 

CCCTCAACGC CTGCTTGAGA 4200 

AATTGTTCAT GTGCGGCTTG 4260 

GTTTTAAACT GTTCTTCAGT 4320 
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GCCAAATTGC GCGGCAGTTT GTCTTAcTGC GTTAAATACA TCATCACGGT TTGATACATC 4440 

TGCTTTGATA GCAATAGCTT TTGTACCATC ACTTGATAAT TTAAGTGCAG CTGCTTTTGC 4500 

5 

CCCTTCTTCA TTGAAATCAA CAACTGCTAC TTTGAAACCA TCTTCCACTA AACGTTCTGC 4560 

AATTTTAAAA CCAATCCCTT GTGcTCCGCC AGTTACTAAT GCTACTTTGT TGTTTGTCAT 4 620 

AAAGATCACT CCTCAAATTT CTTTCCTTTA ATTACATTTT ACTCCTCTTC ATTTGAATAG 4680 

10 

TACAACAAAG GTAGCTCCAT TTAACAAAAT ATTCAGATAT TTAAGGTATA GTTAAACGCA 4740 

CTACCATTAG TGATTGGCAA TGCGTTTAAA TGTCGTTTTA AAAGTTCTTA TGTTGAATAT 4800 

/5 TATTTTTTTA AGTCTCTCGA TTAGTTTGTC ATCAATCTTT TTTCGAGACA TGGTCTTTTG 4860 

ATTCAATAGG CGGTTCCGTG TTATCACTGA CAACTTTAGT TGTAGCTTCA TCTTTATGTA 4920 

TTTCTTCGTT AAATCCTTCA AGGTTTTTAG TCX3TGGGATT TTTAACCTCA GGATGTTCCA 4980 

TCATGTCTTG ACTATCAAGT TCCTTTTTAC ACGTGTCTTT ATGTGATGCT TGATTTGCGT 5040 

TCCCTTTACT TTTTTGAATA GTGGTAGTAT CTGCTGCAGC TACTAATTTT TTTCTACCTA 5100 

AAATAGATAT GGCTGAAACA AACCAGAGTA TTGCAGATAC AAAGTTGCAT AATACTAAAG 5160 

25 

CGATAATAGC CAATACAATT AATATGACAC CTTTTGAAAT CCTTTCTTTA AATAAGTCAG 5220 

ATGCCAATAC GATGACAGGT ACGATTGAAA GTATAATTAC AAATATAGAA ATTATTGCCG 5280 

ATATAACTAT TGTTACTATT AAATAATCAG CTCTGCTACC TGATAATAAA TAGAAAAGGC 5340 

30 

CGAAAATTAG TCCATAGCAA ATTACAAACC CACATAAAGT TATAGCCATG AGTACTATAT 5400 

AAGCTATTTG AAAATATAAA CCTATCTTTA TGAATGATTT TTCTACATTT TTTTCCATGT 5460 

35 CTATTCCCCA TTTATTTAAA ATTTATACTT TACCTTAAAT ATTCTCTTTA TTCTTTAGTG 5520 

ATTTTATCTT TAGATTCAAA TTGATTCTCT GTACTTTCAA TATCAACTTT TTCATTTTCG 5580 

TCTGTCGATT CATCTTTTGA GTATTTATTC CAAATCAGCA AAATACCACC AATCAGCCAT 5640 

AAAATTGACG AAAGGAAATT ATATAAACAC AGTGCAATAA TAGCATAAAC AATAAAAAGT 5700 

GCACCTCCGA TTACAGAGTA ACTTTCCATA TAAATCGCAG TAAAGATGGT TGGTAAAACA 5760 

GTGAAAAGAG CCAATATTAA TCCTAATAAA AAAATTGTTT CGTAATCAGA TCCTCCAGCA 5820 

45 

ATATTAATAG ATATCATCCT AACAAAAACG ACACTAAAAT ATATTTGAGC TACGATGCCT 5880 

ATCCAAATTG CTATTTTTCC TATAATTGAG CTCATACTCA TTCCCCATTT ATTTAAAATT 594 0 

go TATACTTTAC CTTAATATAC CTTATTTTAT TTAATTTTTA TATGCAAAAT ACAAAAATGG 600 0 

AGAACTTCAA TATTTATAAA ATATCAAAAG TTCTCCACAC TATATTGTTT TATTATATTT 6060 

TCGCTATCAA TACGCTAAAT CATCATATTT CCCTCAACAT CACAGTAAAA CTATTGCTCC 6120 
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TTCCAATTGC GCAGTTGTTC AACATCATCA 
AGATTAAGAC ATCGTCCTGA AATATTAAAG 
^ TTATGAACAA CCGCTTCAAT TTCCTTATAA 

TGTTGAGAAA GACAAGGATA TGTACCTTGT 
TAACTTGCGA CAACCTTTTC CCATACTTGA 

10 

AAATATTGTT CTGTATCACC ATGACACATT 
GTAGTCCATG GCAAGCGATG TTCTTGTTGT 

,5 TTATGTTGCC ATGTACTAAT TGAATATTGT 

TTACATCCTA AOGCTTTCAA ACTTGTATAC 
CCATGTTGCA TCX3CTGTCAC TAAAATAGGA 

20 CTTTTCGTTT TTTCCAATCT TAAAGGTTCG 

GGTACCAATT TTAAATGTTC ATGAATATGA 
GTTAAATAAA TAAATTCAGG ATGTGGATGG 

25 

CCGTATGCAC CTGCATATTT GAAAACAATA 
ACTTCTCTAG CAAAGACATC TTTCGGTGTA 
CTCGAAATTG AAACTTTTTC AAATGAATAT 

30 

GGATGGTTAT GTTGCCAAGA TACCGGCAGT 
GCATACCAAG CACCATGTAC TTTCTTAATG 
35 TGTGCCACAA TAAAGCGCCC ACATTCAAAG 

ACGATAAGTG TTTTAAAACG TTCTACAAAA 
GCA'H\GTTAA CGCCTATGCC ACCACCAAGA 
TCAGACCATG CCTTTGCTTT TTTAAAATAA 
TCTAAATTGT TAGAAATAGA ATGAAAATGA 
AGCGCAgcTT CAATGACATC ATCAACTTCG 

45 

CCTGCCATAT GCAACGTTGC ATTGGGAAAT 
TGTTGTGTCT TATCTTCATC TTCTAAGATG 
50 TCAACATGAA TACGCTGAAC ACCTTCACTT 

CCAGGGCCAC CAAAAATAAT ATGATTTGCT 
CCTTGAGATG CAACTTCGAA TCCTTCAACA 

55 



TCTTGTTTAA GTAATGCCAG TGGTACTTGA 6240 

CGTGTCACAC CTGCTGGCAC AGTTTCCCCT 6300 

CTCAATGGCT GATACTTCAT GAGTACATCT 6360 

GCAATTCTCT CTACAGAACA ACAACCACTA 6420 

AAATGTGCTT CGCCTAAATC TTTTGTATAC 6480 

GTAATAAATG GCGCTTCTTG TCTTGTCTCA 6540 

AACGTTTCCC ACCACACACC AAATGGAACT 6600 

GTTTCATGGA TTTCTTGCAC TGGAACTTTC 6660 

CGATGCACAC CATCTATAAC CATATATCTA 6720 

TGACGTATAA AATCATCTGC TTCAATACTA 67fi0 

AATGTTTCGT GAAGATCAAT CTTATCTACT 6840 

TTCAATAGTT ATTCATCCTC CTTTGTTTGT 6900 

CTTAAGAAAT CGTGATGTGA AATAGACCAT 6960 

ACGTCGCCTG TACTGATTGC GTCTATCTGT 7020 

CATAATTGAC CGACTAACGT TGTGTCCTGT 7080 

GGATTGTCCT TATAGCGATA AATGTCAAAA 7140 

CTAAATTGTT GCGTACCTCC TCTTAATATG 7200 

TCTAGCACTT CTGTCACATA GTAACCAATA 7260 

TTCAATGTCA CATCTTCCAT TTCTTGCTCA 7320 

TTATCCCATT CAAATTGGTT AGTTAAATCT 7380 

TTGATATGTT TGAGTGGAAA TCGATGTTTT 7440 

AGTTTCACTA CATCGACATG TAAATTCGAG 7500 

AATCCATCTA GATGAATCTT TGGCATTGCG 7560 

TCTTCAGAAA TACCAAATTG TGTTGGGCGT 7620 

GGTCCTGCTA AATTAACACG CAATAAAATG 7680 

GCATTTAGCC GTTGTAATTC ATGCATACTT 7740 

ACTGCATATC TTAGTTCCTC GTCTGTCTTA 7800 

GGTTTAAAAG CAAGACCTTT TGCTATTTCA 7860 

TACTGACTAA TTGTATCTAG GATTTTTCGT 7920 
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TGTTGCAAAT GATGTTCCAG TCCGACTAAA 
TGTGCTTTTA ATTGTTCAAT AACAGGTTGA 
^ GTTTAGACGT CGCTAGAGAT GCACTTAAAT 
AAATAAATGT TTGTACACCT TGTGCCTGCC 
ATGCACAAAA ATGTTTACCA TGTGCATTCA 

JO 

TTACTTGATC ATCACGCGTT TGCCATGGTA 
CTTCGACTAT CATGTCTAAA CCTTCGACTT 

,5 CAACATCTTC TATCATGGCA ATCACCATAA 
GTAATGGTGT ACGTCCAAAT CTTGCCATGC 
GGTAATAACG ACTTAATTTC ACAATATGCT 

20 CAATAATACC TCTCGCACCC ATATCCAACA 
TGACACGTAC AATTGGTATA ATATGCGCTG 
TCTCATCATT AATCGCCACG TGTTCTGTAT 

25 

CGATAACCTC GATCATCAAT GGGTCCGGTA 
CATTGTTTAA TCTATGTTTC AGAGATAGTT 
GGATTTGTAA CATGATGAAT TCTTAACTCG 

30 

TTTTCAACTT GAATCGTAGG TTCAAACAAA 
AATGCTTCTT GATACGCCTC GATGATGCCT 
35 ATACCATATT GCTTTTCAAT AAATAAGATG 
TCATGTAAAA AGTCGCGTAC TAAACGTTCG 

ACrrnTTAT gtgcttctgg cattggcttt 

TGctCACGCT TAAAACGAAC ACCATCATGG 
CCATTTTCAT GAATGAGCAT CATATTTTGT 
TAAAGCATAT GAATCATTGG ACGAATCGCT 

4S 

GAACCATATT GTTTAATCCA ATTTTCAATG 
AGTGCATTAA ATGGTATCGC ATCCTCTTCA 
50 CATATAACAC CTAACGCACC ATAAACTTGA 
AAATAAGACT GTCCTAAGAC TTCCCCTAGA 
ATATCTTGTT GCTGTATCTG CTTTAACCAA 

55 
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TCATAGATAT AATGACAAAC TGGATGAGAT 8040 

ACTATACGCA TTAGCCTTCA TCCCCTTTCT 8100 

GGCGATATAT TTTTCCGCGA TCATCACCTA 8160 

ATTTTGCAAT ATCTTCATCT TCACGTGGTA 8220 

CAACTTCAAA AATATGTTGA ACATGTGATG 8280 

TGCCAAGTGA CTGCGATAAA TCTGCGGCAC 8340 

GTGCTATATC GTCAATGGCC ATAACCCCTT 8400 

TATGCTCATT AGCCATCTCC ATTGCATCAA 6460 

GACCACCATT CAAACTTCTT AATCCTTGCG 8520 

CAACTGTCTC ACGATCTTTA ACGTGTGGCA 8580 

CTTTAATGAT ATCTCTATCT ATCACTGCAG 8640 

CTTCAGCTGC ACGAATTAAA TGCGCTAGTG 8700 

CAATCACAAC AAAGTCATAC CCGCTTGCTG 8760 

TAGAATTAAA AATGCCATAA ACTGAATCAC 8 820 

GTTGCATCAT TGATACCTCC TACACCTAAT 8880 

GAGTCACTTA ATAATCGACG TGTCGTTAAC 8940 

TCGAAATGTT GATAGTTATT CAACTCTGGA 9000 

TTAACCCATT GCCATTGCAG CTCCTCATCG 9060 

ATTTCGGCGA TATTAATAAA GAAAAATGCA 9120 

TCATCTGTTT CAATAAATGA ATTACTATTC 9180 

AATGTCAOGT GTGAAGCAGC TTCACTTAAA 9240 

AAATCTTTTA AGGCAATACG TGTAGGCCAA 9300 

GCATGCGATT CAAAGGCAAT ACCGTGATAA 9360 

ACAGCTAAAA ATTGCTTTGT CCAAGCTTCA 9420 

AATGGTACAC CATCCTTATC ACTTGCATAA 94 80 

TCGATTAACA TATGATATAT ATTTTCACGC 954 0 

GTTTGTTTAT AAGGCGAAAG TTGTGTATTT 9600 

AAAACTGTCT TTAATTCATC TTTTAAATAC 9660 

TCCGTAATTT GCGCTGCATT TTCAATTGTA 9720 
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TATTTTGTCX5 TGTCTATTGG CGACATCGTA CGAATCGATT GTTGAGGGTG ATATAGCTCA 9840 

TCACTTTCCC CTAACCATAG TACTGTGCCA TTAAGCCTTT CTTCAGCCAA ATCAACTTGG 9900 

ATGACATGTT CAAACTGCCA TGGGTGTACA GGTATCATCT CAACATCATT TACATGTTTG 9960 

CCAGATGCTT CAATTTGCTG TACAAAATGT TCATAAGTCT TATCGCCAAC TTGTTGACGT 10020 

AACATTTCGT TAACTACAAC ATTTCTTGAT ACCGTCGTTT CTACTTTATC TTTGTCGATA 10080 

GCTAACCACT GCAGTTTAAC GTTTGGTACA AAATCAGGAC CAAATTTCAA ATTATCACTC 10X40 

AAOGTAAATC CTAAACGTGA TTTGTAACTT GGATGATACT GATGCCCTTC CATCGCATAA 10200 

AATTCATAGT CGTTAAATGT CTCAGGTOTT GCTGGTGGGT TTGATTCTCG ATACTGCATA 10260 

CTTTGOGTAT CTTTTAATTC TGTCTGTAAT AACTCGACAA TAAATTGTTC TAGCTTTTCA 10320 

TCATTTTTAG GAAATGTAAA TACAACCTCT CTCAATAATT GTGTATAGTC TGTTGTTGTA 10380 

TCTGCCTCAT CTCCTACGAC ACGCTCAATT GGTGATGTGA TACGTATACG ATCAAAGCTA 10440 

TGTGTCTTTT CAGCAGTAAA ACGATACTCT GAATCATGTC CTTCTATTGT AAAATGACCG 10500 

ACACCGTCIT GATATGACGC TTTATACACA ACAATATTCT CATAAATAAG TGATGATACC 10560 

AGTTGGTGCA TCACTCTAGT CTTTACACGA TTAAGAATTG TTTGATTCAC AATACGATAC 10620 

CTCCTTGTTA TGACAAATTG GATTTGGTAT ATGTGTATAA ATAGGGTTTG CACCACAATC 10680 

ATTCAATTTA CTCATCAAAT TCGCTTTAGC CGcAATGGTC GGCGTTTGAT ATAAATCTTC 10740 

TACACAGTCA ACAAATACTG CGTTATTCGC GTATTCTTTT TTCCAAGTCA TAAGACGATG 10800 

CGCTACAAGT TGCCATAACA CAACTTCATT TCTAGTCGCT TTACCAATAG TTGATACTAA 10860 

ATGTCCTAAG TGATTTACTA CAACGTAATA TTTAAGACXy^ TGCCATGCTT CATCATGTGC 10920 

ATATACAACA GGGCTTGATG CTGCCACAAC ATTTGGCACA AGCTGTTTTT CAGTAGCAAT 10980 

CGTTSTAGAT AGACAAATGC CTTCAAGATC TCTGACAAAG CATACGTCGG GTATGCCATC 11040 

TTTTAATTCA ATTAATGTAT TTTGTACATG TGCTTCTAGA CTAATGCCTG TGTTACTAAA 11100 

CAGCTTTAAT ATCGGCAATA ATGTACGATT CAAATAACAT TCAAGCCATG CTTCTGGTGC 11160 

TAAACCACTT TGCTCAATCA CTTGTGATAA CTTAGACATC GGTGAATCAG GCATCGTTTC 11220 

AAATAATGAC GCCAATACAT GAATATCTTT ATCAGCATGG TAATTCGGTA TCCCTTCACG 11280 

AACAATCATG GCACTATTTG TTAATAAATC CATTTCAGGT TCAACTGTTT GCCCTAATGG 11340 

ATTCGGTAAC AATGCACGAT ATCCTTCTTC AAACATCAAT TTAAAATGGG GTGTTTC/U^C 11400 

CTCATCTTTG ACTGATGCGA TAACTTGCGC GGCATCAATT GTCCGTTCAA TCTGTTCAAG 11460 

GTCATTCGTA CGTATAAAAT TAGTGATTTT AACGTGTATC GGTAATTTTA AATAAATGTT 11520 



55 
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GCCAAGGTCT TTTATTAAAC CTTGTTCACT 
CACATTGATT TGATAAGGAT GTGTTGGTAA 
^ ATCTATGTCT GCTAATTGAT ACAACACTTT 

GCGCGTGAGC AGAACATCTT GATGCACAGC 
TTCGGGTGCA TATTTCTCTA AATCTGCTTC 

10 

ATGAAATGGA TGACCTAAGT ATAAAGATTG 
GTCTATTGTG TTACTTTGCA AATAACGTGC 

75 CATAATTTGC GCCATATGTT GTTGCACTGC 
TTGCAAAATA CGCGCAATTG CTTCTTTATA 
AAGCCATACC TCTGGATGAT ACATATGATG 

20 CGTTAAAGTT TCGAGCTCTG ATAATTGTAT 
ATATAAATTT TCTTCTCTAA AATATTCATT 

ATGTTGTATT AATTCTTTAT TTTGCACTTT 

25 

TGTGATCGTT GATTTGATTA GTGATGGTTG 

ATACTACGCC CATAACGATA AACGTAGTAG 

cACTaAGACT GCCAATAATT TGACCAACAA 

30 

TGCCTTTAAG TTGTTGATGA CACGCATTCA 
CACTATATGT TAATCCTTGA AGTATTCTTG 
35 AACCTTGCAG TATCGCACTA CAACCACATG 
CATATGATTT ATCATTAAAG CGTCCCCATA 
ATGCGGACTG TAAAAATCCA ATCACACTAC 
AAGCAAGTGG TGATAATGCA GTTAGCATGC 
CGATAATAAA TCGACATGTT TGTTGTGTGC 
CTTTATTAAT ATTTGGTGTT TGTGATTTTG 

45 

CACCGAAAAT ACAGACAATA AAAGTAATAA 

CTAATATCGA AGCTGTAACA CCGCCAATTA 
gQ AACTTTGCAG TCTTCCTAAT ACCTTTCCAC 
ACGCACTTGA TGCATCAACA ACACCACCAA 
ACTGTAATGG TGTCGTACAC AATGCCATTA 

55 
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ATATTGCATA TACTGTGGAT GCTGTCGCAA 11640 

TAAAATAAAA TCTTTGGGTA TCTCTGATAT 11700 

CTCAACCTGA TCTTCTTTAC CTTCTACATA 11760 

TAAATAATGC AATTGGAATG ATGTATGACA 11820 

TGAAAACCCA CTTGCACTCT TAGGAGTCGG 11880 

TTCTGAAACG ATATAACGAT CCTCTACGTA 11940 

CGTGCGATGA ATGCTATTAT CGATGTCAGA 12000 

C6TTTGATTA TCTGCACTTT GAGCCATATG 12060 

AGTTGTTATT TTTTTACTTT TTCCATCGAT 12120 

CCCCATCGCA GACCAATAGC GAAATTCACC 12180 

AGACCATTGA TGATTTTGAG GTGGTACTTG 12240 

TAAAATGCGT TCGATAGCCG CATAC6CTGC 12300 

TTTGTTTCAA CTCCCATAAT TTCATTAATG 12360 

AACAAATTAA AAATAAACTA CTTACTGCAA 12420 

CTGGTGTAGT ATAACTTGTA ATGGCAGCX5C 12480 

CTAACATACT GTTCGTCGTT CCAACAAATG 12540 

CGACAACAAA CATGACACTT TGAATCAATG 12600 

CAGCCATTAA AAACTCTATA TTCGTCGCTA 12660 

CAATCGTGGC AAATATATAT ACTGATTTAA 12720 

AAGGCGCGCT TAATATCGAA GCCGTCCAAA 12780 

GGTCATCTAT CGCTGTATGA TTCACTGATG 12840 

CATACATAGC AAAGTTTGCT AAAACGCCAA 12900 

ATAATAGACA TTGAAATGAA CGGCGAATAC 12960 

GCATATGTGT CGTTTCAATC AATTTTAATG 13020 

CGGCAATACT CATCAGTAAC GCACTAAAAC 13080 

ATGGCCCCAC AAGAGACCCT GCGCTGACTG 13140 

GATCTTCAGC TGGCGCCTCT GCACTCGCAA 13200 

ATAGTCCCTG CAATAACCTC ACAAGTACAA 13260 

AAAATAAGCA TACCGCCAAA CCAAGTAACG 13320 
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ctATCATCGT CGTTACAGCT GGAGCAGCAA TCGCTATACC ACTCCACAAC TGTATTTCTA 1344 0 

CGACTGATAG ATTTTGTAGT GATGCCATAT AAATTGGCAA TAATGGCACA AGTACTGTCA 13S00 

GTCCAGCAAT CGCTATAAAC TGACTGAGCC ATAAAATGCG AAAGTTACTG CGCCATATAG 13560 

ACTGATTAAT CATATGTCAC CATTGGATTT GGTACGGTAG TTAAACCTGA AGGCATACTA 13620 

CCTCCACCAC TATCACGTTG ATATAGCAAT GGTAATAAAA TTTGTTTGAA TGGCCACGTC 13680 

TGTTTATCAA ATAAAATGTG TCTGACAGCT AGCTGATCAG TTGTAACCCA GQAAATAGTT 13740 

GCCACTTCAT TTTTTAAAAT TTGTTTTAAC AACGACATAA GTTCATGCTC ACTTACACCA 13800 

AATAAATCTT GAATTGCATC AATAATGGCA TATAGATTTA CCGATACAGC TAATGTTTGA 13860 

AAATAAGCAA AGAATGTTTC CAAATCCTCA TTAATTAGCG TATTAGGTCT ATCTTCTCTG 13920 

ACGACATACT TCGGCAATGA AAGCTGATGT GCTGTTAGCC ATGGTTTATA AATTCTGACA 13980 

GTATCATGAT CACGTAACAC GCATTTTTGT ACACGTCCAT CTTCAAATGA CAACAATATA 14040 

TTTTGACCAT GCAACTCTGG TAATGCGCCG TATTGCATAA ATGATAGTGT TACCTTTAAA 14100 

AAGACTTGCG CGATATCTTC AAATAACGTC ATGACATCAT TTTTAGAAAT ATTATCTTTT 14160 

CCACAAATCA TTTGATATAA AGTGCGATCA TTTGCGGCGA GTGCTGCCAT TGACACTAGC 14220 

TGTTGCGTAT CATTTTTGGC TAGCACTTCG GGATACTTTC TTAGCTGAAC AGTTAGATGA 14280 

CCTAATTGAT CTTTGAAAAT ATCATTATCT TGACCCATAT ATGACCACCA AGCTGTTTCA 14340 

TCACAAACCA TGACATACTT AGCTAGTGCT TCATCTTTTT CTATAAGCTG ACGTAATAAT 14400 

TGTTCTGCTT GTTCTCCGTT TTTCATGTAA CGCGTAGGCG TTAGCCTTAA TGCGCCTAAT 14460 

GACTGCATTG CAAATGGTAC TTTGACATGG TTATACGGTG CGCCAATATC AATTAATGAA 14520 

CGCATACTTG AAGACGACAG ATAATCTCCA AATTTTAACG GTAATAGTAC AACCAACTTT 14580 

TCACTAATCT CTTTCXKIAAA GACGTTCGGC AGAATATGCT GATATTGCCA AGGATGTACC 14640 

GGAAATAGTA CATAGTCATC TATTGATAAC CCTTGATCAT TTAACATGTC TGTCGCTTGT 14700 

TCTTTTATAG GTACTGTCAA ATTTTCTAAT TCATCGATAT TTGCAGTATC GCCATGAATC 14760 

ATATGTGTCT TTTTAACTGC TGCAACCATT AAAGGAAATG ATTGATTTAA TTCAGCTTGA 14 820 

TACACTTGAT AATCCGCTTC TCTTAATCCT CTTTTTTCTT TAGCTAATGG ATGAAATGGA 14880 

CGATCTTTTA AACTTGCAAA CTGCTCTGAC ATCACAAAAG GATGTGACGC TAAATCTAAT 14940 

TCTGATAATT GTTTAGCAAG CTGTGTGGCA GCAGTAGTCA GTCCTTCTTC AACGCGAGCC 15000 

ACTTCCCATT CATGACTTAG ATCACAATTC ATATTAGCAA TTGTTTGCCA AAATTCAGCT 15060 

GCCGTTAAAG GTTGCTTAGA CACCCTTCCC TCTATCGTAA TTGGTTGTGA ACTTTCGTAA 15120 
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TATATCAAAA GCGTTTGTCC GTTTTCTTTA 
ATATCTTCAA ATAATAATGC ATCAACTAAA 

5 

ACTGCTGTAT GATTCTGCAA TGTTCAGACA 
TCCCAATATT TTGTTGTTGT GCCTGTTGAT 
TAGCCATACC CATCGGATTA AGTAATATGA 

10 

CACCTGTCAC AAGTTGTCCT AGTTCA6CAT 
ACACCAATTG GTTAATAGTT TTCTTTTCTC 
IS CTTTGTCAGC TTTAATAAAG ACTTCTTTAT 
ATGCACCCTT TTGTAACCAA TCATATTCAA 
TGACTACTTC ACCATTTGAT ACTGCTTCTT 
CCGGACGCTG TTGTTGCCAT CTATCAACAA 
AAACAAACAC GCGTTCAATA TGATCGAATT 
CGATTAGCCC GCATCCAATG ATTGTTAAGT 

25 

CTGCAATCAC TGAAACTGCT GCAGTACGCA 
CAATTGGATA ATTCGTTTCT GGATCATTCA 
3^ TACGTTTCGA TGGATTGTCG TGCTTACTAC 
CACCACCGAT ATGACTTGGC ATTGCAATAA 
CCtGTCTTAA ATACGGCTTA AGCGGTTGTA 
CTTCTGTTAA TGCGTCCACA TAAACTTGTG 
ATCTATTTAA ATACAACATC TCTCTatTCa 

TTTTTCTAAC CATGTATCTG AATAAACTAA 

40 

AATCGTGACA ATTGTTGCAC CTTCTTCAAT 

AATCGAACCT GTTGAACCTC CGGCAAATAT 

CAAAGCAGAT TGATAATCAT CTACATGGAT 

45 

TTCGGGTACA CGACTAGCAC CGATACCAGG 
AATGACTGAC CCTTTCGCAT CAACAGCAAC 
SO TTTTCTACTC ATACCCATAA TGCTACCTGT 
AGGTTGCTTA ATTGTTTCAA CAATCTCTGT 
TAACTCATTC GCATATTGAT TAATCCAATA 

55 



GTAATCTCAC TATTCGATAC AATTCCGGCT 15240 

TCTCTTAATA TTATCGCTTG TGCTGTATTG 15300 

CCTCGCATTC TTAATATAGG TTCAATGTTG 15360 

AAATAAAATA AGCACTTGAA ATATCTTCGA 15420 

TCTCATCATC GTCTTCACGT CCTGGTATGT 15480 

GAAGAGCTTC TTTGCTGAAT TTACCTTCTA 15540 

GATTACATTG TGACCAGTCA TCTACTACGA 15600 

GCACATCCAT 6ATAGAAATG TTGCTAATAA 15660 

TGTATGGTTG ATCCGTTACG GTACATGTAA 15720 

TAOCATTTTC TGTCGCAATA AAATTAATTT 15780 

AGCGTGCACA TGCTTCAGAG AATTGATCGT 15840 

GCTCTAACAT ACTTTGTAAT TGCTTGTCTC 15900 

CTTTAAATCC TTTTTTAGCC AAATGCTTTG 15960 

TACTACTAAT TAAACTTGCT TCCATAACTG 16020 

AAATAATGAC GCCACTTGCA CGCTCCATAT 16080 

CTATCCACTT AATACCTGAA ATTGCGTGTT 16140 

TTCGATCTGC GATGTGTCCA TTTTCAGGAT 16200 

CAAAATCATT GTGCGCATGG GCTGTTAATG 16260 

AATGATTACC TCCCGCTTGT TCAATATCTG 16320 

TTCTGaTTTA ACTCCTTGTC TTGATTTCAT 16380 

ATCTAAGTAA CGATCX5CCTC GATCTGGTAA 16440 

TGACGTTATC AACTGCTCAA TCGCTGCAAT 16500 

GCCTTCATAA TCAATCAGTT TTCGACAGCC 16560 

CACTTGATTA ATTTCTGATC TATTCAATAT 16620 

TAATTCTCTA TTAATAGGTT TGTCACCAAA 16680 

AATTTGTGCG TTTGGATGCA CTTCTTTTAT 16740 

CGTGCTGACT GGCGCGACAA AATAATCTAT 16800 

GCCTGCACCA TGATAATGGG ATTGCCAATT 16860 

TGCATCGTCA ATAGTGGCTA ACAGTTCTTG 16920 
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TACATTGGCA CCATAACTTT TAATAATTTT CAAATTTGTT GGTGATATTT TAGGATCAAC 17040 

AACACACGTG AGTTTTAATC CCTTGATTTT AGCTATCATT GCCAACGCAA TGCCTAAATT 17100 

ACCAGAAGTA CTTTCAATTA AATGTGTATT CTCAGTGATT AAACCATGTT TAATACCATG 17160 

TTCAATGATG TACTTGGCAG GTCGATCTTT CATGCTGCCT CCAGGATTCA TATACTCTAA 17220 

CTTTGCAAAC ACTTCATGTT TCGGAAATAG TTGATGAAGT TGAACCATAG GT6TTTGCCC 17280 

10 

TACAGAATCT AACAATGAAT CGTGCACATG 17310 
(2) INFORMATION FOR SBQ ID NO: 24: 

IS <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATACTAGTAA GCGCATCGGT TATTGACATC GAATTCAACT TTAACAGTTT TCATGTTCGG 60 

25 

TGATGTTTCa ATAGAATGTG TGTGTTGTAC TTGCGCATTT ATATTTCCAC CTAAATTACT 120 
TAAGTTTCCT GTAATACTAG AAATGTCAGG TGCGTTTAAT GTAGGTTGAA ATGCATCAAC 180 

30 TACTTTATCT GCAACATTAG AAACATTACG GATAACTTTA CTTGAATGAT TATCTATACC 240 
TTTAACGAAA CCTAACATTG AATACATACC AACATCCATG AATTCACGTG AAGGTGAGTG 300 
AATACCTAGC GCTCTTTTGG CTGCATTTAA AGCACCTTTT GCTACACTAG CTGCTTTTTC 360 

^ AGCTAAGTCT CTAGCCATAT TACCAATACC TCTCATCAAA CCACGGATCA TATCAGCACC 420 
TGCTGATACA AAGTCATCCA CAAAGCTTTT AACTTTATTT ACTGCATTTG TCATACCTTG 480 
ACTAACTTTG TTTACAACAT TAACGAATCC TTGAACAACT CTATTAACAA rCTTAATTAG 540 

40 

CGTACtTGTt ATAGTAGATA CCCaTnGCAT ACCTTTAGTG ACmATGAAGT TCCAAGCTTG 600 
AGACATTTTG TCTGATATAG TTGAAACAAC TTGTGTGAAT ATGCTTACAA CTTTATTCCA 660 
AATTGTCGTT AATATACCAG ATAAGAAACT CCAAATCGTA TTCCATATAT TAGAAATAAA 720 

45 

ACTCCATGCC GCTTGTAACG CAGTAGATAT AGCTGTAGTG ATAGCGTTCC AAACCTTAGT 780 
TGCCACAGTA ACTATAGTGT TCCACAACGT TTGTAAGAAC GTCCAAATAG CGTTCCAAAT 840 
so TGTTATTGCG ATAGTCATAA TTGTGGTAAA CACTGTAGTT ATTACAGTGA CTAACAAATT 900 
CCAAATCGTA GTAGCGATTG TAATTATCGT ATTCCAGATT GTACTTAAGA ACGTCCAAAT 960 
AGCTGTCCAT ATCGTCATAA CTATTGTCAT TATCGTCGTG AAAACAGTTG TAATGATTGT 1020 

55 
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ATAAGCGACT ATTTGATTCC AAACAATCAT TATAAAATTG TAAACATTCG ATACTGCTGT 1140 

AGTGATAGCT GTTAAAATAG CATTCCATAC AACCGAAGCT ACAGCTTTTA ATACATTCCA 1200 

5 

AACATTAACC ATAAACGTTT TTATCGCATT CCAAGCATTT ATAATAAAGT TTCTGAATCC 1260 

TTCATTTTTA TTCCACAATA AAACGAATAT AGCTATTAAT GCAGCAATTA CACCAATTAC 1320 

TATTGTTATT GGACCGCCTA AAATACCAAA CACAGTTACT AGTCCTGTGA TAGCATTTCT 1380 

AATTAATCCA ATCTTACCGA ATAACAATTG GAATATAACT GATATAATTT TTAATGGTCC 1440 

TTTTAATAAC ATGAACGCAC CTTTTAAAAT TGTTAATCCC GCTCTTAATA AACCGAACTT 1500 

IS ACTTACTAAT GCAATGrTTC TACCTATTAA TCCGCCACCC ATAAAGTTAG ATACAGCAAG 1560 

AATAATCGGT ATTAAAAATC TAAATGCACC AACTAAAGTT ATAATGACAC CAACTAATTG 1620 

TGCTGTAGCT GGATGCGCCT CAAACAAGTT AGCTATCCAA CCAGTTATTG CAACTGCAAC 1680 

20 

GCGTAATACT GCACTAGCTA TAGGAGCCAT TGCTGTTGCG AATGCArmTA ATCCTCTTGC 1740 

GATGTTTCCA ATCAATTGCA TTATTAGTGG TCCATTTGTT TGTATATAAC TGACAAAGTC 1800 

TTTAAACCCT TGAGATTGTC CTACTTGTTC AGACCATTCC CTAAACTTAG CTGTCATTTG 1860 

2S 

TTCAAGAGAT TGGAATATGC CAGTTGATGA TCCGCTGAAT GCATTCATCA AATTGTTAAT 1920 

TCCAACGAAA ACATTTTTGA AAATATTACC AATGATAGGT AAGTTTGTTT TTGTGTATTC 1980 

AATAAAACGA GTTATCGAAT TTTCTCCAGC TGCACTATTA GCCCAGTTAG AGAAAGATTG 2040 

ACCTAATCTA TCCAACCAAT CAGCCGACCA TTGAAACAGT GGTGCTAATT GCGTGAATAC 2100 

ATTGACTAAT CCGTCACCAA AACCACCTGC AGCACTTAAT AGCTTGTTAA ATACCGAAAC 2160 

^ ACCCGTTGTA TTCATCATAT TAAAGAATCT TGAAGCTACA CTGCTATTTT CAGCCCATTT 2220 

AAGCACGCTT TGAGACGCTT CTTCCATTCC TCTTGAAATA CCACTAAAAA ACX3GTTGTAA 2280 

GCTCTGCATT GCAGTTTTAA CAGTATTTAA ACCATTTGCA AGAGTTGTGA AGATAGCGGA 2340 

40 

TTGATTTTGC TTTATAATAT CAGTCCATGC TGACTTTACG CCATCTAACG CTTTTTTGTA 2400 

TTCGTTTGTT QCTGAGCTAG CTTGTAAAGT GCCATCATTA AGCATCTTTA TAGCGCTGAT 2460 

AGCCATTGCG CCAAACGCTA CAAATCCTGC TCCCGCTATT GCTACGGCAC CACCTAAAGC 2520 

45 

AAGTACACCA CCAGTTAACA CTTTGATAGC GTTTAATAGC GCAAATACTA CAGGTACTAC 2580 

GCTCGCTATT ACAGGTATTA AGATACTAAA AGATGATGTA AGTAATCCAC CAACCATATT 2640 

50 AGAACCTACA GTACCGAACA CACGGAACAT ATTAGCTAAA TTCCCCATCT GTCTTTGAAA 2700 

ATTGTCATTT GCTTTTATTA TGTAGGCATA AGCTTTCTTT AAACCATTAG TATCGACATC 2760 

TACCTTTGTT GTTTTTTTGT TCGGCAATGC GTCTAATGAT TTTTTAAACG CATAAATAGT 2820 
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70 



75 



20 



25 



30 



35 



40 



45 



SO 



55 



AAGTTCTTCT TTAGTACGTT 


TGATTTTAGA 


GTTAGCAACA 


CCATTGTCCA 


CGTCTATAAT 


2940 


AGCTTTGGCT TTAGACCTAT 


TTAATGCTTC 


GAGACTAGCT 


TTAGATACTT 


TTAACACTCG 


3000 


ATTGAATTTA CTGTTATCTG 


CATTGACGTC 


AATATTGACA 


CGTTTCTTTT 


CTAATTCTGA 


3060 


TAATTTAGCT TCTGTTTCAG 


CGATATCTTT 


AATCAACTTT 


TGTTTTTGCA 


ACTTAACTTC 


3120 


TGGTGTAACT TCTTTAGAGT 


TTAGTTTGTC 


TAGTTCAAAA 


TTCGATTCTA 


GTACCTTTTG 


3X80 


TTGTAAATCT TGTATACTAG 


CATCTAATTT 


AGCTTTTACA 


TTTTTGTTAC 


TAAAGGCATC 


3240 


TAAAGACTTT TTAGCAACTT 


TGATAGTTTT 


TTGTAAATTT 


TTATCGTTAG 


CGTTTAATTC 


3300 


AACATCTTTA GTTTGATCTG 


CTACTCGTTT 


AAATCTTTGC 


ACAGACTTAA 


CCX3CACTATC 


3360 


AATTTGCCTT TTGAATTTGG 


CTACACTAGC 


TTCAATAGTC 


GCTTTAATTT 


TATATTCCGT 


3420 


CACATTAACA CCTCTCTTTC 


TATTGCTTAT 


TAAATTCTGC 


TATAACTTTA 


AAGAATTCAT 


3480 


TATTTTGTGG TTCGTATTCA 


TCACGTTCGC 


TACTAAATCT 


TATATCTTTA 


CCTTCGTTAA 


3540 


GCCGTTGGAT ATTTTCTTCA 


TAAGGCAATA 


CGTCGTTTGC 


ATTGTTAAAA 


ACATATTCCT 


3600 


CTTTAGGTTT ATTTTCTGTC 


CCAACATTTT 


TAGTAGCTGC 


AGCATCACGA 


ATAGCAAACG 


3660 


CAAGTTTGTA ACGTTCGAAT 


TCTTGGGTTA 


GCATTTCATA 


CTCTTTCGCA 


TACATTCX3AT 


3720 


AGTTATATTC TGTTAATGTC 


ATTTGCTCAA 


TAACGTTCAA ATCTGTAATA 


CCAAGTGTTG 


3780 


ACATACAAGT TATAACGATT 


CTGTCGTAAG 


TTATTAGGcT 


TCCGCTGGTT 


TTTCTTCCGT 


3840 


TTCCACTACT TCGACTAGGT 


TTCGGGTCAT 


AGGTCGCTTT 


CCCAACTCCG 


TTAAAATATC 


3900 


C6AACCGAAT TCTTCTAGTC 


CGATATTTTC 


TGCGATTTCA 


TCTAATGCTT 


CATCAATGTT 


3960 


ATTAATAGTA ATTGCTTGTT 


TTTTTAAGTG 


AGATGTAGCT 


GCGATTAAAA 


CTTCX5CCAAT 


4020 


CACAACCGGA TTTCCACTTT 


CTAAACCTAC 


AGGCAACATT 


GATACACCTT 


GACCGATAQA 


4080 


AGCTTGTTCA ACTTTTAAAC 


CTAATCGGTT 


ATCGATTTCT 


CTTAAAAATT 


TAAAACCAAA 


4140 


ACTTAATTCT AATGACTTTC 


CGTTAATTTC 


TACATTCATA 


ACTTAAAATC 


TCCATTCATA 


4200 


ATTAATTTAA ACAAAATAAA 


cnArGCTTAAC 


GCCCTATTTT 


TATACCTCTC 


TTGGTGCAAC 


4260 


CGGTGGTGAA TCTACTTTAG 


GTTGTGGAAT 


TGCTGTTAAA 


TCTTCGCCAG 


TTAATGCATC 


4320 


TGCTTTTGTA GTGTCGTGGA 


ATCTGTATcC 


AGTCGCCTTA AGTTTCTTTG 


TTACAGCCTC 


4380 


AGGTAGTGTT GCAAATCCAC 


GTTGGAAACG 


ACCATTCACT 


CCATATTCAT 


ATTCATATTC 


4440 


ATCAATACCG TTAGCTTCTG 


CTTTTAATTC 


AAATTTATTG 


TGGAAACCTT 


GGAAATATTT 


4500 


CGCTTTAAAT TTAGCGGAAT 


CCCCATTTTT 


GCCTGGTATT 


CTACTTTCAA 


CTTCCCAAGC 


4560 


TTCATACAAT ACGCX3ATCTA 


CAACTGCATC 


TTCAATTTCA 


TCTGCAAAAT 


CGTCACCATA 


4620 
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5 



20 



GTCCATTGTA 


TCCTCTGTAT 


CTGTATCAGC 


TTCATGTGAT 


AAGCCGTATT 


CAGTTAAAAA 


4740 


AAGCATTTTA 


GTAGCATCTA 


CTTTTTCGCC 


AGCTTTTCTA 


AATAAAATAA 


TACGATCATT 


4800 


ACTATTTTTC 


ATATTTGCCA 


TTCAATATTC 


CTCCGTTTTT 


TAAAATGTTT 


TGTAAGATAT 


4860 


CGTTACTGAT 


GTGTGTAGCA 


ATTCTTGATT 


GGTAGTATCA 


TCAACTAACT 


GTGTGATGTT 


4920 


AGTATCTTCT 


TCTTCAAAGT 


CATAATCGTT 


TGTTTTAACG 


CTAGGTGTTA 


AATCATCAAT 


4980 


ACATCTTTTA 


ACAAGTCCGT 


CATGATGTCC 


TAAATCATCG 


CTTACACTCC 


AAATATCAAT 


5040 


AACTAAATTC 


GTATCGCCAG 


AATAACTATC 


AAACGTGTAC 


TTACTTCTAT 


TTGACTCCGG 


5100 


CATTTTTATT 


ACAAAAAAAG 


GATACGGAAT 


CTCTTGTTGC 


ATCTCTTTAC 


GAGAAATAAC 


5160 


AGGGAATCCA 


TATCCTTGTA 


GCGTTTCATA 


CGCTTTATTA TAAAGTTGTA AGTTCGGTGT 


5220 


CATGCTTTTA 


TCTCCTATTC 


AAACAACX3CT 


TTCAATTCTT 


CTACAGTTGA 


TTTCCTAATC 


5280 


ACTTCGTATA 


CCGGCCACAT 


AAAAGGTTCA 


GCCTCCATGT ATCGAGTACC AAATTCTAAG 


5340 


AAACCACTAT 


AAGCTGCGTG 


CGATGTGATA 


GTGTATTGCA AATCGCCAGT 


TTTTTTATAT 


5400 


CTGATATTGC 


GTGATaAATT 


ACC 








5423 



2S 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNKSS: double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAACGCAGAT GTTCAATTAG AACCAGTCTA TCGTATTAAG GAAGGTATTA AACAAAAGCA 60 

AATACGAGAC CAAATTAGAC AAGCGTTAAA TGATGTGACA ATTCATGAAT GGTTAACTGA 120 

40 

TGAACTAAGA GAAAAATATA AATTAGAGAC CTTGGACTTT ACTTTGAACA CATTACATCA 180 

TCCTAAAAGT AAAGAGGATT TATTACGTGC TCGTAGAACC TATGCATTTA CTGAACTGTT 240 

TTTATTCGAA TTACGTATGC AATGGCTAAA TAGATTAGAA AAGTCATCTG ACGAAGCAAT 300 

45 

TGAAATTGAT TATGACATAG ACCAAGTTAA ATCATTTATT GATCGTTTAC CTTTTGAACT 360 

AACTGAAGCA CAGAAATCCA GTGTTAATGA AATTTTTAGA GATTTAAAAG CACCAATACG 420 

SO TATGCATCGA TTACTTCAAG GTGATGTAGG TTCAGGAAAA ACAGTAGTTG CTGCAATTTG 480 

TATGTATGCG TTAAAAACTG CTGGTTATCA ATCAGCATTG ATGGTACCAA CTGAAATTTT 540 

AGCAGAGCAA CATGCTGAAA GTTTAATGGC TTTATTTGGA GATTCTATGA ACGTTGCATT 600 

55 



296 



10 



EP0786 519 A2 

TACGATTGAT TGTTTAATTG GAACCCATGC TTTGATTCAA GATGATGTGA TTTTCCATAA 720 

TGTTGGTTTA GTAATTACAG ATGAACAACA TCGATTTGGT GTGAATCAAC GCCAGCTTTT 780 

AAGAGAAAAA GGTGCAATGA CGAATGTGTT ATTTATGACA GCAACGCCGA TACCAAGAAC 840 

ACTAGCAATA TCAGTTTTTG GTGAGATGGA TGTGTCTTCA ATTAAACAAT TACCAAAAGG 900 

TCGTAAACCT ATCATTACTA CTTGGGCAAA GCATGAGCAA TACGATAAAG TTTTGATGCA 960 

AATGACCTCA GAGITGAAAA AAGGTCX7TCA AGCATATGTC ATTTGCCCGC TAATAGAAAG 1020 

TTCTGAGCAT CTCGAAGATG TTCAAAATOT TGTCGCATTG TACGAGTCTT TACAACAGTA .1080 

IS TTATGGTGTT TCCCGTGTAG GQTTATTGCA TGGTAAGTTA TCTGCOGATG AAAAAGATGA 1140 

GGTCATGCAA AAGTTTAGTA ATCATGAGAT AAATGTTTTA GTTTCTACTA CTGTTGTTGA 1200 

AGTAGGTGTT AATGTACCGA ATGCAACTTT TATGATGATT TATGATGCGG ATCX3CTTTGG 1260 

ATTATCAACT TTACATCAGT TACGCGGTCG TGTAGGTAGA AGTGACCAGC AAAGTTACTG 1320 

TGTTTTAATT GCATCCCCTA AAACAGAAAC AGGAATTGAA AGAATGACAA TTATGACACA 1380 

AACAACGGAT GGATTTGAAT TGAGTGAACX5 AGACTTAGAA ATGCGTGGTC CTGGAGATTT 1440 

CTTTGGTGTT AAACAAAGTG GaTTGCCAGA TTTCTTAGTT GCCAATTTAG TTGAAGATTA 1500 

TCGTATGTTA GAAGTTGCTC GTGATGAAGC AGCTGAACTT ATTCAATCTG GCGTATTCTT 1560 

TGAAAATACG TATCAACATT TACGTCATTT TGTTGAAGAA AATTTATTAC ATCGTAGTTT 1620 

TGACTAATTG CCATGCTGAT TTGTCAATTT GAGTGCAACa CTTCGTTAAT TGAGTGATAT 1680 

GACACTTGAA CTATTTAAAT GTAAAGTGGT ATTTTAACAA TTTATAAATT TTCX3ACTAAA 1740 

35 TAATAGCTAA ATATTACAGT TATTTGTTGA GTCGGTTAAA TAGAAAGTGT TATGATATGT 1800 

GAGGAATGTT TAAGACTAGG TACTAAAAAA TGAGGGGTGA GACGTTGAAA CTAAAGAAAG 1860 

ATAflACGTAG AGAAGCAATC AGACAACAAA TTGATAGCAA TCCCTTCATC ACAGACCATG 1920 

AACTAAGCGA CTTATTTCAA GTGAGTATAC AAACAATTCG TTtAGaTCGC ACTTATTTAA 1980 

ACATACCAGA ATTAAGGAAG CGTATTAAAT TAGTTGCTGA AAAGAATTAT GACCAAATAA 2040 

GTTCTATTGA AGAACAAGAA TTTATTGGTG ATTTGATTCA AGTCAATCCa AATGTTAAAG 2100 

CGCAATCAAT TTTAGATATT ACATCGGATT CTGrTTTTCA TAAAACTGGA ATTGCGCGTG 2160 

GTCATGTGCT GTTTGCTCAG GCAAATTCGT TATGTGTTGC GCTAATTAAG CAACCAACAG 2220 

SO TTTTAACTCA TGAGAGTAGC ATTCAATTTA TTGAAAAAGT AAAATTAAAT GATACGGTAA 2280 

GAGCAGAAGC ACGAGTTGTA AATCAAACTG CAAAACATTA TTACGTCGAA GTAAAGTCAT 2340 

ATGTTAAACA TACATTAGTT TTCAAAGGAA ATTTTAAAAT GTTTTATGAT AAGCGAGGAT 2400 

SS 



20 



2S 



30 



40 



4S 



297 



10 



EP 0 786 519 A2 

TTAGAAGCCG TACAAAAGGC TGTTGAAGAC TTTAAAGATC TAGAAATTAT ACTTTTCGGT 2520 

GACGAAAAAA AGTATAATCT GAACCATGAA CGAATCGAAT TTAGACATTG TTCTGAAAAG 2580 

ATTGAAATGG AAGATGAGCC TGTTAGAGCG ATTAAACGTA AAAAAGATAG CTCAATGGTA 264 0 

AAAATGGCTG AAGCTGTGAA ATCTGGTGAA GCAGATGGAT GTGTGTCAGC AGGTAATACT 2700 

GGTGCTTTAA TGTCAGCTGG TTTATTCATT GTTGGACGTA TTAAAGGTGT AGCTAGACCG 2760 

GCTTTAGTAG TAACATTGCC AACGATTGAT GGAAAAGGTT TTGTCrTTTT AGACGTTGGT 2820 

GCAAATGCTG ATGCTAAACC TGAACACTTA TTACAGTATG CGCAACTAGG GGATATTTAT 2880 

75 GCTCAAAAAA TTAGAGGTAT TGATAATCCG AAAATCTCAT TATTAAATAT AGGAACCGAG 2940 

CCAGCTAAAG GTAATAGTTT AACGAAAAAA TCATATGAGT TATTAAATCA TGATCATTCA 3000 

TTGAATTTTG TTGGGAATAT TGAAGCGAAG ACATTAATGG ATGGCGATAC AGATGTTGTA 3060 

GTTACCX3ATG GCTATACTGG GAACATGGTC CTTAAAAATT TAGAAGGTAC TGCAAAATCA 3120 

ATCGGTAAAA TGTTAAAAGA TACGATTATG AGTAGTACTA AAAATAAATT AGCAGGTGCA 3180 

ATATTGAAGA AAGATTTAGC TGAATTCGCT AAAAAGATGG ATTACTCAGA ATACGGTGGT 324 0 

TCCGTATTAT TAGGATTGGA AGGTACTGTA GTTAAAGCAC ACGGTAGTTC AAATGCTAAA 3300 

GCTTTTTATT CTGCAATTAG ACAAGCGAAA ATCGCAGGAG AACAAAATAT TGTACAAACA 3360 

ATGAAAGAGA CTGTAGGTGA AtCAAATGaG TaAAACAGCA ATTATTTTTC CGGGACAAGG 3420 

TGCCCAAAAA GTTGGTATGG CGCAAGATTT GTTTAACAAC AATGATCAAG CAACTGAAAT 3480 

TTTAACTTCA GCAGCGAACA CATTAGACTT TGATATTTTA GAGACAATGT TTACTGATGA 3540 

35 AGAAGGTAAA TTGGGTGAAA CTGAAAACAC ACAACCAGCT TTaTTGaCGC aTAGTTCGGC 3600 

ATTATTAGCA GCGCTAAAAA ATTTGAATCC TGATTTTACT ATGGGGCATA GTTTAGGTGA 3660 

ATATTCAAGT TTAGTTGCAG CTGACGTATT ATCATTTGAA GATGCAGTTA AAATTGTTAG 3720 

AAAACGTGGT CAATTAATGG CGCAAGCATT TCCTACTGGT GTAGGAAGCA TGGCTGCAGT 3780 

ATTGGGATTA GATTTTGATA AAGTCGATGA AATTTGTAAG TCATTATCAT CTGATGACAA 3840 

AATAATTGAA CCAGCAAACA TTAATTGCCC AGGTCAAATT GTTGTTTCAG GTCACAAAGC 3900 

TTTAATTGAT GAGCTAGTAG AAAAAGGTAA ATCATTAGGT GCAAAACGTG TCATGCCTTT 3960 

AGCAGTATCT GGACCATTCC ATTCATCGCT AATGAAAGTG ATTGAAGAAG ATTTTTCAAG 4020 

SO TTACATTAAT CAATTTGAAT GGCGTGATGC TAAGTTTCCT GTAGTTCAAA ATGTAAATGC 4080 

GCAAGGTGAA ACTGACAAAG AAGTAATTAA ATCTAATATG GTCAAGCAAT TATATTCACC 4140 

AGTACAATTC ATTAACTCAA CAGAATG6CT AATAGACCAA GGTGTTGATC ATTTTATTGA 4200 
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AACATCAATT CAAACTTTAG AAGATGTGAA 
TAGTAACAGG TGCATCAAGA GGAATTGGAC 

5 

GATATAATGT AGCAGTAAAC TATGCAGGCA 
AAATCAAAGC TAAAGGTGTT GACAGTTTTG 
AAGTTAAAGC AATGATTAAA GAAGTAGTTA 

10 

ATAATGCAGG TATTACTCGC GATAATTTAT 
ATGTTATrGA CACAAACTTA AAAGGTGTAT 

,5 TGTTAAGACA ACGTAGTGGT GCTATCATCA 
ATCCGGGACA AGCAAACTAT GTTGCAACAA 
CGGC6CGTGA ATTAGCATCT CGTGGTATCA 

20 TTTCTGATAT GACAGATGCT TTAAGTGATG 
CGTTAGCACG TTTTGGTCAA GACACAGATA 
ACAAAGCAAA ATATATTACA GGTCAAACAA 

25 

ATATTTGAGC TAAAGCTCAT TGACGCAGTG 
GACCTAGTCA ACTTTGCGGG GGAAATTCTA 
CCTAAGAAAC ACTAATCAAT aAATTGwTAA 

30 

AATTTAAAAT GGGAAAATAT AGTAGTCTAT 
CGTGGAAAAT TTCGATAAAG TAAAAGATAT 
3S TAAAGTAACT GAAGATGCAT CTTTCAAAGA 
TGAATTAGTA ATGGAATTAG AAGACGAGTT 
AAAAffrCAAC ACTGTTGGTG ATGCTGTTAA 
CTTACATCTG GGTCGTCAGT ATTGTCGACT 
AACGTAAAAT TAAAGATGAT TCAAGAGCAA 
AAAAGAAAAG TGAGATAGTT AATCGTTTTA 

45 

TAGGCTTTAC TTATCAAAAT ATTGATTTAT 
TTAATGATTT TAATATGAAT CGTTTAGACC 
CGGTATTAGA ATTGACGGTT TCACGATATT 
GGAATTTAAC AAAAATGCGT GCCaCTATTG 
ATAAAATTGG ATTGAACX5AA ATGATTTTAC 

55 



AGGATGGAAT GAAAATGACT AAGAGTGCTT 4320 

GTAGTATTGC GTTACAATTA GCAGAAGAAG 4380 

GCAAAGAGAA AGCTGAAGcA GTAGTCGAAG 4440 

CGATTCAAGC AAATGTTGCC GATGCTGATG 4 500 

GCCAATTTGG TTCTTTAGAT GTTTTAGTAA 4560 

TAATGCX^TAT GAAAGAACAA GAGTGGGATG 4620 

TTAACTGTAT CCAAAAAGCA ACACCACAAA 4680 

ATTTATCAAG TGTTGTTGGA GCAGTAGGTA 4740 

AAGCAGGTGT TATTGGTTTA ACTAAATCTG 4800 

CTGTAAATGC AGTTGCACCT GGTTTTATTG 4860 

AOCTTAAAGA ACAAATGTTG ACTCAAATTC 4920 

TTGCTAATAC AGTAGCGTTC TTAGCATCAG 4980 

TCCATGTAAA TGGTGGAATG TACATGTAAT 5040 

GTTGACTGGT CATCCAATGG AGAATTGTCT 5100 

AGCAACCTAG ATAAGGTTCC AGAATTTCTC 5160 

GTGTTTCTAA AATTTCTACT TGTTTTTTAG 5220 

GTATAGGCAT TTTTAAAGGA GGTGAATCGA 5280 

CATCGTTGAC CgTTTAGGTG TAGACGCTGA 5340 

TGATTTAGGC GCTGACTCAC TTGATATCGC 5400 

TGGTACTGAA ATTCCTGATG AAGAnGCTGA 5460 

ATTTATTAAC AGTCTTGAAA AATAATAAAT 5520 

CAGTTTTTTT CTTTAATTAT CAATAGTTTT 5530 

CACATAAAGG AGATAAAATA ATGTCTAAAC 5640 

GAAAGCGCTT TGATACTAAA ATGACAGAGT 5700 

ACCAACAAGC ATTTTCGCAT TCGAGTTTTA 5760 

ATAATGAGCG TTTAGAGTTT TTGGGTGATG 5820 

TATTTGATAa ACATCCCAAC TTGCCAGAAG 5880 

TATGTGAGCC CtCACTkGTA ATATTTGCGA 5940 

TTGGTAAAGG TGAAGAGAAA ACAGGGGGAC 6000 
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ATCAAGGACT AGATATAGTT TGGAAATTTG CTGAGAAAGT CATTTTCCCA CATGTAGAAC 



6120 



AAAATGAGTT ATTAGGCGTG GTAGATTTTA AAACACAATT CCAAGAATAT GTGCACCAGC 



6180 



AAAATAAAGG TGATGTAACC TATAATTTAA TAAAAGAAGA GGGACCGGCA CATCATCGTC 



6240 



TATTCACTTC A 



6251 



10 



IS 



20 



25 



30 



35 



40 



4S 



SO 



(2) IKFORMATIGN FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ACCTACTGAA GTTGCTAATT TTTTGGAGCA ACTAAGCACT GAAATTGAAC GTCTTAAAGA 60 

AGATAAAAAA CAACTTGAAA AAGTAATCGA AGAGAGaGAT ACTAATATTA AGTCTTATCA 120 

AGACGTGgCA TCAATCTGTA AGTGaTGCTT TGATACAAGC TCAAAAAGCT GGTGAAGAAA 180 

CTAAGCAAGC TGCAGAGAAA CAAGCTGAAG CGATTATAGC TAAGGCAGAA GCGCAAgcTA 240 

ATCAAATGGT TGGTGACGCG GTAGAAAAAG CACGCCGTTT AGCATTCCAG ACTGAAGATA 300 

TGAAACGTCA ATCAAAAGTA TTTAGATCGC GTTTCCGTAT GTTAGTTGAA GCGCAATTAG 3 60 

ACTTATTAAA AAACGAAGAT TGGGATTACT TGTTGAATTA TGATTTAGAC GCTGAACAAG 420 

TGACGCTTGA AAATATTCAT CATTTGCATG AAAATGATTT AAAGCCAGAT GAAGTTGCAG 480 

CAAATGCACA AAATAATGCA TCAAATACAC CAGACAATAA TCAACAATCC AATGATTCAG 540 

AAACAACTAA GAAGTAAGAA TTAAATAAAG ACAGACGCGT AATATACATT TAACTTTTCA 600 

CAGCGAATTA GGTAATGGTG AGAGCCTAGT AAAAOCATGT ATGTTATATC ACTGGCTTTT 660 

TAATATTTAA ATAATGTAAT GAGAGAACTC TAAGTTGAGT TAATAAGGGT GGTACCGCGA 720 

GCAATCGTCC CTTTTAATTT AACTTAGAGT TTTTTAAATT TTTAAGGAGT GAAAAAAATG 780 

GATTACAAAG AAACGTTATT AATGCCTAAA ACAGATTTCC CAATGCGAGG TGGTTTACCA 840 

AACAAGGAAC CGCAAATTCA AGAAAAATGG GATGCAGAAG ATCAATACCA TAAAGCGTTA 900 

GAAAAAAATA AAGGTAACGA AACATTCATT TTACATGATG GCCCACCATA CGCGAATGGT 960 

AACTTACATA TGGGACATGC CTTGAACAAA ATTTTAAAAG ACTTTATTGT ACGTTATAAA 1020 

ACTATGCAAG GGTTCTATGC ACCATACGTA CCAGGTTGGG ATACACATGG TTTACCAATT 1080 

GAACAAGCAT TAACGAAAAA AGGTGTTGAC CGAAAGAAAA TGTCAACAGC TGAATTCCGT 1140 
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TTAGGTGTTC GTGGTGACTT TAATGATCCA TATATTACAT TAAAACCTGA ATACGAAGCT 1260 

GCACAAATTC GTATTTTTGG AGAAATGGCA GATAAAGGTT TAATTTATAA AGGTAAAAAG 1320 

5 

CCAGTTTATT GGTCTCCTTC AAGTGAGTCT TCATTAGCAG AAGCAGAAAT TGAATATCAC 1380 

GATAAACGTT CAGCATCAAT TTACGTTGCA TTTGACGTTA AAGATGACAA AGGTGTCGTT 1440 

GATGCAGATG CTAAATTTAT TATCTGGACA ACAACGCCAT GGACAATTCC ATCAAATGTT 1500 

10 

GCGATTACOG TTCATCCTGA ATTAAAATAT GGTCAATACA ATGTAAATGG CGAAAAATAT 1560 

ATTATTGCAG AAGCCTTGTC TGACGCTGTA GCAGAAGCAC TGGaTTGGGA TAAAGCATCA 1620 

IS ATCAAAITAQ AAAAAGAATA CACAGGTAAA GAATTAGA6T ATGTTGTAGC ACAACATCCA 1680 

TTCTTAGACA GAGAATCGTT AGTGATTAAT GGTGATCATG TTACTACAGA TGCTGGTACA 1740 

GgTTGTGTAC ATACAGCACC AGGTCACGGG GAAGATGACT ATATTGTTGG TCAAAAATAT 1800 

on 

GAATTGCCAG TAATTAGTCC AATCGATGAT AAAGGTGTAT TTACTGAAGA AGGCGGCCAA 1860 

TTTGAAGGGA TGTTCTATGA TAAAGCTAAT AAAGCCGTTA CTGATTTATT AACAGAAAAA 1920 

GGTGCACTAT TAAAATTAGA CTTTATTACA CATAGCTATC CACACGACTG GAGAACAAAA 1980 

25 

AAACCTGTAA TCTTCCGTGC TACACCACAA TGGTTTGCCT CAATCAGTAA AGTAAGACAA 2040 

GATATTTTAG ATGCAATCGA AAATACAAAC TTCAAAGTAA ATTGGGGTAA AACACGTATT 2100 

^ TACAATATGG TTCGTGACCG TGGCGAATGG GTTATTTCTC GTCAACGTGT GTGGGGTGTA 216 0 

CCGTTACCAG TATTTTATGC TGAAAATGGC GAAATTATCA TGACGAAAGA AACAGTGAAT 2220 

CATGTTGCTG ATTTATTTGC AGAACACGGT TCAAATATTT GGTTTGAAAG AGAAGCGAAA 2280 

3S GACTTACTAC CAGAAGGATT TACACATCCA GGCAGCCCTA ACGGTACATT TACTAAAGAA 2340 

ACAGACATTA TGGACGTTTG GTTTGATTCT GGTTCATCAC ACCGTGGCGT GTTGGAAACA 2400 

AGAGCGGAAT TAAGTTTCCC AGCGGATATG TATTTAGAAG GTAGTGACCA ATATCXSTGGT 2460 

40 

TGGTTCAACT CTTCTATCAC AACTTCAGTT GCTACAAGAG GAGTATCACC TTATAAATTC 2520 

TTACTTTCTC ATGGTTTTGT TATGGACGGT GAAGGTAAGA AAATGAGTAA ATCTTTAGGT 2580 

AATGTGATTG TACCTGACCA AGTGGTTAAA CAAAAAGGTG CTGATATTGC GAGACTTTGG 2640 

45 

GTAAGTAGTA CGGACTATTT AGCTGATGTT AGAATTTCTG ATGAAATTTT AAAACAAACA 2700 

TCTGATGTTT ATCGTAAAAT CAGAAATACA TTAAGATTTA TGTTAGGTAA CATTAACGAT 2760 

SO TTCAATCCTG ACACAGATAG CATTCCTGAA TCAGAGTTAT TAGAAGTGGA TCGTTACTTG 2820 

CTAAATCGTT TACGTGAATT TACTGCAAGT ACGATTAACA ACTATGAAAA CTTTGACTAC 2880 

TTAAATATTT ATCAAGAAGT TCAAAACTTT ATCAATGTTG AGTTAAGTAA TTTCTATTTG 2940 
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CAAACAGTGT TATATCAAAT TTTAGTTGAT ATGACGAAGT TGTTAGCACC AATCTTAGTG 3060 

CATACAGCTG AAGAAGTTTG GTCTCATACA CCACATGTTA AAGAAGAAAG TGTTCACTTA 3120 

GCAGACATGC CTAAAGTTGT AGAAGTAGAT CAAGCTTTAT TGGATAAATG GCGTACATTT 3180 

ATGAATTTAC GTGATGATGT GAACCGTGCA TTAGAAACTG CTCGTAATGA AAAAGTTATT 3240 

GGTAAATCAT TAGAAGCTAA AGTTACGATT GCTAGTAACG ATAAATTTAA TGCATCTGAA 3300 

TTCTTAACTT CATTTGATGC ATTACATCAA TTATTTATCG TGTCACAAGT TAAAGTTGTA 3360 

GATAAGTTAG ACGATCAGGC AACAGCTTAT GAACATGGTG ATATTGTCAT CGAACATGCA 3420 

GATGGTGAAA AATGT6AAAG ATGTTGGAAC TATTCAGAGG ATCTTGGTGC TGTTGATGAA 3480 

TTGACGCATC TATGTCCACG ATGCCAACAA GTTGTAAAAT CACTTGTATA ATTGAAATTG 3540 

TATAAAGTAC TCATACAGAT GATATAAATT AAAGCTCTCT TCATAATCAT GTTGTAGTTT 3600 

TTGTT6ACAT GATGAAGAGA GTTTTTTTGT GAATAAAAAA ATGACGAAGT TACCGGTCAT 3660 

ATATGTAAAA AATGTGCGAT TTACTAAAAT AAAAATTATT CAGGAATGGT ACAAATTCTC 3720 

TGAGGCATAT AAATGCGTTA TAGTTGCTAT TCTCAATTAT GTTCGCGATA ATTTTAAGTA 3780 

AAAGTAAGCA CAGATATTGA ATTTGATAGG AGTTAATTGA ATGTATCATA ACAGTAACGC 3840 

AAACTTTGTC AATGGTATCA CTTTAAAT6T GAGAGATAAG AATGAATTAA AGCCATTTTA 3900 

30 TGAGGACATA TTAGGATTAA ATATTATAAA TGAGACATTA ACATCGATAC AATATGAAGT 3960 

AGGTCAAAAT AATCATGTCA TTACACTTGT TGAATTACAA AATGGACGTG AACCTTTAAT 4020 

6TCCX5AAGCG GGACTGTTTC ATATCGCAAT TAAACTACCT CAAATTAGTG ATTTAGCTAA 4080 

TTTACTAATT CATTTAAGCG AATATGATAT TCCAGTTAAC GGAGGTATAC AGCCTGCTTC 4140 

GTTATCATTA TTTTTTGAAG ACCCGGAAGG AAACX3GTTTT AAATTTTATG TTGATAAAGA 4200 

CGAA6CGCAA TGGACGAGGC AAAATAATTT AGTAAAAATT GATATTAGAC CATTAAATGT 4260 

ACCGAGATTA GTGAGTCATG CAACAAAATT GTTATGGTTA GGTATTCCAG ATGACGCTAT 4 320 

TATAGGTGCA TTGCATATTA AGACAATTCA TTTATCAGAG GTAAAAGAGT ACTACCTCGA 4380 

^ TTATTTTGGA TTAGAGCAAT CGGCATATAT GGATGATTAT TCAATATTTT TAGCATCGAA 4440 

TGGCTATTAT CAACATTTGG CCATQAATGA TTGGGTATCA GCAACGAAAC GTGTAGAAAA 4500 

TTTTGATACG TATGGATTAG CAATTGTTGA CTTTCATTAT CCTGAAACAA CACATTTAAA 4560 

SO TTTACAAGGT CCGGATGGTA TCTATTATCG CTTTAATCAT ATCGAAGTTG AAGATTAGTA 4620 

TATACTTTGA ATGGACGAAC CATATAATGA ATCGTTTTTA ATGATCTTTT TATACAAGTT 4680 

ATGAAGGAGG CTGGGACATT AAGTTCTTAG GCAATGTAAA AAGCTGATTT CTATTAATTA 4740 
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TTTTCCTTAT ATTAATTGCC ATTAATACAA AACCTAGCTC TCGTTTAACT TTATTTATTC 4860 
CTCGAACTGA CATTCGnGTG AACTCAAAAT nGCCTACTTn CTTAAATTAC CAATATCTAT 4920 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 



25 



30 



IS ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TGGATTGCCA TTACATGGAC AAGATTTAAC TGAATCAATT ACACCATATG AAGGTGGTAT 60 

CGCTTTTGCA AGTAAACCAT TAATTGAT6C TGATTTTATT GGTAAATCTG TATTAAAAGA 120 

TCAAAAAGAA AATGGTGCAC CAAGAAGAAC AGTGGGATTA GAATTACTTG AAAAAGGAAT 180 

TGCAAGAACT GGTTATGAAG TTATGGATTT AGATGGAAAT ATTATTGGAG AAGTAACTTC 240 

AGGAACACAG TCTCCATCAT CAGGAAAATC AATTGCACTT GCAATGATAA AAAGAGATGA 300 

GTTTGAAATG GGTAGAGAGT TGCTTGTTCA AOTTCGTAAG CGTCAATTAA AAGCGAAAAT 360 

TGTTAAGAAA AATCAAATTG ATAAATAATT AAAAAGGGGT GTGCATTGTG AGTCATCGTT 420 

ATATACCTTT AACTGAAAAA GACAAGCAAG AAATGTTACA AACAATTGGT GCAAAATCTA 480 

TAGGAGAATT ATTCGGTGAT GTACCAAGTG ACATTTTATT AAATAGAGAT TTAAATATTG 540 

CTGAAGGCGA ACGGAGAACA ACGTTACTTA GAAGATTnAA TCGCATTGCA AGCAAGAGTA 600 

35 TCACTAGAGG AACGCGTACA TCGTTT 626 

(2) INFORMATION FOR SEQ ID NO: 28: 

r<i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
nGGAAGTGGT GTATATATTT GTAATGAGTG TATTGAATTA TGCTCAGAAA TCGTCGAAGA 60 
SO AGAATTAGCT CAAAACACTT CTGAAGCGAT GACAGAATTA CCTACTCCTA AAGAAATTAT 120 
GGATCATTTA AACGAATATG TTATTGGTCA AGAAAAAGCT AAAAAATCTT TAGCTGTAGC 180 
TGTTTATAAC CACTATAAGC GTATTCAACA ATTAGGACCA AAAGAAGATG ATGTTGAATT 240 
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5 



10 



25 



AACCTTAGCC AAGACGTTGA 


ATGTACCATT 


TGCAATTGCA 


GATGCGACAA 


GTTTAACTGA 


360 


AGCTGGTTAT GTAGGCGATG 


ATGTTGAAAA 


TATCTTGTTG 


AGATTAATTC 


AAGCAGCTGA 


420 


CTTTGACATT GATAAAGCCG 


AAAAAGGTAT 


TATTTATGTA 


GATGAAATTG 


ATAAAATTGC 


480 


ACGTAAATCT GAAAACACAT 


CTATAACACG 


TGACGTTTCA 


GGTGAAGGTG 


TTCAACAAGC 


540 


ATTGCTTAAA ATCTTAGAAG 


GTACGACTGC 


AAGTGTTCCG 


CCACAAGGTG 


GACGCAAACA 


600 


TCCAAACCAA GAAATGATTC 


AAATTGATAC 


AACAAATATC 


TTATTTATTC 


TTGGTGGTGC 


660 


CTTTGATGGT ATTGAAGAAG 


TGATTAAGCG 


CCGTCTTGGT 


GAAAAAGTTA TTGGTTTCTC 


720 


AAGCAATGAA GCTGATAAAT 


ATGAC6AACA 


AGCATTATTA GCACAAATTC GCCCAGAAGA 


780 


TTTGCAAGCC TATGGTTTGA 


TTCCTGAATT 


TATCGGACGT 


GTGCCAATTG 


TAGCTAATTT 


840 




CGTTGAAAAA 


CATCTTAACX3 


CAACCTAAAA ATGCACTTGT 


900 


GAAACAATAT ACTAAAATGC 


TGGAATTAGA 


TGATGTGGAT 


TTAGAGTTCA 


CTGAAGAAGC 


960 


TTTATCAGCA ATTAGTGAAA 


AAGCAATTGA 


AAGAAAAACA 


GGTGCGCGTG 


GTTTACGTTC 


1020 


AATCATAGAA GAATCGTTAA 


TCGATATTAT 


GTTTGATGTG 


CCTTCTAACG 


AAAATGTAAC 


1080 


GAaGGTAGTT ATTACAGCAC 


AAACmATTAA 


TGrAGaACTG 


AACCAG 




1126 



(2) INFORMATION FOR SEQ ID NO: 29: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



40 



45 



ATTGACTTCT 


TAGCAATnAA 


TaTGAGTGAA 


QAACGTACTG 


TT6AAGTACC 


AGTTCAATTA 


60 


GTTGGTGAAG 


CAGTAGGCGC 


TAAAGAAGGC 


GGC6TAGTTG AACAACCATT 


ATTCAACTTA 


120 


GAAGTAACTG 


CTACTCCAGA 


CAATATTCCA 


GAAGCAATCG 


AAGTAGACAT 


TACTGAATTA 


180 


AACATTAACG 


ACAGCTTAAC 


TGTTGCTGAT 


GTTAAAGTAA 


CTGGCGACTT 


CAAAATCGAA 


240 


AACGATTCAG 


CTGAATCAGT 


AGTAACAGTA 


GTTGCTCCAA 


CTGAAGAACC 


AACTGAAGAA 


300 


GAAATCGAAG 


CTATGGAAGG 


CGAACAACAA 


ACTGAAGAAC 


CAGAAGTTGT 


TGGCGAAAGC 


360 


AAAGAAGACG 


AAGAAAAAAC 


TGAAGAGTAA 


TTTTAATCTG 


TTACATTAAA 


GTTTTTATAC 


420 


TTTGTTTAAC 


AAGCACTGTG 


CTTATTTTAA 


TATAAGCATG 


GTGCTTTTTG 


TGTTATTATA 


480 


AAGCTTAATT AAACTTTATT 


ACTTTGTACT 


AAAGTTTAAT 


TAATTTTAGT 


GAGTAAAAGA 


540 
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CTTACTAAGC TAAAGAATAA TGATAATTGA TGGCAATGGC GGAAAATGGA TGTTGTCATT 660 

ATAATAATAA ATGAAACAAT TATGTTGGAG GTAAACACGC ATGAAATGTA TTGTAGGTCT 720 

5 

AGGTAATATA GGTAAACGTT TTGAACTTAC AAGACATAAT ATCGGCTTTG AAGTCGTTGA 780 

TTATATTTTA GAGAAAAATA ATTTTTCATT AGATAAACAA AAGTTTAAAG GTGCATATAC 840 

AATTGAACGA ATGAACGGCG ATAAAGTGTT ATTTATCGAA CCAATGACAA TGATGAATTT 900 

10 

GTCAGGTGAA GCaGTTGCAC CGATTATGGA TTATTACAAT GTTAATCCAG AAGATTTAAT 960 

TCTCTTATAT GATGATTTAG ATTTAGAACA AGGACAAGTT CGCTTAAGAC AAAAAGGAAG X020 

IS TGCGGGCGGT CACAATG6TA TGAAATCAAT TATTAAAATG CTTGGTACAG ACCAATTTAA 1080 

ACGTATT06T ATTGGTGTGG GAAGACCAAC GAATGGTATG ACGGTACCTG ATTATGTTTT 1140 

ACAACGCTTT TCAAATGATG AAATGGTAAC GATGGAAAAA GTTATCGAAC ACGCAGCACG 1200 

OQ 

CGCAATTGAA AAGTTTGTTG AAACATCACG ATTTGACCAT GTTATGAATG AATTTAATOG 1260 

TGAAGTGAAA TAATGACAAT ATTGACAACG CTTATAAAAG AAGATAATCA TTTTCAAGAC 1320 

CTTAATCAGG TATTTGGACA AGCAAACACA CTAGTAACTG GTCTTTCCCC GTCAGCTAAA 1380 

2S 

GTGACGATGA TTGCTGAAAA ATATGCACAA AGTAATCAAC AGTTATTATT AATTACCAAT 1440 

AATTTATACC AAGCAGATAA ATTAGAAACA GATTTACTTC AATTTATAGA TGCTGAAGAA 1500 

30 TTGTATAAGT ATCCTGTGCA AGATATTATG ACCGAAGAGT TTTCAACACA AAGCCCTCAA 1560 

CTGATGAGTG AACGTATTAG AACTTTAACT GCGTTAGCTC AAGGTAAGAA AGGGTTATTT 1620 

ATCGTTCCTT TAAATGGTTT GAAAAAGTGG TTAACTCCTG TTGAAATOTG GCAAAATCAC 1680 

^ CAAATGACAT TGCGTGTTGG TGAGGATATC GATGTGGACC AATTTCTTAA CAAATTAGTT 1740 

AATATGGGGT ACAAACGGGA ATCCGTGGTA TCGCATATTG GTGAATTCTC ATTGCX3AGGA 1800 

GGTATTATCG ATATCTTTCC GCTAATTGGG GAACcAATCA GAATTGAGCT ATTTGATACC 1860 

40 . 

GAAATTGATT CTATTCGGGA TTTTGATGTT GAAAOGCAGC GTTCCAAAGA TAATGTTGAA 1920 

GAAGTCGATA TCACAACTGC AAGTGATTAT ATCATTACTG AAGAAGTGAT CAGCCATCTT 1980 

AAAGAAGAGT TAAAAACTGC ATATGAAAAT ACAAGACCCA AAATAGATAA ATCAGTGCX3C 2040 

45 

AATGATTTGA AAGAAACGTA TGAAAGCTTT AAATTATTCG AAAGTACATA CTTTGATCAT 2100 

CAAATACTAC GTCGCTTAGT AGCGTTTATG TATGAAACAC CTTCGACAAT TATTGAGTAT 2160 

SO TTCCAAAAAG ATGCAATCAT TGCAGTTGAT GAATTTAATC GTATTAAAGA AACTGAAGAA 2220 

AGTTTAACAG TAGAGTCTGA TTCGTTTATT AGCAATATTA TTGAAAGTGG TAATGGATTT 2280 

ATAGGACAAA GTTTTATAAA ATATGATGAT TTTGAAACAT TGATTGAAGG CTATCCTGTC 2340 
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10 



IS 



TCATGTAAAC CTGTCCAACA ATTTTATGGG CAATATGACA TTATGCGTTC TGAATTTCAA 2460 

CGATATGTTA ATCAAAACTA TCATATCGTG GTTTTGGTCG AAACCGAAAC TAAAGTTGAA 2520 

CGTATGCAAG CGATGTTAAG TGAAAtGCAT ATTCCATCAA TAACAAAATT GCATCGCTCA 2580 

ATGTCATCGG GGCAAGCAGT GATTATTGAA GGCAGTTTAT CTGAAGGATT TGAACTACCT 264 0 

GATATGGGAT TAGTTGTCAT TACTGAGCGT GAgcTTTTTA AATCAAAACA GAAAAAGCAA 2700 

CGAAAACGTA CGAAAGCTAT CTCAAATGCT GAAAAAATTA AGTCTTACCA AGATTTAAAT 2760 

GTGGGAGATT ATATTGTTCA TGTGCATCAT GGTGTTGGTA GATATTTAGG TGTTGAGACG 2820 

CTCGAAGTGG GGCAAACGCA TCGTGATTAT ATTAAATTGC AATATAAAGG TACGGATCAA 2880 

CTATTTGTTC CAGTAGATCA AATGGATCAA GTTCAAAAAT ATGTAGCTTC GGAAGATAAG 2940 

ACGCCAAAAT TAAATAAACT CGGTGGCAGT GAATGGAAAA AAACAAAAGC TAAAGTTGAA 3000 

CAAAGTGTTG AAGATATTGC TGAAGAGTTG ATTGATTTAT ATAAAGAAAG AGAAATGGCA 3060 

GAAGGTTATC AATATGGGGA AGACACAGCT GAGCAAACAA CATTTGAATT AGATTTTCCA 3120 

TATGAACTTA CGCCTGACCA AGCTAAATCT ATCGATGAAA TTAAAGATGA CATGCAAAAA 3180 

TCGCGTCCAA TGGATCGCTT GCTATGTGGT GATGTTGGTT ATGGTAAAAC TGAAGTTGCA 324 0 

GTGAGAGCAG CATTCAAAGC TGTAATGGAA GGAAAGCAGG TTGCATTTTT AGTTCCTACA 3300 

ACTATTTTAG CTCAGCAACA TTATGAGACG TTAATTGAGC GTATGCAAGA TTTTCCTGTT 3360 

GAAATTCAAT TAATGAGTCG TTTTAGAACG CCTAAAGAGA TAAAACAAAC TAAGGAAGGA 3420 

CTTAAAACTG GATTTGTTGA CATAGTTGTT GGTACACACA AATTACTTAG TAAAGATATA 3480 

^5 CAGTATAAAG ATTTAGGGCT GTTGATT6TA GATGAAGAAC AACGATTTGG TGTACGCCAT 3540 

AAAGAGCGTA TTAAAACATT AAAACATAAT GTAGATGTAC TAACATTGAC TGCAACCCCA 3600 

ATAdCTAGAA CATTGCATAT GAGTATGCTA GGTGTGCGGG ATTTGTCAGT GATTGAAACG 3660 

CCGCCAGAAA ATCGTTTCCC AGTTCAAACA TATGTATTAG AACAGAACAT GAGTTTTATC 3720 

AAAGAAGCTT TAGAAAGAGA ACTATCCCGT GATGGCCAAG TGTTTTATCT TTATAATAAA 3780 

GTGCAATCCA TTTATGaAAA ACGAGAACAA CTCCAGATGT TAATGCCAGA TGCTAACATT 3840 

GCAGTTGCTC ATGGACAAAT GACAGAGCGC GATTTAGAAG AAACGATGTT AAGTTTTATC 3900 

AATAATgAAT ATGATATTTT AGTAACGACG ACGATTATTG AAACAGGTGT CGATGTCCCA 3 960 

SO AATGCAAATA CTTTGATCAT TGAAGATGCA GATCGCTTTG GATTGAGTCA GTTGTATCAA 4 020 

TTAAGAGGTC GTGTTGGTCG TTCAAGTCGT ATTGGTTATG CATACTTCTT ACATCCAGCA 4080 

AATAAGGTAC TAACTGAGAC TGCAGAAGAT CGATTACAAG CGATTAAAGA ATTTACGGAG 4140 
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TTAGGTAAAC AACAGCACGG CTTTATTGAT ACAGTTGGAT TTGATTTGTA CAGTCAAATG 4260 
TTAGAAGAAG CTGTAAATGA AAAACGTGGT ATTAAGGAAC CAGAATCTGA GGTGCCAGAA 4320 

5 

GTCGAAGTTG ATTTAAACTT GGATGCATAT TTGCCAACAG AATATATTGC AAATGAACAA 4380 
GCTAAAATTG AA 4392 
(2) INFORMATION FOR SEQ ID NO: 30: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
IS (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
^0 TTTCTTTTGA ATCTATATCG AGGTGGTTGG TAGGTTCATC TAAAATAAGT ACATTGTCAC 60 
GTTGCAACAT AAGTAGTGCT AGTTGTAAAC GTGCTTTTTC ACCACCAGAT AAATCATTAA 120 
TTATCTTTTT AACATCGTCT TGTACAAATA AGAAACGTCC AAGAACTGCT CGAATATCTT 180 

25 

TTTCATTCAT TAACGGATAT TGATCCCACA CATAATCTAA AATCGTTTTA CTAGATTTAA 240 
ATTCTGCTTG CTTTTGATCA TAATAACCAA TTTGTAAATT TGCGCCGAAA GTAATATCGC 300 
CATTAAGCGC TTTTTGTTGA TTAGCAATAG TTTTAATTAA GGTCGATTTT CCAATACCAT 360 
TTGGCCCAAT GATTGCTATA TGATCGCCTT TAGAGACCTC TATACTCATA GGTTTGGTAA 420 
TTGCAGTTTG ATAACCGATT TCTAAATTTT TTACATGCAT GACGTCATTA CCTGTATTCC 480 
35 GGTCAAAGCC AAATTCaAATA TTTGCACTTT TGGCATCTAA CATTGGTTTA TCAATGCGTT 540 
CCATTTTTTC TAAAATCTTA CGTCTACTTT TTGCCATTCC ACTTGTTGAA GCACGGGTAA 600 
TATTTTTCTC AACAAAAGTT TCTAATCGTT TTATTTCTGC TTGTTGACTT TCATATTCTT 660 

40 

GCATTCGTTT TTGATAATAT AAATCCCGTT GCTGTATAAA TTCCTCGTAA TTACCAACAT 720 
AGCGTTTGA '^29 
(2) INFORMATION FOR SEQ ID NO: 31: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

55 



307 



EP 0 786 519 A2 



TGATGTTTCG ATACATTTGT TGCACCTTGT GGATATACTT TAAAGGTTGT GTCGTATGTT 120 

TCCTTACTAT CTTTAGCTTC AGATTCCTGT GATTCAACCG TTTTATATTT TTCAAGTGCA 180 

5 

TGTCCTTCAA TATCAACTCG TGGAATAATG CGATTCAACC ATGCTGGTAA ATACCACGAA 24 0 

CCTTTtCCAA ACAATTTCGt TAATGCAGGA ATTAACATCA TtCTGACTAC GAAGGCATCA 3 00 

AAGAGTACAC CAAACGCTAA TGCCATACCC ATTGATTTAA TCATGACATC TTCTTGGAAT 360 

to 

ACAAACGCAA AGAAGACACT AAACATAATT AATGCAGCTG CTACAATAAC AGGACCGCTT 420 

TCTTTCAATC CTACTTTGAT AGAATAATCA TTATCCCCTG TTTTACTATm yyCTTCATGr 480 

IS ATTCGCGACA TAAGGAAGAC TTCATAATCC ATCGCTAATC CAAATAAGAT ACCTATAGTA 540 

ATAACCGGTA AAAATGCTAG CATTGGTCCT GTCGTTTCAA TACCAAACAG ACCTTTCATA 600 

AAACCATCTT GCATTACTAA TGTTGTAAAT CCTAATGTTG CCATTAATGA CAAGACGAAT 660 

^ CCTAAAACTG CTTTTAATGG TATTAGAATT GAACGGAAGA CAATCATTAA TAAGAAAAAT 720 

GCTAATACAA CAATGACTGA GGCAAATAAA GGTATCGCCT CATTTAACTT TTTAGACATA 780 

TCAATATTAA TGACACTTTG TCCCGAAATC TCCGTTTTGA ACCCATATTT ATCTTGTGCA 840 

25 

TCTTTATGAT AATCTCGTAA ATCATGCACT AAATCATTTG TACTCTCTGC ATTAGGCCCT 900 

TGCTTAGGTA TCACGACCAT CAAAGCGTAA TCATTATCTT TACTCATTTG TGGTGGCGTA 960 

ACGATATCTA CATTTTTCTT ATCTTTAATA TCTTTATATA CAGACTGTAA ATCTTGTTGT 1020 

30 

AATCCTTGTG GATCATCCTT TTTATCTTTC ACATTTATCA ACATCGGTAT TTGGCCATTA 1080 

AATCCTTCAC CAAATTTATC CGAGATAATA TOGTAAGCTT TTTTCTGTGT AGAATCTGCT 1140 

35 GGTTTAACAC CGTCATCTGG AATACCAAGT CGCATATGAC TAACTGGTAT TGCAGCTGCT 1200 

ACTAATATGA TTAAACCTAG TAATACTGCC GCAAGTGCAT TTCCTGTAAT AAATTTAQAC 1260 

CAT06CGTAT CAATATCTTT TTTGAATTTA GACTGTAATT TATTCACTTT AATGCGTTtA 1320 

'^^ TGGAAAATGC TTATTAATGC AGGTAATAAA GTTAAAGCGC TAAGTACTGC AAAAACAACA 1380 

CTAATTGCCG AAGCAAATCC CATTACCGCT AAGAAGTCAA TGCCTACTAA TGATAAACCA 1440 

CATACTGCAA TTACAACTGT TACACCAGCA AAAACAACTG CACTACCTGC TGTTCCTATT 1500 

45 

GCAAGACCAA TGCCTTTAAT GTAATCTGTT TCAGTTTTCA TAACTTGTCG ATATCTGAAT 1560 

AAAATAAATA ATGCATAATC GATACCAACT GCTAGTCCAA TCATTACGGC TAATGTCAGT 1620 

so GTGACATTTG GTATATCGAA TGCATAAGTT AACAAACTGA TAATACCTAC ACCAGAGGCT 1680 

AGACCAATCA ATGCACTTAT AATTGGTAAT CCTGCAGCAA TGACTGAACC GAATGTGATT 1740 

AACAGTACAA CAAATGCAAC AATAATACCA ACTAGTTCAG AATTACCGCC TACTTCTGTA 1800 
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AAATGACTTT TAACATTATC TCTAGAGCCA TCTTTTAAAG ATGTTTGACT AACGTCATAT 1920 

GTGATATCTG CAAATGCAGT TGTTTTATCT TTACTAATTT GCTTATTTTC ATAAGGATCT 1980 

GATATTTTAT CAATGTGCTT GTCATCTTTT TTAATATCAT CTAACGTTTT CTTAATATCT 2040 

TTAGTAATGT TCGGTTGCAC AATACCATCA TCTTTAGTCG TCTTAAAGAC AACACGTATT 2100 

TGTGCCTTTT CACTATCTTG ATTAAAATGT TTTTCAATCT TTTTATTCGT ATCTAACGAC 2160 

TCTAATCCTG TCATTTTAAT ATCATTGTCA AATTTCX3GTG CATTTGTAGC AAGTGGTATC 2220 

AATATTGCAG CTACAATCAC TATCCATGCA ATGACCGCGG ACCATTTATG TTTTGCGATG 2280 

AATGTCCCCA TCTTATATAA AAATTTTGCC AAAGTATATT GCCTCCTTTT AAAATCAACG 2340 

TTATAGTTTA AATATACAGT GTAGATTATT GTTCGATTAT AGTATCTATC CCCGACCTCT 2400 

TAAAGAATCA ATTGGAAAAT TTTGTATATT AAACTACACA CAAAGGAGAA ATGTAGATGA 2460 

AAGAGACTGA TTTACGAGTT ATAAAGACAA AAAAAGCATT GTCGAGTAGC TTGCTACAAT 2520 

TCTTAGAACA GCAATTATTC CAAACGATTA CTGTCAATCA AATTTGCGAC AACXKACTCG 2580 

TACACCGTAC AACATTTTAT AAACATTTTT ATGATAAATA TGATCTTCTA GAGTACTTGT 2640 

TCAATCAATT GACTAAAGAC TACTTTGCTA GAGATATCAG TGACCGTCTT AATCATCCAT 2700 

TCCAAACGAT GAGTGATACG ATTAATAATA AAGAGGATTT GAGAGAAATC GCAGAATTCC 2760 

AAGAAGAAGA CGCTGAATTT AATAAAGTAT TAAAAAATGT CTGCATTAAA ATTATGCATA 2820 

ACGATATCAA AAATAATAGA GACCGTATCG ATATTGACAG CGACATCCCA GATAATCTCA 2880 

TATTTTATAT TTATGACTCG TTGATTGAAG GTTTTATACA TTGGATAAAA GATGAAAAAA 2940 

TTGATTGGCC TGGCGAAGAT ATTGATAACA TTTTCCATAG ATTAATCAAT ATTAAGATTA 3000 

AATAGTAGAT GAGAAACTCA TGAGCGTTAC CAACATTCAT AATAAAAACG ATAGTGkACA 3060 

CGTTAATGAA TTCGTGTACT ACTATCGTTT TTTATTnTA TCGTGCTTAT CGCTATTAAA 3120 

ACAACTGATA CACAACACAT AAACTATGAA GAAAAAAATA AATCCGCTAT CTAAATGACT 3180 

TTGACTCAGT TGTTTAAATG ACCAAATTGC TAATACAATT CCCATTATTA TTGAAATAAC 3240 

GTATCTCACA TTCTTATACC TATAATCCTT TTCTAAAAAT ATGGTTGCTA TTACTTAATT 3300 

TTTAAAGTTA TAAATAAAAA GAGCCAACCG CAATGGATGG CCCTTGTTCA TTATGAAGCA 3360 

TTAGAACATT TCTGAAACAA CCTTTTGTTC TAAGAAGTGT AATAAGTAGT CTGGACTACC 3420 

TGTTTTAGCG TCCGTACCTG ACATTTTGAA ACCACCAAAT GGATGGTATC CAACAACTGC 34 80 

TGAAGTACAG CCTCTGTTAA GGTATAAATT GCCTACATCA AATTCGTTTA CCGCTTTAAT 3540 

CCAATGCTCG CGATTATTTG TAATCACTGC ACCAGTTAAA CCGTAATCTG TATCATTTGC 3600 
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TTCTTCTTGC ATGATTCTAT CTTTAGATTT 

GTAACCTTTT GAATCATCAG TGCCGCCACC 

5 

CTCAATATAA TTTTTAATCT TATCAAATTG 

ATTGTCTACA GTATTGCCCA ACGTTAATTC 

TTCGTCATAA ACGTCTTTAT GCACAATTGC 

10 

AAAACCAAAT GCTGACGTTA CAATAGCTTC 
AACTACAATG GCATCTTTAC CACCCATTTC 
^5 TTCTTGAACA ACGGCACTAC GTTCATAAAT 
TGTAACX3AAA TGCGTATCTT TATGATCAAC 
AGGAACAAAG TTAACTACGC CmTGGTAA 
ATAAGCGATA TAAGGTGTAT CCTCAGCAGG 
TGGTGCTAAA GTTGTACCAG CCATAATCGC 
ACCTGTACCA ATTGATTTAT AGAAATATTT 

25 

CTTACCTTGA GCCAAGTCCA TCATTGAACG 
AGCTGCATCA CCAACTGCTT CATCCCATGG 
AATTTCCGCT TTTCGACGAC GAATAATTGC 

30 

ATTTGCTGAC CATGTTTTCC AAGATTTATA 
AACATCTTGT TTTGTTGCCT TTGATGCATT 
35 GATTGATTTA ATTTTGTCAT CTTTGAAAAT 
TTGACCTAAT TCTTTTTCCA CGTCTTTCAA 
GAC^AAAAA TCGTAACCAG GTTCATTTTT 

40 

ATAAATTTTG AAAGTGGTTT AACCCTTTGA 
TTACTATGAT TAAGGTTAGT TTTGCAATCG 
CAAGTATTTT GAAATTGATT GGTTACTTTT 

45 

TATCGTTTCG TCATTTAATG TTTCGGATGG 
ACAAGGGTTT CCAACCGCTA AGCTGTGTGG 
50 ACCAATCACA CTGCCTTCTC CAATCGTCAC 
CCAAGTATTA CTGCCAATAT GAATGGGTCC 
GAAATTAAGT GGATGTGTCG CTGTGTAGAA 

55 



AAGTCCTGAA ATGATTGTTG GTTCTACAAA 3720 

TTGTTCTAAT TTACCTTCTT CTTTACCAAT 3780 

TTTTTTATTA ATAACTGGGC CCATATACGT 3840 

TTTTGTTAAT TTGATTGATT TCTCTAATAC 3900 

ACGTGAACAT GCTGAACATT TTTGACCAGA 3960 

TGCTGCCATA TCTGTATCAA TATTTTCATC 4020 

AGCGATAACA CGTTTCAAGA AGTTTTGACC 4080 

TCTAGTACCT GTCGCACGTG ATCCTGTAAA 4140 

TAAGTAATCA CCAATTTCTT TCGGATCACC 4200 

TCCTGCTTCT TCTAAAATTT CCATTAATTT 4260 

TTTGAATAAC ACTGTATTAC CTGCCACAAC 4320 

AAACGGGAAG TTCCACGGCG GAATTGTAAC 4380 

ATTGTGTTCA CCTTCACGAT CAAGTACTGG 4440 

TGCATAGTAT TCAATAAAAT CAATAGCTTC 4500 

CTTACCTGCT TCATAAACCA TAATTGCTGC 4560 

CGAAACACGT AACATAAGCT CTGCACGATC 4620 

AGCTTCGTTT GCTGCTTTAA ACGCATCTTC 4680 

TGCAATCACT TGTGATGTGT CTGCAGGATT 4740 

CTTCTCTCCA TTAATCACTA ATGGTATGTC 4800 

TGCTTTCTTA AACATATCCA CATTTTCTTG 4860 

AAATTCTACT ACCATGTACA CTTACCCCCT 4920 

TTTAATGATA TAACATCATT TAAACTCATT 4980 

CTTTCATTTT TATGTTTTAT CACTTATTCT 5040 

TAAAATTTAT ATGGGTCGCA ACTGCTACTT 5100 

TAGGTCATTA TCAATTTTAC GAACGACTTT 5160 

CGGAATATCT TTAGTGACAA CACTACCAGC 5220 

CCCTGGTAAC ACGGCTACAT GACCGCCAAA 5280 

GGCTTTTTCA AAACCTTCAT TTCTATGATG 5340 

TCCACAATTA GGTCCTATAA AAACATTATC 5400 
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TCCTAGTTTA ACGTTCCAAC CATAATCTGT ATCAAAAGGA ATCGAAATAC TTACATTGTC 5520 

TGTTGTTGTT TGAAATAATT GATCAATTAA TTCCTTTCTT TTATTTGTAG CACTCGGTCT 5580 

5 

TGTATGATTT AATTCAAAGC AAATATCTTT CX3CTCGTGCA CGTTCATTGA TTAAGTATTG 5640 

ATCAAAGTTT GCATCGTACC ATTTTTCTGC TAACATTTTT TCTTTTTCAG TCATTACACC 5700 

TTTCAACTCC TAATAACTTA TTTACTTGTT TAAAAGTTAA TCAAATAAAC CTTCGCCTAT 5760 

10 

GCAACTAATA CGCTATAACA TTATGAAATC ATGACCTTAT CACCCTTATC TATACAATTC 5820 

TCGCATCAAA TACTGCTAAA GTAGTAGATA AATTCAATAC TACAGACGCA TT»TTTTTT 5880 

IS AATCTATTAA CGTACAAT6T GAGTAAGAGA AATATAAAGG AGTATGATAG CGATGAGAAT 5940 

ATTAATTACA GGCACAGTTG CTATCTTAAT CATTCTAGGT TTGGTCAAAA CGATACAAGA 6000 

TTACGAAATG ACAAACGACA CGAGTCGTcA GTT6TCAGAC AACAAAGATG ATGATAAAGT 6060 

20 

CATCCATCTT AATAATTTTA AAAATTTACA TGCGAAAGAA TTTAACCCAT CTGATTTCTT 6120 

TTAAGTCACC TAAGAATTGC AAATCCAGAA GTCATTTAAG TTTTACCTTT CATTCATACA 6180 

TCCITTAATA TTAATTACGA CTTCTTTTAT ATAGATGCTA AGTAGAGAGA TTGTTGTGCA 6240 

25 

ATGTTTGCAC GGCAATCTCT CTTTTTCTTT TTAAAATTGG TAAAAGTAAA ACGCAACGAT 6300 

TGACTTATAT ACCTATAGGG GGTACATTAG ACGTGTAACA ATGAATCACA GGGAGGCAAT 6360 

30 AATGTGGCTA ATACX3AAAAA AACAACATTA GATATCACTG GTATGACTTG TGCCGCATGT 6420 

TCAAATCGTA TCGAAAAGAA ACTGAATAAA CTTGATGACG TTAATGCCCA AGTGAATTTA 6480 

ACTACAGAGA AAGCAACTGT TGAGTATAAC CCTGATCAAC ATGATGTCCA AGAATTTATT 6540 

^ AATACGATTC AACATTTAGG TTACGGTGTC GCT6TAGAAA CTGTCGAATT AGACATTACA 6600 

GGTATGACTT GTGCTGCATG CTC7UVGCCGT ATTGAAAAAG TGTTAAATAA AATGGACGGC 6660 

GTT.CAAAATG CAACGGTCAA TTTAACAACA GAGCAAGCTA AAGTTGACTA TTATCCTGAA 6720 

40 

GAAACAGATG CTGATAAACT TGTCACTCGC ATTCAAAAAT TAGGTTATGA CGCGTCTATT 6780 

AAAGATAACA ATAAAGATCA AACGTCACGC AAAGCTGAAG CGCTACAACA TAAATTGATT 6840 

AAGCTTATCA TATCAGCAGT ATTATCTTTA CCACTATTAA TGTTAATGTT TGTACATCTT 6900 

45 

TTCAATATGC ATATACCAGC ACTATTTACG AATCCATGGT TCCAATTTAT TTTAGCTACA 6960 

CCTGTACAAT TTATTATTGG ATGGCAATTT TATGTAGGTG CTTATAAAAA CTTAAGAAAT 7020 

SO GGTGGCGCCA ATATGGATGT ACTTGTTGCT GTTGGTACAA GTGCAGCATA TTTTTACAGT 7080 

ATTTATGAAA TGGTTCGTTG GCTAAATGGC TCAACAACGC AACCGCATTT ATACTTTGAA 7140 

ACAAGCGCCG TACTAATTAC CTTAATCTTA TTCX^GTAAGT ATTTAGAAGC TAGAGCGAAG 7200 
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TTAAAAGATG GTAATGAAGT GATGATTCCT 
ATCGTTAAAC CAGGTGAAAA GATACCTGTT 
^ ATCGACGAAT CTATGTTAAC AGGTGAATCT 

GTAATTGGTT CAACGATGAA CAAAAACGGT 
GGGGACACTG CGTTGGCAAA TATTATTAAA 

10 

CCGATTCAAC GATTGGCAGA TATTATTTCT 
GCACTATTAA CATTTATCX5T GTGGATTACT 
^5 CTTGTTGCGA GTATTTCCGT TCTCGTCATT 
CCAACTTCTA TTATGGTAG6 TACTGGTCGC 
GGCGAGTTTG TTGAACGCAC ACATCAAATT 
ATTACAAATO GTCGTCCAGT CGTGACAGAT 
CTTGCTACTG CTGAAAAAGA TTCTGAACAC 
AAAGAAAAGC AATTAATATT AACTGA6ACA 

25 

ATTGAAGCAA CGATTGATCA TCACCATATA 
AATGATATTA GCTTGCCTAA GCATATTTCT 
AAAACTGCTA TGCTCATTGC TGTTAATTAT 

30 

ACTGTCAAAG ATCATGCCAA AGATGCTATA 
GCCATGTTAA CTGGCGATAA TAAAAACACT 
35 GATACTGTTA TTGCAGATAT TTTACCAGAA 
CAACAAGGTA AGAAGGTTGC GATGGTTGGT 
AAAGCTGATA TCGGTATCGC CATTGGTACA 
ATTACTATTC TTGGTGGCGA CTTGATGCTT 
ACCATTCGTA ATATTCGTCA AAATCTATTT 
CCTATAGCTG CATTGGGCTT ACTTGCGCCA 

45 

TCAGTAAGTG TTGTCACAAA CGCACTTAGA 
AAAGATGCCT AGATTCCTTA ATAATGAAGG 
ATTGGCTCTA TAATGTCGCG GTTTAyaGTt 
CACTTTTCGC TTGGCGAATT AGTGTATCTT 
ATTATTATAA ATAATAAGTA CACTACGGtT 

55 



CTAAATGAAG TACATGTTGG AQATACACTT 7320 

GATGGCAAAA TTATTAAAGG TATGACTGCC 7380 

ATGCCTGTTG AGAAGAATGT TGATGATACT 7440 

ACTATTACTA TGACAGCAAC AAAAGTTGGC 7500 

GTTGTCGAAG AAGCTCAAAG TTCTAAAGCG 7560 

GGTTATTTCO TTCCTATCGT TGTTGGTATC 7620 

TTAGTTACAC CAGGTACATT TGAACCTGCA 7680 

GCTTGTCCAT GCGCATTGGG ACTTGCTACA 7740 

GCTGCTGaAA ATGGTATTTT ATTTAAAGGT 7800 

GATACCATCG TTTTAGATAA GACGGGTACC 7860 

TATCATGGTG ACAATCAAAC GCTACAACTA 7920 

CCATTGGCAG AAGCCATTGT CAATTATGCA 7980 

ACAACATTTA AAGCAGTACC TGGCCATGGT 8040 

TTGGTTGGTA ACCGTAAATT AATGGCTGAC 8100 

GATGATTTAA CACATTATGA ACGAGATGGT 8160 

TCATTAACTG GTATCATCGC AGTGGCAGAT 8220 

AAACAATTGC ATGATATGGG CATTGAAGTT 8280 

GCTCAAGCCA TTGCAAAACA AGTAGGCATA 8340 

GAAAAAGCTG CACAAATTGC GAAACTACAG 8400 

GACGGTGTAA ATGATGCACC TGCATTAGTT 8460 

GGTACAGAAG TTGCCATTGA AGCAGCTGAT 8520 

ATTCCTAAAG CCATTTATGC AAGTAAAGCA 8580 

TGGGCATTCG GCTATAATAT TGCCGGTATC 8640 

TGGGTTGCTG GTGCTGCAAT GGCACTAAGT 8700 

TTGAAAAAGA TGCGATTAGA ACCACGCCGT 8760 

ATTCGTTGGT GATTCTGAGA TAGGCTAGTG 8820 

GGATCTTCGC TCCAACTGCA TATATAGTnA 8880 

ACCTAATAGc TCCGCCTATT AGGTTCCATC 8940 

TACAGTTGGA TCTTCGCTCC AACTGCATAA 9000 
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GAAATTTTAA ATGTTGAAGG TATGAGCTGT 
TTAAATAATA TTGACGGTGT CACTTCAGCT 

5 

GTTCAATATG ATGACAGTAA AGTTGCTGTA 
GGTTACGATG TCGTTTAATT AGGCAATATT 
AACTGATGAG AATCCCAACA ATCCAAATTA 

10 

AACCTAGTAT CTCCAGTCTG CAATACATCT 
TTTTTAACTG CAGCTTCAGT ATAAAACGCA 
IS TCAATCAAOG ATTCTAACAA TGGATCGTTC 
AGTTTGCGTT CAAATTCATA CX3TATCAAGT 
GCGTCTAATA ACCCCTTAGT ATCTACTAAA 
CCCTTTTTAA AATGTTTAAA TAATTCAGCA 
GGTACATGTA ATGTCACGAT ATCAGCACCT 
TCGACATACG TTGCAATTTT AGCATTAGGA 

25 

AACCATTGGC AAATATATCG GCTACTACAC 
CTTTTAAATC TTTAATGGAT TTCGATAAAA 
GCACTTTCGT TTGAATTTGA TTAAAATGAC 

30 

ACTCCGCAAT TGAATTCGGA GAGTATGACG 
TTGCTAACTC CAAATCATAT GTATCAAATC 
35 CTAGTTCATT TAATCGTTTA TAAACATGCT 
AGCCATCATA ACCAGCGACA CCTTCAACAT 

CTACCTCAAC ATGATGTTTC TCTGCCCACG 

40 

CACTCATGAT TTTAATTTTT GTCATTTTAA 

TATGCTAGTT CTGTTAATCT TATTGCAGCT 

GAAATTCTCA CATAACGATT ACCATTCTCT 

45 

GACTTCTCTT GCACTAAAAA TTGCTCAAAT 
AACCATACAT ATATGCCACC TTTAGCATGA 
SO GCTTCGAATC GGTCACGACG TGTTTTAAAT 
TAATGATTCA AAGCATATAT TGCGGCATCT 
TGCGTTTGGT ACTTTTTCAA AGCTTGAATC 

55 



GGTCACTGCA AAAGTGCTGT TGAATCTGCA 9120 

GACGTTAACC TTGAAAATGG TCAAGTAAGT 9180 

TCTCAAATGA AAGACGCAAT TGAAGATCAA 9240 

CAACGTCATC AACACCAAAT TAAAAAATCG 9300 

TCTCATCAGT TCGATTTTTA ATTTACTCGT 9360 

AATGTTGCAT CTAATGCATC GACAATTAGA 9420 

ATATGTGGTG TTAATATGAC ATCTTCCCTG 9480 

AGTGTTTTGC CCCTTTGATC ACTTGGGAAA 9540 

GCTGCACCTT TAATCACACC ATTGTCTAAT 9600 

GAACCTCTCG CACAATTGAC AAATACTGCG 9660 

TTAAATAGAT AATGATTATA TTTCGTTGCA 9720 

TCAACCGCTT CCTCAATCGT ATCTTTGTAA 9780 

AACGGtCGTA TGCGACCACA TCACTTTGAT 9840 

GGCCAATTCG ACCTGTACCA ATAACAGCTA 9900 

TAGTAGGTTC CCATCTAAAA TCATGcTCCC 9960 

GAACCACATT AATAGCCTGG TTCACAGCAA 10020 

GCACATTTGA CACAATAAAG TTATACTTGT 10080 

CAGCACTACG TTGTGCGATT TGTTTAATAC 1014 0 

CTGATAATGG TATTTGTTGT GATAGCGATA 10200 

TGTCATCAGT TAATGCTTCT TTAGTAATAT 10260 

CCTTGATATA AGGCATATCT TCATCACGTA 10320 

CATCACCCTT AACTTTATTA TTCATATAAA 10380 

TCGTCTAATT TCTGGTCATC TAACGCCAAT 10440 

CCAAATGGTT TCCCTGGAGC AACAAGTATT 10500 

TGCTCGCTGT CATAACCAGG CGGTGTTTCC 10560 

ACAAATGGCA AATCAGCTTT TGCAAGCATG 10620 

ACATTGCTTT GTTCTTCTAA AAAATCATCA 10680 

TGTAATGCAC CAAACATCCC AGCATTTGTG 10740 

ATATCTTTAT TACCAACTGC AAAACCGACT 10800 
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CCATTTTCCG AAGCAAGTAT ACTAGGATTT 
AAATCATGCA CGATTTTAGT GTCTGTACCT 
5 TCTTTCGTAG CTGTCGATCC AGTTGGATTA 
TTATCTATTA TTTGTGAATC AACTTTGGAC 
TTAAGCGGGA CTGGCTTGCC ATCAGCTAAA 

10 

GGATCAGGTA GTAATACATA GTCTCCTGGA 
CCATTTTTTG TACCATATAA AATGCATACT 
75 T6TCTTTGAT AAAAATCTAC AATAGCTTGC 
TATTTTTGAT TTTCAGGAAT AGTTAGTGCT 
GTGGGCCCAT CAGGGATTCC AACTGCCATA 
TTACGTCCCA TCGTTTTCCC GAAATAACTA 
ATCAAATTCC TCCTCTATCA TTAAACATAG 
GTATCACTCT CATTTAGATG GTTACAATGA 

2S 

CTTATGACAC ACGTTGTATT GAATGAATTT 
GTCAATATTA GGAATTTTCA GATTAATATG 
ATATCCGCAT GCGCAACCAG TTAGATATGC 

30 

GTATTCAAAC GTGAACCTTA ACAGGCGTCA 
TACTTATTTC ACTATGCCTT TTACGTTCCC 
35 CTACTCCCTT ATACGCCCCG CTCAATATCT 
CTCGTTAAGA CAATAGGAAC GCCTGCACCT 

TCTTT ATAAT CTCGCGATAC ATTTTGTGGA 

40 ' 

ATTAAACCGA ATGCCGAACC AAATTTCGCA 

AAGATTGTTT CTGAAACAAT ATGCGATTTT 

TTACGATAAA TAATTTCCTT TATTTGTTGC 

45 

CTACCTGTTT TAAGTTCCGG CGTCGGCATT 
GCAAGTGATT TATCAGCGAC CGCTGGTACA 
SO CGTCCCTCAA ATATTTCTTC AATATTGCCT 
AGTCTCACTT GATCTGTCAC ATCAATATCT 
GAGTAATCTA AGTCTGCAAT TTTATGTGGT 

55 



TTAGCGTCGA AACCGAAAGC ACCATAAGCA 10920 

TTAAATTTAG cTATCGCTTC ATCAAAAACT 10980 

TTTGGATACG TTAAATAAAT GAGTTTTGTT 11040 

CAATCTGGCA AATAATGTGG CGGTTCTAAA 11100 

AGTACACCTG CTAAATAATC CGTGTAGCCT 11160 

TTCATAACAC ATGTTGGTAC TGCCACTAAT 11220 

TCATCTTCTT TATCTAACGT CACATTATAT 11280 

TTGAACX;CTT CTTTACCATG AAAAGCACCA 11340 

TTTTGAAAAT GATCAATAAT ACCTTGTGGC 11400 

TTAATTAATG GCAATGGTCC ATGTTCGATT 11460 

TCAGGGATAT TTGCTAATTT GTTAGAGATC 11520 

CCTGGGCGAC TATCATAATC CTAACAACTT 11580 

CATCGCCATT CACCGTTATG TTCAACAGAA 11640 

ATTTTCATTT TAGGTAGGTA TAATATTATT 11700 

CACTCAATCG TTATGATTTA ACTGTCATGC 11760 

TTATATAAAG TATAACGCCC ATCAAGGTAC 11820 

TTCATTGTTA AATAAAACTT CTTAAGCACA 11880 

CTTATACTTT TCTCACATCT TTCTCTTAGA 11940 

TTAATCATTT CATCTACAGT TATTTTCGCA 12000 

GGATGCGTAC TTGCACCTGC AAAATATAAA 12060 

CGATAATAAT TACTTTGCGC TAAAGTTGGC 12120 

TGATACGTTT GCTCAAAATC ATTTGGCGTA 12180 

ATATCTTCAA ATACTTCAAT CGTTGCTAAT 12240 

GTCAAAGCTT CATCTGACCA ATCGATTCCG 12300 

AGCACATAAA TACCAGTTTT GCCTTCTGGC 12360 

TACACATAAA TAGAAGGATC ATATGATAAA 12420 

CTAAAGTCAT CTGAAAAAAT AACATTATGA 12480 

ATACCGATAT ACATTAAAAA TGCTGAACAA 12540 

GGATACTTTT TAATAGGTGC AAAATCTGGC 12600 
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ATGTCACCAT TCACTTTTAT CGCATCGGCC CGTTTGAATT TAGGATCAAT AATAATTTGC 12720 
TCAATTTCAG CATTTAGTTC AATATTAACG CCTAAGTCTT TATTTAATTG CGCTAGcCCT 12780 

5 

TGAGCCATGC CATACATACC GCCTTTAATA AAATGCACAC CAAACATCAT TTCAATCATA 12840 
GGAATAATTG AATATAGTGA CGGGCCTCGT TTTGGATCAA TTCCTATGTA TAACGTTTGA 12 900 
AACGCTAAAA GCTTTTGTAT CTTTTCGTTA TCAATATAAT GTTCAATTAG CTGATCTGCA 12960 

JO 

TGATTTAACG TTTTTAACTT AGCACCITGC ACAAGTGACG TCATATTATA AAAGTCACTC 13020 
GGTTTGCGAT ACGTTCTTTC TAAGAAATAG CGACGTGCAA TTTCATATTT TTTATAAACA 13080 
IS TCCGTTAAAA AGGACATAAA ACCATGCGTT GAACCAGGTT CTATACTTTC TAGCATTTGC 13140 
TGTAATTCAG CTAAATCTGT AGGCACCX5TT ATACGATCAT CGTGGTCAAA ATACACATCG 13200 
TAAATATAAC GTAATT6TCT CAATTCAATA TAATCTTCAT AATTTTTACC ACACGCTGTA 13260 
AAAACATCTT TATAAACATC TGGCATCATG ACAATTGTGG GACCCATATC AAATGTAAAG 13320 
CCGTCTTTCT TTAATTGATT CATACGCCCG CCTACATTAT TATTTTTTTC AAATATC6TC 13380 
ACTTCATGAC CTTGAGAAGC AATACGGGCT GCCGCTGCTA ATCCTGTGAC ACCTGCACCA 13440 

25 

ATTACTGCAA TCTTCATTAT TCAACCACCT ATATTCTATG ATATTTACTA TTTATTTCAT 13500 
GAAACAACTT TGCCTTTTTC CTCTTATCCA CAAAAACACG TTCATGTAAT GTATAGTTAG 13560 
CCTGTCTCAC TTCGTCCAGT ATTTCAATAT ATATACGTGC TGCTAATTCT ATGATTGGTT 13620 

30 

GTGCTTCAAT ACTAAATACT TTGATTTGAT CCATAACATC TTGAAAATCT TTTTCTGCGA 13680 
TAGCTGCATA ATATTCCCAT AAGTCAATAT AATGATTATT AACACCATTT TGGTACACTT 13740 
35 CAGCAATATC AACTTCATAT TGCTTTAATC GTTGCTTACT AAAATATATC CGTTCATTGT 13800 
C3UUW^TCTTC ACCGACATCT CTTAATATAT TAAnGGGATC CTCTAGAGTC GACCTG 13856 
(2) INFORMATION FOR SBQ ID NO: 32: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
SO ATATATAAAT ATAGATTAAG TATATAGATT AATCAACTTT TTTGGAAGAG CAAATCACGC 60 
AATCAACAAA TAATATAAGA AGTTTTTGCG ATAGTTTTAA AATAGCTGTA ATAGAATACT 120 
AAATGTGACA AACTTAGAAC TAATATCAAG TGTTGATGTT TTGAATATAA AAATGCTAAT 180 
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ATAATTGGTT AATATATGAG TAATTAGAAA 
AATATGAAAG ATTATGGGTT AACAGGCATA 

5 

CGTGCGTTAA ATCGTGGAAG ATGTAAACCA 
GATATTTGCA AACCATTAAC GATATATGGC 
ATTTTACGCC GATGTCATTC TGGTCCTTTA 

JO 

COTGGTTATA ATGGACACAG TCATATTCAT 
GTATCX5TATC CTTATAACAA TACAGCTATG 

75 ATA6GTGT0A CCATTAAGAA TGTAGTGAGT 

GGACTCTATA TTAAAAGCTG TTCATTTGAA 
ATTCTOAAGC AATACAA7TA GACATTCAAG 

^ CAGATGGTAC GATAACGAAA AATGTCATTA 

TGCCCGAAAT GGGAA6TTGG AATCGTGCTA 
ACTATGAGAA TATTCATATT AGAAATAATA 

25 

CTCCCTTGaA GTATAAAGAT GCTTTCATTA 
GCATTAGATA TTTAGGAGTT AGAGATGGTA 
ACTTAGGTTC CCAAGCAGGC ATAAATATGA 

30 

TGTCTAAAGA TGCGATACAT GTACGTAATT 
TCGTTGGGAA TACATTCAAT AATTCGACTC 
35 TGTTTTTAAG TCCTGTTGAA GCGGGTATTC 

AAAAGTAAAA AGTTTCGCAT GACATTAQGA 
ATTGATAAAA CGGTATAAAT ATGCTATAAT 
TGACGGTAAT GATAATACAA GATAGACAAC 
AGCTTGTCAT AATCATCATG AGGGGGAAAT 
TGATATCGAA AAGGTATTTA ACATTCTTTT 

45 

CGGGACAACT TATTGGACTA ATATTAGGTC 
TTCATCCACA AGACTTACCT TGGAAAGGCG 
50 CGACTTGGTG GATTACTGAA GCAATTCCTA 

TATTACCATT AGGTCATATA CTTACACCAG 
TTATCTTTTT GTTTTTAGGT GGATTTATTT 

55 



ATAGACAAAG GATGACGATT TATGTATATC 300 
AACAAAACTA AAGATACTCX3 AGCAATACAA 360 
ACGACAGTTT ATATACCGAA AGGGACGTAT 420 
AATACAACAC TTTTGTTAGA TAATGAAACT 4 80 

TTAAAAAATG GTCGTCGCTT TGGTTTTTaT 540 
ATTAAAGGCG GCAAGTTTGA TATGAATGGT 600 
TGCATTQGGC ATGCTGAAGA TATTCAATTA 660 
GGTCATGCAA TTGATGCTTG TGGGATTAAC 720 
GGATTCATAG ACTATAGTGG CGAACcTTTT 780 
TACCTGGTGC TTTTCCAAAA TTCG6AAC9A 840 
TCGAAGATTG TTATTTTGGA CCTTCAGAAT 900 
TTGGCTCACA TGCAA6TAGA CATAATCGAT 960 

TATTTGAAGA TATACAAGGT TATGCATTAA 1020 

TTAATAATAA GTTTATTAAC TGTGaGGGTG 1080 

AAAATGCAGC AGATGTGaTG ACAGGaAAAG 1140 

ATATAATTGG AAATGAATTT AAAGGATCAA 1200 

ATAATAATGT TAAACATAAA GATGTATTAA 1260 

AATCAATTCA TTTAGAAGAT ATTGATACAG 1320 

AAGTTACTAC AATCAAT6TA GATGAAATAA 1380 

TTAAGAATAG TAQATAATTT TTGAAAGCGC 1440 

AAACCCAATT ATCTGATAAA AGGGGTATTT 1500 

TTTCTATACT CTAATATAGT GAGTTGAAGT 1560 

TTATGGCTTA TTTCAATCAA CATCAATCAA 1620 

CAAAATCAAA GAAAAAGAAA CCGTTTAGTG 1680 

CATTACTTTT CCTATTAACA TTATTATTCT 1740 

TCTATGTTTT AGCGATTACT TTATGGATTG 1800 

TTGCAGCAAC GAGCTTATTA CCAATTGTGT 1860 

AACAAGTATC ATCCGAATAT GGCAATGATA 1920 

TGGCAATTGC AATGGAAAGA TGGAATTTAC 1980 
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TTGGATTCAT GGTGGCAACA GGATTCTTAT CTATGTTTGT ATCGAACACT GCAGCTGTAA 2100 

TGATTATGAT TCCGATTGGT TTAGCAATTA TTAAGGAAGC ACATQATTTA CAAGAAGCCA 2160 

ATACGAATCA AACAAGTATT CAAAAGTTTG AAAAATCTCT AGTTTTAGCA ATTGGCTATG 2220 

CAGGTACGAT TGGTGGCTTG GGTACATTAA TCGGAACCCC GCCATTAATT ATTTTAAAAG 2280 

GACAATACAT GCAACATTTT GGACATGAAA TTAGTTTTGC TAAATGGATG ATTGTAGGGA 234 0 

TTCCAACGGT CATTGTTTTG TTAGGTATTA CTTGGCTCTA . TTTAAGATAT GTTGCGTTTA 2400 

GACATGATTT GAAATATTTa CCTGGTGGTC AGACGTTAAT TAAACAAAAG TTAGACGAGC 2460 

TTG6CAAAAT GAAGTATGAA GAAAAOGTAG TACAAACTAT CTTTGTACTT GCTAGCTTAT 2520 

TATGGATTAC AAGAGAGTTT CTTCTGAAAA AATGGGAAGT TACGTCATCT GTTGCAGATG 2580 

GTACGATTGC TATTTTTATA TCAATATTAT TATTTATTAT TCCAGCTAAA AATACTGAAA 2640 

20 AACATCGCCG TATCATTGAC TGGGAAGTTG CAAAAGAGCT CCCTTGGGGT GTATTAATTT 2700 

TATTTGGTGG CGGTTTAGCA TTAGCGAAAG OTATTTCTGA AAGTGGTTTA GCAAAATGGT 2760 

TAGGCX3AACA GTTGAAATCA TTAAATGGTG TTAGTCCGAT TCTTATTGTA ATTGTCATAA 2820 

CAATqTTTGT CTTATTTTTA ACTGAAGTGA CATCTAATAC TGCAACTGCA ACGATGATTT 2880 

TACCGATTTT AGCAACGTTG TCTGTTGCTG TTGGAGTGCA TCCATTACTA CTTATGGCAC 2940 

CTGCAGCTAT GGCGGCTAAC TGTGCATACA TGTTACCAGT AGGGACACCA CCGAATGCAA 3000 

TTATCTTTGG TTCTGGTAAA ATATCTATCA AACAAATGGC ATCAGTAGGA TTCTGGGTAA 3060 

ACTTAATCAG TGCAATAATT ATTATTTTAG TCGTGTATTA TGTAATGCCT ATAGTTTTAG 312 0 

GTATTGATAT AAATCAACCA CTGCCATTGA AATAGTAATT GCAGATTAGA ACGAAAAATA 3180 

AAAGGTTACA TTAGCAATTG CTTGGACX3AG TGGTAACX3AA ACGTATACCG CAGCATCGTG 3240 

TAA2AACAAT ACAAACAAAA GAAAGTCAAC CAAGGATGGA TTCCTATTT7 AATCCTTGGT 3300 

40 TGACTCTTTA TTTTATTTAA ATTGTAGAAC CTAGAAAATA AAGTTTAATT AAAAGCACCA 3360 

ATCATTTCTA CTTTGAAATC TAAGGTTTCT AAAATAGCAA TGACTTTCTT TATATCGGTT 3420 

GTAATTGCAG AATCAGCCTG AACGAAAAAT CGATACATAC CTAATTGTGT TTTTAAAGGA 3480 

CGAGACTCAA TCCAGGATAA ATTAATATTA AACAAAGCAA ATGTATTAAG CACACTTGCT 3540 

AACAACCCAG GTTTATCATG CATTGGTGTA ATTAAAAACA TCAATGATGT CGCATTTTGA 3600 

TCAAATTGCT GCTGATTTTT TATAACTAAA AAACGTGTCA CGTTATGTGG ATAGTCTTCA 3660 

ATATGTGTAT CAATAGGTGT AAAACCATAA GctTCGCCAC TACCTAAAGG TGCAATTGCT 3720 

GCAACGCCAT TTTCAATTTT AGTCAAACTT TGAATTGTAC TGTCGACATA ATCATAGTCA 3780 
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TTTTTAATAT CAGAAATGGA ATCTGTTCCA TTACCATATA ATGCAAAGTT AATATCTAAA 3900 

CGTATTTCAC CGTGTGCAAA GACATCTTGC TGTGCAAGTG CATCTGCCAC AATGTTGATT 3960 

GTTCCTTCTA TAGAATTTTC AATAGGGACA ACACCAATCG ATGTGTCATC ATCTGCAACT 4 020 

GCCTTGATGA CTTCAAATAA ATTTGACTTT GGTTGAAAAG TTGCTTCATT TTCAGAAAAA 4080 

TACTGACGAC AAGCCAAATA TGAAAATGTA CCTTTAGGGC CTAAATAATA TAATTGCATA 4140 

TGCTACACCT CTACTAACTT AATGATGGAA AGGGCACTGG TTAGCATTTG ATTCTTTCTT 4200 

TTTATAGAAA AAGTTTGGAT CTTTTACTGT ATTGTCATAT CCQTQATGAT AATTTGACGT 4260 

IS CAATGTTGGA GATAATGGCG GTGCTAGCCA AGACCATTTT CCGGTAACTT GACGACCTTG 4320 

TTGTGCTTCG TTACOTTCGA ATAGTTCGAA TTGCTTTQCA OCGOTCAAAT GATCX3ACAAT 4380 

TQATACGCCT TCTTTTTTAA AGGAATGATA CACAGCATAG TTCAATTCAA CAAGTGCTCG 4440 

ATCTTTATTA AATGAATTAT TTTTAAGTGT ATCAAATTCA AACGCATCTQ CAACTTTTTC 4500 

TAGTAAATTG TAACGGTAAT CATCAATAAA GTTACGTACG CCAATTTCAG TTACCATATA 4560 

CCAACCGTTA AAGGGTGCAG TTGGATATAC AATGCCACCG ATTTTTAAGT CCATATTGGA 4620 

AATGATAGGG ACTGCATACC ATTTTAAGTT CAATTTTCTT AATTTTGGAT AATGATTATG 4680 

TTCAATAGGT ACTTCTTTAA TTAATGAAGT AGGATATTCG TAAAATTTAA CTGACTCATT 4740 

AGGTAATTGG TAAATCAGTG GTAACACGTC AAAATTAGTA CCTTTTCCTT TCCAACCTAA 4800 

GTGATTTGCT AAGCGTGTAA CTTCTTTTTC AGCAGGATCA CCACAATTGT CATAGCCAGC 4860 

ATAGCGAATT AATTGATTGT TGAAAATTTT AGGTCCATCC TTTGGAGCAT ATATAGTAAT 4920 

35 ATACGGCTTT AATTTACCTT CATTTGTAGC CTGTGTAATA TGATAAGTAA TTGATGATAA 4980 

6AACGATGCT TCGTCAGTAA CATCTCTTGC ATCAATGACA TTTAACGAAT CCCAAAATAA 5040 

ACGACCAATG CAACGATTTG AATTACGCCA AGCCATTTTA GCACCATAAA TAAGTTCTTC 5100 

TTCTGTATGT GTATATOTCC CAGTTTCTTT TATTTCTAGT TCAATGTCAT GTAAACGTTT 5160 

ATTGATAATT TGCGTTTCAT AATGACACTC TTTATACATG TTTTCTATGA AAGCTTGAGC 5220 

CTCTTTAAAT AACATTAACA ACACCTCGCT TTATATTATA GTCTACATTA TTAAAATACT 5280 

CTTAAAAATT ATGTATATGT CATTAAATTG TTGGTTGATT TTAATTAAAA GTATGGAAAT 5340 

TAAGGGGCTC TTATGTATAT AAAAAAATGA ATTATGATAA AATGTAAGAA AATATTTAGG 54 00 

50 TCGATTGGAG AGATACAAGT GTACCAATTA GAAGACGACA GTTTAATGTT ACATAATGAC 5460 

TTATATCAAA TAAATATGGC TGAAAGTTAT TGGAATGATA ATATTCATGA AAAAATGGCT 5520 

GTATTTGATT TGTATTTTAG AAAAATGCCA TTTAATAGTG GCTATGCTGT TTTTAATGGT 5580 
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TTAAAGTCTA TTGGCTACAA GGATGATTTC TTATCATATT TAAAAGATTT AAAATTCACA 5700 

GGCAGCATCC GTTCGATGCA AGAAGGCGAA TTATGCTTTG GTAACC3AACC ATTGTTACGC 5760 

5 

GTAGAAGCAC CATTGATTCA AGCGCAATTA ATAGAAACAA TnTATTAAA CATTGTAAAT 5820 

TTCCATACAT TAATTACAAC AAAGGCTAGC AGAATTCGTC AAATTGCATC AAATGATAAA 5880 

TTAATGGAGT TTGGTACACG TCGTGCGCAA GAAATTQATG CAGCATTGTG GGGCGCTAGA 5940 

GCTGCTTACA TCGGGGGCTT TGATTCTACA AGTAATGTTA GGGCGGGGAA ATTATTTGGT 6000 

ATACCTGTGT CTGGTACACA TGCACATGCA TTTGTCCAAA CITATGGAGA CGAATATGTT 6060 

IS GCCTTCAAAA AATATGCTGA AAGACATAAA AATTGTGTGT TCCTAGTAGA TACATTCCAT 6120 

ACTTTAAAAT CTGGCGT6CC AAATGCAATA AAAGTTGCAA AAGAATTAGG TGACAAAATT 6180 

AACTTTGTAG 6TATTCGATT AGATTCTGGA GATATCGCTT ATTTATCTAA AGAGGCAAGA 6240 

20 

CGTATGCTTG ATGAAGCAGG ATTTACTGAA ACTAAAATTA TCGCGTCTAA TGATTTGGAT 6300 

GAAGAAACGA TTACGAGTTT GAAAGCACAA GGTGCAAAAG TAGATTCTTG GGGCX5TTGGT 6360 

ACAAAGCTGA TTACAGGATA CGATCAACCA GCATTAGGTG CAGTATATAA ACTTGTAGCT 6420 

25 

ATTGAAAATG AAGATGGTTC ATATAGTGAT CGTATTAAAT TATCAAATAA CGCTGAAAAG 6480 

GTTACGACGC CAGGTAAGAA AAATGTATAT CGCATTATAA ACAAGAAAAC AGGTAAGGCA 6540 

30 GAAGGCGATT ATATTACTTT GGAAAATGAA AATCCATACX5 ATGAACAACC TTTAAAATTA 6600 

TTCCATCCAG TGCATACTTA TAAAATGAAA TTTATAAAAT CTTTCGAAGC CATTGATTTG 6660 

CATCATAATA TTTATGAAAA TGGTAAATTA GTATATCAAA TGCCAACAGA AGATGAATCA 6720 

^ CGTGAATATT TAGCACTAGG ATTACAATCT ATTTGGGATG AAAATAAGCX5 TTTCCTGAAT 6780 

CCACAAGAAT ATCCAGTCGA TTTAAGCAAG GCATGTTGGG ATAATAAACA TAAACGTATT 6840 

TTTGAAGTTG CGGAACACGT TAAOGAGATG GAAGAAGATA ATGAGTAAAT TACAAGACGT 6900 

40 

TATTGTACAA GAAATGAAAG TGAAAAAGCG TATCQATAGT GCTGAAGAAA TTATGGAATT 6960 

AAAGCAATTT ATAAAAAATT ATGTACAATC ACATTCATTT ATAAAATCTT TAGTGTTAGG 7020 

TATTTCAGGA GGACAGGATT CTACATTAGT TOQAAAACTA GTACAAATGT CTGTTAACGA 7080 

45 

ATTACGTGAA GAAGGCATTG ATTGTACGTT TATTGCAGTT AAATTACCTT ATGGAGTTCA 7140 

AAAAGATGCT GATGAAGTTG AGCAAGCTTT GCGATTCATT GAACCAGATG AAATAGTAAC 7200 

SO AGTCAATATT AAGCCTGCAG TTGATCAAAG TGTGCAATCA TTAAAAGAAG CCGGTATTGT 7260 

TCTTACAGAT TTCCAAAAAG GAAATGAAAA AGCGCGTGAA CX?rATGAAAG TACAATTTTC 7320 

AATTGCTTCA AACCGACAAG GTATTGTAGT AGGAACAGAT CATTCAGCTG AAAATATAAC 7380 
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TAAACGACAA GGTCGTCAAT TATTAGCGTA TCTTGGTGCG CCAAAGGAAT TATATGAAAA 7500 

AACGCCAACT GCTGATTTAG AAGATGATAA ACCACAGCTT CCAGATGAAG ATGCATTAGG 7560 

TGTAACTTAT GAGGCGATTG ATAATTATTT AGAAGGTAAG CCAGTTACGC CAGAAGAACA 7620 

AAAAGTAATT GAAAATCATT ATATACGAAA TGCACACAAA CGTGAACTTG CATATACAAG 7680 

ATACACGTGG CCAAAATCCT AATTTAATTT TTTCTTCTAA CGTGTGACTT AAATTAAATA 7740 

TGAGTTAGAA TTAATAACAT TAAACCACAT TCAGCTAGAC TACTTCAGTG TATAAATTGA 7800 

AAGTGTATGA ACTAAAGTAA GTATGTTCAT TTGAGAATAA ATTTTTATTT ATGACAAATT 7860 

IS CGCTATTTAT TTATGAGAGT TTTC6TACTA TATTATATTA ATATGCATTC ATTAAGGTTA 7920 

G6TTGAAGCA GTTTGGTATT TAAAGTGTAA TTGAAAGA6A GTGGGGCGCC TTATGTCATT 7980 

CGTAACAGAA AATCCATGGT TAATG6TACT AACTATATTT ATCATTAACG TTT6TTATGT 8040 

AACGTTTTTA ACGATGCGAA CAATTTTAAC GTTGAAAGGT TATCGTTATA TTGCTGCATC 8100 

AGTTAGTTTT TTAGAAGTAT TAGTTTATAT CGTTGGTTTA GGTTTGGTTA TGTCTAATTT 8160 

AGACCATATT CAAAATATTA TTGCCTACGC ATTTGGTTTT TCAATAGGTA TCATTGTTGG 8220 

TATGAAAATA GAAGAAAAAC TGGCATTAGG TTATACAGTT GTAAATGTAA CTTCAGCAGA 8280 

ATATGAGTTA GATTTACCGA ATGAACTTCG AAATTTAGGA TATGGCGTTA CGCACTATGC 8340 

TGCX5TTTGGT AGAGATGGTA GTCGTATGGT GATGCAAATT TTAACACCAA GAAAATATGA 8400 

ACGTAAATTG ATGGATACGA TAAAAAATTT AGATCCGAAA GCATTTATCA TTGCGTATGA 8460 

ACCTCGAAAC ATACATGGTG GATTCTGGAC TAAAGGCATT CGTCGTAGAA AGCTTAAAGA 8520 

3S TTATGAACCA GAAGAACTGG AAaGTGTAGT AGAaCATGAA aTTCtciAAGTA AaTGAGAaTG 8580 

AAmCAATtGC TGATTGTTTG TCACGAATGA AAtGCAAGGG TATATGCCGG TAAAACGTAT 8640 

TGAAAAACCC GTGTTTCAAG AGCAAAAAGA TGGCACGGTT GAAGTATCAC ATCAAGAAAT 8700 

40 CGTtTTTGTA GGTAAGAAAA TCCAATAACA TAATCCAATT TAAATAAAGA CTATTTGAAG 8760 

AGGAAAGGCT ATTCAAAGTT TGAGTAATTT TACTTTGAAT AGCCTATTTG TTTATACATG 8820 

CAAGATGCTC GATCCATATT GTATGAGAAA CCCCCAGCAA GCTATATAAA GCATATGCTG 8880 

GGGGTTCTTA ATATTTTAAA AATTATTGTT AGATTATATA TATCGTCGCT TTTTCTAAAA 8940 

CAATCTCATC GCATGAAATT TTTTCTTCCT AGAGACCTTT AATAAGATTA ATAGTTTACT 9000 

TAATCATATC TAGATAGTCT TATGACTTAT GCTTAATGAA AGTCATTCTA GGAGAAGTTC 9060 

CCAAAGCTTC TGTGTTCATA ATTGTTAGTA GTATTTTATT ATCATTTGGT ATAAATATTT 9120 

CAATAACAAT TGAGCTATTA TTTTTATTAT ATAATGTGAG TTGTTTGTGT TCTGTATTTA 9180 
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CATTTAAATC TTGAGGATGC CATTCTCCCT CAATAATATT AAGATAATAC TTAGCCTCTG 9300 
AATTACATTT GAATTTATCA ATACTAAATA ATTCAATTTG TTCCATAATA TTATTTACCT 9360 
^ TTCTAAAATA CAAATTTTAA TAACCATAAA TAGATGAATA CCATCGATAA TGGTCGCCAT 9420 

TGGATACTGG AATAACATTG TTTTTAGCAT CTTGAGTCAT AAAACCATTA TCCCATGGAT 9480 
TCCATATAAT TATAACCTCT TGTCCATTAT CTAATTTAGC GTTCCCAACA ACTGCCATGG 9540 

10 

CATGCCCTGC 6TGCATACCA TTTCTTGATT CTACTCTACT ACCTAAAACA GCAATTCCTT 9600 
TATTATTTTT AGTAAGATTG TCAACTTCAT TATATGTAGT CATTCTATTA AGAAGTTGTG 9660 

^5 GACTTCTTCC CTGAGTTTGT CCAAAATAAA TCATCTCTCT TGGCX3TTAAA CCAGTAAATT 9720 

GGAATCGTTG TCCTTGTAAG TTTGGGTGTA AAAATCTCAT CACAGCTTCT GCATGATATT 9780 
TGTTAGTATT ATAAGTCGCA TTTAGTAATT CAGACATC6T ATAGCCTGCA CACCAACCAT 9840 

20 TGTTACCTTG AGTTTCTCTT ATCTTGAAAT TCTCAAGTTT ATTTATATAT TGsTC3GTTGT 9900 

AAGTATAATT ATTACTTTTA AATTGACTAG TTGGCATAGT QACAOAAGCT TTTTGCTTTA 9960 
GTTGCGTTAC ATTATTGCCA GTAGGTATAC TCTCAGTCTT TnTnAACTtiT nTATCTTCTA 10020 
GACGTGGTGT TTTTAGTACT AGTTTAGCTT TATGATTTTG AGTACCACAT AGTAACCTTT 10080 
TGAGTTGT 10088 
(2) INFORMATION FOR SEQ ID NO: 33: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7563 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS : double 

« (D) TOPOLOGY: linear 



r (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

40 CGGAAACGnA CCCnATGCGT ATGCTTGACG TGCCAAAATT AAATACGAAG TTCATAGCTT 60 

TGAGGTACCA GAAGAACATT TATCTGGTCA AGAAGTCGCA GnACTCATAC AAGCAAATGT 120 

TAAAACAGTA TTTAAAACGC TTGTTCTAGA AAATACAAAA CATGAACATT TTGTATTTGT 180 

45 

TATCCCAGTA AGTGAAACTT TAGATATGAA AAAGGCAGCT GCTTTGGTTG GAGAGAAGAA 240 

ATTGCAGCTT ATGCCTTTAG ATAATTTGAA AAATGTAACG GGATACATTC GTGGTGGGTG 300 

TTCGCCTGTT GGTATGAAAA CATTGTTTCC AACAGTCGTT GACAAATCGT GTGAAAATTA 360 

SO 

TAGTCATATC AGTGTGAGTG GTGGGCTTCG AACAATGCAA ATCACAATAG CTGTTGAGGA 420 

TTTGATTACA ATAACTAAAG GCAAAATTGG AGCAGTTATC CATGAATGAT TAATAACAAC 480 
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TGCCACACTC CTTTTTGATT GAATTAGCAT 
GTATTTGAAC ATAAAAATGT AATTTTATCG 
S GTAATTTATG AITGAAAAGT GAAAGCGTAC 

GATGATAATT ACTGaAAAAA GACACGAGTT 
TTTGACTTTA CAAGAATTAA TAGATCGAAC 

10 

TTTATCTAAA CTACAACAAT TAGGGAAATT 
AGAAAATCX^T ATGGTTGAGG CGAATTTAAC 
GAAAATGATT GCTAAAATAG CAGCTAATCA 
TGCTGGtTCA TCTACATrGG AGCTAATTAA 
AACCAATGGT TTAACACATG TAGAAGCTTT 
20 AGGTGGTCAA GTTAAAGAAA ATACACTTGC 
AAGACGATAT TGTTTCGATA AAGCTTTTAT 

ATTAACTACT CCCGATGAGC AAGAGGCATT 

25 

TCAATCATTT GTACTTATAG ATCATTCTAA 

TTTGCTAGAA AGTACGACAA TCATCACATC 

AGAATACCAA CAAAAGTATC ACTTTATAGG 

30 

AATCCTTCAA TTGACTATGT CATTTTTACG 
GCAACAGCAA CATATAAATT CGCTGGGGGG 
ACATTGGATG TTGAGTCAAC TGCCTTGGGA 
ATAGATACAT TAAATAACAG TGCAATTCAA 
CGTAJTAATG TGAAATTAAA AACAGGACAA 
40 ATAACGTCAA CACAATTTGA ACAACTGTTA 
ATAGTTATTG TTGCTGGAAG TGTACCAAGT 

GCACAAATTA CAGCACAGAC AGGTGCTAAA 

45 

GAAAgCGTTT TACCATATCA TCCACTATTT 

ATGTTTAATA CAACAGTGAA CTCAGACACA 

GATAAAGGTG CGCAATCTGT TATTGTCTCG 

SO 

AAAGAAATCA GTATTAAAGC AGTTAATCCA 
GGTGATAGTA CAGTTGCAGG CATGGTGGCT 
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TTTACGATCA TAAACAGTCA TTATAATTGA 600 

TAACAATTTG AGTGTTTGTG ATTGTTTTTG 660 

TCATTATAAT ACAAAGTGAG ATGGGGTGAT 720 

AATATTAGAA GAACTTTCGC ACAAAGATTT 780 

TGGTTGCAGT GCTTCAACAA TACGArGAGA 840 

GCAACGTGTG CATGGTGGTG CAATGTTAAA 900 

TGAAAAATTA GCAACGAATC TTGATGAAAA 960 

AATCAACGAT AATGAATGCT TATTTATCGA 1020 

ATATATTCAA GCGAAAGATA TCATTGTGGT 1080 

ACTTAAAAAA GGTATTAAAA CAATTATGCT 1140 

TACGATTGGT TCTAGTGCTA TGGAGATATT 1200 

CGGGATGAAT GGATTAGATA TTGAACTTGG 1260 

AGTTAAACAA ACAGCAATGT CATTAGCCAA 1320 

GTTTAATAAA GTATATTTTG CTCGTGTACC 1380 

TGAAAAAGCA TTAAATCAAG AATCGTTAAA 1440 

AGGGACTTTA TGATTTATAC AGTGACTTTC 1500 

AATGATTTTA AAATTGATGG TTTGAACAGA 1560 

AAAGGTATTA ATGTCTCGCG CX3TCTTAAAG 1620 

TTTGCAGGTG GATTTCCTGG GAAATTCATT 1680 

TCGAATTTTA TTGAAGTTGA TGAAGATACA 1740 

GAAACAGAAA TCAATGCACC GGGTCCTCAT 1800 

CAACAAATTA AAAATACAAC AAGCGAAGAT 1860 

AGTATTCCAA GCGATGCX3TA TGCGCA7UVTT 1920 

TTAGTAGTCG ACGCTGAAAA AGAATTGGCT 1980 

ATTAAACCTA ATAAAGATGA ATTAGAAGTG 2040 

GATGTTATTA AATATGGTCG TTTGTTAGTT 2100 

CTTGGCGGTG ATGGTGCTAT TTATATTGAT 2160 

CAAGGGAAAG TGGTTAATAC AGTTGGCTCT 2220 

GGAATTGCTT CAGGTTTAAC GATTGAAAAA 2280 
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CGGGACGCTA TAGAAAAAAT AAAATCACAA GTTACGATTA GCGTACTTGA TGGGGAGTGA 2400 

AAATAATGAG AGTAACAGAG TTATTAACAA AAGATACAAT AGCAATGGAT TTAATGGCAA 2460 

ATGACAAAAA TGGTGTTATT GATGAGTTAG TAAATCAATT AGACAAAGCA GGTAAATTAA 2520 

GTGATGTCGC GTCATTTAAG GAAGCGATTC ACAATCGAGA ATCACAAAGT ACAACTGGTA 2580 

TCGGCGAAGG TATTGCCATT CXACATGCCA AAGTGGCCGC AGTTAAGTCA CCAGCTATTG 264 0 

CGTTTGGTAA ATCTAAAGCA GGCGTAGATT ATCAAAGTTT GGATATGCAA CCAGCACACT 2700 

TATTCTTTAT GATTGcAGcG CCAGAAGGTG GCGCCCAAAC ACATCTAGAT GCTTTAGCTA 2760 

AGTTGTCTGG TATTTTAATG GATQAAAATG TACXTTGAGAA ATTATTACAT GCTTCATCAC 2820 

CTGAAGAAGT ACTAGCGATC ATAGATGAGG CTGATGATGA AGTGACAAAA GAAGAAGAGG 2880 

CAGAAGCTGA AGCACAACAA GTTGCAACTG CAGAACAATC ATCTAAACAA TCTAATGAGC 2940 

20 CATATGTGTT AGCAGTAACT GCTTGTCCAA CAGGTATTGC ACACACATAT ATGGCACX3TG 3000 

ATGCATTGAA AAAGCAAGCG GATAAAATGG GTATTAAAAT TAAAGTAGAA ACGAATGGTT 3060 

CAAGCGGCAT TAAAAACCAT TTAACTGAAC AAGATATTGA AAATGCAACA GGTATCATTG 3120 

TTGCTGCTGA TGTTCATGTT GAGACGGATC GCTTCX3ATGG TAAAAATGTC GTAGAAGTAC 3180 

CAGTAGCAGA TGGTATTAAA CGCCCAQAAO AATTAATTAA TAAAGCATTA GATACAAGTC 3240 

GTAAACCTTT TGTTGCCCGT GATGGTCAAA GAAAAGGTAA CTCAAATGAC AGTCAAGAAA 3300 

AATTAAGCCC AGGTAAAGCA TTCTATAAAC ACTTAATGAA CGGTGTTTCT AACATGTTGC 3360 

CACTTGTAAT ATCTGGTGGT ATTTTAATGG CAATTGTATT TTTATTTGGA GCAAATTCAT 3420 

TTAATCCAAA AAGCTCAGAG TACAATGCGT TTGCAGAGCA GCTTTGGAAC ATTGGTAGTA 34 80 

AAAGTGCATT CGCGTTAATC ATTCCAATTT TATCTGGATT CATTGCACGT AGTATTGCGG 3540 

ATAAACCTGG TTTCGCTTCA GGTCTTGTAG GTGGTATGTT AGCAATTTCA GGTGGTTCAG 3600 

40 GATTTATTGG TGGTATTATT GCAGGTTTCT TAGCAGGTTA CTTAACACAA GGTGTTAAAG 3660 

CCATGACACG TAAGTTACCA CAAGCATTAG AGGGATTAAA GCCAACATTA ATTTATCCAC 3720 

TATTAACAGT GACGGCTACA GGCTTATTGA TGATTTATGC CTTTAATCCA CCAGCATCTT 3780 

GGTTAAATCA TTTGTTATTA GATGGATTAA ACAATTTATC AGGTTCTAAT ATTGTATTAT 3840 

TAGGTTTAGT TATTGGCGCT ATGATGGCGA TTGATATGGG CGGTCCATTC AACAAAGCGG 3900 

CATATGTTTT TGCAACAGGT GCGTTGATTG AAGGTAATGC AGCACCAATT ACAGCTGCAA 3960 

TGATTGGTGG TATGATTCCA CCGTTAGCAA TTGCGACAGC GATGTTAATT TTTAGACGTA 4020 

AATTTACAAA AGAACAACX5T GGTTCAATTA TCCCTAACTA TGTGATGGGT ATGTCATTTA 4080 
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TGATrGGTTC AGGTATAGGT GGCGCAATTG 

CACATGGTGG TATTATTGTA ATTGTTGGTA 

5 

TTGCACTTCT AGTTGGCACA TTAGTTTCAG 

TAACTGAAAC AGAAATCGAA GCTTCAAAAT 

TGATTGTTAG CAAAGAGCTT CATATTAAGT 

10 

TATATCGTGT TAACGGTAGC TTATACAAAG 
TTATGAATTG ATATGAAAGT GTTTTTATTT 
CAAATGTATA GACTTTTTTA ATATTTTGCA 
AAAATATGAG TGTCTTAAAG TGAAAATTTA 
TTAATTATAT ATAACGGCAA AGTTTATACT 
20 CATGTGAAAG ATGGACAGAT TGTTGCAATT 

AATGATACGA CAAATAAAAT TCAAGTGATT 
TTTATTGATA TACATATTCA TGGTGGTTAT 
GGCTTAAAAT ATCTATCCGA AAATTTGTTG 
ACAATGACGC AATCGACTGA TAAAATAGAT 
GCGGAgCAAG ATGTTCACAA TGCAGCGGAA 

30 

ATATCTGAAA ATAAAGTTGG TGCTCAACAT 
AAAATTAAAC ATTTTCAAGA GACTGCTAAC 

^ GAAATTGAAG GTGCAAAAGA AGCGCTTGAA 

GGTCATACAG TAGCAACATA CGAAGAAGCA 
GTCACGCATT TATATAATGC AGCGACGCCA 

40 GCAGCATG6T TGAATGATGC TCTACATACC 
CCGGCATCGG TTGCAATTGC TTACCGTATG 
GATGCAATGC GTGCAAAAGG TATGCCTGAA 

45 

ACTGTTCAAT CGCAACAAGC ACGTCTTGCA 
ATGAATCATG GGTTACGTAA CTTAATATCA 
CGAGTAACAA GTTTAAATCA AGCCATTGCA 

SO 

AAAGTAAATA AGGATGCAGA TCTTGTTATT 
ATAAAACAAG GCAAGGTTCA CACATTTAGC 

ss 



CTTTAGGCTT AGGTTCACGA ATTACTGCGC 4200 

CTGATGGTGC ACACTTACTT CAAACTCTTA 4260 

CATTAATTTA CGGTTTAATC AAACCAAAGT 4320 

CAATGGACGA GTAGTTTTAA TGATGTAAAA 4380 

TGTATGTTCA ATGAATATAT GTTAGTTTTA 444 0 

CTGTAAAAAC ACTTTCTATT AATTCAGTTT 4500 

TTAGATAAAT GAATGAAGAA ATAGACACCA 4560 

AAAAGTTATG CCAAACGAAG CAGATATAGT 4620 

TAAATAAAGA AGGGTTTATA CGTGTCAGAA 4680 

GAAGATGGCA AAATCGATAA TGGTTACATT 4740 

GGAGAAGTGG ATGATAAAGC AGCAATTGAT 4800 

GATGCTAAAG GTCATCATGT ATTACCAGGT 4860 

GGTCAAGATG CAATGGATGG GTCATACGAT 4920 

TCTGAAGGGA CGACATCATA CTTGGCCACT 4980 

AATGCACTTA CAAATATTGC TAAATATGAA 5040 

ATTGTAGGTA TACATTTAGA AGGACCATTT 5100 

CCGCAATACG TTGTACGCCC ATTTATCGAT 5160 

GGATTAATAA AGATTATGAC GTTTGCACCT 5220 

ACGTATAAAG ATGACATTAT TTTTTCAATT 5280 

GTTGAAGCTG TTGAGCGAGG AGCTAAACAT 5340 

TTCCAACATA GAGAACCAGG TGTTTTTGGA 5400 

GAAATGATTG TTGATGGCAC TCATTCTCAT 5460 

AAAGGTAATG AACGTTTTTA TTTAATTACC 5520 

GGAGAATATG ATTTGGGTGG ACAAAAAGTA 5580 

AATGGTGCGC TTGCTGGTAG TATTTTAAAA 5640 

TTTACAGGTG ATACATTAGA TCATTTATGG 5700 

TTAGGTATCG ATGATAGAAA AGGTAGTATT 5760 

CTAGATGATG ATATGAATGT AAAATCTACA 5820 

TAATAAATAA TCATAATTAA ATGTATGCAA 5880 
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TTTTCTGGGG GTGTCTAAAT GGGAAGGCGA TAACATGTAG TTGTAATTTA AGTCATAGTG 6000 

ATAAATTTGA ATGCGTGTTA CCCATGAGTG ACACATATAA CATGGAGGTG AATCCCTAGA 6060 

AATAGGGAAT TAATTGGAAA CTTCGACCAT AATTAGTTTG ATTATATTTA TTCTATTAAT 6120 

TGCATTAACC ACTGTATTTG TTGGTTCAGA ATTTGCATTA GTAAAAATTA GAGCAACAAG 6180 

AATTGAACAG CTAGCAGATG AAGGAAATAA ACCTGCTAAA ATAGT/^AAAA AGATGATTGC 6240 

TAATCTAGAT TATTATCTTT CTGCTTGTCA GTTAGGTATA ACAGTAACAT CTTTAGGGTT 6300 

AGGTTGGCTT GGTGAACCAA CX3TTTGAAAA GCTATTACAC CCAATATTTG AAGCAATCAA 6360 

TTTACCAACT GCATTAACX3A CGACGATTTC GTTTGCAGTG TCATTTATAA TCGTTACGTA 6420 

TTTGCATGTA GTACTTGGTG AATTAGCGCC TAAATCTATA GCTATTCAAC ATACTGAAAA 6480 

GCTTGCTTTA GTATATGCAA GACCATTGTT CTATTTCGGT AACATTATGA AACCATTGAT 6540 

20 TTGGCTGATG AATGGTTCTG CACGTGTTAT TATTAGAATG TTTGGTGTAA ATCCTGATGC 6600 

CCAAACTGAT GCAATGTCAG AAGAAGAAAT CAAAATTATT ATTAACAATA GTTATAATGG 6660 

TGGAGAAATC AACCAAACTG AATTGGCATA TATGCAAAAT ATCTTTTCAT TCGATGAAAG 6720 

ACATGCAAAA GATATAATGG TACCTAGAAC TCAAATGATT ACACTAAATG AACCTTTTAA 6780 

TGTAGACGAA TTACTAGAAA CAATAAAAGA ACATCAATTT ACGOGTTATC CAATTACTGA 6840 

TGATGGTGAT AAAGACCACA TTAAAGGATT TATTAACGTC AAAGAATTTT TAACT6AATA 6900 

CGCTTCTGGA AAAACGATTA AAATAGCAAA CTATATaCAT GAGTTGCCAA TGATTTCAGA 6960 

GACAACACGT ATCAGTGATG CATTAATTAG AATGCAACGT GAACATGTAC ATATGAGTCT 7020 

TATTATAGAT GAATATGGTG GAACGGCAGG TATTTTAACG ATGGAAGATA TTTTAGAAGA 7080 

AATCGTTGGA GAAATTCGTG ATGAATTTGA TGATGATGAA GTGAATGATA TCGTTAAAAT 7140 

TGAT5ATAAG ACATTCCAAG TAAATGGCAG AGTACTATTG GATGATTTAA CTGAAGAGTT 7200 

40 CGGTATAGAA TTTGATGACT CTGAGGATAT TGATACGATA GGTGGATGGT TACAATCTCG 7260 

TAATACCAAT TTACAAAAAG ATGATTACGT GGATACAACT TATGATCGCT GGGTTGTTTC 7320 

AGAAATCGAT AACCACCAAA TTATTTGGGT GATATTAAAC TATGAATTTA ATGAAGCGAG 7380 

ACCTACTATC GGACAGTCTG ATGAAGATGA AAAATCAGAA TAGATATTAA TATATAAACC 7440 

AACTAAGAAT GATTTAATTC ATTTTTGGTT GGTTATTTTT TTGACTAAAA TTAAnGAAAA 7500 

GTGAAAATAG TATTGGAACT CAATATCTTT AATGATTTAA TGAATAAnTT TTATTGAAAG 7560 

CGA 7563 
(2) INFORMATION FOR SEQ ID NO: 34: 
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(A) LENGTH: 3492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TTATATCAAC TTCATGGCGG AACCATTGAT GACCCATTAG ACGAAACAAT AAGCGCATTT 60 

SATGAATTGA AACAAGAAGG AATTATACGT GCTTACGGTA TTTCTTCTAT TCGCCCAAAT 120 

GTAATTGATT ATTATTTAAA ACATAGTCAA ATCGAAACGA TAATGTCTCA ATTCAATTTG 180 

^5 ATTGATAATC GTCCAGAATC ATTATTAGAT GCAATTCACA ACAATGATGT TAAAGTATTG 240 

GCAAGAGGAC CTGTGTCTAA AGGATTATTA ACTTCAAACA GTGTTAATGT GCTCGACAAT 300 

AAATTTAAA6 ATGGTATTTT TGATTATTCT CATGATGAAT TGGGTGAAAC AATAGCCTCT 360 

ATTAAAGAAA TTGAAAGTAA TTTATCTGCA TTGACATTTA GTTATTTAAC ATCACATGAC 420 

GTGCTTGGTT CCATCATTGT AGGTGCAAGT AGCGTCGACC AATTAAAAGA AAATATTGAA 480 

AACTATCATA CTAAAGTTAG TTTAGATCAG ATTAAAACAG CAAGAGCTCG TGTAAAGGAT 54 0 

TTGGAATATA CCAATCATTT AGTGTAGAAG TCATTTTCAG TAATAAAAAC AGCAGCATGA 600 

GGCGTTTCAT TATAAAAATG CCTTACTGCT GTTGTTTATG TACAATTCGC TATAATTTAT 660 

GATTATGATT ACTCACTTAT GATAGAAATT AAAGCGTTGT CCTCACGCAT CAGTATTTAG 720 

TAATTTCGCC TTGCGGCATT GCCTTAAGCA AACTTCTGCC ACTTCATCTC TTAATAATTT 780 

TATTAAAACA TCTTTCTATA TTTCACTTCG CATGTTGATT CATCATTATT AGTTATTATT 840 

3S TGTACACCCA GCACATTTCC TTGCAACACA AGTAGTTTGA ATTTTTCACA AGTATAATAT 900 

AATGTACCGT CTGAAATTTG GTCTACAGAA ATATCGCCTA AAATATCCAG CACTGTAAAT 960 

TCTTCAAATA CTGATAGTTG TTCCGCATAT CGTACACAAA GTCTTACCAC ACTCTCCGAT 1020 

TGACAGTTCA TTGCCATCCC ACCTATTTAT GCTTTATTTT TAAATAATTT AGGGAAACAT 1080 

CGTTCAAAAA ATCTAGGCGC AATTTGATAC ATTTTCAACG CATGaTGCAT CCATTTAGGC 1140 

CGATTAATTT CCAATTGTTT TGTTTTAATG CCATAAATGA TATCTTCTGC AAGCTGATTA 1200 

GCATCAAGCA TAATTTCGCC CATCTTTTTA gCATACTTCA TTGATGGGTC GGCTTTTTGA 1260 

TGAAAAGGTG TATCAATCGG GCCAACATTA ACTGTCATGA TATGTAAGTT TGGTGACTCT 1320 

AGTCTTAAAG CATTCATTAA TGCATAAAAC CCTGCTTTCG ATGCCCCATA ATGTGCAGCA 1380 

TTTGCTTGTG TGGAAAATGC AGCTTGACTT GAAATACCTA CAATATGTGC GTTAGATGTT 144 0 

AAATATGGTC TCAACACAGT ATATAAAACA TTAAAACTAA TTAAATTAAG CTGATACGTT 1500 
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TAAATGAATC CATCGAATGA TGTATTGTCT TCAAATTGCA GTGCCTGTAT CGACTTCAAA 1620 

TCATTTAAGT CACAAGGAAT AACATTTATA GTTTTCCCCA ATTCCTGTTC AAAGATTCTA 1680 

GTTGCTTTAT CAACATCACG CACCAACAAC GTTACATGCA CTTTATTTTC TAGTAACTTT 1740 

CGGACAATCG ATAAACCTAA ACCACTCGTA CCACCAGTCA CTATAAAATG TTGTCCTTTC 1800 

ATCAATTAAC CTTCCTTTTC AATTATATAG AATGCAATTT ATCAACTTTA CATAATTGAG 1860 

ACAAGTTGAT TATCTTTCCT AATATATATA CAATAATAAG AAAATATAAC ATACAAATCA 1920 

AAAACTAAAG GGATGTGaCG TTAATGrAAC TCX3TATTTTA TGGAGCTGGT AATATGGCAC 1980 

^5 AAGCTATATT TACAGGrATT ATTAACTCmA GCAACTTAGA TGCCAATGAT ATATATTTAA 2040 

CAAATAAATC TAATGAACAA GCTTTAAAAG CATTCGCTGA AAAACTAGGT GTTAACTATA 2100 

GTTATGAtGA TGCGACATTA TTAAAAGATG CAGAyTATGT ATTTTTAGGT ACCAAACCAC 2160 

20 ATGACTTTGA TGCTCTAGCA ACACGCATCA AACCACATAT TACAAAAGwC A A TTGCTTCA 2220 

TTTCAATTAT GGCAGGTATT CCGATTGATT ATATTAAACA ACAATTAGAA T6CCAAAATC 2280 

CaGTTGCTAG AATTATGCCA AACACAAATG COCAAGTTGG ACACTCTGTT ACTGGCATTA 2340 

GTTTTTCAAA CAACTTTGAC CCTAAATCTA AAGATGAAAT TAACGATTTA GTTAAAGCAT 2400 

TTGGTTCTGT AATTGAAGTA TCAGAAGATC ATTTACATCA AGTAACAGCT ATCACCGGAA 2460 

GCGGCCCAGC ATTTTTATAT CATGTATTCG AGCAATATGT TAAAGCTGGT aCsAAACTTG 2520 

GTCTAGAAAA AGAACAAGTT GAAGAATCTA TACGCAACCT TATTATAGGT ACAAGTAAGA 2580 

TGATTGAACG TTCAGAtTTG AGCATGGCTC AATTAAGAAA AAATATTACC TCTAAAGGTG 2640 

GTACGACACA AGCTGGCCTT GATACATTGT CACAATATGA TTTAGTATCT ATTTTCGAAG 2700 

ATTGTCTAAA CGCTGCCGTC GACCGTAGTA TTGAACTTTC TAATATAGAA GACCAATAAA 2760 

AACAI^CCCG CCAACACATG TATGCATCAT CGCAAGCACT GTGTTTGACG GGTTATmT 2820 

40 ATAATTTATT GTTATTTGGC AAGCATTGTT TATTACTTTG TCATTAGATT TTAAAACTAT 2880 

CAAAATCTTT TACAAAATTA AAATTAGGTG TATCTTCATT TTGTATCAAT GTTTGATAAA 2940 

TTTCATTTAT ATCTTCTGTA TTATAGCGAT TGCTCAAATG TGTAATCAAC GTACX5TTTAA 3000 

CATTGGCTTC TTTTATCAAT GCAAATACGT CTTCAATATG GCTATGATGA TAATTGTTGG 3060 

CTAAATGCTT TTCACCATCT ATATAGGTCG CTTCATGTAC CATCACATCA GCATCTCTAG 3120 

AAATCACACG TTCATTAGAA CATGGTTTTG TATCACCAAA AATTGCTACA ACTGGACCCT 3180 

GTTTGGACTC ACCTCTAAAA TCTTTTGATT GATAAACTTG ACCATTATGT TCAAATGTAT 3240 

CATGAGATTT TACTTCTTGA TATTTAGGAC CTGGTTCAAG ACCAATGTTT TTTAACGCTT 3300 
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CATGATTAAG TAAATGCGCC TCTACAGTAA AACCATCCAT GATGATATGT CAGATGATCA 3420 

TCGATTTCAA TATATGtAAT TGGATAGTTT AAATGTGACT CTGATAAATT CATAGACATT 3480 

^ TCCACATATG CT 3492 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATCTAGCGGT ACAAGC6TCT TGGAGGCTAG TATGTTGAAC ATTGTAAACC CTGAA6ATCA 60 

20 CTTCGTTGTC ATTGTTTCAG 6TGCCTTTGG TAACCGATTT AAACAAATTG CACAAACTTA X20 

TTACAAAAAT GTGCATATTT ATGACGTAAC ATGGGGAGAA GCTGTAGATG TCAAAGATTT 180 

CATCAATTTC CTTTCAACTT TAAATGTTGA AGTTAAAGCA GTATTTAGTC AATATTGOGA 240 

AACATCTACG ACAGTGCTAC ACCCTATTCA CGAGTTAGGA AATGCCATTA ATCAATTTAA 300 

TAGTAATATT TATTTTGTAG TTGAOGGCGT AAGTtGCATT GGTGCTGTTG ATGTTGACAT 360 

TAACAAAGAT AAAATTGATG TACTTGTTTC TGGTAGTCAA AAAGCAATTA TGTTACCTCC 420 

AGGATTAGCT TTTGTAGCTT ATAGCCACCG TGCAAAAGAA CATTTCAAAG AAGTAACTAC 480 

GCCAAAATTT TATCTAGACT TAAATAAATA CATTTCGTCA CAAGCTGACA ATTCTACACC 540 

GTTCACACCA AATGTGTCTT TATTTAGAGG TGTAAATGCA TACGTTGAAA CCGTAAAAGC 600 

AGAAGGTTTC AATCACGTAA TAGCACGACA CTATGCAATT AGAAATGCAT TAAGAAGCGC 660 

CTTAAAAGCA TTAGATTTAA CTTTATTAGT CAATGATAAA GATGCATCTC CAACGGTTAC 720 

40 AGCATTCAAA CCTAATACAA ATGATGAAGT GAAAATAATC mAAGATGAAC TTAAAAATnG 780 

CTTTAAAATA ACAATTGCnG GTGGTCAAGG CCATCTTAAA GGTCAAATTT TnAGAATTOG 840 

TCATATGGGG AAAATTAGTC CTTTCGATAT TTTATCGGTA GTATCTGCTT TAGAAATTAT 900 

^ TTTAACTGAA CACCGTAAAG TTAACTATAT CGGTAAAGGT ATATCAAAAT ATATGGAGGT 960 

TATTCATGAA GCAATTTAAT GTACTCGTTG CAGATCCCAT ATCAAAAGAT GGTATCAAAG 1020 

CATTATTAGA TCACGAACAA TTCAATGTAG ATATTCAAAC TGGCTTGTCC GAAGAAGCAT lOBO 

SO 

TAATCAAAAT TATACCTTCA TACCATGCTT TAATCGTTOG TAGTCAAACT ACGGTTACTG 1140 

AAAATATCAT AAATGCTGCT GATTCTTTAA AAGTAATCGC ACG06CCGGT GTTGGTGTAG 1200 
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GTAATACGAT 


TTCAGCTACT 


GAACATACAC 


TGGCAATGTT 


ATTATCAATG 


GCACGAAATA 


1320 




TTCCGCAAGC 


ACACCAATCA 


CTTACAAATA 


AAGAATGGAA 


TCGAAATGCA 


TTTAAAGGTA 


1360 


5 


CTGAGCTTTA 


TCATAAAACA 


TTAGGTGTCA 


TTGGTGCTGG 


TAGAATTGGT 


TTAGGTGTTG 


1440 




CTAAACGTGC 


GCAAAGTTTC 


GGAATGAAAA 


TACTAGCTTT 


TGACCCTTAC 


TTAACGGATG 


1500 


10 


AAAAAGCAAA 


ATCTTTAAGC 


ATTACGAAGG 


CAACAGTTGA 


TGAGATTGCC 


CAACATTCTG 


1560 


ATTTOGTTAC 


ATTACATACA 


CCACTAACAC 


CTAAAACAAA 


AGGCTTAATT 


AATGCTGTCT 


1620 




TTTTTGCCAA 


AGCAAAACCT 


AGTTTGCAAA 


TAATCAATGT 


GGCACGTGGT 


GGTATTATTG 


1680 


IS 


ATGAAAAGGC 


GCTAATAAAA 


GCATTAGACG 


AA6GACAAAT 


TAGTCGOGCA 


GCTATCGATG 


1740 




TGTTTGAACA 


TGAACCTGCA 


ACTGACTCGC 


CTCTTGTTQC 


ACATGATAAA 


ATTATTGTTA 


1800 




CACCTCATTT 


GGGTGCTTCA 


ACAGTCGAAG 


CTCAAGAAAA 


AGTGGCAATT 


TCTGTTTCAA 


1860 


20 


ATGAAATCAT 


CGAAATTTTA 


ATTGATGGTA 


CTGTAACGCA 


TGCAgTGAAT 


GCACCTAAAA 


1920 




TGGACTTAAG 


CAATATAGAT 


GATACT6TAA 


AATCATTCAT 


CAATTTAAGC 


CAA 


1973 




(2} INFORMATION FOR SEQ ID NO: 36: 








2S 
30 


(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 7620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 






3S 


GGTGTTTCAG 


ATGTCACTGG 


TTGATTTTTA 


ATTGTAGACG 


GGTATTTTGG 


GCTTTCGCCA 


60 




TATTTATTTG 


CCGGCTTACT 


GTCAAAGCAT 


AGGAATACTA 


TCATAACAAT 


TGTTAGGCCT 


120 




AAATj;AACAA 


AATAAAGAAG 


TACTAACAAA 


ATATTAAGAC 


CCATCGGCAT 


TAATGTAAAA 


180 


40 


TCACTGTCAT 


AATAACTATC 


GATAATCTGT 


AATACTATAT 


AAAATATAAT 


ACTGAATACT 


240 




GTCATAATCA 


TTGGAAATAA 


CATTGTTCTT 


GATATATCGT 


GAAATCTTCG 


AACGCACAAC 


300 




GCTAAATTTG 


GAATAAACGT 


TGCCAAACTA 


TAGACAAAAG 


TATACACAGA 


TGTAAGGATA 


360 


4$ 


ATCATCAATA 


TACTCATAAC 


TATTAATGTT 


TCGTTATCCG 


CCGCTATAGA 


AA 1. AAAIoMA i 






AGAAATAGGT 


TTATTATTAG 


CACACACACA 


GCTGGAACCA 


TAA6TATCAA 


ATGCCATAGT 


480 


SO 


GCCATATACC 


AATATTCACT 


ACGTCTTGAT 


CTCCCCTTAA 


AATTTACATA 


ATTTTTCCAA 


540 


AATAAAACGA 


ATGATTTCAT 


AAAACCTACT 


TGAGGTAATT 


GTTCCATTGT 


AATCTCCCTT 


600 




TCGTTAATCA 


TATTTATATT 


TTTAATTATT 


GTTACCGTTA 


TAATTTACAA 


GATTCATTAT 


660 
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GTAAAATGAA AACCCGCTAC AAGTACACAT CTATATGGAG ACTCATTTGA AAGTCAACGC 780 

TTCGTTAACT ATACTAAAAA TATGTCATAC TGCAATGTTC ACGTTTAAAA GAGTCTCAAT 840 

5 

CTATGCAAAT AAAATATTCC ATAACAAAGT ATATACTTTA CATTTTTATA ATTCTTAACA 900 

ATACTATTTT ATCAAACATT TACCACAATA AAAATATCTT TTTCATTTTT ATTTAAATTA 960 

ATCATATAAT TGCGAGGAGA*'ATATTATGGA TTTCGTTAAT AATGATACAA GACAAATTGC 1020 

TAAAAACTTA TTAGGTGTCA AAGTGATTTA TCAGGATACC ACTCAAACGT ATACAGGCTA 1080 

CATCGTGGAA ACGGAAGCTT ACTTAGGTTT GAATGATCGT GCGGCTCATG GCTATGGCGG 1140 

TAAAATAACA CCTAAAGTCA CGTCATTATA TAAACGTGGT GGTACAATTT ATGCACATGT 1200 

CATGCATACG CATTTACTCA TTAATTTTGT AACAAAATCT GAAGGTATAC CTGAAGGCGT 1260 

ACTTATCOGC GCAATTGAAC CAGAAGAAGG TTTATCCGCT ATGTTCCGTA ACAGAGGTAA 1320 

20 GAAAGGCTAC GAGGTAACGA ATGGCCCAGG AAAATGGACT AAGGCATTTA ACATTCCACG 1380 

GGCTATCGAT GGCGCTACGT TAAATGACTG TAGATTGTCT ATTGATACTA AGAATCGTAA 1440 

ATATCCTAAA GATATTATTG CTAGTCCACG AATCGGTATT CCAAATAAAG GTGATTGGAC 1500 

ACATAAATCT TTACGTTACA CAGTGAAAGG TAATCCATTT GTGTCTCGCA TGCGTAAATC 1560 

AGATTGTATG TTTCCCGAAG ATACTTGGAA ATAAATGCCA TCTTTCATTG ATTACTATCA 1620 

TGAAAATGAA ATCTATCTCC TTATAAGTCA ATCAATCGTG CCGTCAACAT GCGGATGGGT 1680 

30 

TGATTGTTTT TCTTTGTATC CATCATATTT TTTGATTCAT CTCCTCTTAT TGAACTTGTT 1740 

CTTAATTATA AAATATAACA ATAGAATTAT TTATAATTAT TAAATTTAGA TGCATTAATA 18 00 

TTATTGATAT TATTTTCAAA AACTAGAAAT ATTGATTTGT TGCATGTATA ATGTTAAAAG 1860 

35 

CGCCCTTTTA TAACGCTTAC ATATAAAAGC TTATTTAGGG AGAGGGATAT TCAACAAGGG 1920 

GGATTTGAAA ATGATAGAAC TTAATGCAAT TACAACATTA TGTTTAGCTT 6TATCCTTTA 1980 

40 TTTACTTGGT AAGGCTATCG TTAATCACGT TAATTTTTTA AAACGTATTT GTATACCAGC 2040 

ACCAGTGATT GGCGGCTTAA TCTTTGCTAT TTTAGTTGCG GCTTTGGATT CATTTGGCAT 2100 

GGTTAAGATT AAATTAGATG CTTCATTCAT TCAAGATTTC TTCATGTTAG CATTCTTTAC 2160 

^ GACAATCGGT CTTGGTGCAT CATTGAAATT ATTTAAATTA GGTGGCAAAG TCTTGCTATT 2220 

ATACTTTATG TTTTGTGCTA TCATTTCAGT CATTCAAAAC ATAGTTGGTG TATCACTAGC 2280 

AAAAGTATTA AATATTAAAC CTTTGTTAGG ATTAACAGCA GGTTCCATGT CTATGGAAGG 2340 

SO 

CGGTCATGGT AATGCTGCTG CTTATGGTAA GACAATTCAA GATTTAGGTA TTGATTCGGC 2400 

ACTGACAGCG GCTCTTGCAG CTGCAACTTT AGGTCTTGTA TTTGGAGGGC TTATCGGTGG 2460 
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ATTTAAAGAT TATAGCCAAG TAGCATATAA CGAACATTTA CATAGTAAAT TTAATGCCAC 2580 

TGAAGTATTC TTCATTCAAT TTACAATCX5T TGTATTCTGT ATGGCAGTTG GAAGTTATTT 264 0 

5 

CAGTCATTTG TTTACAGCTC AAACAGGGAT TAATGTTCCA ATTTACGTTG GCTCATTATT 2700 

TGTAGCTGTT ATTGTCCGAA ATATCTCTGA AAGTTTTAAT TTTAATATTG TAGATTTAAA 2760 

AATTACTAAT CAAATTGGCG ATGTCGCATT AGGTATTTTC TTATCTCTTG CGCTAATGAG 2820 

10 

CATTCAATTA ATCGAAATTT ATAAACTTGC TATACCTCTT ATTATTATCG TTTTAGTTCA 2880 

AGTTGTCGTT ATGATTTTAT TTGCTQTTTT AATTTTATTT AGAGGTTTAG GAAAAGATTA 2940 

IS TGATGCTGCA GTAATGGTAG GTGGTTTTAT CGGTCATGGG CTTGGTGCAc GCCAAATGCC 3000 

ATGGCAAATT TAGATGTTAT TACTAAAAAA TATGGAAACT CACCTAAAGC ATATTTAGTT 3060 

GTACCTATTG TTGGTGCATT CTTAATCGAT TTAATTGGTG TTATAGTCAT TATGGGATTC 3120 

^ ATACAATGGT TTAGTTAAAC ACCAAACTCA TAAATAAAAG AGGAGGCCTT CGCCTCcTcT 3180 

TTTATTTATC CTCGATGTAT ATTCAAGTTA CX3TTGTTCTA TCCATGACAA TATTTCCGGA 3240 

CTAAATACGA TTTGTTTTTG TGTTAAGTCG TCAATATTTT TAGCATCTAA CATCGTCATT 3300 

25 

ATTGATTTCA TGTGTTCAAT AAATGATTCT ACATAAGCTA CTGTATGTGC AATGCCATTA 3360 

TTTTCAACTT GATTTAAAAA CGQACGTGAC ATACCAGTTG CCTTTGCACC AAGTGCTAAA 3420 

CTTTTAATTG CATCGAGTGG TGTACGTAAA CCACCACTCG CGAAAACTGA AATTTCGCTT 34 80 

30 

TGATAAGCCG TTGTTTCAAG TAATGACTCA ACTGTAGACT GTCCCCATGA TGATAAGTAA 3540 

TCCATATCTT TATTTGCACG ACGTTCATTT TCAATATCTA CAAAGTTAGT ACCACCTTTG 3600 

^ CCACTAACAT CGACATACTT GACGCCTATT TGTTGTAAGT CATGCATTAA TTCTTTGCTC 3660 

ATACCAAATC CAACTTCTTT TATAATGACT GGAACAGACA CTCGTGATAC AATCGACGCT 3720 

ATAia'ATCTA ACCAAGTCAC AAATTCACGA TTCCCTTCAG GCATAACTAA TTCTTGAGGA 3780 

40 GAATTAACAT GGATTTGTAA CGCTTGTGCC TCAAGTAATT CAACTGCTTC CAAAGCCTTT 3840 

TCTACTGGTA CX5TCCGCACC AACATTGCTA AAAATCATGC CTTCAGGATT CATTTTTOGC 3900 

GCAATCGTAA ACGTCTCAGC CATGCGTGGA TTTCTCAATG CCGCATGTGT TGATCCAACT 3960 

4$ 

GCCATCGCTA AGCCAGTTTC TCTTGCAACT ACAGCTAGCT TTTCATTGAT GTTTTTCGTC 4020 

CACTCGCTAC CACCCGTCAT TGCATTAATA TAAACCGGAT ATGCCATCGT TAAGTCAGGC 4080 

GTCTGTGATG TCAAATCGAT ATCATTTACA TTAATTGATG GGATAGAATG ATGCACAAAA 4140 

so 

CGCATCTTAT CAAAATCTGA ATGCATTGCG TCAGATTGGG CCATTGCTAT TTCAACATGT 4200 

TCATTTTTTC TCTGTTCTCT TTGAAAATCA CTCATOATTA AACCTACCTT TTCGTCATTT 4260 
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ATTACAGCTA AGCAAATATA ATATCCATAA TGTAAATGTA ATGCCGGCAT ATTTACAAAG 4380 

TTCATACCAT AAATCCCAGC TATGAATGTT AACGGTGAAA ATATAACTGA TACTAATGTC 4440 

5 

AGTACTTGCA TAATACTATT CATTCTAAAT GACGTGTATG ACTCAAAATT TTCTCGTATT 4500 

TCGTTTGTCA TTTCTTGAGC AGTACGAATG ATATTACGTT GCTTAATCAA GTGGTCATCG 4560 

ATATGTTGAA TGTATAGCGA ATGTTTATTA TCTATAATCA AATCACCATT TTGTTTCATT 4620 

10 

6TATCAATTA GCTCTTGCAT AGGAAACAGT ACACGTTTTA CTTTAATCAA ATCCGAACGT 4680 

AACTTAAAGA CACTATCCAT GACCATTTTA TTAAAGCX3AT CATCTACATG GCGGTCTTCA 4740 

IS AAATGATAAA CACTATCTTC AAGTGCATAT ACAAAGTTGA AATATTTATC AACCATCATA 4800 

TCTAAAATTA ATATGACGAC ATCTGCACAA TCTAATTCTG CATCTAATGT ATTCATATAC 4860 

TTATAGACTA CTTTATTTAA TGATTCCAAC GTTTGATGAT GATATGTTAC TAATACATTG 4920 

TCTT6TATAA AAATATTTAG TGCTATTGGT GAATAGTTTG ACCCCATAAT ACTATGGAAT 4980 

ACTAAGTATT GATAATCTTT ATAAGATTTA TATTTAQCTC QTGGCATACC GTTAATTGCA 5040 

TCATCCACTT CTAAATCATT AAAATTAAAA TGTGCTTTAA ACCATTCATT TTCTTGTTCA 5100 

25 

TTCGGTTCAT CAAAATCATA CCAAACAATA GTCGCATCTT TTGGTATCTC TTTGATATCA 5160 

TCAACTACTT TAAACGGTTC ATATGTAGTT TGATACCGTA TCTTTAAAGC CATCGATACT 5220 

CCCCCTAAAT AACGAATTCT CTATTATTTT ATCATGAATT AAATAACGTG TATGTCTTAA 5280 

30 

TTTATTTTAG TATGATAGTC ACTAAGGAGA TGGTTATTAT CAAACAACTT TTTACACATA 534 0 

CTCAAACCGT AACATCTGAA TTCATTGACC ATAACAATCA TATGCATGAT GCAAATTATA 5400 

55 ATATCATTTT TAGTGACGTC GTGAATCGTT TTAATTACAG CCACGGTCTT TCTTTAAAAG 5460 

AACGCGAAAA TTTAGCATAT ACGCTATTTA CACTAGAAGA ACATACGACA TACCTCTCAG 5520 

AATTCTCTCT TGGCGATGTA TTTACTGTTA CTTTATATAT TTATGATTAC GATTATAAGC 5580 

40 GGTTGCATTT ATTTTTAACA TTAACTAAAG AAGATGGTAC ACTAGCATCA ACAAATGAAG 5640 

TAATGATGAT GGGAATTAAT CAGCACACAC GTCGTTCTGA TGCTTTTCCT GAATCATTTT 5700 

CAACACAAAT AGCACACTAT TATAAAAATC AATCAACTAT CACTTGGCCT GAACAATTAG 5760 

^ GACATAAAAT AGCAATTCCA CACAAAGGAG CATTAAAATG ACAGATGCAT TACAACAAAA 5820 

GATTCATATC GAATTACTAG ATTTATTAGA TGATGTTAAG TTTGAATTAA CAGAATTAAA 5880 

TGCACAAAAA GGGTTATACA TTAACGGACC AGCAAATCAG CTACTTAAGC GTGGCGTGCA 5940 

SO 

TATGGCTTAT GTTCAAGGAC AAAAGCAAGC CATCGATAAT ATTATGACTA TTGTGGAACA 6000 

ACAGCTTGAA AGATCAACAT TTCCTAGAAC ATTATGATAA ATTTCAAAAT GAGGTTGCTC 6060 
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ATAATTTTTT AGATCAATTT TATCAAATTA AAGGGCAATA CTTTATCATC ACACATATCA 6180 

ATACACTTAT TGGTGATTTT CACTCAGAAG CTCATTAACA ATTAGTCTAT ATAACCCTTG 6240 

5 

CTATATTTTC AAAAACAAAA CCCAATTACG TTTTCATGTC AAATATCATC TTGCATGAAA 6300 

TCGTAACTGG GTCATTTATA TGTTATTAGT TATTTTGTGT TACATCCTCA TCTATCGATT 6360 

TGGCAATTTG TTTAATAGCT TTATGTGATT GTCTAATTGG ATAAATTGGA AAATCATGTA 6420 

10 

CCATCTTAGG ATAATCATAA AACTCAATGT ATTGATQATG TTGCAACATC ATTTGTTCAA 6480 

ATAGCTTCAT ATCAGGATGT GTCATTTCAC GTCCACCACC AAACATATAA ACTGGTGGCA 6540 

,5 ATCCTTCTAT TGTGCCATTA ATTGGCGATA TGCGCTTATC TGTTAATGGT AGGCCATTCG €600 

CCCATTTTTT CATAATCTCA TTGACACCAA ACTGACTTAG aACCGCATCT TGTTCGATTA 6660 

AGGCGTCCGA AATATCTTTA TTAGATAGTG TTGCATCTAA AATTGGTGAG ATTAAATACA 6720 

20 ATTTATTCGG TAATGGCTGT TGATTAkCTA AAAGAGATTG TACAAAGGAT AATGCCAGTG 6780 

CACCACCTGA ACCATCACCC ATGACTACGA CATTTTGATG TCCTACTTCA GATACTAATT 6840 

GaTCATAAAC ACGTTGTATC GCTTGGnAAA GTATCGTCaA TATGnAAACT CTGGTGTCTT 6900 

25 

TGGATAGATA GGCAGTACAA CCTCATATAA TGtACTTAAA GTGATTTTAT CCCAACAATC 6960 

TCCAATGGAA CGGTGATGGT TGTAGTGCAT TGAATCCACC GTGAATATAT AAAATTTTCT 7020 

TATCAATTTG ATGTCTGAAA TTAAAGCGAA AGACTTGCAT ATCATCTAAT GACAATTTTT 7080 

30 

CTAAATTTGC TTTAACATTT AATGTTGAAG GCTGCTTATG TTTTTTTCTA TTTTCAATTT 7140 

CTCTTTTATA AAAAAATCTT TCAACATCTT GATCATTTTT AAACATAATC GAGCGATTGT 7200 

GAAGCAAATA TTTATTGACA ACGCTATTCA TAACACGGTT TCTAATCAAT GTCTTAACCT 7260 

35 

ACCTTTATAT ATTTTATGTA TCCAATGATk GTCTATCCCC TACATTCTTT GCCAAAAAAA 7320 

GTAT^TAATG TAGAAGATAT TTTCTTTTTC ACTTTCAAAT TTAAGACTAC AATTGAACAG 7380 

40 TGATTTTTCA TCATTATAAC AGACAACTAG ACATATTGAT AAGTAAAGAA AAGAACTTTA 7440 

TACGGAGGTA CCTTGCATGA CAAATCCAAA TCAACGATTA GAACCATTTG ATGA6ACATT 7500 

TCAACAACCG AATATTCATC GTGGTAAGCG ATATGGTAAG AAAAAACX3TT CATTGGTAAG 7560 

CATGATTATT CAAATCATTG TTGTwATATT AACCACCATC GCTGGAATAC AGCATGGTGG 7620 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 9834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GTCATtACCG amTTTCtTAG AaTCATTTAA AGATGATAAA TATACAAACG TTGGTAATTT 60 

^ AAAAGAAGTG AATTTTGATA AAATTGCTOC GACGAAACCC GAAGTAATCT TTATCTCTGG 120 

ACGTACAGCT AATCAAAAGA ATTTAGATGA ATTCAAAAAA GCTGCACCTA AAGCGAAAAT 180 

TGTTTATGTT GGTGCAGATG AAAAGAACTT AATTGGTTCA ATGAAACAAA ACACTGAAAA 24 0 

10 

TATCGGAAAA ATTTACGATA AAGAAGATAA AGCTAAAGAA TTAAATAAAG ATTTAGATAA 300 

CAAAATTGCT TCAATGAAAG ATAAAACGAA AAACTTCAAT AAAACTGTTA TGTATTTACT 360 

AGTTAACGAA GGTGAATTAT CAACATTTGG ACCTAAAGGT CGTTTTGGTG GATTAQTTTA 420 

IS 

CGATACATTA GGATTCAATG CAGTTGATAA AAAAGTAAGT AATAGCAATC ATGGACAAAA 480 

TGTTTCTAAC 6AATATGTTA ATAAAGAAAA TCCAGATGTT ATTTTAGCGA TGGATAGAGG 540 

20 TCAAGCGATA AGTCGTAAAT CAACTGCGAA ACAAGCATTA AATAATCCTG TATTAAAAAA 600 

TGTTAAAGCA ATTAAAGAAG ACAAAGTATA TAATTTAGAT CCTAAATTAT GGTACTTTGC 660 

AGCTGGATCA ACTACAACTA CAATTAAACA AATTGAGGAA CTTGATAAAG TTGTAAAATA 720 

25 ATTTTAAAAG AGGGGAACAA TGGTTAAAGG TCTTAATCAT TGCTCCCCTC TTTTCTTTAA 780 

AAAAGGAAAT CTGGGACGTC AATCAATGTC CTAGACTCTA AAATGTTCTG TTGTCAGTCG 840 

TTGGTTGAAT GAACATGTAC TTGTAACAAG TTCATTTCAA TACTAGTGGG CTCCAAACAT 900 

^ AGAGAAATTT GATTTTCAAT TTCTACTGAC AATGCAAGTT GGCGGGGCCC AAACATAGAG 960 

AATTTCAAAA AGGAATTCTA CAGAAGTGGT GCTTTATCAT GTCTGACCCA CTCCCTATAA 1020 

TGTTTTGACT ATGTTGTTTA AATTTCAAAA TAAATATGAT AGTGATATTT ACAGCGATTG 1080 

35 

TTAAACCGAG ATTGGCAATT TGGACAACGC TCTACCATCA TATATTCATT GATTGTTAAT 1140 

TCGTQTTTGC ATACACCGCA TAAGATTGCT TTTTCGTTAA ATGAAGGCTC AGACCAACGC 1200 

TTAATGGCGT GCTTTTCAAA CTCATTATGG CACTTATAGC ATGGATAGTA TTTATTACAA 1260 

40 

CATTTAAATT TAATAGCAAT AATATCTTCT TCGGTAAAAT AATGGCGACA SCgTGTTTCA 1320 

GTATCGATTA ATGAACCATA AACTTTAGGC ATAGACAAAG CTCCTTAACT TACGATTCCT 1380 

45 TTGGATGTTC ACCAATAATG CGAACTTCAC GATTTAATTC AATGCCAAAT TTTTCTTTGA 1440 

CGQTCTTTTG TACATAATGA ATAAGGTTTT CATAATCTGT AGCAGTTCCA TTGTCTACAT 1500 

TTACCATAAA ACCAGCGTGT TTGGTTGAAA CTTCAACGCC GCCAATACGG TGACCTTGCA 1560 

AATTAGAATC TTGTATCAAT TTACCTGCAA AATGACCAGG CGGTCTTTGG AATACACTAC 1620 

CACATGAAGG ATACTCTAAA GGTTGTTTAG ATTCTCTACG TTCTGTTAAA TCATCCATTT 1680 

55 



334 



EP 0 786 519 A2 





AGTGTTCTTT TTGAATAATG CTATTAOGAT AATCTAACTC TAATTCTTTT GTTGTAAGTT 


leoo 




TAATTAACGA GCCTTGTTCG TTTACGCAAA GCGCATAGTC TATACAATCT TTAACTTCGC 


1860 


5 


CACCATAAGC GCCAGCATTC ATATACACTG CACCACCAAT 


TGAACCTGGA ATACCACATG 


1920 




CAAATTCAAG GCCAGTAAGT GCGTAATCAC GAGCAACACG 


TGAGACATCA ATAATTGCAG 


1960 




CGCCGCTACC GGCTATTATC GCATCATCAG ATACTTCGAT ATGATCTAGT GATAATAAAC 


2040 


10 


TAATTACAAT ACCGCGAATA CCACCTTCAC GGATAATAAT 


ATTTGAGCCA 


TTTCCTAAAT 


2100 




ATGTAACAGG AATCTCATTT TGaTAGGCAT ATTTAACAAC 


TGCTTGTACT 


TCTTCATTTT 


2160 


IS 


TAGTAGGGGT AATGTAAAAG TCGGCATTAC CACCTGTTTT 


AGTATAAGTG 


TATCGTTTTA 


2220 


AAGGTTCATC AACTTTAATT TTTTCATTTG GGATAAGTTG 


TTGTAAAGCT 


TGATAGATGT 


2280 




CTTTATTTAT CACTTCTCAG TACATCCTTT CTCATGTCTT 


TAATATCATA 


TAGTATTATA 


2340 


20 


CCAATTTTAA AATTCATTTG CGAAAATTGA AAAGAAAGTA TTAGAATTAG 


TATAATTATA 


2400 




AAATACGGCA TTATTGTCGT TATAAGTATT TTTTACATAG 


TTTTTCAAAG 


TATTGTTGCT 


2460 




TTTGCATCTC ATATTGTCTA ATTGTTAAGC TATGTTGCAA TATTTGGTGT 


TTTTTTGTAT 


2520 


25 


TGAATTGCAA AGCAATATCA TCATTAGTTG ATAAGAGGTA ATCAAGTGCA AGATAAGATT 


2580 




CAAATGTTTG GGTATTCATT TGAATGATAT GTAGACGCAC 


CTGTTGTTTT AGTTCATGAA 


2640 




AATTGTTAAA CTTCGCCATC ATAACTTTCT TAGTATATTT 


ATGATGCAAA 


CGATAAAACC 


2700 


30 


CTACATAATT TAAGCGTTTT TCATCTAAGG ATGTAATATC 


ATGCAAATTT 


TCTACACCTA 


2760 




CTAAAATATC TAAAATTGGC TCTGTTGAAT ATTTAAAATG 


aTGctACCXSC 


CAATATGTTT 


2820 




TGTATATTTT ACTGGGCTGT CTAAGAGGTT GAATAATAAT 


GATTCAATTT 


CAGTGTATTG 


2880 


TGATTGAAAA CAATTAGTTA AATCACTATT AATGAATGGT 


TGAACATTTG 


AATACATGAT 


2940 




AAACTcCTTT GATATTGAAA ATTAATTTAA TCACGATAAA 


GTCTGGAATA 


CTATAACATA 


3000 


40 


ATTCATTTTC ATAATAAACA TGTTITTGTA TAATGAATCT 


GTTAAGGAGT 


GCAATCATGA 


3060 




AAAAAATTGT TATTATCGCT GTTTTAGCGA TTTTATTTGT 


AGTAATAAGT 


GCTTGTGGTA 


3120 




ATAAAGAAAA AGAGGCACAA CATCAATTTA CTAAGCAATT 


TAAAGATGTT 


GAGCAAAAAC 


3180 


45 


AAAAAGAATT ACAACATGTC ATGGATAATA TACATTTGAA AGAAATTGAT 


CATCTAAGTA 


3240 




AAACTGATAC AACTGATAAA AATAGTAAAG AATTTAAGGC ACTACAAGAA GATGTTAAAA 


3300 




ACCATCTCAT ACCTAAATTT GAAGCATATT ATAAGTCAGC 


AAAAAATTTG 


CCTGATGATA 


3360 


SO 


CAATGAAAGT TAAGAAATTA AAAAAAGAAT ATATGACGCT TGCAAATGAG AAGAAGGATG 


3420 




CGATATATCA ATTAAAAAAA TTCATAGGTT TATGTAATCA ATCTATCAAG 


TATAACGAAG 


3480 



SS 
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AATTAGCTGA TAATAAAAGT GAAGCAACTA ATCTTACGAC AAAATTAGAA CATAATAATA 3600 

AAGCGTTAAG AGATACT6CG AAGAAGAACC TAGATGATAG TAAAGAAAAT GAAGTAAAAG 3660 

GCGCGATTAA AAATCACATT ATGCCAATGA TTGAAAAGCA AATTACCX5AT ATTAACCAAA 3720 

CTAATATTAG TGATAAGCAT GTTAATAATG CAAGGAAAAA CGCAATAGAA ATGTATTACA 3780 

GTCTGCAGAA CTATTATAAT ACACGTATTG AAACAATAAA GGTTAGTGAG AAGTTATCAm 3840 

AAGTCGATGT AGATAAGTTG CCGAAAAAGG GTATAGATAT AACTCACGGC GATAAAGCCT 3 900 

TTGAAAAAAA GCTTGAAAAA TTAGAAGAAA AATAACTATA ATCATTTTTC AAAGTTAAAA 3960 

ATTTTGAATT TATGGTTAAC ATGTCAACTT ACTATGTGTA TAATGGTAAA CATTGATATT 4020 

AACTATATGT ATAAAAATGT CACGCAGATG CTATTTAAAT GTGATAAATA TTTTTAGAGG 4080 

TGAATAGAGT GGCTATAAAG CTAAGTTCAA TTGACCAATT TGAACAGGTT ATTGAGGAAA 4140 

20 ATAAATATGT TTTTGTATTA AAACATAGTG AAACTTGTCC AATATCGGCA AATGCGTACG 4200 

ATCAATTTAA TAAATTTTTA TATGAACX5CG ATATGGACX5G TTATTATTTG ATTGTCCAAC 4260 

AAGAACGCGA TTTGTCAGAT TATATTGCTA AAAAAACGAA CGTTAAACAT GAATCACCTC 4320 

25 AAGCATTTTA TTTTGTAAAT GGTGAAATGG TTTGGAATCG AGACCACGGT GATATCAATG 4380 

TGTCGTCATT AGCACAAGCA GAAGAATAAT GAAACTATAG GGTTGGAACA TTTTGCCTTA 4440 

CACTACTAGA CGTGAATAGC ACAACTTAAA TTCGTGTGAA TCAGAGTAGT TTGGCTATAA 4500 

TGATGTTCTG ACCTTTTATT TTATGTCACC TTTAGAAGCA GTTAAGTTAG TACTTTTTTA 4560 

CAAACATATG TATAATATAT TCXSAGTATTT TTATTGAAAa tATTTTGGAA AACGACGAAT 4620 

CCAATAAGAA AATTTAAACA TGATTTGTAA GTTAGTTTAA TAGGAAATAT ATGCTAAACC 4680 

AAAAGAAGCA TATTGTTATT TACTGGAATA ATTAATAATC ATGTCATGTT AAATGTTAGC 4 740 

ATATAATCAC GAGATAAAAT CTAAAATTTA AGATTAATCT TTTATGAATA AAAAACGTAT 4800 

CACAACAAAT AATAAAGTAA GGTGGTCAAG GTTATGAAAG TATTAGTAGC CATGGATGAG 4860 

TTTCATGGAA TTATTTCAAG TTATCAAGCT AATAGATATG TTGAAGAGGC AGTTGCAAGC 4920 

CAAATTGAAA CTGCAGATGT AGTTCAAGTA CCATTGTTTA ATGGAAGACA TGAATTATTA 4980 

45 GATTCTGTAT TTTTATGGcm ATCTGGGcaA AAGTATCGTA TACCAGTACA TGATGCAGAT 5040 

ATGAATGAAG TTGAAGGTGT TTACGGACAA ACTGATACAG GGATGACCX3T TATCGAGGGG 5100 

AATTTATTTT TAAAAGGTAA AAAACCAATT GTTGAACGAA CAAGTTATGG TTTAGGAGAA 5160 

en 

ATGATTAAAC ATGCATTAGA TAACGACGCA AAACATGTTG TAATTTCACT AGGTGGGATT 5220 

GATAGTTTTG ATGCTX3GTGC AGGTATGTTA CAAGCATTAG GTGCTCAATT CTATGATGAC 5280 
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GATATGTCGA ACTTACACCC TAAAATGGAA 
TCAAGTCGAT TATATGGTAA 6CAAAGTGAA 
^ AATCATAATC AAGCAGCAGA AATCGATAAT 

AGTGAATTGA AAATTGCAAT TGGTCCAGTT 
GCAGTCTTGA ATGGACTGTA TCAAGCTGAA 

10 

CTAACACATT TAGAAAATTT AGTTGAACAA 
AATGAAAATG ATCAGTTGCT AGAAACGACA 
CATCAAAAGG TTGCCATTGC AATTTGTGCA 

15 

CAAGGGGTTA CAGCAATGTT TAATACATTT 
AAAATGGGtT ACAAATTAGG CATTATACGG 
TTAATGTTGA GGTTTAGTAA AGAAGGACTA 
CATTTATGAT GGTTAGCAAA ACGAATTAGA 
AATCACGCTA TCATTGCACT GAATGTTAGC 

25 TGAACTAGTA TATAATCGTT GGTTTTTAGT 

TGATTACTTG TACTGCTTTT TCCaTAACAT 
AGTTACCGCA ACCTGTAAAG ATGTTTGGAG 

^ CATCTGTACC ACCGCGAATA GGTTCAGTGT 

GTTTAGGTAT ATCAATAATA TGAGGCAATG 

GATCCGATAT ATCAACTTTA ACTGGATAAT 

35 

tttctaaaat acx3tttctta cgcaattcga 

attgEaaagt TGCTTTTTCA ACAGTTCCTT 

ATCCTTCTGT TCGCTCCGGA ACTTCACTAT 

40 

AACGTATTGC GTTTACCATT GCATTTTTAG 
ATGTAATAAC CGCTTCAGCA GCGTTAAAGC 

45 CATCCATAGT ATAAGCAAAA TCAGCATTGA 
GACCGATTTC TTCX5TCTGGT GTAAATCCAA 
GTTCTTGTAA ATAACAAATA GCTTCCATAA 

SO CTAGTAACGA TGTACCATCA GTTACCATTA 
GAAATACTTT AGGATCTAAG ACACGTTTAG 

55 
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ACAGCAAGAA TTCAAGTAAT 6TCGGATTTT 5400 

ATCATGCAAA CTTATGATGC GCATCAGTTG 5460 

TTAATTTGGT ATTTTAGTGA GTTATTTAAA 5520 

GAACGTGGTG GTGCTGGTGG TGGAATTGCA 5580 

ATATTAACCA GTCATGCATT AGTAGACCAA 564 0 

GCGGATTTAA TTATTTTTGG AGAAGGATTA 5700 

ACATTGCGTA TTGCAGAACT TTGTCATAAA 5760 

ACTGCTGAAA AGTTTGATTT ATTTGAATCA 5820 

ATCGATATGC CAGAAACTTA TACTGACTTT 5880 

TTCAGTCTTT AAAACTGTTG AAAACACATT 5940 

AATTGGTGAT OCTGTCATQA TGGTTAATAA 6000 

AGATCXSAAAG TATACGTAAA AAATATGAAA 6060 

GTGATTTTTA TATATTAATT AAGCCTGAGT 6120 

GATTTTCAGC 6ATATCTTCT ACAATTCCAA 6180 

CAATGGATGC aTATTCATAT GGGCCGTGGA 6240 

TTGGTAACCC CATAAATGAC AATTGTX5AAC 6300 

TTGCTGGAAT ATCTAATTTG GCAAAGACAC 6360 

GTAATATTTT TTCTGCCATA TTGAAATATT 6420 

TTTCAAAATG GGCATTGATA TCGTCACGTA 6480 

ATTGTTTTTT ATCATGATCA CGAATAATGT 6540 

CAAAGTTCAT TAAGTGATAA AAGCCTTCGT 6600 

CAGGTAGCAA ACTATCGAAT TGTTCACCTA 6660 

CTGAACCAGG ATGAACATTT ACACCGTGGC 6720 

TTTCATATTG TAATTCTCCA TATTGACTAC 6780 

AGCGGTCAAC ATCAAATTTA TGTGGACCAC 6840 

TGCGAATGGT ACCATGTTTA ATTTCTGGAT 6900 

TTTCCACAAT ACCCGCTTTA TCX5TCTGCAC 6960 

ATGTATGACC AACTAAACTG TTAAGTTCTG 7020 

TATTGCCTAG TTTGTATGGC TTACCATCAT 7080 
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10 



15 



20 



GCGCCAAAAA TCCAACTGTT QGGACGTCGA CATCGATGTT ACTTTCTAAT GTAGCAAATA 7200 

AGTAGCCATT TTCATCTAAA TCAGTTGGCA ATCCTAATTG TTGTAATTCT TTTTCTAATA 7260 

AATGTAACAA ATCCCATTGC TTTTCAGTTG AAGGTGTTGT TGTAGATTTT GGATCAGATT 7320 

GCGTATCAAT TGTCGTATAT CTTGTTAATC TATCTATCAA TTGGTTCTTC ATTATATTCG 7380 

ACCCCTTAAA CTCTATTATT CATGTTGTAA GATTTTTTAT ATGTCTTACC TTTGATTTTA 7440 

CCATACAGTT GTTTGATACG TGTGTATAGG TAATATAGAA TTTCAGAAAC TAATATACCG 7500 

AAAGCAATCG CACCTGAAAT CAGTGTAcTT CTAAAAATGT ATTTACAGCA CTTGTATAAT 7560 

CATTTGATAC TAAAAAACGA GTCGCTTGAT AAGCTGCACC ACCAGGTACT AATGGTATAA 7620 

TGCCTGGCAC TATGAATATA AITACCGGTC GTTTATATCT GCGACTCATA GTATGACTCA 7680 

TTAAGCCTAA AATTAAGCTT CCCAAAAATG AAGCGCCAAC TTTTCCAAAC TCTAAATCTA 7740 

CCGTTAATTG GTAAATCGTC CATGCAATGG CACCCACAAA TCCACATGCT ACTAAGAGGC 7800 

GTTTGGGTGC ATTGAAAATG ATAGAGAAAA GTACTGTTGA TATAAAGCTG ATTGTAAAAT 7860 

GAAATAAATA AAATAGCATG CTTTAACAGT CCTTCCTTAA ATGATTAATA AAACGATTGC 7920 

25 GACACCAGCA CCGATTGCGA ATGCTGTTAA TGCAGCTTCA ACACCGCX3AG ACATACCTGC 7980 

AAGTAATTCA CCCGCTAATA AATCTCGAAT GGCATTGGTA ATTAATATAC CAGGGACAAG 8040 

TGGCATGACA CTGGCTATAG TAATGATATC TTGATTGGTT GCAATGCCTA ATTTAGTAAA 8100 

TGTGGCTGCA ATGGATATGA CCACAGCGGC TGCAACAAAC TCTGAGAAAA ATTTAATTTG 8160 

TATATAGCGT tGCACAAAQC TGAATGTTAA AAAT6CGGAT CCGCCAGCAA TGACTGCAAT 8220 

CCAACAATCT GATGCGACAC CACCAAACAT AAATAGGAAG AAGCCACAT6 CAATGGCAGC 8280 

TGCAAAGAAA TTCGTTAAAA AAGAATATTG TAATGATGCA TGCTGTAAAT GAATAAATTC 8340 

AGATTTAGCT TCATCAATTG TGAGTTCTTT ATTTGATATT TTACGTGAAA GACTATTCGT 8400 

TAAAGCGATT TTCTCTAAAT CTGTTGTACG CTCTTGTACA CXrAATTAATC TTGTACTTGT 8460 

TCGATCGTTT AATGAAAAAA TAATTGCAGT TGAACTGACA AAACTATATG TATTATGAAG 8520 

ACCATAACTA TGTGCGATAC GGTTCATTGT ATCTTCAACT CGATATGTTT CAGCACCTGA 85 BO 

4S TTCaAGTAAA ATTCTACCTG CAATTAATAC AACATCAATC ACTTTGTTTT CATCTATAAT 8640 

TGTGATTGAA TCTGGCATAT CAATTCACCT CCAATGATAT GTGTTATTTA TTTGAACAAT 8700 

TGaAGTTTAC AACTTGTTGT TACAACTTTC AATAGTGAGA CTTTGTGTTA GTATGATGAA 8760 

SO CTTGTATGGT TCAAATTTAA ATAAGAAAAA CTGTTAATCT ITGCTATTAT ACTATGATTT 8820 

AATAATAGCA AAGGATTAAC AGTTTTGTCG TTGTTATAAA TTGATAATAG GGTTAAACAT 8880 



55 



30 



40 



338 



EP 0 786 519 A2 





TTTACGCTGT 


GATTTTGGAT 


CGTCATCTGT 


TAAATAACCA 


ACACCGATAG 


ACACTGACAA 


9000 




TTTAATAACT 


TCTTTGTTTG 


GTAAATGGAA 


TGATGATTTT 


TCAACACCCG 


AACGAATATT 


9060 


5 


TTCAGCTAAT 


TTAACACTTT 


GATCAAGTGA 


ATAATTGTGA 


ATGACAACTG 


AGAACTCTTC 


9120 




GCCACCATTT 


CTAAAAATTT 


TAAATTGATT 


CGGCACATAG 


TTTTTAAGTA 


ATTGAGACAT 


9180 


10 


TTGTTTTAAT 


ACAGCATCAC 


CTGATTTGTG 


TGAGTAGGTA 


TCATTGaCAT 


CTTTAAATCC 


9240 


ATCGATATCX5 


ATTAATAATA 


ATGCGATACT 


TTGATGTTCT 


TTTTCAGCTT 


TTCGTGAAAT 


9300 




TTCATTTAAA 


TGTCTATCAA 


ATTCTTTTAC 


ATTACCTAAG 


CCTGTTAAGT 


AATCATATTT 


9360 


IS 


ATCTTCGTTT 


TCATAACGAT 


TTACX3AGTGA 


GAAGAAATGC 


CAAATATCGA 


CAAATGTTAT 


9420 


CX3CTGAAGCT 


AAAGTGATAA 


TTAATGAAAT 


TGGTATTAAA 


AT6ATAACTT 


CCGATAGTGT 


9480 




GTAAATAGGA 


CTCACTAACG 


CGACACCAAA 


TAAAATGATT 


ATTGTAACAA 


CATTAAGTAT 


9540 


ZQ 


TAATAAT6AT 


AGCACATCAT 


TTTGTTTTAA 


AAATGGTCCA 


ATAGCACTTG 


TTACTGCAGC 


9600 




AATAACAATC 


AACGTAACAC 


CGTACATAAT 


CGAGTTGTTA 


AATACTACAA 


TTTCAACAAT 


9660 




TGCTACAATT 


ACTGTGGCAG 


ATAATGTATA 


GACCATATTT 


GTAAATCTAC 


CTAAAAACAA 


9720 


25 


TAAAQGAACG 


AATGTTAAGT 


GAATTAAATA 


ATCTTCACGA 


TAAGGGATAG 


GGTAGACAGA 


9780 




TAATAATAAT 


GATACGATTG 


TCATTAAAAC 


AGTGACATAA 


GCCTTAGAAA 


AAAC 


9834 



(2) INFORMATION FOR SEQ ID NO: 38: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

35 

C(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
TCTCAATCAG ATGAAAAATT GCATATCGTA GGTTTTACAG AAAGTGCAAA ATATAATGCG 60 

40 

TCATCAGTCA TTTTCACGAA TGACGCTACC ATTGCCAAGA TCAATCCTAG ATTGACTGGA 120 
GATAAAATTA ATGCAGTTGT TGTACGTGAT ACAAATTGGA AAGACAAAAA ATTAAACCAA 180 

45 GAGCTTGAAG CGGTAAGTAT TAATGACTTT ATTGAAAATT TACCAGGTTA TAAACCACAG 240 
AACTTAACAT TAAACTTTAT GATTTCATTC TTATTTGTCA TTTCAGCTAC AGTTATAGGC 300 
ATTTTCCTAT ATGTCATGAC ATTACAAAAG ACGAGTTTAT TTGGCATATT AAAAGCTCAA 360 

SO GGATTTACGA ATGGCTATTT GGCGAATGTG GTAATTTCGC AGACGGTCAT ATTAGCACTA 420 
TTTGGTACGG CATTTGGCTT ACTGTTAACA GGCGTTACAG GTGCATTTTT ACCTGATGCA 480 
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TCTGTATTAG GAAGTTTATT CTCCATTTTA 
ATTGGGTAGG AGGTGTAGCA AATGTTGAAA 
^ OGGAATCGTA ACATTGAAGC GGTTAAAGAT 

ATAGCATTGG TTGGACCTTC TGGCTCTGGT 
TTACAAACAC CGACATCTGG GCACATTTTA 

10 

CAAAAAGCAT TGGCAAAAGT TAGAATGTCT 
CTTGTACCAT TTTTAACXSGT AAAGCAACAA 
GTTATGTCTA ATGAAGACTA TCAGCAACTT 

IS 

AATAAGTTAC CTTCAGAAAT TTCAGGTGGT 
TATATACGAA TCCGTCGATT ATTTTAGCX3G 

2^ ATGCGATTGA AGTCATTAAA ATTCTACGTG 

TTATTGTTAC ACATGATGAA CGACTTAAAG 
ATGGCGTCCT TAATCTTGAA AATGAAACAG 

25 GTGCCGGTAT TTTTATGTTT ATGTATTATT 

ATTATTATCG AAAATCAGAA ATATTCCGTG 
TCTAAGTATT CAATATTTAA ATATGGGGAT 
TGGTTATGGA TTTATTGATA GGTACTTTAT 
TATTTACATA TAAAGCGCCT AATGGTATGC 
TCGCAACATT TTTAGTGGAA GCATTTAATA 

3$ 

AATTTTTAGA AGAGCTAGGA GACGCTGCGG 
TAAC3LGCATT AGCTATCGGT GTGTCACCAG 
GTGGTATGGA TTTATTACCA GGTTTCTTTG 

40 

ATACAGAGAA ATATGTGCCG GATGGTGTCG 
CATTAGCTCG TCTTATTGCA GTATTATTAA 

45 TTGGTGATAT TATCCAAAGT AGTACGAATA 
GTGGTATTAT TACGGTTGTC GGCACAGCGC 
TAGGTTTAAC GGGTGTACCT ATGGCTATTG 

SO TGAATGGGAC GCTATTCCAT CGCTTAAAAT 
GTATTGAACC TTTATCACAA GCAGATATTG 

55 



ACAATTAGAA AAATAGATCC GTTAAAGGCG 600 

TTTGAAAATG TAACAAA6TC ATTTAAAGAT 660 

ACAAATTTTG AGATAAATAA AGGTGATATT 720 

AAAAGTACAT TTCTAACTAT GGCAGGTGCT 780 

ATCAATAACC AAGATATTAC GACAATGAAG 840 

GAAATAGGTT TTATTTTACA AGCTACAAAC 900 

TTTACATTAT TGAAAAAGAA AAATAAGAAT 960 

ATGTCACAAT TAGGTCTAAC TTCATTGCTT 1020 

CAGAAACAAC GTGTGGCGAT AgCaAAGCGT 1080 

ATGAACCTAC CGCGGCGTTA GATACTGAAA 1140 

ATCAAGCCAA ACAAAGAAAG AAAGCATGTA 1200 

CATATTGTOA TCGTTCATAT CATATGAAAG 1260 

TAGAATAGTT TTATTAAGCC GGTACATCAT 1320 

TGAATAAACT TTCACATTCA ATTAATAATA 1380 

AAATATAATA TTTTTTGTAG TAAAATGGCC 1440 

TGAATATAAA ATTATCGTAA TGGGGGTCAA 1500 

TTTTATTTTT GGTCTTAGTG ATTTTTACAT 1560 

GTGCCATGGG AGCATTAGCT AATGCAGCAA 1620 

AATATGTTGG TGGCGAAGTA TTCGGTATTA 1680 

GAGGTCTAGG TGGTGTCGCT GCCGCTGGAT 1740 

TATATGCATT AGTTATAGCA GCCGCGTGCG 1800 

CGGGTTATAT GATTGGATAT GTGATGAAAT 1860 

ACTTAATTGG ATCGATTGTC ATCTTAGCGC 1920 

CGCCAGTAGT GAATAGTACA TTGATTCGAA 1980 

CGAATCCAAT TATCATGGGT ATCATTTTAG 2040 

CATTGAGTTC AATGGCATTG ACAGCATTAT 2100 

GTGCCATGGC AGCATTTAGT TCGGCATTTA 2160 

TAGGTGATCG TAAGTCTACG ATTGCAGTAA 2220 

TATCAGCCAA TCCAATTCCA ATCTATATTA 2280 
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GCCATtCCAG CAACCATAGC CATTATAGAT GGCAAAATTA AAATTGGTTT A6AAAGCGAA 4200 

GATTTAGAAA TACTGGCAAC TAGTAAAGAC GTTGCTAAAG TATCTAGAAG GGATTTAGCA 4260 

GAAGTTATTG CGATGAAGTG TGTTGGTGCT ACTACTGTAG CGACGACGAT GATATGTGCT 4320 

GCAATGGCTG GTATTCAATT TTTTGTTACA GGAGGTATTG GGGGCGTCCA TAAAGGTGCA 4380 

GAACATACGA TGGACATTTC AGCAGACTTA GAAGAACTGT CTAAAACAAA TGTCACTGTT 4440 

ATCTGTGCAG GTGCCAAATC AATTTTAGAC TTACCTAAGA CGATGGAGTA TTTAGAAACA 4500 

AAAGGCGTTC CAGTTATTGG ATATCAAACG AATGAATTGC CAGCATTCTT CACTCGCGAA 4560 

AGCGGTGTTA AGTTAACAAG TTCGGTTGAA ACGCCAGAAC QACTTGCTQA CATTCATTTA 4620 

ACAAAACAGC AGTTAAATCT TGAAGGTGGC ATTGTTGTTG CTAATCCAAT TCCATATGAG 4680 

CATGCCTTAT CAAAAGCATA TATTGAGGCA ATCATAAATG AAGCTQTTGT TGAAGCGGAA 4740 

20 AATCAAQGTA TTAAAGGTAA GGACOCCACA CCGTTCTTGT TAGGGAAAAT TGTAGAAAAA 4800 

AOGAATGGTA AAAGTTTAGC AGCAAATATA AAACTTGTTG AAAACAATGC GGC6TTGGGT 4860 

GCTAAAATTG CTGTCGCTGT TAATAAATTA TTGTAGGTGA TGATACATGA ATATTTTATT 4920 

25 CGCTATCACA GGGATAGCAT TTGCACTATT TGTrOOGTTT TTATTCAGTT TTGATCOTAA 4980 

AAAAATAGAC TTCAAAAAGA CGTTAATAAT GATATTTATT CAAGTGTTGA TCGTGTTATT 5040 

TATGATGAAC ACAACGATTG GTTTGACAAT TTTAACTGCA CTAGGTTCAT TTTTTGAAGG 5100 

GCTAATAAAT ATTAGTAAAG CAGGCATAAA TTTTGTTTTT GGAGATATAC AAAATAAAAA 5160 

TGGCTTTACG TTCTTTTTAA ACGTATTACT GCCATTAGTT TTTATTTCTG TATTAATAGG 5220 

CATCTTTAAT TATATTAAGG TATTACCATT TATTATCAAA TATGTAGGTA TCGCTATTAA 5280 

TAAAATAACT AGAATGGGGC GCTTAGAAAG TTATTTTGCT ATTTCAACAG CAATGTTTGG 534 0 

GCAftCCAGAA GTATATTTAA CAATAAAAGA TATTATTCCA AGATTATCTA GAGCGAAATT 54 00 

ATATACAATT GCGACGTCTG GTATGAGTGC TGTTAGTATG GCAATGCTAG GTTCATATAT 54 60 

GCAGATGATT GAACCCAAGT TCGTAGTTAC AGCAGTAATG TTAAATATTT TTAGTGCGCT 5520 

TATCATCGCC AGTGTAATCA ATCCCTATAA ATCTGATGAT ACTGATGTTG AAATTGATAA 5580 

45 CTTAACGAAA TCCACAGAAA CTAAAACATT GAATGGAAAA ACAGGAAAAC CTAAGAAAGT 5640 

TGCCTTTTTC CAAATGATTG GTGATAGTGC GATGGATGGG TTTAAAATCG CTGTTGTAGT 5700 

AGCCGTAATG TTGTTAGCAT TTATTTCATT AATGGAAGCA ATTAATATCA TGTTTGGTAG 5760 

SO TGTTGGTTTG AACTTTAAAC AGCTTATTGG CTATGTGTTT GCACCAATCG CATTCTTAAT 5820 

GGGGATTCCA TGGAGCGAAC TGTTCCAGCT GGCTCTTTAA TGGCGACTAA ATTAATTACA 5880 
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CAAGGTATCA TTTCAGTTTA CTTAGTAAGc TTCGCTAATT TTGGTACGGT TGOTATCATC 6000 

GTAGGTTCAA TTAAAGGCAT TAGTGATAAA CAAGGAGAAA AAGTTGCATC CTTTGCAATG 6060 

5 

AGGTTGCTAC TTGGTTCAAC TCTAGCTTCA ATCATTTCAG GATCAATCAT TGGCTTAGTA 6120 

TTGTAAATGA ATCGAAGTAC CTAAATTAAA TTCATGGCAA AGCTAAACCC CGTCACCAAG 6180 

TTGGCGCAAC AGCGcATgcA TAACTTAGTG ACGGGGTTTT ATCATAACAA TCTACTTTTT 6240 

10 

CGTAGCCGTT TTTGAAATGT ATGTTGATGG TTTATCTTTT TCAAAAATTG TTAATCCCGT 6300 

TATATCTTTT TTATGTTTTG AAGGGACAAT GAAGCTAAGT ATATAAGCAA AGACAAAAGC 6360 

AACTGTAAAT GAAATGGTAG ATACATAGAA AGGTGAGTTA CCTTTGCCAA CACCATTATA 6420 

GACATAAGCA AAGAT6ATAC CCAATATTAA TCCACAAATA ACACCGAATG TATTCGTACG 6480 

TTTAGTCAAA ATACCAACTO CAAATACACC AGCCAATGGA ACGCCGAATA ATCCAGTCAC 6540 

20 AAACAAGAAT AAATCCCATA AGTCATTTGA ATTAGAAGCA ATTAAGTATA GTGACATTCC 6600 

AAAACCGAAA ATACCTGCAA TGATAATAAT GAAACGTOCA AAGTTAACTT OGTGTCGCTC 6660 

GCTACCTTTT CCGAAGAAGC GTTGCTTAAT GTCGATTGAA ATACAAGCAG ATATAOAATT 6720 

TAAACTAGAT GAAATGGTAG ACTGTGCAGC GGCGAAAATG GCTGCAATAA GTAATCCTGC 6780 

TACAAATGGT GGCATCTCAG TCAAAATGAA ATATGGCACT ACAGATGATG TATTGAAGCC 6840 

TTTTGGTAAA ACAGCTTCAT GTGTATAAAA TGAATACAGC ATTGTACCCA TACCATAAAA 6900 

30 

TAAGGGTGCT GAAATTAAAG CTAGQATACC ATTTGTCCAT AACGATTTAT TTGTTTCTTT 6960 

TAAACTATCA GAAGCTTGAT AACGCTGCAC GACGTCTTGA CTCGCTGTGT ATTGATACAA 7020 

GTTGTTGAAA ATATTTCCTA GGAAAATAAT TGGAATGGCA GCTGCCGCAG TATTTAGTTT 7080 

35 

CCAATTGTCT GCACTAATTA ATTTrTTGTG CTCAATCGCA TCTGCAAAGA CAGTGCCGAA 7140 

ACCGCCTTTA ATGTTCACAA CACCTAGAAT AATAATAACT AAAGCGCCGC CTAATAAAAT 7200 

GACGCCTTGA ATGAAATCAC TCCAAACCAC ACCTTCGAAA CCACCTAAAA ATGTATATAA 7260 

AATACATAGT AAACCAACGA GTGATGCAAC GATATAAGGG TTCATGTCTG ATACAGATGT 7320 

GATTGCTAAT GTTGGTAAGT AGATAACAAT TGCAACACGC CCTAAATGGT AAACGACAAA 7380 

45 TAATAATGAG CCAATGACAC GTATGCTAGG GCCAAATCTA GCTTCTAAAT ATTCATATGC 7440 

AGATGTTACC TTTAACTTTT TAAAGAAAGG GACATAGAAA TAAATAAGTA ATGGAATAAT 7500 

TGCGACGATA GCAATGTTAC CAGCGATATA TGACCAATCT GTTAAAAATG CTTTCTCTGG 7560 

^ TGTCGACATA AATGTAATCG CACTTAACGT AGTAGCATAA ATTGAAAAGC CAACTACCCA 7620 

AGATGGCAAG CGACCACTTG CGGTAAAGAA ACTATTGGTA CTTTGGCTCG CGCGCTTGGT 7680 
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TGTGCCAAAT CCAACTTCTT TCATGGGCAA CATCCCCTTT ACAATGTATT GATTCTTTGA 7800 

TGTCTATAAA TCGTATTTTG CAATGAGTTG ATCTAATGTT TGTCGATGTG CTTCGTTAAA 7860 

AGGTTTGAAA GGTCTTTTCG GTAATCCTGC ATCAATGCCA CGATGACGTA ATATTTCTTT 7920 

CAATGTTGGA TAAATCCCCA TTGATAACAC TGTTTCGATA ATGTCGTTTG AATCATGTTG 7980 

CAGTTGGTAA GCTTCTTGAA TTTGACCTTG TCGTGCTAAG TCGAAGATTT TTCTTGCACG 804 0 

GCGACCATTA ACGTTATATG TAGAACCAAT TGCACCATCT ACGCCAGAAA TCGTAGCTTG 8100 

AACTAACATT TCATCAAAGC CAGATAAGAT TAATTTGTCT GGGAATGCTT TTCTAATACG 8160 

TTCGAGTAGG AAGAAGTTTG GCGCTGTATA TTTAACACCA ACAATTTTTT CATGATTAAA 8220 

TAGCTCGCTG AATTGTTCAA TAGAAATATT CACACCTGTT AAATCTGGTA TTGCATAAAT 8280 

AATCATATTG TTCTGAGTTG CTTCGATAAT ATCGAAATAG TAATCTCTAA TTTCTTCAAA 8340 

AGTAAATGGA TAGTAGAATG GTGTTACGGC AGAAA6TGCA TCATAACCGA GTTCTGTGGC 8400 

ATATTTTCCA AGTTCAATGG CTTCATTTAA ATCTAACGAA CCTACTTGAG CAATCAATTT 8460 

CACTTTATCC CCAACTGCCT CTTTGGCAAC CTTGAAAACT TGCTTCTTCT GCTCTGTATT 8520 

2S TAATAAAAAG TTTTCGCCTG AGCTACCATT TACATAAAGA CCGTCTAATT CTTCAGTTTC 8580 

AATGGCATTT TGAGCAATTT GTTTAAGTCC TTGTTCATTT ACTTGACCAT TTTCATCAAA 8640 

AGGAACGAGT AACGCTGCAT ATAAACCTTT TAAATCTTTG TTCATTATGA AGTCCCTCCA 8700 

30 AAAATCATTT GATAATATAG TTTACAGCTA TAATTGTAAA CGCTATCATA AAATGTAACA 8760 

ATATCTTTTT GAAAATTGTA GTCATATTTA TGTATAATTA ATGAAAATGT TTTTCAAAAT 8820 

CAATAGAAAT GGAGTGAGTA AGGTGTATTA CATCGCAATC GATATTGGAG GCACTCAAAT 8880 

TAAATCGGCA GTTATTGATA AGCAATTGAA TATGTTTGAC TATCAACAAA TATCAACGCC 8940 

GGACAACAAA AGTGAGCTTA TTACTGACAA AGTATATGAG ATTGTAACAG GATATATGAA 9000 

GCAATATCAG TTGATCCAAC CTGTCATAGG TATTTCATCA GCAGGCGTTO TTGATGAACA 9060 

AAAAGGCGAA ATTGTATACG CAGGGCCAAC CATTCCGAAT TATAAAGGTA CTAATTTTAA 9120 

GCGATTATTA AAATCACTGT CTCCTTATGT CAAAGTAAAA AATGATGTAA ACGCTGCATT 9180 

ACTAGGCGAA TTGAAATTAC ATCAATATCA AGCAGAACGG ATCTTTTGTA TGACGCTTGG 9240 

TACAGGCATT GGGGGTGCGT ACAAGAATAA TCAAGGTCAT ATTGATAATG GTGAGCTTCA 9300 

TAAGGCAAAT GAAGTTGGGT ATTTATTGTA TCGTCCAACT GAAAATACAA CGTTTGAGCA 9360 

so ACGTGCTGCA ACGAGTGCAT TGAAAAAGCG CATGATTGCC GGAGGATTTA CGAGAAGCAC 9420 

ACATGTGCCA GTATTGTTTG AAGCAGCTGA AGAAGGTGAT GATATTGCAA AACAAATATT 9480 
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40 



4S 



SO 



AGGGCTTATA 
CGAGCCGAAA 
TAAGAGTAAA 
CATTCTAAAA 
TCACACTCTA 
AAATTGAAGA 
ATTAAAAAA6 
GGATTTGAAT 
GTTTGACTCA 
GTCGCAATTT 
AGTACTAAAT 
TATGCACCAT 
TTTGACATAG 
TGTGAATCGG 
GTCGCTGATA 
TTGTCGATAA 
TGTTGATGAT 
ATTTCrrGTT 
AAGCGTGTCA 
AAGGCATC6C 
AATAAATGTT 
AATGATTACT 
ATCTTAATGA 
ATGTTACCAC 
TCTTTTATTA 
GCAAATACTA 
ATTGTGAAAC 
GATGAACTGA 
CCGAAAGAAA 



TTAATTGGGG 
GTTGCACACT 
AATGATGCAG 
TAGAATTTGA 
TCTAGGACCA 
AGCTQAGATA 
CACCTAGCAA 
ATACTACTAG 
GATTCGTATT 
GTGTGTTGAT 
CTGCACAATC 
GAGATTTGGC 
CAATAAACAT 
TAGTAACATT 
AACCAGAGCT 
AGGTTTGGAT 
ATTTATGAAT 
GAATATTAAA 
TCGTTGCTGG 
TATAACCATT 
GATAACGTTG 
ATTATATATG 
TATATTGTAA 
ATGGATTAAT 
TGTCGAAAAT 
AGGAAGACAT 
GTGACTATGA 
TAGAAAGCCA 
CGTTAGACGA 



GCGGTATATC 
ATTTACCAAA 
CATTATATGG 
AACCGTTACG 
ATCTAAACTA 
TTAAAATTTT 
CTCGTTGGGA 
TTATTTGTTG 
TTCTAATAAA 
AAATTGATGG 
TGTAAGTTTA 
GACTTCC6CT 
ATCCGAATGA 
ACCTTTTAGC 
ACCTAGTCCA 
AATGTCGTTA 
TCTTTGAATA 
TTTTAAATCT 
TGATGTACCA 
TTGTCTTATA 
AACACGATTC 
AAAAATATTT 
ATGACTTTAC 
AGTATCTTGT 
GGCATTAGCT 
TTTAGCAATT 
TCACTCAGAT 
ATGTGAAGTC 
ATTAGTATCA 



TGAACAAGGA 
AGACTATGTT 
CTGTTTGCAA 
AGAGATGAGA 
TATCAACCAA 
AGAAAATGTA 
CAATCACGAT 
TCTAGGATAA 
TGATAACTCA 
TOGGTATTAC 
CTACCTTCAA 
GCAGAAATTA 
GATAGTAGGG 
CCCATACGAA 
GCAAAGAGTA 
TCAATAAATT 
ATTGGGCTAT 
TGGAAATTCT 
ATCGCATGGG 
TAATTGACGA 
TCAAATTTCA 
TCAAGATAGT 
GTGAAAAAAC 
CAGGCACTAC 
GCGTATGAAG 
AAAGAAACGG 
GTTTTCATTA 
ATTGCATTGG 
TATATTAGAA 



GATAATCTCA 
TATGCACCAA 
TGATAGTTGA 
GCTGTTGTTA 
CAGTGTGCCA 
AAAAAATATT 
GATTGTCTAC 
TAGATTTAGT 
C6ATATCGAT 
GCQATTGATC 
AATTTCTGAT 
ATTCCGAA6T 
ATGCCGATAT 
TCATACGATA 
TATGTCGACT 
CACCAGTTTG 
TTTCAATAAC 
CATAATCCAG 
CTAAGGAGTT 
TGCGTTTATC 
TTGTGTCACC 
AAAAAGCATT 
GACTTATGGA 
CAGATGAACC 
GTGGTGCTGT 
TAGATTTACC 
CTGCAACGTC 
ATGCAACGTT 
CACATGCACC 



TTAAATATAT 
TACAAACGAC 
AAGAAGGAGT 
GTTCCACACA 
CGGGCAAATT 
TGGTATTGAA 
AGTTGCAGGT 
ATGTTGATAA 
TAAAAAGAGT 
CGTTGTTAAA 
GGCAACX^ACA 
ATTACCACTA 
TTTCATTAAA 
ATAAAATTCA 
TGATTGAAGT 
TTGAATGATT 
TGTCTCTGTC 
CTTATGACTA 
AATCGTTGAA 
AGTTTTTGTA 
CCTTCATCTT 
GATAAAAATT 
GTGAGGAATA 
ATTGCATTCA 
TGGTATTCGC 
AGTTATTGGC 
AAAAGAAGTT 
ACAGCAACGT 
GAACGTTGAA 



9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
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TATATTGGCA CGACGTTACA TGGCTATACT AGTTATACGC AAGGACAATT ACTTTATCAA 11400 

AATGACTTCC AATTTTTAAA AGATGTACTA CAAAGTGTTG ATGCAAAAGT TATTGCGGAA 11460 

^ GGTAATGTCA TTACACCGGA TATGTATAAA CGTGTGATGG ACTTAGGCGT TCATTGTTCA 11520 

GTCGTTGGTG GTGCGATAAC ACGACCAAAA GAAATTACGA AACGTTTTGT TCAAATTATG 11580 

GAAGATTAAA TGATAACGAT AAAAAAACGA GATGACCATC ATTAATTAAA GGCACCTAAT 1164 0 

10 

TATCTTAGGT GGCTGAATGA ATGTAATGGG TTCATCTCGT TTTGTTTGTT TATGATAGTG 11700 

ATTTTATTTT CAACTTTATC CAAAAATAAG TAAAGCGACG GGGATGGTGA TTAATAGCGA 11760 

CAACGCCACG CGTAAAAACC AAATGATGAT GAGTITCCAG ACAGGTATTT TAATTTCAGT 11820 

IS 

TGCTAGTATA CATGGCACTA ATGCTGAGAA AAAGATAATG GCTGATACGC TTACTACACC 11880 

GACGACAAAT TTAGTACTCA TTGCAGCTTT AGTTACTAAC AAAGATGGTA GAAACATCTC 11940 

TACAATAGAA AckCTGACGC TTTTGCTAGT AAAGCCTGAT CAGCAATTGG GAAAATATAA 12000 

ATAAATGGAT AGAAGATATA GCCAAGCCAA TCAATGAATG GTGTATAGTT CGCTACAATC 12060 

AGTCCTAAAA AACCAATCGA TAATATAGAA GGTAAAATAC CAACAGTCAT TTCTAAACCG 12120 

25 TCTTTCAAAT TGTCCCAAAC GTTCTTCACXS AGAGATGGTG TTAATGCATT TTGTTTCATC 12180 

GCCTCTGCAT ATGCAGTTTT CAGTCTGCTT CCTTCAATAG CAACTTCTTG TTCTCCTTCT 12240 

TGTCCGTTAT AATATTCTGT TGATTCATTG CTGATTGGCG GTAGCCATGC AGTAATT6CA 12300 

^ GTCACGACAA ATGTGATGAC TAAAGTTATC CAAAAGTATA AATTCCAATG CGGCATTAAT 12360 

CCTAAAGTTT TAGCAACGAT AATCATAAAA GTTGCTGAAA CTGTTGAAAA GCCAGTCGCA 12420 

ATAATCGTGG CTTCTCGTTT GTTGTACATC CCTTGCTTAT AGACACGATT AGTAATCAAT 12480 

35 

AATCCTAAGG AATAACTGCC GACAAACGAA GCCACTGCAT CGACAGCXiGA TTTTCCTGGT 12540 

GimAAAAA TAGGTCTCAT AATAGGCTCC ATATAAACAC CGACAAATTC TAATAAGCCA 12600 

TAGCCCACTA ATAAAGAAAG cGcAATTGCA CCTACTGGAA TTAAGATACT TAATGGCATC 12660 

40 

ATTAATTTTT CAAACAAAAA CGGACCATAG TTAGCTTTAA ATAGTATTGA TGGACCGATT 12720 

TTAAATACAT ACATTATACC GATCATTGCA CCTGCAACTT TAAATAATQT AATGACCAAG 12780 

TTTGTGATTG AAGTCATAAA AGTACGTCTC ACTATTGGTA ACGCTGTACC AATTAAAATC 1284 0 

45 

ATAATCAGTG CAACATAGGG CATAAGTGGA CCTATGATTG AGCGAATGGC TAGATGAACA 12 900 

TGATCGACGA AAATAGTGTT GTTACCATTA ATCGTAAAAG GAATAAAGAA ACATAGTATG 12960 

SO CCCACTAAAC TATAGACAAA AAAACGCCAT GCACTTGGTT GTTGTGCATT AGAATGATAT 13020 

TGATTCATTA AAGCAACCCC TTTGTTTAAA TGAATACACA AAACTGTATG ATGCATCTTC 13080 

55 



346 



EP 0 786 519 A2 



ATAGTTTGAA TTATTTTCAT ACCAATACAA 
ATTACTTAAT AAAATATTTA TCTTAAATGT 
^ GTGTTTATAA ATTATTTGGA AATACACATA 

GTATTATTAA ATTTTTATTA ATTTTGTAGT 
TATTGAAC3GT GCAGTTGTTT TTCATTCTCA 

10 

ATTATATGTT AAGAGGACAA GAAGAAAGAA 
TGGTGTCAGT GTTAGCGGCT ACAATGTTTG 
AAAAAACATC AACTAATGCA GCGGCACAAA 

IS 

GGAATGCGAT AACGTCACAT CAAATGCAGT 
AGAATGGTAA AAGTGGAACA GTGACAQAAG 
AATCAACACA AAATAGTAAA ACAATCAGAA 
CTGAACGACA AGGTTCTAAA CAGTCACACC 
AAAATGATCA GGTTCAAAAT ACCCATCATG 

25 CACAATGGAA TGATGTTGAT AAATCACAAC 

ATCATGATAA AGCAGCACCA ACTTCAACTA 
AATCAACAAA AGCACAAGAT GCAACCACGG 

30 AACCTGCX»CA TCAAATCATA GATGCAAAGC 
AACCACAAGT TGGCGATTTA AGTAAACATA 
CAGATAAAAA TACTGATaAT AAACAACTAA 

^ GTTCGACTAC AAATGCAGCA GCAGATGCTA 
TACAACCACT TAACAAATAT CCA G T TGT TT 
ATAATGCACC TGCTTTATAT CCAAATTATT 

40 

AATTGAGAAA GCAAGGCTAT AATGTACATC 
ATGATCGCGC TGTAGAACTT TATTATTACA 
CACATGCAGC TAAATACGGA CATGAGCGCT 

45 

ATTGGGAACC TGGTAAAAAG GTACATCTTG 
GTTTAATGGA AGAGTTTTTA AGAAATGGTA 
SO ATGGTGGAGA AATATCACCA TTATTCACTG 
CAACATTAGC AACACCACAT AATGGTTCAC 



ATTAACTAAT TATATATAGA TTGAAACTAT 13200 

TGTTGTGTTG ATTCAACACC ACAACTAAAA 13260 

TTTGTAAATG ATTAGTATCG ATTTAATATC 13320 

CTTAATCmAA AAATAATATA TGTCATGTTA 133 80 

AGAGGGGGTC AAAAAAATAC TTTTGAGGTG 13440 

AGTATAGTAT TAGAAAGTAT TCAATAGGCQ 13500 

TTGTGTCATC ACATGAAGCA CAAGCCTCGG 13560 

AAGAAACACT AAATCAACCG GGAGAACAAG 13620 

CAGGAAAGCA ATTAGACGAT ATGCATAAAG 13680 

GTAAAGATAC GCTTCAATCA TCGAAGCATC 13740 

CGCAAAATGA TAATCAAGTA AAGCAAGATT 13800 

AAAATAATGC GACTAATAAT ACTGAACGTC 13860 

CTGAACGTAA TGGATCACAA TCGACAACGT 13920 

CATCCATTCC GGCACAAAAG GTAATACCCA 13980 

CACCCCOGTC TAATGATAAA ACTGCACCTA 14040 

ACAAACATCC AAATCAACAA 6ATACACATC 14100 

AAGATGATAC TGTTCGCCAA AGTGAACAGA 14160 

TCGATGGTCA AAATTCCCCA GAGAAACCGA 14220 

TCAAAGATGC GCTTCAAGCG CCTAAAACAC 14280 

AAAAGGTTCG ACCACTTAAA GCGAATCAAG 14340 

TTGTACATGG ATTTTTAGGA TTAGTAGGCG 14400 

GGGGTGQAAA TAAATTTAAA GTTATCGAAG 14460 

AAGCAAGTGT AAGTGCATTT GGTAGTAACT 14520 

TTAAAGGTGG TCGCGTAGAT TATGGCGCAG 14580 

ATGGTAAGAC TTATAAAGGA ATCATGCCTA 14640 

TAGGGCATAG TATGGGTGGT CAAACAATTC 14700 

ACAAAGAAGA AATTGCCTAT CATAAAGCGC 14760 

GTGGTCATAA CAATATGGTT GCATCAATCA 14820 

AAGCAGCTGA TAAGTTTGGA AATACAGAAG 14880 



55 



347 



EP 0 786 519 A2 



5 



10 



IS 



20 



2S 



30 



ATTTAGGATT 


AACGCAATGG 


GGCTTTAAAC 


AATTACCAAA 


TGAGAGTTAC 


ATTGACTATA 


X5000 


TAAAACGCGT 


TAGTAAAAGC 


AAAATTTGGA 


CATCAGACGA 


CAATGCTGCC 


TATGATTTAA 


15060 


CGTTAGATGG 


CTCTGCAAAA 


TTGAACAACA 


TGACAAGTAT 


GAATCCTAAT 


ATTACGTATA 


15120 


CGACTTATAC 


AGGTGTATCA 


TCTCATACTG 


GTCCATTAGG 


TTATGAAAAT 


CCTGATTTAG 


15180 


GTACATTTTT 


CTTAATGGCT 


ACAACGAGTA 


GAATTATTGG 


TCATGATGCA 


AGAGAAGAAT 


15240 


GGCGTAAAAA 


TGATGGTGTC 


GTACCAGTGA 


TTTCGTCATT 


ACATCCGTCC 


AATCAACCAT 


15300 


TTGTTAATGT 


TACGAATGAT 


GAACCTGCCA 


CACGCAGAGG 


TATCTGGCAA 


GTTAAACCAA 


15360 


TCATACAAGG 


ATGGGATCAT 


GTCGATTTTA 


TCGGTGTGGA 


CTTCCTGGAT 


TTCAAACGTA 


15420 


AAGGT6CAGA 


ACTTGCCAAC 


TTCTATACAG 


GTATTATAAA 


TGACTTGTTG 


CGTGTTGAAG 


15480 


CGACTGAAAG 


TAAAGGAACA 


CAATTGAAAG 


CAAGTTAAAT 


TCATCTTCTG 


AATTTAATAT 


15540 


GCTATGTAAA 


TCGTGCTGTT 


ATCATGGCAC 


ATCAGATATA 


AGTAGCATCA 


CAGTGTTGAA 


15600 


TTTAAAAATA 


GTAAAGTGAA 


ATAAAGCGCC 


TGTCTCATTA 


GCGAAAACTA 


AAGGGACAGG 


15660 


OGTATCTGTT 


TATGAGCTTA 


ATAAATTGTA 


TGAATAATAT 


GGTTGATCGA 


ATAACTGTTT 


15720 


ATCATGATGA 


TAAATTGAGT 


TTTTTAAAAT 


AATGATATAT 


TACATCATTG 


TTATAGCGTT 


15780 


TAAGAAATCA 


ACAACTTTAC 


GATAAATAGT 


GATTGCTTCG 


TCATTAGGTC 


TACGATCAAA 


15840 


ATCATGCTCG 


TTTTTATTCA 


CGCGTTCAAA 


TGTT6AATGT 


GGAACATGAT 


TCATGATATG 


15900 


TTCGCTTTCC 


TCAACGGGAA 


CATCATAATC 


GCCATTACAA 


TGCGCAATGA 


AAACAGGTGG 


15960 



AAGTGTTTTA AGTTCATCTG GTGCAATATT ATATTTTGAA TTAGTATAAT CAGCAATGTT 16020 

AATCATATTT ATCCATTTAC CTGTGCCACG TGCATAAACG TAGATTAAAA AACGTTGTGC 16080 

3S 

GATTTGATCT TGAACAACCG GTGTTGGTGA AGTGAGTTGT GCAATCATTG TTTCGTTTAC 16140 

GCTTTGAGCT ATTTTTGCGT AATAACTATT AGTTGTTTTA AAAGGTTCAG TGTTGATGCG 16200 

ACTATAACCA TAAAAATCAA TAACACCATC AATATCTCTG TCTCGTGCAA TTAATAGACT 16260 

40 

TAAATATGCA CCTGATGATC TGCCAAAGGT AAAAATAGGG CAATTAGAAT ATTGTGATTG 16320 

AATCGCATCG AATGAtGCgn AGnACATCCT CAATAATGCA ATCGAGACTT ACTTCTGGTA 16380 

45 ATAAACGATA ACTTAGTTGA ATTAAATCGT AATGTTCCGT AAgATATCGA TATACTGTGG 16440 

GGATAAATCG TTAGCTTTAC CGAACATTAA TCCACCACCG TGGATGTAGA CAATAGCGCC 16500 

TTTTGTTGGT TGATTTTTTG CTTTAATAAT TGTGTAAGGT AATGCAAATG CATCTTTAGT 16560 

SO AATTACTTTA TCTTTAArTT CAGTCACGAT TTAATAGGCT CCTTATTTTT GATATTGATG 16620 

TCATTATAAC ACTGTCTTAA ATTTCCATGA AAAATAGTCT TAAGACGATG AGTCATGATA 16680 
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CATCATTTTA ACAATATCTT TAAAAGCAGC ATGTGGAATG GCTAAATCTT CTAAATCTGC 16800 

CATAGAAAAT TCAAGATTGA TATCATGTGG TCGCTGTTCA GCAAGTTTAT GCACAAAGTC 16860 

5 

AGGTTCTGTG ACAAAAGGCG AAGACATGCC GACCATATCT GCATGTTGTA AAGCATCTAA 16920 

AGCAGACTCT GGAGAATTAA TCCCGCCACT TGCAAITAAA GGGATACGAC CTGCTAAATG 16980 

TTCATAGACA ATTTGGTTAA CTGGTCGACC GAAATGATCA CCTGGTGTAC GAGACGTATT 17040 

10 

TTGATAAATA TGTCGACCCC AGCTAGCGAT TGCTAAGTAT TGGATGTTTG AAACQTCCAT 17100 

GACCCAATTG ATTAATTGGT TGAACTCGTC AATGGTATAT CCTAAATCAC TGCCTCTGGT 17160 

,5 TTCTTCTGGC GTTGCTCGAA ATCCTAAAAT AAAATTGTCA GGTGCTTCTT TATCAATCAC 17220 

TTCTTOTACC GCACGCATAA CTTCTAAACA TAATCTTGCA CGATTTTTTA ATGAGTCGGC 17280 

ACCGTAATGG TCTGTACGTT TATTCGAAAA AGTTGAGAAA AATGTTTGAA TCAGCAAACG 17340 

20 TTGTGCAATC GAAATTTCCA CACCATCAAA ACCTGCTTTA ATCGCGCGTA ATGTAGCATC 17400 

GCGATACT6C TGAATGATGC TATTGATTTT CTCATGAGAC ATGGCGATAA CATCGTGTTC 17460 

AATCGGTGAA TGCAATGTCA TAGGGCTTGG TCCATACACC TTTCCAAAAT TTAAAATGGC 17520 

TTGATTTGAA AAACGACCAG CATGCGCTAg CTGGATAATA GCGAGGCTAC CATGTTGTTT 17580 

CATCGTAGAT GCCATGTTAG TTAATCCAGG GATACAAGCA TCATGATCAA TATTAAAGCC 17640 

ATATTCAAAC AATTGACCAT AAGGTTCAAT GTAAGCAGCG CCGGTGACTT GCATTCCAGC 17700 

30 

TGAATTAGAG CGACGTGCAG CATAAGCCAA GTCTTCTTTT GTAATATAGC CTTCTTTTGT 17760 

TGATGTGTTT ACGGTCATTG GTGATAATAC AAAGCGATTC GAAATTTTGA TGCCATTAGG 17820 

TAAGTGGATT GATTGTAAAA GTGGTTTGTA TCGGTACATA CTATGATTCC TTTTCTATTC 17880 

35 

AATATTGTTT TCAAAGTACC ATGGAAAGAA TOAATAATCA ATGATGAACA GTCTTGATAG 17940 

AATAGAATTG GTACATGGAA AGTATTTTTA AAATTAAACT AATGAATGGC ATTTGTAGGT 18000 

CTGAAAATAT GAATATGAAA AAGAAAAATA AAGGCGAAAA GATATAAAAG TTAATTGAAA 18060 

40 

AACGTTATCA TATACGTGGG TATATGAAGA GGGAATGGTA TTAAGAACGC TAAAATGTTA 18120 

TGTCGGTTTG ACATGACAGG ATAAGTTTGG AGATGACGGA TTGGTTAAAT TAAGCGTATT 18180 

45 AGACTATGCC TTAATAGATG AAGGTAAGGA TGCACAAAAG GCATTGCAAG ATTCAGTGAC 18240 

ACTTGCAAAA TTAGCAGATC GACTTGGCTT TAAGCXjAATT TGGTTTACGG AACATCATAA 18300 

TGTACCAGCG TTT6CGTGTA GTAGTCCAGA ACTTTTGATG ATGCATACAT TGGCGCAGAC 18360 

AAATCACATA CGAGTTGGCT CTGGTGGTGT GATOCTGCCX; CACTATCGAC CTTATAAAAT 18420 

TGCTGAGCAT TTTAGAATGA TGGCAGCOTT ATATCCAAAT CGTATTGATT TAQGTATTGG 18480 
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20 



25 



30 
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40 



45 



SO 



TAGTTACGAT 
TGCGCATACG 
TAGTAGCGCA 
ATTTTTGCTA 
AAAACATTTC 
CATTGTAGCT 
ATTAGGTAAA 
GTATAAGCTT 
AGGTACACAA 
TGAGGTGTTA 
ACTCGCGGAA 
AAAGTTAGCC 
ACCTGAATTG 
AGTTAAAGTG 
AACGCCATTG 
TTTAAACTCT 
AGCATTCCTA 
TGAATATTTA 
ATCTTCATTT 
ACCTGTAAGT 
ACATCTTACG 
AGGfGATATC 
AGGATGTATG 
TCAAGTTCQA 
TAAAGCCAAA 
TCCGCAAATA 
TCTTAGCTGT 
ATCTACAGGT 
AGAAAGCTAT 



GAATCGATTT 
TTAGGTGTCC 
ACATCTGCCA 
CCAGATATAA 
CAAGCATCAA 
GATAACGAAG 
TTACAATTTG 
AATGATCGAG 
GAAAAGGTTA 
GTAGCACCGC 
ATTTATTTGT 
AATTATTTAT 
CAAGATGATA 
GATGATGAAA 
TCAGGAACGA 
GAAAAACCAG 
GCATTACCGG 
ATCAATGATA 
GAAGATTTGT 
GATGAATATT 
GATTTGAAGG 
ACGACGTTAA 
CAAGCTAATC 
CTTGATTGTG 
ATAACACGTG 
CGTCGATTGC 
CTTAAATTGG 
GTATTTGCTT 
CTCAAAGAAA 



CGTTATTACG 
AACCACACAT 
AAATAGCTGC 
ATGCGATACA 
CGATTAAAAT 
CGGAAGTAGC 
CAGAATTTGA 
ACAAAGAGAT 
AAGCACAATT 
TTATTCCAGG 
AGCATTTTAA 
GGGTAGAAAA 
TTGGGACAGT 
TTGTGAGTAT 
TTATTGAGCG 
AAGAAAATTG 
AGGCTTAAAT 
TGCATCGAGA 
GGGAATTATA 
TAGCTGTACA 
ATTTGAAGCC 
AAATCOATGC 
ATGACTGCAT 
CAGAGATCAT 
GATATAATTT 
CTGTTTCAAA 
CTGATCAACA 
TTCCTCAAGA 
CAAATTCAAC 



TGATTATCTT 
TGATCATTTT 

CGAACTAGGT 
TACAGCGAAG 
GGACGCAAAG 
AGCATTACAA 
AGATTTTCCT 
GATTCAAGCA 
AGATGATTTC 
TATTGAACAG 
ATAGAAGAGA 
AGTAGGAGAT 
AGGTTATGTT 
CGAAGCATCG 
AAATACAAAA 
GTTGTTCAAA 
GGAAACGTTA 
GAGAAATGAC 
TOGAGGCTTA 
AGATGCTAT6 
GATAAAAGGT 
TATTGTTAAT 
TGATAATATT 
TCGACAACAA 
GCCAGCAAAG 
GATGAATCAG 
TAGTTTAAAT 
TGAAGCAGCA 
ATTGAAAGTC 



ACAATAAAGG 
CCAGAAATGT 

ATAGGGCTTT 
GATAACATTG 
GTQATGGCAT 
CATGCCTTAG 
TCAGTAGACA 
CATCAAGCAC 
ATTGCTACGT 
CGTTGTAAAA 
AAGGATGAAG 
TTGTATGTGT 
GAATTCGTAA 
AAAACGGTCA 
GCGGAAGAAG 
TTGGATGATG 
AAATCAAATA 
AATGACGTAT 
GCAAATGTCA 
TTAAGTGATT 
GACAATATCT 
GCTGCAAATA 
ATTCATACAA 
GGGCGCAATG 
TATATAATTC 
GACTTGTTAG 
CATGTCGCTT 
GAAATTGCTG 
GTGTTCAATG 



ATAAACCAAG 
GGTTATTAAG 

CTGTTGGAAC 
ATATTTACAA 
CTGTATTTGT 
ATGTTTGGTT 
CAGCACAAAA 
GCATCATTGC 
TTGAAGTTGA 
CATTAAAATT 
ATAAGATGAA 
TTAGTATGAC 
GTCCAGATGA 
TTGATGTGCA 
AACCGACAAT 
TCGATAAAGA 
AAGCGAGACT 
TGGTAATGCC 
GACCGGCATT 
TGAATCGTCA 
TTGTTTGGCA 
GTCGTTTTCT 
AAGCGGGTGT 
AAGGTGTAGG 
ATACGGTTGG 
CTAAATGTTA 
TTTGCTGTAT 
TTCGAACAGT 
TATTTACAGA 



18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
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CAATGTCTCT GTTAATGGAT GACAAGACAA AGCAGGCTGA AGTATTGCGT ACTGCGATTG 20400 

ATGAAGCAGA TGCGATAGTG ATTGGAATTG GTGCAGGCAT GTCTGCATCT GACGGATTTA 20460 

CATATGTAGG AGAGCGTTTT ACGGAAAATT TCCCAGATTT TATTGAAAAA TATCGCTTCT 20520 

TTGATATGTT GCAAGCGAGT TTACATCCTT ATGGCAGTTG GCAAGAGTAT TGGGCATTTG 20580 

AGAGTCGTTT TATTACATTA AACTATTTAG ATCAACCTGT AGGTCAGTCT TACCTCGCTT 20640 

TAAAATCCTT GGTGGAAGGT AAACAGTACC ACATTATAAC TACGAATGCA GATAATGCTT 20700 

TCGATGTAGC TGATTATGAT ATGACTCATG TATTTCATAT ACAAGGGGAG TATATACTGC 20760 

AACAGTGTAG CTCAGCATTG TCATGCTGAA ACGTATCGCA ATGATGATTT AATTCXSTAAA 20820 

ATGGTTGTTG CGCAACAAGA TATGCTTATA CCTTGGGAGA TGATTCCAAG ATGTCCAAAA 20880 

TGTGATGCCC CAATGGAAGT GAATAAACXTT AAAGCGGAAG TTGGGAT6GT TGAAGATGCT 20940 

GAATTTCATG CX3CAACTACA TCGTTATAAT GCTTTTCTAG AGCAACATCA AGATGATAAA 21000 

GTGTTGTATT TGGAAATTGG AATTGGTTAT ACTACACCAC AATTTGTGAA GCATCCTTTT 21060 

CAGCGTATGA CACGTAAAAA T6AAAATGCC CTTTATATGA CGATGAATAA AAAGGCATAT 21120 

CGCATTCCGA ATTCAATTCA AGAACGTACC ATACATTTAA CTGAGGATAT CTCAACATTG 21180 

ATTACAGCAG CACTCCGGAA CGACAGCACA ACGAAAAATA ACAACATTGG A6AGACAGAA 21240 

GATGTACTTA ATAGAACCGA TTAGAAATGG AGAATATATT ACTGATGGTG CGATTGCACT 21300 

CGCTATGCAA GTTTATGTTA ACCAGCATAT CTTTTTAGAT GAAGATATTT TATTCCCTTA 21360 

TTATTGTGAT CCAAAAGTGG AAATTGGACG TTTTCAAAAT ACTGCTATAG AAGTGAATCA 21420 

AGATTATATA GATAAACACA GTATTCAAGT AGTTCGCCGA GATACTGGTG GTGGCGCTGT 21480 

GTATGTTGAT AAAGGTGCCG TTAATATGTG TTGTATTTTA GAACAAGACA CTTCAATTTA 21540 

TGGT&Arrrr CAACGATTTT ATCAACCAGC TATAAAGGCG TTGCATACAT TAGGTGCAAC 21600 

AGATGTGGTA CAAAGCGGTA GAAATGATTT AACATTGAAT GGTAAAAAAG TGTCAGGCGC 21660 

CGCAATGACA TTAATGAATA ATCGTATTTA TGGCGGTTAT TCGCTATTAC TTGATGTTAA 21720 

TTATGAAGCA ATGGATAAAG TGTTAAAGCC TAATCGCAAA AAGATTGCAT CGAAAGGGAT 21780 

TAAATCTGTG CGCGCACGTG TTGGTCATCT TAGAGAAGCA CTGGATGAAA AGTATCGTGA 21840 

TATAACCATT GAAGAATTTA AAAATTTAAT GGTGACGCAG ATTTTGGGAA TCGATGACAT 21900 

TAAAGAGGCG AAACGATATG AATTAACGGA TGCAGATTGG GAAGCGATTG ATGAATTAGC 21960 

TGATAAAAAG TATAAAAATT GGGATTGGAA TTATGGCAAG TCACCCAAAT ATGAATACAA 22020 

TCGAAGTGAA AGATTATCTT CAGGTACGGT AGACATAACA ATTTCTGTTG AACAAAATCG 22080 
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AGAAGCATTA CAAGGAACAA AAATGACAAG AGAAGATTTA ACGCATCAGT TAAAGCAATT 22200 

AGACATCGTT TATTATTTTG GCAATGTTAC GGTAGAAGCA TTAGTGGATA TGATTTTAAG 22260 

^ TTAATATTGT TATTTTATGT ATGCTGAATC ATTGGAAGTG TTTGCTTGCT CTTGAAAAGG 22320 

TGACAATAGT GTTTGGTGAA GGTTGAACAT ATGAGTGGAA ATTATTGCCT TTAACTATTC 22380 

AAAGTATGAT ATATATATGG TTTTTGTTTC TAAATGATTG GGTATTTGAA AATAGATGAG 22440 

10 

TTTAATATTT TAAGGAATAT AATGATGTTT ACTTTTATAA TTCATATAGA ATATTAAGCA 22500 

ATATAAGTCT GTTGATATAT ACAAAATATA ATGACTGCTA TAATGAGTAA TCAATAGACA 22560 

CAAAGAGGAG ATTATGTGAT GAATAATAAA GTATTAGTAA CCGGTGGTAC AGGGTTTGTT 22620 

IS 

GGCATGCGAA TTATTTCACG ATTATTAGAA CAAGGTTATG AC6TACAAAC GACGATAC6T 22680 

GATTTAAGTA AAGCTGATAA AOTAATTAAA ACAATGCAAG ACAATGGCAT TTCCACAGAG 22740 

20 CGATTAATGT TTGTCGAAGC GGATTTATCA CAAGATGAAC ATTGGGATGA AGCAATGAAA 22800 

GATTGCAAGT ATGTCTTGAG TGTAGCATCT CCOGTGTTTT TCG6TAAAAC AGACX^TGCA 22860 

GAAGTQATGG CGAaCTGcAA TTGAAGGTAT ACAACGTATT TTAAGAGCTG CAGAACATGC 22920 

25 GGGTGTTAAA CGTGTGGTAA TGACTGCAAA CTTTGGTGCA GTTGGTTTTA GTAATAAAGA 22980 

TAAAAATTCA ATCACAAATG AAAGTCATTG GACAAATGAA GAT6AACCAG GCTTATCAGT 23040 

ATATGAAAAA TCAAAATTGT TAGCTGAAAA GGCAGCGTGG GATTTTGTTG AGAATGAAAA 23100 

TACAACAGTA GAATTTGCCA CAATCAATCC AGTTGCAATT TTTGGGCCAT CATTAGATGC 23160 

ACACGTTTCA GGAAGCTTTC ATTTATTAGA AAATTTATTG AATGGTTCAA TGAAACGTCT 23220 

ACCGCAAATT CCGTTAAATG TTGTTGATGT GAGAGACGTA GCTGAACTGC ACATTTTGGC 23280 

$5 

AATGACAAAT GAACAAGCTA ATGGCAAGCG ATTTATTGCG ACOGCTGATG GACmAATTwA 23340 

tTTGlTGGGA ATIGcCAAAt TAATTAAAGA AAAGGGCCTG GAAATAGCTC CAAAAGTTCC 23400 

TACTAAAAAA TTACCCAGCT TTATTTTGAG CnAnGnGCC 23439 

40 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCCTTTGAGA GTATATCATC TAGTCAAATT ATGCCTGTCA TTAGAGCGAC TAGCTTTGAT 60 

SS 
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20 



TATTATGCAG TCGATTTAGG GAAATCATAT CX5TCTAATTG ACX3AAAGCAT GTTAGAGGAT 180 

TTGAAGTTAA CTGAACAACA AATAAGAGAA ATGTCTCTGT TTAATGTTAG AAAATTGTCA 240 

AATTCATATA CGACTGATGA AGTAAAAGGT AATATnTTT ATTTTATTAA CTCAAATGAC 300 

GGGTATGATG CAAGTAGGAT ACTAAATACT GCATTTTTAA ATGAAATTGA GGCACAATGT 360 

CAAGGCGAAA TGCTCGTAGC AGTGCCACAC CAAGATGTGT TAATTATTGC AGATATACGC 4 20 

AATAAAACAG GATATGATGT GATGGCACAT TTAACAATGG AATTTTTCAC TAAAGGTCTA 480 

GTTCCAATTA CATCATTATC CTTTGGATAT AAACAGGGTC ATCTTGAACC GATATTTATT 540 

TTAGGTAAAA ATAATAAACA AAAAAGAGAT CCAAACGTGA TTCAGCGTTT AGAAGCAAAT 600 

CGTCGTAAAT TTAATAAAGA TAAATAGAAA TAATTGGATA AGGAGTTTTG TCATAATGAA 660 

TTTATTTTAC AATCCTAAAT ATGTAGGAGA TGTCGCATTT TTACAAATTG AACCAGTTGA 720 

AGGTGAATTA AACTACAATA AAAAAGGTAA TGTTGTTGAA ATTACtAATG AAGGTAATGT 780 

TGTAGGTTAT AATATTTTTG AAATTTCAAA AGATATAACA ATTGAAGAAA AAGGTCATAT 840 

TAAATTAACT GATGAACTTG TAAATGTATT CCAAAA6CGT ATTTCAGAAG CTGGTTTTGA 900 

25 TTATAAATTA AATGCTGATC TATCACCGAA ATTTGTAGTT GGCTACGTTG AAACTAAAGA 960 

CAAACATCCT GATGCAGATA AATTAAGTGT ACTAAATGTA AACGTTGGAA ATGACACATT 1020 

ACAAATTGTA TGTGGCX3CQC CTAACGTTGA AGCTGGACAG AAAGTTGTTG TTGCTAAAGT 1080 

30 AGGTGCAGTG ATGCCTAGCG GTATGGTAAT TAAAGATGCT GAATTACGTC GTGTTGCCTC 1140 

AAGCGGTATG ATTTGTTCAA TGAAAGAATT GAATTTACCT AATGCACCTG AAGAAAAAGG 1200 

TATTATGGTA TTAAATGACA GCTATGAAAT TGGACAAGCA TTtTTTGAAT AATTAAGGAA 1260 

Q6TAGTGAAA ATATGAGCTG GTTTGATAAA TTATTCXSGCG AAGATAATGA TTCAAATGAT 1320 

GACnGATTC ATAGAAAGAA AAAAAGACXST CAAGAATCAC AAAATATAGA TrACGATCAT 13 80 

GACTCATTAC TGCCTCAAAA TAATGATATT TATAGTCGTC CGAGGGGAAA ATTCCGTTTT 1440 

CCTATGAGCX5 TAGCTTATGA AAATGAAAAT GTTGAACAAT CTGCAGATAC TATTTCAGAT 1500 

GAAAAAGAAC AATACCATCG AGACTATCGC AAACAAAGCC ACGATTCTCG TTCACAAAAA 1560 

CGACATCGCC GTAQAAGAAA TCAAACAACT GAAGAACAAA ATTATAGTGA ACAACGTGGG 1620 

AATTCTAAAA TATCACAGCA AAGTATAAAA TATAAAGATC ATTCACATTA CCATACGAAT 1680 

AAGCCAGGTA CATATGTTTC TGCAATTAAT GGTATTGAGA AGGAAACGCA CAAGCCAAAA 1740 

SQ ACACATAATA TGTATTCTAA TAATACAAAT CATCGTGCTA AAGATTCAAC TCCAGATTAT 1800 

CACAAAGAAA GTTTCAAGAC TTCAGAGGTA CCGTCAGCTA TTTTTGGCAC AATGAAACCT 1860 
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AAACAAAAAT ATGATAAATA TGTAGCTAAG 
CAAGAAAAAC AAAATGATAG TGTTGTCAAA 
S AATGTATCAT CAACAACAAA ATCAATGCCT 

ATTGAAAATA TTTATGCTTC ACAAATTGTT 
GTGCTTCAAA AGCGTCX3ATT TAAAAAAGCG 
GAAGAGCAAG ATGCAATACA ACGTGCAATT 
TATGTTGGTG ATAGTTCATT AAATGATGAT 
AGTCAGCTTC ATACAAATGG CATAGAGAAT 

IS 

GCGTCAATAC AAAATGAAGA CACTAATGAC 
GAGGAAGTTA GTTTGAaTCA AGTATCGACA 
GTTTCGAATG TAACGTCTCA ACATCAATCA 

20 

AAAGATGAAC TAAAAAATCA ATCCAGATTA 
aATAAAGAAG AATATTCAGk AAGTCAAATC 
ACAGAAGTAG ATGAGGATAC TACTTCAAAT 

25 

GAAATGCATG TAGACGCTCC TAAAACGCAA 
AATATCGATA AAACGGTTGA TAATGAAATT 

30 CAAACAAACT TAAGTGTCAA CTCATTGAAA 
GAAGATTCAA GCATGAATGA AATAGAAAAG 
AACGAAGCAG CTGAAAGTGA ACAAAATGTC 

3S AAGAAACAGA CTGAAAAGGT TTCAACTTTA 
CCATCTGATA AAAAGCGTAT GATGGATCGT 
TTAAAGCCTG TACAAAGTAA GCAAGCTGTG 

40 

CCATCATCAA GATCTGATTC ACAAGAGTCA 
ACATCAAACA ATGTTGaGAA CAATCAACTT 
CAAAATGCAC AACAATATTC AGAGCAGAAA 

45 

TTTGAAGAAA GTCAAGATGA TAATCAATTG 
TCTTCAGTTT CAGAAGTAAG CX^ACATAACT 
AATACTAGTG GACAACAAGA TAATGATGAT 

SO 

AATAAAAATG AAGATACAGC TAATGAAAAT 
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ACGCAAACGT CTCAAAATAA ACAATTAGAA 1980 

CAAGGAACTG CATCTAAATC ATCTGATGAA 2040 

AATTATTCAA AAGTTGATAA TACTATCAAA 2100 

GAAGAAATTA GACGTGAACG AGAACGTAAA 2160 

TTGCAACAAA AGCGTGAAGA ACATAAAAAC 2220 

GATGAAATGT ATGCTAAACA AGcGGAACgC 2280 

AGTGACTTAA CAGATAATAG TACAGATGCT 2340 

GAAACTGTAT CAAATGATGA AAATAAACAA 2400 

ACTCATGTAG ATGAAAGTCC ATACAATTAT 2460 

ACAAAACAAT TGTCAGATGA TGAAGTTACG 2520 

GCACTACAAC ATAACGTTGA AGTAAATGAT 2580 

ATTGCTGATT CAGAAGAAGA TGGAGCAACG 2640 

GATGATGCAG AATTTTATGA ATTAAATGAT 2700 

ATCGAAGATA ATACCAATAG AAACGCGTCT 2760 

GAGTACGCAG TAACTGAATC TCAAGTAAAT 2B20 

GAATTAGCAC CGCGTCATAA AAAAGATGAC 2880 

ACXiAATGATG TGAATGATAA TCATGTTGTG 2940 

AATAACGCAG AAATTACAGA AAATGTGCAA 3000 

GAAGAGAAAA CTATTGAAAA CGTAAATCCA 3060 

AGTAAAAGAC CATTTAATGT TGTGATGACG 3120 

AAAAAGCATT CAAAAGTCAA TGTGCCTGAA 3180 

AGTGAAAGAA TGCCTGCGAG TCAAGCCACA 3240 

AATACAAATG CATATAAAAC AAATAATATG 3300 

ATTGGTCATG CAGAAACAGA AAATGATTAT 3360 

CCTTCTGTTG aTTCAACTCA AACGGAAATA 3420 

GAAAATGAGC AAGTTGATCA ATCAACTTCG 3480 

GAAGAAAGCG AAGAAACAAC ACATCCAAAC 3540 

CAACAAAAAG ATTTACAGTC ATCATTTTCA 3600 

AGACCTCGGA CGAACCAACA AGATGTTGCA 3 660 
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CCAAGTGTTT 


CATTACTAGA 


AGAACCACAA 


GTTATTGAGT 


CGGACGAGGA 


CTGGATTACA 


3780 


GATAAAAAGA 


AAGAACTGAA 


TGACGCATTA 


TTTTACTTTA ATGTACCTGC AGAA6TACAA 


3840 


GATGTAACTG 


AAGGTCCAAG 


TGTTACAAGA 


TTTGAATTAT 


CAGTTGAAAA 


AGGTGTTAAA 


3900 


GTTTCAAGAA 


TTACGGCATT 


ACAAGAT6AC 


ATTAAAATGG 


CATTGGCAGC 


GAAAGATATT 


3960 


CGTATAGAAG 


CGCCTATTCC 


AGGAACTAGT 


CX5TGTTGGTA 


TTGAAGTTCC 


GAACCAAAAT 


4020 


CCAACGACAG 


TCAACTTACG 


TTCTATTATT 


GAATCTCCaA 


GTTTTAAAAA 


TGCTGAATCT 


4060 


AAATTAACAG 


TTGCGATGGG 


GTATAGAATT 


AATAATGAAC 


CATTACTTAT 


GGATATTGCT 


4140 


AAAACGCCAC 


ACGCACTAAT 


TGCAGGTGCA 


ACTQGATCAG 


GGAAATCAGT 


TTGTATCAAT 


4200 


AGTATTTTGA 


TGTCTTTACT 


ATATAAAAAT 


CATCCTGAGG 


AATTAAGATT 


ATTACTTATC 


4260 


GATCCAAAAA 


TGGTTGAATT 


AGCTCCTTAT 


AATGGTTTGC 


CACATTTAGT 


TGCACCGGTA 


4320 


ATTACAGATG 


TCAAAGCAGC 


TACACAGAGT 


TTAAAATGGG 


CCGTAGAAGA 


AATGGAACGA 


4380 


CGTTATAAGT 


TATTTGCACA 


TTACCCATGT 


ACGTAnTATA 


ACAGCATTTA 


ACnAAAAAGC 


4440 


CCCATATGAT 


GAAAGAATGn 


CAAAAATTGT 


CATTGTAaTT 


GATGAGTTGG 


CTGATTTAAT 


4500 


GATGATGGTC 


CGCAAGAAGT 


TG 








4522 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TCAACTTTAC GGATACGTAT ATATTTTGCA TGACATTTAG TGCAATAATA TTCATAATTT 60 

GCCCGTTGTT GATAGCTTTC AATGCTGTTA CAAAATCTAG 6CGCTCCAAC CTGTTGGCTC 120 

40 

AATCGTTTAA AATCTTGATC TTTATGTTGA TAACCTTTAC CAGCAATATG CAAGTGATAA 180 

TGACACAATT CGTGCAGTAT AATTTTTACA ACAGCATCTT CTCCATAATG CTCATATTGT 240 

TTTGGATTAA TTTCAATATC ATGGGACTTT AAAAGATAAC GTCCGCCTGT TGTACGTAAC 300 

45 

CTTTTATTAA AATATGCACA ATGTCGAAAC GTACGTCCAA ATTTTTCTTC CGAAAGATTC 360 

TCAACCATTC GCTGAAGTTT GTCATTATTC ATGTGGATCA ATCATCGTTA ATGATACTTT 420 

GTCTTTATTT TTGTCAATAC TGTAAATCCA AACGTCAACG ATATCACCAA CACTGACAAT 480 

SO 

ATCCATTGGA TTTTTTACGA ACTTCTTAGA AAGTTTCGAA ACATGGACAA GTCCATCTTG 540 

SS 
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TTTCATTCCT TCTTGTAAAT CTTCAATTGA TAGCACATCG GATTTAAGGA TTGGTGTTTC 660 
AAACTCGTCC CTTGGATCTC GATTAGGTGC GTTCAAGGAT TTAATAATAT CCTCTAATGT 720 

5 

AGGTACACCG ACTTGTAATT CAATCGCCAG T 751 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



20 



40 



4S 



TCTCCAGCTT 


TAACTTuATw 


TGGCACTTTA 


ACAATTGTCT 


GATCCATACA 


TACGCGACCA 


60 




« »fVTV^ &rTV^ A/^f^ 


ATTTACATTT 


ACAAAGCTAC 


CTTGCATTAT GCGTAAATGG 


120 








ATT6TAGTTG 


GGTCAGTA6C 


TGTATAAGTT 


180 






X X\9X 


AGCGTCTTTG 


TTTGAACTAC 


ATTAGCAATT 




AATTGCACAC 


TTGGTTTAAG 


GTGTACTTTA 


ACTTTTTGCT 


GTACATACTC 


TGATGGATAA 


300 


TATCCATAAA 


GGGAAATTCC 


TGGTCTTATT 


GCATTACAGA 


ATTGGCAATC 


CATTAATAGA 


360 


GAGCCTGCTG 


AGTTCTGACA 


ATGTATATAT 


TCAGGTTTAA 


TTGCTTCATT 


GACCATATCT 


420 


TTAAAACGTT 


GATATTGTTC 


AGTTGTCATA 


TCTCCTGGTT 


CGTCAGCACA 


GGCAAAGTGT 


480 


GTAAACACGC 


CTTCAAATAC 


AAGTTGCTCA 


TATTGTTGAA 


TGATTTCAAT 


CACTTCTTGA 


540 


TACGTTTTAG 


TATCTTTAAT 


ACCTAAACGT 


CCCATTCCTG 


TATCTAATTT AATGTGCAAC 


600 


CATAACTTTT 


TCTCTTGCTC 


ACCAGAAAT6 


TTTTTAATTG 


CTTCTTTCAA 


CCACTOTTTA 


660 


GACCS&AACCG 


TTAAGGCAAC 


TCGGTGTTGT 


ATCGCTTTAT 


CAATATCTTT 


AGCTGGTAAC 


720 


ACACCTAAGA 


CTAAAATTTT 


AGCAGTAATC 


CCATGCATTC 


TAAGTTCTAT 


CGCTTCATCT 


780 


AACGTTGCTA 


CAGCAAAAAA 


TGTGGCGCCA 


TTTTCCATTA 


AATGACGTGC 


TACTTTAACA 


840 


CTACCTAGTC 


CATAGGCATT 


GGCTTTAACG 


ACAGCCATCA 


CTGTTTTATT 


TGGATGCAAT 


900 


GTACTGAATA 


CTTTGAAATT 


TGATGCAACA 


GCGTTTAAAT 


CTACATTCAT 


ATACGCAGAT 


960 


CTATAATATT 


TATCCGACAT 


ATTACTTCCT 


CCTGTAATTC 


CCACACGTTT 


TAAAACTAGA 


1020 


TCTTAATTAT 


CATTGTATAA 


CAAATTTAAA 


ATGCTGACTT 


TTCTAAAACA ACTTGG 


1076 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2930 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TGACCACAAT GCCCAATACA ACCATCCCAT GGTAAAGCCA AGAGATGAGT CAATAAAGCG 60 

TGTTGAATAA GAGCTGAATG AACCTGATAC TGGATAAAAT GTTGCCAACT CTCCAATTGA 120 

10 

TGACATTAAG AAATATAGCA TGACACCAAT AACAAGATAA GCOAQTATAG CX3CCTCCAGQ 180 

ACCAGCTTGA GAAATGATAT TACCAGTAGC TACAAATAGA CCAGTCCCAA TTGCACCACC 240 

TATAGCAATC ATGGAAATGT GTCTTQAGTT AAGACTACGQ TTCATTTTAT TATCTTCCAT 300 

ATTTAGTCTC CCATCTATTT AAATATACCC ATTATTGTAA GCTTTTTAAG TGTACTATTC 360 

AATAACTATT TAGTACTGTA AAGCGAAAAA ATTAAAATTT TCTGATTTTT TAATCATCTT 420 

20 GAGCATGTTT AATTGTAATT TTGATGGGGT TAAATTATAA TATGTATTAA ATTATAATTA 480 

TnATAAATTG TGGAGGGaTG ACTATGTCAC AACAAGACAA AAAGTTAACT GGTGTTTTTG 540 

GGCATCCAGT ATCAGACCGA GAAAATAGTA TGACAGCAGG GCCTAGGGGA CCTCTTTTAA 600 

25 TGCAAGATAT TTACTTTTTA GAGCAAATGT CTCAATTTGA TAGAGAAGTA ATACCAGAAC 660 

GTCGAATGCA TGCCAAAGGT TCTGGTGCAT TTGGGACATT TACTGTAACT AAAGATATAA 720 

CAAAATATAC GAATGCTAAA AtATTCTCTG AAATAGGTAA GCAAACCGAA ATGTTTGCCC 780 

^ GTTTCTCTAC TGTAGCAGGA GAACGTGGTG CTOCTGATGC GGAcGTGACA TTCGAGGATT 840 

TGCGTTAAAG TTCTACACTG AAGAAGGGAA CTGGGaTTTA GTAGGGAATA ACACACCaGT 900 

ATTCTTCTTT AGAGATCCAA AGTTATTTGT TAGTTTAAAT CGTGCGGTGA AACGAGATCC 960 

35 

TAGAACAAAT ATGAGAGATG CACAAAATAA CTGGGATrTC TGGaCGGGTt TCCAGAAGCA 1020 

TTGOICCAAG TAACGATCTT AATGTCAGAT AGAGGGATTC CTAAAGATTT ACX3TCATATG lOBO 

CATdGGTTCG GTTCTCACAC ATACTCTATG TATAATGATT CTGGTGAACG TGTTTGGGTT 1140 

40 

AAATTCCATT TTAGAACGCA ACAAGGTATT GAAAACTTAA CTGATGAAGA AGCTGCTGAA 1200 

ATTATAGCTA CAGATCGTGA TTCATCTCAA CGCGATTTAT TCGAAGCCAT TGAAAAAGGT 1260 

GATTATCCAA AATGGACAAT GTATATTCAA GTAATGACTG AGGAACAAGC TAAAAACCAT 1320 

45 

AAAGATAATC CATTTGATTT AACAAAAGTA TGGTATCACG ATGAGTATCC TCTAATTGAA 1380 

GTTGGAGAGT TTGAATTAAA TAGAAATCCA GATAATTACT TTATGGATGT TGAACAAGCT 1440 

SO GCGTTTGCAC CAACTAATAT TATTCCAGGA TTAGATTTTT CTCCAGACAA AATGCTGCAA 1500 

GGGCGTTTAT TCTCATATGG CGATGCGCAA AGATATCGAT TAGGAGTTAA TCATTGGCAG 1560 

55 



357 



EP 0 786 519 A2 



20 



25 



30 



35 



GGTCAAATGC 


GCGTAGTTGA 


CAATAACCAA 


GGTGGAGGAA 


CACATTATTA 


TCCAAATAAC 


1680 


CATGGTAAAT 


TTGATTCTCA 


ACCTGAATAT 


AAAAAGCCAC 


CATTCCCAAC 


TGATGGATAC 


1740 


GGCTATGAAT 


ATAATCAACX3 


TCAAGATGAT 


GATAATTATT 


TTGAACAACC 


AGGTAAATTG 


IBOO 


TTTAGATTAC 


AATCAGAGGA 


CGCTAAAGAA 


AGAATTTTTA 


CAAATACAGC 


AAATGCAATG 


1860 


GAAGGCGTAA 


CGGATGATGT 


TAAACGACGT 


CATATTCGTC 


ATTGTTACAA 


AGCTGACCCA 


1920 


GAATATGGTA 


AAGGTGTTGC 


AAAAGCATTA 


GGTATT6ATA 


TAAATTCTAT 


TGATCTTGAA 


1980 


ACTGAAAATG 


ATGAAACATA 


CGAAAACTTT 


GAAAAATAAA 


TTTGATATGT 


AGTTTCTATA 


2040 


TTGCGTAGTT 


GAGCAGTTTA 


TGATATCATA 


ATAAATCGTA 


AAGATTCCTA 


ACAAGAGAGG 


2100 


GTGTTTAACG 


TGCGCGTAAA 


CGTAACATTA 


GCATGCACAG 


AATGTGGCGA 


TCGTAACTAT 


2160 


ATCACTACTA 


AAAATAAACG 


TAATAATCCT 


GAGCGTATTG 


AAATGAAAAA 


ATATTGCCCA 


2220 


AGATTAAACA 


AATATACGTT 


ACATCGTGAA 


ACTAAGTAAT 


TCTTATCATT 


CAAATACGAC 


2280 


GATTTGAAAA 


TAAAGCGGGC 


TTACCTATTA 


TATTGGGGAG 


CTCGCTTTTT 


TATGAAATTT 


2340 


TTGTGAAGAG 


TGATTAATGG 


ATTGAGTTTC 


ATCGGTAGAA 


CAATATATGA 


TTATATTAGT 


2400 


TGTTACTTTA 


TTAAAaTTTG 


AGAATATTTA 


TAGAAGGAAA 


TAGATTACTG 


ATTTTATAAA 


2460 


GTCACTTTGT 


TAGCGAATGC 


TTGAAAGAGT 


ATTTAATATA 


GTAGAATTTA 


AAATTTCAAA 


2520 


GCGGAATTTA 


ATAAGTACGA 


AGTAGTTCTG 


GGTATGTTTT 


ATAAATGTTC 


GATAATACAC 


2580 


TTTAATCTTA 


AATATGATGG 


TTTAGAAAAT 


GATTTAACAA 


AGAAATGAaA 


CTTTACTGTT 


2640 


GAATTATGTG 


AGGATTGTGT 


TATTATATAA 


ATCGTAATAA 


TTACGATTTG 


ATAAAAAGTG 


2700 


AGGTAACTAT 


ATATGGCTAA 


GAAATCTAAA 


ATAGCAAAAG 


AGAGAAAAAG 


AGAAGAGTTA 


2760 


GTAAATAAAT 


ATTACGAATT 


ACGTAAAGAG 


TTAAAAGCAA 


AAGGTGATTA 


C3GAAGCGTTA 


2820 


AGAAAATTAC 


CAAGAGATTC 


ATCACCTACA 


CGTTTAACTA 


GAAGAT6TAA 


AGTAACTGGA 


2880 


AGACCTAGAG 


GTGTATTACG 


TAAATTTGAA 


ATGTCTCGTA 


TTGC3GTTTAG 




2930 



40 (2) INFORMATION FOR SEQ ID NO: 43: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CTTCTTGCCA TGGCTCTCTT TATTTAAAAA TGCTTCCAAC TTGTCCATTT GATTGTTTCT 60 
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20 



25 



30 



35 



40 



45 



SO 



TTATAAAAAA 


CTAATTTTAC AAATGCTTTT GCGTTCTTAC AAAAAATGCA 


TTTGACTATT 


180 


ATTATAATAA 


GCGTATAATT GTCGCATATT ATTTTTTGTA TTTTTGGCAA 


TAACGAAGGA 


240 


GTATTTATGA 


ATAAAGACAA 


GCAATTGCAC AACGACAAAA 


TCAATCTATC 


CCAATTAGTC 


300 


TTATTAGGGT 


TAGGCTCTTT AATAGGATCT GGTTGGCTAT TTGGTGCGTG 


GGAAGCATCA 


360 


TCAATAGCTG 


GACCAGCAGC 


AATCATATCA 


TGGGTTCTTG 


GATTCCTAGT 


CATTGGAACC 


420 


ATTGCCTATA 


ACTACATTQA AATCGGCACA ATGTTTCCTC 


AATCAGGTGG 


CATGAGTAAC 


480 


TATGCCCAGT 


ATACACATGG 


CTCATTATTA GGCTTTATTG 


CTGCTTGGGC 


GAATTGGGTG 


540 


TCTTTGGTGA 


CAATAATACC 


TATC6AAGCT 


6TGTCAGCTG 


TTCAATATAT 


GAGTTCTTGG 


600 


CCGTGGCATT 


GGGCGAAACC 


AATGAGATAT TTAATGGAAA ATGGCTCTAT 


TAGCACATAC 


660 


GGATTGCTAG 


CTGTATATCT 


CATCATTGTT 


ATTTTTTCAT 


TATTAAACTA 


TTGGTCCGTA 


720 


AAACTTTTAA 


CATCATTTAC 


GAGTTTAATT 


TCTCTAITTA 


AATTAGGCGT 


ACCCATGTTA 


780 


ACCATCATCA 


TGTTGATGCT 


ATCAGGATTC 


GACACTTCAA 


ATTACGGCCA 


TTCGGCAAGC 


840 


ACATTTATGC 


CTTACGGAAG 


TGCACCGATT 


TTTGCTGCAA 


CAACAGCATC 


AGGGATTATT 


900 


TTTTCATTCA 


ATTCATTCCA 


GACAATTATT 


AATATGGGTT 


CAGAAATTAA 


AAATCCTGAA 


960 


AAAAATATCG 


CAAGAGGCAT 


CGCTATCTCA 


CTGTCAATCA 


GTGCAGTGTT 


GTACATCATT 


1020 


TTACAAAGTA 


CGTTTATCAC 


TTCTATGCCT 


CAATCAATGT 


TACAACATAG 


TGGATGGAAT 


1080 


GGCATCAACT 


TCAATTCACC 


ATTTGCTGAT 


TTAGCTATCT 


TATTAGGAAT 


TAATTGGCTC 


1140 


GCAATTTTAC 


TATACATTGA 


AGCTTTTGTA 


TCACCATTCG 


GTACTGGCGT 


GTCATTTGTC 


1200 


GCCGTTACAG 


GTCGAGTTTT 


ACGAGCAATG 


GAGAAAAATG 


GACATATCCC 


TAAATTTCTT 


1260 


GGGAAGATGA 


ATGAAAAATA 


TCATATCCCA 


CGTGTAGCAA 


TCATCTTTAA 


TGCCATCATT 


1320 


agtjCtgatta 


TGGTTACATT 


ATTTAGAGAT 


TGGGGTACGC 


TAGCAGCAGT 


TATTTCTACT 


1380 


GCAACTTTAG 


TAGCCTATTT 


AACTGGCCCA ACGACAGTGA 


TTQCATTAAG 


AAAAATGGGA 


1440 


CCAACAATGA 


CTCGTCCATT 


TAGAGCAAAA ATTTTAAAAG 


TAATGGCACC 


ATTATCATTT 


1500 


GTATTAGCTT 


CATTAGCTAT ATATTGGGCA ATGTGGCCAA CAAGGGCTGA 


AGTTATTTTA 


1560 


ATCATTATAC 


TTGGATTACC 


AATCTACTTC 


TTCTATGAAT 


ATCGTATGAA 


TTGGCGTAAT 


1620 


ACAAAGAAAC 


AAATTGGTGG 


TAGCTTATGG ATTATTGTAT 


ATTTAATCGT 


GCTATCAATA 


1680 


CTGTCATTTA 


TAGGAAGCAA AGAATTTAAA GGCTTAAATA TGATTCACTA 


TCCATTTGAC 


1740 


TTTATCGTTA 


TTATTATTGT 


GGCACTTATC 


TTCTATTACA 


TCGGTACAAC 


GAGTTCATTT 


1800 


GAAAGCGTCT 


ATTTCCGTCG 


CGCAACACGA ATCAATACGA AGATGCGTGA 


GTCACTAAAT 


1860 
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CACACACATT 


AACCAACCAT TGATTTCAAC ATCTTGGTTG GTTTTTTATT TTGAAAATCG 


1980 




GTTATAAATA 


ACTAACATAA CAAGATGATG ATCAGGCTGG GACATAAATC AATGTTCTAT 


2040 


5 


GCTCTACGAA 


gTTATATTGG CAGTAGTTGA CTGAACX3AAA ATGCGCTTGT AACAAGCTTT 


2100 




TTTCGATTCT 


AGTCAGGGGC 


CCCAACACAG AGAATTTCGA AAAGAAATTC TACAGGCAAT 


2160 




GCAAGTTGGG 


GTGGGACGAC GATAAAGAAA TACTTTTTCT ATAGAAATTA GTATytCTTA 


2220 


TO 


TGCATGAGTT 


TTACTCATGT 


ATTCATATTT 


TTAAGTACAC 


ATTAGCTGTG 


GCTAATGTAT 


2280 




AAGAACCACT 


ACATAATAAA TCATTTGTGG 


CTCTTTATCA TTTCTGTCCC ACTCCCGTAG 


2340 




AAGTACATCA 


TATAATGCT6 AAAATGGTTT 


GAGTTAAAAC AGATATCAAG 


CTCGTCTGAT 


2400 


IS 


TCAGTCACAA 


AATTGTCTTG 


TTATACTTGT 


CACCTATCAT 


CTATAGACCG 


TGGTATGATT 


2460 




AAATTGGGGA 


TGATAAAGGA GGTTAATAAA TATGAAGATT AATACTACAG 


GTGGTCAAAT 


2520 


20 


TCATGGTATT 


ACACAAGATG 


GTTTAGATAT 


CTTCTTAGGC 


ATTCCTTATG 


CAGAACCACC 


2580 


AGTTCATGAC 


AATCGCTTTA AACATTCTAC 


GTTAAAAACA 


CAATGGTCAG 


AGCCAATTGA 


2640 




TGCAACTGAA 


ATACAACCCA 


TCCCACCGCA 


ACCAGACAAC 


AAATTAGAAG 


ATTTTTTCTC 


2700 


25 


CTCACAATCT 


ACAACTTTTA 


CTGAACATGA 


AGACTGTTTA 


TATCTAAATA 


TTTGGAAACA 


2760 


ACAT7ATGAT 


CAGACGAAGA 


AACCTGTCAT 


CATTTATTTT 


TATGGTGGTA 


GTTTTGAAAA 


2620 




TGGTCATGGT 


ACAGCCGAAC 


TCTATCAACC 


GGCACATTTA 


GTACAAAATA 


ACGACATTAT 


2880 


30 


CGTTATTACA 


TGCAATTATC 


GTTTAGGCGC 


ATTAGGATAT 


TTAGACTGGT 


CATATTTTAA 


2940 


TAAAGATTTT 


CATTCCAATA 


ATGGCCTTTC 


AGATCAAATC 


AATGTCATAA AATGGGTGCA 


3000 




TCAATTTATT 


GAATCCTTCG 


GTGGCGACGC 


TAATAACATT 


ACTTTAATGG 


GTCAGTCTGC 


3060 


35 


AGGCAGTATG 


AGCATTTTGA 


CTTTACTTAA 


AATACCTGAC 


ATTGAGCCAT 


ACTTCCATAA 


3120 




AGTQGTTCTA 


CTAAGTGGCG 


CACTACGATT 


AGACACCCTT 


GAGAGTGCAC 


GCAATAAAGC 


3180 




ACAACATTTC 


CAAAAAATGA 


TGCTCGATTA 


TTTAGATACA 


GATGATGTTA 


CATCATTATC 


3240 


40 


GACAAATGAT 


ATTCTTATGC TGATGGCGAA gcTAAAACAA TCTCGAGGAC CTTCTAAAGG 


3300 




GCTTGATTTA 


ATATATGCGC 


CTATTAAAAC 


AGATTATATA 


CAAAATAATT 


ATCCAACAAC 


3360 




GAAACCAATT 


TTTGCATGTT 


ATACAAAAGA 


TGAAGGCGAT 


ATTTATATTA 


CTAGTGAACA 


3420 


45 


GAAAAAATTA 


TCGCCGCAAC 


GCTTTATCGA 


CATTATGGAA 


TTAAATGATA 


TTCCTTTAAA 


3480 




ATACGAAGAT 


GTTCAGACGG 


CGAAGCAACA 


ATCTTTAGCG 


ATTACACATT 


GTTATTTCaA 


3540 




ACAGCCGATG 


aAGCAATTTT 


TACtnACtnACT 


CAATATACmA GATTCCAACC 


GCACCAACTA 


3600 


SO 


TGGCTT 












3606 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 44: 

GAAATTAAAA AAGCAATTGG nACAAGATGC AACAGTGTCA TTGTTTGATG AATTTGATAA 60 

AAAATTATAC ACTTACGGCG ATAACTGGGG TCGTGGTGGA GAAGTATTAT ATCAAGCATT 120 

TGGTTTGAAA ATGCAACsAG AACAACAAAA GTTAACTGCA AAAGCAGGTT GGGCTGAAGT 180 

GAAACAAGAA GAAATTGAAA AATATGCTGG TGATTACATT GTGAGTACAA GTGAAGGTAA 240 

ACCTACACCA GGATACGAAT CAACAAACAT GTGGaAGAAT TTGAAAGCTA CTAAAGAAGG 300 

ACATATTGTT AAAGTTGATG CTGGTACATA CTGGTACAAC GATCCTTATA CATTAGATTT 360 

CATGCGTAAA GATTTAAAAG AmAAATTAAT TAAAGCTGCA AAATAATTCA GCTATATAAG 420 

TTAGTGAAAT GAGAGTCTGA AACATATCAA TCTTTTGATA TTGTATTAGG CTCTTATTTT 480 

TATAGCTAGA AAGTTAGATA TTTGTATTTT TTTAAATAAT AAGTGCCGTT GTTATCGTTC 540 

AATTTAATTA ATGATAGATT AGTATTATTA TAGCTAAAGT AGTATACCTG AGAAAATAGC 600 

TCAATGTATC TCTTTATTAA TAAGTTATAT CATAATTATT TTAGTGCATA CTTTATGGAA 660 

GGGATATCAG GGAATGGCTT TCAATTAAAG AAGAGGTTTA AAAGGATTAC AACAGAATGT 720 

TATGATTTTG TAGAAAGATA TATAACAACG TTTTATAAAA ACATAATATT GTTAATGGAA 780 

AATGAAATGT AAGGGGGATT TCGAGTGACT AAGAAAGTTT ATTTTAACCA CGATGGTGGT 840 

GTAGATGATT TAGTATCTCT ATTTTTATTA TTACAAATGG AAAACGTTCA ATTGATAGGG 900 

GTaC5TACAA TTGGTGCTGA TTGTTATTTA GAGCCATCTT TGAGCGCATC AGTAAAAATT 960 

ATTAATCGTT TTTCAAATGA AGATATTCAA GTTGCGCCAT CATATGAACG AGGAAAAAAT 1020 

40 CCATTTCCTA AAGAATGGCG TATGCATGCC TTTTTTATGG ACGCATTGCC AATTTTAAAT 1080 

GAGCCAGTCA AACATGTTGC TTCAAATGTG AGCQACAAAG AAGCCTTTGA AGACATTATT 1140 

CAAACTTTAA AGAGACAATC AGAAAAAGTA ACATTATTAT TTACAGGCCC GCTTACAGAT 1200 

4S TTAGCAAAAG CACTACAAAA AGATTCATCT ATCGTTCAGT ATATAGAAAA ATTAGTTTGG 1260 

ATGGGTGGCA CCTTTTTACC AAAAGGAAAT GTTGAAGAAC CTGAGCATGA TGGTTCTGCA 1320 

GAATGGAATG CATATTGGGA TCCAGAAGCG GTTAAAATTG TTTTTGATAG CGATATAGA6 1380 

*0 ATTGATATGG TTGCTTTAGA AAGTACGAAT CAAGTACCGC TAACC5TTAGA TGTTAGACAA 1440 



TO 



IS 



20 



2S 



30 



SS 
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GTACCACCAT TAACACACTT TATAACAAAT TCTACTTACT TTTTATGGGA TGTTTTAACG 1560 

ACTGCTTATA TTGGTAACAA GGACTTGGTT CATTCAATTG AGAAAAAAGT CGATGTAATA 1620 

5 AGTTATGGAC CAAGTCAAGG TAAGACATTT GAGTGTAAAG ATGGGCGCAA AATTAATGTC 1680 

ATAAATCATG TAGATAACAA CGCATTTTTT GATTATATAA CTGCACTTGC TAAAAAAGTA 1740 

AATTAACAGC TGTGTAGAAT AATTAAGGTT TTAATTTATA TAGAACAACT TATTGTAAAC 1800 

10 TTTTCATTTC TTAAAGTTTA CAATGGTGCT ATAATAATGG TCATGAAATA C3QAAAGGAAG 1860 

TAAAAAATGA CAACAAAACA GTTAGTATAT ACAGCTTTAA TGACAGCGAT TATCGCTATT 1920 

TTAGGATTGG TACCGGTAAT TCCACTACCA TTTTCTTCAG TACCAATTGT ACTTCAAAAC 1980 

ATTGGTATTT TCTTAGCAGG TGCX3ATTTTA GGAOGTAAAT ATGGCACATT AAGTGTTATC 2040 

GTCTTTTTAT TATTAGTAGT TGCTGGCTTG CCATTGTTAr CAGGTGGTOG CGGTGGCATC 2100 

GGTGTATTCG CAGGTCCTTC AGCAGGGTTT TTACTATTAT ATCGAGTTGT AGCATTCATG 2160 

20 

ATTGGGGCGA TTCGAGATAG ATTCATCAAT GAAATTAATT TCTGGATTTT ATTCGTTGGT 2220 

ATTTTAGTTT TTGGTGTTAT AGCATTAGAT GTTATTGGTA CATTGATTAT GGGCATGATT 2280 

ATTAACATAC CATTTACGAA AGCTATTTCA ATTTCATTAG CTTATTTGCC TGGTGATATA 2340 

2S 

TTAAAAGCAA TTGTAGCAAG TTTGATTGGT ACAGCTTTAC TTAATCACTC GCAGTTTCGT 2400 

CAAATTATGG GAATAAAATA ATCATATTTA AGATAGTAAA GTAATTGAAT AAGTTGCTTT 2460 

GAAATTTATA AAAGTGAAAG GAGTAGGTGT CAATGGCTAG TATAAGTATG TCAGATATAT 2520 

30 

ATTGTAACGG CACTATATTT GAAAATGACG ACGAGCAGTT GATTTATTTA ACGCCTTCTT 2580 

TTCCACAACG ATACACAAGT AACACATGGA TATATAAAAA GACGCCTACC CAAGAGCGAT 2640 

GGCTGAAAGA CTTAGAACGT CAACATCAAT TACATACAAA TCAAGGTTCA AATCATTATG 2700 

35 

CGTTTAGTTT CCCGGAAAAT GAACAACTTG ATAATCATTG GATGGCTATG TTTAAAGATA 2760 

TGAATTTTGA ACTAGGTATT ATGGAATTGT ATGCCATAGA AAGTGATGCG CTTGCCAATT 2820 

TGCCGCGTAA CTCTGACGTT GAAATTGCCA TCGTTGACGA GTCGCATATA GATGCCTATT 2880 

40 

TAAAAGTTGC ATATCAGTTT AGTTTGCCAT TTGGAAAAGA CTATGCAGAT GCACATGAAG 2940 

AAATGGTAAG GGAACATTAT CAAAAAGATG TGATTAAACG CTTAGTAGCT TATTTAAATA 3000 

45 ATGAACCTAT TGGCGTTGTA GATGTCATTG AAAGTGAAAA TTACATTGAA TTAGATGGAT 3 060 

TTGGTGTATT AGAACAATTT CGGCACCAAG GAATTGGATC TACAATTCAA TCGTTGATAG 3120 

GTGAATACX3C CATATCAAAA AATCACAAAC CAATCATATT AGTTGCAGAT GGTGAAGATA 3180 

SO CAGCAAAAGA TATGTATGCA AAGCAAGGTT ATGTCTATCA ATCGTTTTGT TATCAAATAT 3240 
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10 



IS 



TAAGCTGGTT TCGAGTAGAA ATCAACTTAC TGCTTTTTAA ATTGTTTTGA GCTACTTATA 3360 

CTTATAAAAA TAGTGCGTTT AAATTGTTGA TTCATGTAGA ATATCGTTCA TTATGACACA 3420 

CTATAATGAA TATGTTATTG TTCAGAATCA ATGATACGTT CTGGATGACT GTATATATTA 3480 

AAGCCACCAT TTCGAATAAA TCCAACTGCC GTAATATTTA GGTCATTAGC TAAGGTTACA 3540 

GCAAGCGTTG TCGGAGCTGA TTTAGATAAA ATGACGCCAA CACCAATTTT TGCGGCTTTA 3600 

ATTAAAATTT CTGATGAAAT ACGTCCACTA AAAATTAATA CTTTATCTCG GACAGTAATA 3660 

TGTCGCTGAA TACAAAATCC ATATAATTTA TCTAGAGCGT TATGTCTACC AATGTCTTGT 3720 

CGATGTACAA AAAATGTCAA ACCATCGCTT ATAGCAGCAT TATGTAAGCC ACCTGTTTCT 3780 

TGGTAAATAT GACTTGCACT TTGTAATCX3A OTCATCATGT TAATAATTTG CATTGGAGTT 3840 

AAAGTGATTT TAGACATAGA TGTTTTAGCG ATAGCAGCAT CATTTTGAAA ATAAAACTCA 3900 

20 CGACTCTTTC CGCAACAAGA TGCAATCATT CGTTTTGTGG AATATTGAAA GCGATOGCCT 3960 

AAATCTTTAT TAAGTTCAAC ATGGGCAAAA CCTTTACTAT CATCAATCAG TACAGATTTT 4020 

AATTCATCTC GCTTTAAAAT GGCACCTTCC GAAGCCAGAA ATCCAATGAC TAACTCCTCA 4080 

25 AGGTTTGTTG GACTGCATAT AACAGTCGCA AATTCTTCAC CATTCACCAT AATTGTAAGT 4140 

GGAAATTCTG TCACATATT6 ATCTGTTGTA TTGAATAATT TTCCATCTTC ATATCTAACA 4200 

ATTGGTTGAC CTAAAGATAC ATCTTTGTTC ATTATCTAAC CCCTTTAATT AGCTTAAACT 4260 

TTATTTTAAA GCAATTTGCT TAAAATTTTA ACATATTTGC TTAAGTTTGA AATTTGATTG 4320 

ATAAAAATTA ATAGCGAGCA ATCTGTTTGA TTTAAATTGA ATTCGAGAAT ATACATACTA 4380 

GGGCATCAAT TAATAAATAT CAATCTTATG CAAATTTGAC AATTGTTTGA ATCAATATAT 4440 

AAACAGGCAA CGGTTCTTTT CAAATATAAT AGTAAGTGTA TAATGAAAAT GTAAATATTA 4 500 

TTAAAAATGG GGGTTCACTC AATGAAATTG AAACGTTTAT TTGCTGTTGT GATTGCAATG 4560 

CTTTTAGTAT TAGCTGGTTG CTCTAATTCT AACGATAATA ATGAAAGTAA AAAAGATGAC 4620 

GCAGACAATG GTAAGAAACA AGAGATTCAA GTTGCAGCGG CAGCAAGTTT AACAGATGTA 4680 

ACCAAGAAAT TAGCTTCAGA ATTTAAAAAA GAGCATAAAA ATGCTGATAT TAAATTTAAC 4740 

45 TATGGTGGAT CAGGGGCATT AAGAAAACAA ATTGAATCAG GCGCACCTGT TGACGTATTT 4800 

ATGTCTGCAA ATACTAAAGA TGTAGATGCA TTAAAA6ACA AGAATAAAGC GCATGATACA 4860 

TATAAATATG OGAAAAATAG TCTAGTATTA ATTGGTGATA AAGATTCAAA TTACACTTCA 4920 

GTAAAAGACT TAAAAGACAA TGATAAATTA GCATTAGGTG AAGTGAAAAC TGTACCAGCA 4980 

GGAAAATATG CGAAACAGTA TTTAGATAAC AATAACTTAT TTAAAGAAGT CXaUUWSTAAA 5040 
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40 



363 



CAAGGTTTTG TGTATAAAAC TGACTTATAT 
GTAATTAAAG AAGTAGAACT TAAGAAGCCA 
S AGTAAATTAG CAAAAGAGTG GATGGAATTC 

AAAGAATACC ACTTTGCAGC ATAAGGAGTT 
GATATCAATA CGAGTTGCTG TAATCAGTAC 
ATCTAAATGG TTGTATCGTC GTAAGGGTTC 
ATTACCTATT GTTTTGCCGC CAACGGTATT 
AAGAGGACCA ATCGGTCAAT TCTTTGCGAA 

IS 

GACAGGTGCT GTGATAGCAT CTGTCATTGT 
GCAAGGCTTC AGAGGTATAG ACACGAAAAT 
TGAAACGAAA ATTTTCCTCA AATTAATTTT 

20 

TATAATGATG AGTTTTGCTC GTGCATTAGG 
ATATATTCCA AATAAAACGA ATACACTACC 

25 TAGAGAAAAT GAAGCGTGGT TATGGGTATT 
ATCTACAATT AATTTATTGA ATAAAGATAA 
CAATGTQAAA TATCAATTAA AGAACACTTT 

^0 AAAAATTTAT GCAGTTCGTG GTCCATCTGG 
TGCCGGATTA CGTAAAGCAG ATGAAGCTAT 

TACGGCAAAA AACGTGAATG TTAAAATTCA 

3$ 

CTACCAATTG TTTCCTAATA TGACGGTCTA 

TGAACACATC GATCAATTAA TTCAAACTTT 

TATGACATTG TCAGGTGGAG AGGCACAACG 

40 

ACCAGATTTA ATTTTATTAG ATGAACCTTT 
GAGTATTACA TTAGTTAAAC GTATTTTCAA 
ACATTCAAAC TATGAAGCAG AACAAATGGC 

43 

TATTTGCCAT TAAAGAGTTT AGAACGTATT 
CATTTTAATG ATGTTTTAAA CTCTTTTTTA 
SO CGTCATATAA TGAAAGTAAT GATAAAAAGA 
ATTCAAGGCA AATTTTATTT AAACAAATAG 
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AAACAAAATA AAAAAATTGA TACTGTAAAA 5160 

ATCACATACG AAGCTGGTGC TACATCAGAT 5220 

TTAAAATCAG ATAAAGCTAA AGAAATACTA 5280 

GTAATCCATG CCTGACTTAA CACCTTTITG 5340 

GATTATTGTA ACGGTTTTAG GTATTTTTAT 5400 

GTGGGTTAAA GTATTGGAAA GTTTATTGAT 5460 

AGGTTTTATT CTATTAATCA TCTTCTCGCC 5520 

TGTACTACAT TTACCTGTAG TGTTCACTTT 5580 

TAGTTTTCCA CTAATGTATC AACATACTGT 5640 

GATTAATACA GCTAGAACGA TGGGAGCAAG 5700 

ACCATTAGCT AAACGCTCTA TTTTAGCAGG 5760 

TGAGTTTGGT GCTACATTAA TGGTTGCAGG 5820 

TTTAGAAATA TACTTCTTAG TGGAACAAGG 5880 

AGTGCTAGTC GCATTCTCTA TTGTGGTTAT 5940 

ATATAAGGAG GTCGACTAGA TGCTTAAAAT 6000 

AATTCGCATC AATATAGATG ATACTGAACC 6060 

CATTGGTAAA ACTACTGTTT TAAATATGAT 6120 

TATCGAAGTG AATGGGCAAT TACTTACTGA 6180 

ACAACGACGT ATTGGATATC 7X3TTTCAAGA 6240 

TAAAAATATT ACTTTTATGG CTGAACCATC 6300 

AAACATTGAT CATTTGATGA AACAATATCC 6360 

TGTAGCACTT GCACGTQCAC TTAGCACrAA 6420 

TTCTAGTTTG GATGATACTA CAAAAGATGA 6480 

CGAATGGCAA ATACCAATCA TATTTGTGAC 6540 

TCATGAAATT ATTACAATTG GGTAATCATT 6600 

TAAAATTGTA GAAGTGAATG CTTCTATCAG 6660 

GGGGCAGTTT TTTTQAGAGA CATTGACGCG 6720 

AAGGATAACT TAATGTGAGT CAAGAACGTT 6780 

GTGAAATAGG TCAAAGCAAA ATAAATCAAA 6840 
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GAGCAGGCAT TGCCAAACTA ATCATTGTTG ATAGAGATTA TATTGAATTT AGTAATTTAC 6960 

AAAGACAAAC ATTGTTTACT GAAGAAGATG CTTTGAAAAT GATGCCTAAG GTGGTTGCAG 7020 

CTAAAAAGCA TTTGCTAGCG TTACGTAGTG ATGTTGATAT TGATGATTAT ATTGCCCATG 7080 

TGGATTATTA TTTTTTGGAA ACACATGGAC AGGACGTTGA CGTTATTATT GATGCAACCG 7140 

ATAACTTTGA AACACXSACAA CTGATTAATG ATTTTGCATA TAAATATCGT ATACCTTGGA 7200 

TTTATGGTGG TGTTGTACAG AGTACATATA CAGAAGCTGC ATTTATACCT GGTAAAACAC 7260 

CTTGCTTTAA CTGTTTGGTA CCACAATTGC CAGCATTAAA TTTAACATGT GATACAGTAG 7320 

GGGTCATTCA ACCTGCCGTG ACGATGGCAA CAAGTTTACA ATTAAGAGAT GCGATGAAAG 7380 

TATTAACXX5A ACAACCAATT GACACAAAAA TAACTTATGG CGATATTTGG GAAGOTAGTC 7440 

ATTATTCATT TGGTTTCAGT AAAATGCAAC GTTCAGACTG TACAACTTGT GGAGATGTAC 7500 

CAAGTTATCC GTATTTAAAC AAGAATGAAC AACGTTATGC AACATTGTGT GGTAGAGACA 7560 

CTGTACAGTA TGAAAATGCA TCAATTACAC ACGACATTCT TGTTCAATTT TTAAAACAAC 7620 

ATCAGTTAAA TTATCGCAGT AATTCGTATA TGGTTATGTT TGAATTTAAA GGACACCGCA 7680 

2S TTGTTGCTTT TAAAGGTGGA AGGTTTTTAA TACATGGCAT GACACGCACA TCAGATGCCA 7740 

CACATCTAAT GAATTTATTG TTTGGATAAA AAAAGATAAG ACAAAAGGAG TGTAATATTA 7800 

TGGGCGAACA TCAAAACGTT AAATTGAATC GTACAGTTAA AGCAGCCGTA CTAACGGTAT 7860 

CAGATACTAG AGACTTTGAT ACAGATAAAG GTGGTCAATG CGTGCXKTCAA CTATTACAAG 7920 

CAGATGACGT TGAAGTGAGT GACGCACATT ATACAATTGT GAAAGATGAA AAAGTAGCCA 7980 

TCACGACGCA GGTGAAGAAG TGGTTAGAAG AAGATATTGA TGTCATCATT ACX3ACTGGTG 8040 

GAACAGGTAT TGCACAACGT GATGTGACGA TTGAAGCAGT AAAACCACTT TTAACTAAAG 8100 

AGAtAGAAGG CTTTGGGGAA TTGTTTAGAT ATTTGAGTTA TGTTGAAGAT GTTGGCACGC 8160 

GTGCATTATT GTCTCGTGCT GTAGCAGGTA CAGTTAATAA TAAATTGATA TTTTCGATTC 8220 

CAGGATCAAC AGGCGCAGTT AAATTAGCAT TAGAAAAGCT CATTAAACCA GAATTAAATC 9280 

ATCTGATTCA TGAGCTTACA AAATAATTTA TTGATTTGAT TGGCGTTGAA AATCTCCAGA 834 0 

4S TTTACCGCCA GACTTGCTTT CAAGGTAGGT TTCGCCAATA ATCATACCTT TATCAACTGC 8400 

TTTCGTCATG TCGTAAATGG TTAAAGCCGT TGCTGATGCA GCGGTTAAAG CTTCCATTTC 8460 

AACACCGGTT TTGCCAGTTG TAGAGACAGT TGT7TGAATG TTTAAAGTAT AAAGGGGTGC 8520 

^ ATTTGTTTCA TCCCAGCTGA AGTGAACATC TATGCCAGTC AATGGTAATG GATQGCACAT 8580 

CGGAATAAGT GTTGATGTAT TTTTGGCAGC CATAATACCA OCQATTTGAG CAGTGTTCAA 8640 
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AATGCTTGAA TGAGCGACAG CAGTTCTTTT 
TTTGGCGTGG CCTTGTTGAT TAATATGAGT 
^ TCTAGTATAT CATGAAAAAA TAAAAGTTTT 

GAAACCCAAT CCCAGTTAAA GAAGCAATTC 
CGGCAATTAC GGTAGCACTT GAAAAAAGTC 

10 

CTACTTATGA TATACCAAGG TTTGATAAAT 
TTGATTCACA AGGGGCAAGT GGTCAGAATC 
GTGCAGGTTC AGTTTCTGAT AAATTAGTTG 
GAGCACAAAT ACCTAATGGC GCAGATGCTG 
AAGATACATT TACAATTCGT AAACCATTTT 

20 AAGAAACAAA GACA6GCGAT GTTGTTCTAA 

TCGCGGTCCT TGCAACATAT GGCTATGCAG 
CTGTTATTGC AACAGGAAGC GAATTATTAG 

25 TTCGTAACTC TAATGGCCCA ATGATTCGTG 

GTATTTACAA AACACAAAAA GATGATTTAG 
TGGAAAAACA TGATATCGTT ATTACAACGG 

30 

TACCTGAGAT TTATAAGGCT GTAAAGGCGG 
CTGGTAGCGT AACAACGGTT GCATTTGTAG 
ATCCATCAGC TTGTTTTACA GGATTTGAAC 

35 

GTGGCGCACT AGAAGTCTTC CCGCAAATAA 
AGGGAAACCC ATTCACACGA TTTATACGTG 
CTGTAGTACC TTCAGGATTC AATAAATCAG 

40 

GTATGGTCAT GTTACCAGGA GGGTCACGTG 
TATTGACTGA ATCTGACGCT GCTGAAGAGG 

45 TTACAAAAAG TCTGGTAAGA CAACATTGAT 

TGGTTATACA GTTGCTACTA TTAAACATCA 
GGATTCAGAC GTCGATCACA TGAAGCATTT 

50 AGGTTTTCAA TATCAGCAAA CTGTAACACG 
TGAAAAATCT GTTACAATTG ACACCAATAT 

55 



TGTAATTTGT TTGTCTGATA CATCGACCAT 8760 

AAACTCAGTC ATTTTACCCC TCCTAGTGCA 8820 

GGAGATGATT TTTAATGGTA GTAGAAAAAA 8880 

AACGTATCGT TAATCAGCAG AGTTCAATGC 8940 

TAAATCATAT CTTAGCAGAA GATATTGTAG 9000 

CACCTTATGA TGGTTTTGCA ATTCGCAGTG 9060 

GCATTGAGTT TAAAGTGATT GATCATATTG 9120 

GGGATCACGA AGCGGTGC6T ATTATGACFG 9180 

TTGTTATGTT TGAACAAACG ATTQAACTAG 9240 

CAAAAAATGA AAATATATCT TTAAAAGGTG 9300 

AAAAAGGACA AGTAATTAAT CCAGGGGCTA 9360 

AGGTTAAAGT TATTAAGCAA CCGAGTGTCG 9420 

ATGTTAATGA TGTATTAGAA GATGGGAAAA 9480 

CCTTAGCAGA AAAATTAGGT CTTOAAGTTG 9540 

ATAGTGGCAT CCAAGTCGTT AAAGAAGCTA 9600 

GCGGAGTTTC TGTTGGAGAT nTGACTATT 9660 

AAGTGTTATT TAATAAAGTA GCAATGCGTC 9720 

ATGGaAAGTA TTTGTTTGGa TTATCTGGAA 9780 

TATTTGTGAA nCCAGCTGTT AAACATATGT 9840 

TTAAAGCAAC ATTAATGGAA GATTTTACCA 9900 

CTAAAGCAAC GTTAACAAGT GCTGGAGCTA 9960 

GTGCGGTTGT AGCGATTGCA CATGCTAACT 10020 

GTTTTAAAGC GGGGCATACA GTAGATATTA 10080 

AACTTCTTTT ATGATTTTAC AAATTGTAGG 10140 

GAGGCATATT GTCTCTTTCT TAAAGTCACA 10200 

TGGGCATGGT AAGGAAGATA TTCAATTACA 10260 

TGAAGCGGGG GCAGATCAAA GTATTGTACA 10320 

TGTAGATAAT CAAAATCTTA CTCAAATTAT 10380 

CGTATTAGTT GAAGGCTTTA AAAATGCTGA 10440 
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GAATGTTTGT TATAGCATTA ATGTAAGGGA 
GTTATTAAAT AAAATTAAAA ATGATTGTGA 
^ TGAAACAATT TGAAATCGTG ACAGAACCGA 

TAAATGAATA TCAAGGTGCA GTAGTTGTTT 
GCGTCAAAAC GGAATATTTA GAATATGAAG 

10 

CACAAATTGG AGATGAAATA AATGAAAAAT 
GAATAGGGCC ATTACAAATT TCAGATATCG 
GTAAAGATGC CTATCGAGCA AATGAATATG 

75 

TTTGGAAAAA AGAAATTTGG GAAGATGGTT 
ATGAAGAAGC AAAGAGGGAG GAATAAGAGA 
AGATATA7TA CAAAAAGCAC AGGAAGATAT 

20 

ATTTGAAGAT TTATTGTTTG AACXSTTATCC 
TGTAAATGAG GAATTTGTAC AAAAATCGGA 

25 AATTCCACCG GTTAGTGGAG GTTAAGGGAG 
TTCAGTX3CGA TTTGGTAAGC CCAAAGCTTT 
TAGAGTAATT AAGACATTAG AATCAACAAA 

30 TGCGCAATTG GCAACGCAAT TTAAATATCC 

TGATAAAGGT CCATTAGCAG GAATTTATAC 
GTTTTTTGTC GTTTCTGTTG ATACACCAAT 
TCAGTTTTTA GTTTCTCATC TTATTGAAAA 
TGGACGTTTT ATTCCAACAA TTGCATTTTA 
AGCACTACAT TCTGATAATT ACAGTTTTAA 

40 

TTTGGATGTA AGGGATGTAG ATGCGCCCTC 
TGATTTGGAC GCTTTAATTC AAAAATTGTA 
AAATAAAAGA TAAACTAGGA CX3TCCCATCC 

45 

GTAACTTTAG GTGTGATTAT TGCATGCCTA 
TACCTAAAAA TGAACTTTTA ACGTTTGATG 
^ AATTAGGTGT AAAAAAAATA CGCATTACAG 
ATGTACTTAT AGCTAAATTA AATCAAATCG 



GCATGAAGAT TTTACAGCAT TTGAGCAATG 10560 

TACACAATTA ACATAGAGGA TTGAAATGAA 10620 

TACAAACAGA ACAATATCGT GAATTCACTA 10680 

TTACCGGTCA TGTTCGCGAA TGGACTAAAG 10740 

CGTATATTCC AATGGCTGAA AAGAAATTGG 10800 

GGCCTGGAAC GATAACGAGT ATTGTTCATA 10860 

CTGTATTAAT TGCGGTTTCT TCACCGCATC 10920 

CAATTGAGCG TATAAAAGAA ATTGTTCCGA 10980 

CAAAATGGCA AGGGCATCAA AAAGGGAATT 11040 

GATGAAGGTA CTTTACTTCG CAGAAATTAA 11100 

TGTGCTTGAA CAAGCATTGA CTGTACAACA 11160 

GCAAATCAAT AATAAAAAGT TTCAAGTTGC 11220 

TTTCATTCAA CCTAATGATA CTGTTGCATT 11280 

CATGAAAGCA ATAATTCTTG CAGGTGGTCA 11340 

TGCGGAAGTG AACGGTGAGA CCTTTTATAG 11400 

TATGTTCAAT GAAATTATTA TTAGTACAAA 11460 

AAATGTTGTT ATAGATGATG AGAATCATAA 11520 

AATCATGAAG CAACATCCTG AAGAAGAATT 11580 

GATTACTGGT AAAGCTGTAA GCACGTTGTA 11640 

TCATTTAGAT GTCGCAGCTT TTAAAGAAGA 11700 

TAGTCCGAAT GCATTAGGCG CTATAACTAA 11760 

AAATGTATAT CATGAATTAT CAACGGATTA 11820 

ATATTGGTAC AAAAATATAA ATTATCAGCA 11880 

AGCTGTTAGG AGGTCCACAA ATGGTAGAAC 11940 

GTGACTTACG GTTATCTGTG ACAGATCGGT 12000 

AAGAGGTATT TGGAGATGAT TTCGTATTTT 12060 

AAATGGCTAG AATCGCTAAG GTATATGCAG 12120 

GTGGAGAACC ATTGATGCGA CGGGATTTAG 12180 

ATGGTATTGA AGATATTGGT TTGACTACAA 12240 
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ATGTCAGTTT GGATGCTATT GATGATACGC TATTTCAATC AATCAATAAT CGTAATATTA 12360 

AAGCGACTAC GATTTTAGAA CAAATTGATT ACGCGACGTC TATTGGTTTG AATGTAAAAG 12420 

TAAATGTTGT TATACAAAAA GGTATTAACG ATGATCAAAT CATACCAATG CTTGAATATT 12480 

TTAAAGATAA ACATATAGAG ATTCGATTTA TAGAATTTAT GGATGTTGGT AATGATAATG 12540 

GATGGGATTT CAGTAAAGTT GTAACTAAAG ATGAAATGCT TACAATGATA GAGCAGCACT 12600 

TTGAAATCGA TCCTGTAGAA CCAAAATATT TTGGGGAAGT AGCAAAATAT TATCGCCATA 12660 

AGGATAATGG TGTTCAATTT GGTTTGATTA CAAGTGTTTC ACAATCATTT TGTTCTACAT 12720 

GTACACGCGC AAGGCTGTCA TCAGATGGGA AGTTTTACGG ATGTTTATTT GCAACTGTCG 12780 

ATGGATTTAA CGTTAAAGCG TTTATTCGTT CTGGCGTGAC CGACGAAGAA TTAAAAGAAC 12840 

AATTTAAAGC TTTATGGCAA ATAAGAGATG ATCGATATTC AGATGAGAGA ACTGCTCAAA 12900 

CAG7TGCCAA TCGTCAACGT AAAAAGATAA ACATGAATTA TATTGGTGGT TAATGTGTAG 12960 

GGACCACTAC ATATTAAATC ATTAGAGATG TTTTAATATT TCTGTCTTAC TCCCTAAAAT 13020 

ACAATATTAT TTATTAAAGT AAAAACGGTC ATATCTATGC CAGATTTAAT AGAAATGATC 13080 

GTTTTTAAAG TTTTTACAAG TTGGCGGGGC CCCAACACAG AAGCTGACAG AAAGTCAGCT 13140 

TACAATAATG TGCAAGTTGG CGGGGCCCCA ACATAGAGAA TTTCAAAAAG AAATTCTACA 13200 

GACAATGCAA GTTGGGGAAC GGGGCCCCAA CACAGAAGGT GACGAAAAGT CAGCATACAA 13260 

TAATGTGCAA OTTGGCGGGG CCCCAACATA GAGAATTTCA AAAGAAATTC TACAGACAAT 13320 

GCAAGTTOGG GATCAACGAA ATAAATTTTA TGAGAATATC ATTTCTATCC CACTCTTAAG 13380 

AATCACTACA TAATAAATCT TTAGTGGTTC TTTAACATTG ATGTCACACT CCATXXICATT 13440 

GAGTTGTAAT ATATCTTTTT TAGGTATAAA TGTTGTCGAA TAAACAACAA GTTGTCCAAA 13500 

AGATAStAAAT CTAAACAAGA TATAGCCAGC AATTTAATAT TTGTAATAGA TAAAATGCTA 13560 

AGTTTGATAT ATAATAAATT TAAGTAATTG TATAATAATA TGAATTACAA ACATCTAAGA 13620 

AGAAACATAG GAGGCATCAT ATTATGAGTA ATAAAGTTCA ACGTTTTATA GAAGCAGAAA 13680 

GGGAGTTAAG TCAGTTAAAG CACTGGTTAA AAACAACACA TAAGATTTCA ATTGAAGAAT 13740 

TTGTAGTCCT TTTTAAAGTG TATGAAGCTG AAAAGATTAG CGGTAAAGAA TTGAGGGATm 13800 

CATTACATTT TGAAATGCTA TGGGATACAA GTAAAATCGA TGTGATTATC CGTAAAaTCT 1386 0 

ATAAAAAAGA GCTTATTTCT AAATTGCGTT CTGAAACGGA TGAAAGACAA GTATTCTATT 13920 

TCTATAGTAC TTCTCAAAAG AAATTGTTAG ATAAAATTAC TAAAGAAATA GAAGTGTTAA 13980 

GCGTTACAAA CTAAAAACTT aAAAAgcaTG CCAATCTCTA TTCATCATAA TTGCGTCTTG 14040 
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GTTCATGGCA 


TTTCTAGTTA 


CATGACGTCC 


ATGAATTAAG 


AAGTAAACAA 


GCATAGTAAT 


14160 




GATTGCTAAA 


GCGGCCATAA 


AGCCGAAGAT 


TTCACTATAT 


GAAAACATAT 


GAGTAAATAA 


14220 


5 


CCCAAGGAAT 


GATGGACCGA 


AGCCGACACC 


TGCATCTAGA 


CCAACGTAAA AAGTAGATGT 


14280 




CGCGATACCA 


TATTTAATCG 


GGGGTGAGAC 


TTTTATCGCA 


ATAGATTGCA 


TTGCAGATGA 


14340 




TAAATTTCCA 


TACCCTAAAC 


CTAGGCAAGC 


ACCAGCAAGT 


AATATTAACC AGCTTTGATA 


14400 


10 


GCTTGAAATT 


AAGCATACAA 


ATGAAAGGAA 


AAGCATGATA 


AATGCTGGGT 


AGACAATAAT 


14460 




ATTTTCATTT 


TTATCATCCA 


TCAATCTACC 


AGCAATAGGT 


CTAGTAATTA 


ACGATGCTAT 


14520 


IS 


AGCATAGCAA 


ATAAAGAAAT 


AGCTTGCTGC 


AGTGACTAGG 


TGTCGCTCTA 


AAGCAAATGC 


14580 


TTGTAAATAA 


GTTAGGATGG 


ACGCATAGGT 


AACGCCAATT 


AAAAGCATAA 


TTACAGCAAC 


14640 




AGGAATGGCC 


TCTTTTGCAA 


TAAATTGATG 


AATACTAAAT 


CTTGGTTTAT 


CAATGACATT 


14700 


20 


AGTTTCAGTT 


TTGTTATTTG 


TTACTTCGAA 


ATCAACTTTT 


ATAAATAATG 


AGATAATGAG 


14760 


TCCGAGTATG 


CCTAATATGA 


CACAAATAAT 


AAACAGTAAG 


TCAATTGCGT 


ATTTTGTAAT 


14820 




AAGTAACATG 


CCTAGAAATG 


GGCCAATCGC 


TGTACCTAAT 


ACTAAACTTA 


AGGAAAATAA 


14680 


2S 


ACTGATGCCT 


TCACTTTTTC 


TATTAACAGG 


GGTAACGTAT 


GCCGCAATAG 


TACCTGTTGC 


14940 




AGTT6TCACA 


ACTGCAGTTG 


CX3ATACCGTT 


TATGAGACGT 


ACAAAGATTA 


AAAAAGCTAA 


15000 




AGATCCATCA 


ATAAAATAAA 


GTAATTGCGT 


GATAATTAAA 


GCAATTAAAC 


CAATAAATAA 


15060 


30 


TAATCGTTTA 


GGTCCrATTT 


SATTTACAAA 


TTTACCTGTA 


GCAAATCGA 




15109 



(2) INFORMATION FOR SBQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9072 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



45 



SO 



GAGAGTCAAT 


GGCAAGAAGA ATATAAATAT TTGAGA6CGT 


TAATCTTTAA 


TGAAACAGAA 


60 


TTAGAGGAAG 


CGTATAAATG 


GATGCATCCT TGTTACACGT 


TGAATAATAA 


AAATGTAGTA 


120 


CTTATCCATG 


GCTTCAAAAA 


TTATGTTGCA CTATTATTTC 


ATAAAGGTGC 


CATTTTQGAG 


180 


GATAAATATC 


ATACACTCAT 


TCAACAGACT GAAAAGGTGC 


AAGCAGCTCG 


TCAGTTACGA 


240 


TTTGAAAATT 


TAACAGAGAT 


TCAAGCACGT ACCGAAGAAA 


TTAAATATTA 


TCTAGCCGAA 


300 


GCAATTAAAG 


CTGAAAAAGC 


TG6TAAAAAA GTTGAAATGA 


AGAAAACAGA 


GGAATATGTT 


360 
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AAATTAACGC CAGGCAGACA ACATCAATAT ATATATCATA TTGGACAAGC TAAACXSCAgT 480 

GgAACAAGAC AAAAGCGTGT TGAAAAGTAT ATTAACCAAA TACTAGAAGG TAAAGGGATG 540 

CATGATAAGT AATTAATGAG TAAAGCATAC CGGTTATACA ACAACATACA AOATGACACG 600 

AAACAACCAA TGGCTCATGC TGTTGGTTGT TTTTTTAGGT GTGTCTGTCA TGGGCAACAC 660 

TTTGACGTTG GAATTCCGTT ACAGGCTTGG GAGTAGAAAA TGTTAGCAAA AGGCAAGGGT 720 

GTCTACAATG AATGATGAAG ATATTAAAAT ATAAGGATGA CTTTGTGAGT GGCGGATGGG 780 

CGGTTGTCCG TCTGTAACAA TGGATGCGTG TGCATTATTA CAAAAATTCG ACTTTTGTAA 840 

TAATATTTCA CATTTTCGAC ACTTTTTTGC TATAAAACAA CCAATTGAGC GATAATAAAT 900 

TCGCTTTTAA AAAATATGAG TTATCTATTT AGTTGCCAAA GATAAAATAA TAATGTTTAA 960 

TAACATCATA TAGAGTATGT TAGTTTTAAA TGTCGAATAT ACGAATGTGc AAACAAAGTA 1020 

ATCGGTAGAA ATTCAACATA CATAGCGCCG TTTACTGTTA AGTATTCACA TTACAGATGA 1080 

AAAATATAAA ATTCTACATA ATCAAGACCA TGATGTGTAC TTGTTrAACT TATGACTCTA 1140 

TTTGTTTAAC AATTGCGATA ATGGTCTTTT TATTTTATGC GTATCATTCG TCATATTTTT 1200 

TATGAGGAAG GAGAAATGAT TATGTTAAGT ATTAAGCATT TAACGAAAAT TTATTCTGGT 1260 

AATAAAAAGG CAGTAGATGA CATCTCTTTA GATATTCAAT CTGGGGAATT TATCGCATTT 1320 

ATTGGAACCA GTGGAAGTGG CAAAACGACT GCTTTAAGAA TGATAAACCG TATGATTGAA 1380 

30 GCGACAGAAG GACAAATTGA AATTGATGGT AAAGATGTTC GGAGTATGAA TCCTGTCGAA 1440 

TTGCGTAGAA ATATrGGCTA TGTTATTCAA CAAATTGGCT TAATGCCTCA TATGACGATT 1500 

AAAGAGAATA TTGTGTTGGT ACCCAAATTG TTGAAATGGA CTAAAGAGGA AAAGGATAAA 1560 

^ CGTGCAAAGG AATTAATTAA ACTTGTGGAT TTACC!GGAGT CATTTTTAGA GCGTTATCCA 1620 

GCAQftACTAT CAGGTGGGCA ACAACAACGT ATCGGTGTTG TAAGAGCACT TGCGGCCGAA 1680 

CAAGATATTA TTTTAATGGA TGAACCTTTT GGTGCATTGG ATCCTATTAC GAGAGATACG 1740 

TTACAAGATT TAGTTAAAAC GTTACAACGA AAATTAGGCA AGACGTTTAT CTTTGTAACA 1800 

CATGATATGG ATGAAGOGAT TAAATTAGCA OACAAAATTT GTATTATGTC AGAAGGTAAG 1860 

GTGGTGCAAT TTGATACXX:C AGACAATATT TTAAGACATC CCGCAAATGA TTTTGTACGT 1920 

GATTTTATAG GACAAAATAG ACTGATTCAA GACCGTCCCA ATQACAAGAC TGTAGAAGGT 1980 

GTAATGATTA AACCAATCAC GATACAAGCA GAAGCAACAC TGAATGACGC CGTTCATATT 2040 

ATGAGACAAA AACGTGTTGA TACTATTTTT GTAGTAGATA GTAATAACCA TTTACTAGGT 2100 

TTCTTAGACA TTGAAGATAT AAATCAGGGT ATACGTGGAC ACAAAAGTTT ACGAGACACC 2160 
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ATTTTAAAAA GAAACGTTAG GAATGTACCT GTCGTAGATG ATCAACAGCG TTTAGTAGGA 2280 

CTGATTACGC GTGCCAATGT TGTTOATATT GTATATGACA CGATTTGGGG CGATAGTGAG 2340 

GATACAGTGC AAACAGAACA TGTGGGGGAA GACAcTGCGT CCTCAAAAGT GCATGAGCAA 24 00 

CACACTACTA ATGTCAAAGT ACGTGACATA GGAGATGATA AATCATGATT GAGTTCCTAC 2460 

ATGAACATGG TGGACAGTTG ATGTCX5AAAA CACTGGAACA TTTCTATATT TCTATAGTGG 2 520 

CATTATTACT TGCCATCATT GTTGCAGTAC CTATAGGCAT TITATTATCA AAAACAAAGC 2580 

GAACTGCCAA TATTGTATTA ACTGTGGCAG GTGTCTTACA AACTATTCCA ACACTAGCTG 2640 

TACTTGCTAT TATGATACCG ATTTTTGGTG TTGGTAAAAC GCCTGCAATT GTAGCGCTAT 2700 

TTATTTATGT ATTATTACCT ATTTTAAATA ACACGGTACT CGGTGTTCAA AATATTGATA 2760 

GCAACATTAA AGAAGCTGGA AAAAGTATGG GAATGACACA ATTTCAATTG ATGAAGGATG 2820 

TTGAATTGCC GTTAGCATTG CCGCTTATCA TTGGTQGCAT TCGTTT6TCA TCTGTGTATG 2880 

TAATTAGTTG GGCTACACTT GCAAGTTATG TAGGTGCX5GG TGGATTAGGT GATTTCATTT 2940 

TCAATGGTTT AAATTTATAT GATCCACTGA TGATTGTAAC TGCAACXX3TA CTCGTTACTG 3000 

25 CACTAGCATT AGGTIGTTGAT GCCTTATTAG CTTTAGTTGA AAAATGGGTA GTTCCCAAAG 3060 

GCTTAAAAGT ATCTGGATAA TTAGGAGGCT AAGATAATGA AGAAAATTAA ATATATACTT 3120 

GTCGTGTTTG TCTTATCGCT TACCGTATTA TCTGGATGTA GTTTGCCCGG ACTAGGTAGT 3180 

30 AAGAGCACGA AAAATGATGT CAAAATTACA GCATTATCAA CAAGCGAATC GCAAATTATT 3240 

TCACATATGT TACGGTTGTT AATAGAGCAT GATACACACG GTAAGATAAA GCCAACATTA 3300 

GTAAATAATT TAGGGTCAAG TACGATTCAA CATAATGCCT TAATTAATGG GGATGCTAAT 3360 

ATATCAGGTG TTAGATATAA TGGCACAGAT TTAACGGGAG CTTTGAAGGA AGCACCAATT 3420 

AAAAATCCTA AGAAAGCAAT GATAGCAACA CAACAAGGAT TTAAAAAGAA ATTTGATCAA 3480 

ACGtTTTTTG ATTCGTATGG TTTTGCGAAT ACGTATGCAT TCATGGTAAC GAAGGAAACC 3540 

GCTAAAAAAT ATCATTTAGA GACAGTTTCA GATTTAGCAA AGCATAGTAA AGATTTACGT 3600 

TTAG6TATGG ATAGTTCATG GATGAATCX5T AAAGGCGATG GCTATGAAGG ATTTAAAAAA 3660 

GAGTATGGTT TTGACTTTGG TACAGTGAGA CCAATGCAAA TAGGTCTAGT CTACGACGCA 3720 

TTAAACTCAG AGAAGTTAGA CGTTGCATTA GGTTATTCTA CAGATGGTCG AATTGCGGCG 3780 

TATGATTTGA AAGTACTTAA AGATGATAAA CAATTTTTCC CACCTTATGC TGCGAGTGCT 3840 

GTTGCAACAA ATGAATTATT ACGGCAACAC CCAGAACTTA AAACQACGAT TAATAAGTTG 3900 

ACAGGAAAGA TTTCGACTTC AGAGATGCAA CGCTTGAATT ATGAAGCGGA TGGTAAAGGT 3960 
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AAAGGTGGTC ATAAGTAATG GAAGGTAATT 
CGAACTTTGG TTATCTATGG GATTTATTTT 
^ TGCTGTTTGC AgCTTTAATT GGTATTCCAT 

TTTCTGGATT TGTAATTACA ATTGCAAATA 
TAGCTATTTT AATGTTAGTC ATGGGCTTAG 

TO 

TATATGCX3TT ACTTCCAATT ATAAAAAACA 
ATATTAAGGA TGCTGGCAAA GGTATGGGAA 
AATTACCGTT ATCTGTTTCG GTTATTATCG 

IS 

TAGGTGTTGT TGCCGTTGGA TCATTTATAG 
GTGGTACAAA TGCGACX3GAT GGCACAACGT 

^ TCATTGCAAT CGTCATTGAT GTACTATTAA 

CACGACATCX3 TAAAAATCAA TCTAATCATC 
AGATGTTTAT AATTTAGCXxA TTTCGTTTCA 

25 GCTCAAATAA TCTTTGAGTA GCCTTTTTAT 

TAGCAATTAT TATCATGAAA GTTTTTGGAT 
AAAATGAGAT GTTTTATTTA TAATTTTCTG 

^ ATATTAAAAA TTTGTTTTTC TTAAACATAG 

ACAATTTAGA GGTGACAACC GATTGCTTTT 
GCTATTCGCG CAGTCACTTG TTAATCTTGT 

^ CGTTGGAACG ATAAATATCX5 CTGTTAGCTT 

AGGTCCTGGT GATGTTGCTG ATAAATTTGG 
ATTAAATGTT GTAGGTTCAT TACTCATCAT 

40 

AGGTAGAATA ATTCAAGGTT TGTCTGCAGC 
TAACGAATAT TATATTGGTA CAAGAAGACA 
TTGGGGTGGT AGTGGTATTT GTACGTTGTT 

45 

GCGTTCAATA TTTGTTGTTT CAATTCTATT 
TGCACCTGAG ACTAAAGCAG AACCAATCAA 
50 CGTTATTGGT TTAGTCATTT TAGTAGTGAC 

GACGTCTCAT TTTGGTTTAG TTTCACCGTT 



TATTACAGCA ATTATTCAAT TATTATGTTA 4080 

TCAAACACTT ATTAATGTCT GTCTATGGTG 414 0 

TGGGAATCTT GCTTGCaAGA TACACAAAAC 4200 

TAATTCAAAC AGTTCCAGTC ATTGCAATGT 4260 

GTTCAGAAAC AGTAGTTTTA ACAGTGTTTT 4320 

CTTATACTGG TATAGCTAGT GTTGATGCGA 4380 

TGACACGCAA TCAAGT6CTA CGAATGATTG 4440 

GTGGCATTCG TATTGCCTTG GTTGTTGCGA 4500 

GAGCACCTAC GCTTGGTGAC ATTGTGATTC 4560 

TTATTTTAGC AGGTGCGATT CCGATTGCTA 4620 

GATTTTTAGA AAAACGATTA GACCCAACAA 4680 

GGCCGCAAAG TATTAATATG TAATAGTAGA 4740 

TGATTTATAA AAAATGAGGC TACTCAAGGA 4800 

AGGTTGTGTT TGTATGCGTT TACACTAAAA 4860 

AAAAAGCGTT AATTATTGTA AAAATACTAA 4920 

CAAATTTATG ATATTGTTTC TTAATATATC 4980 

GAGGCTTATC TAATTCATGG ACACATCAAA 5040 

GGGTATCGTT TTAGGGGTTA TTACCTTTTG 5100 

TGTCCCATTA CAATCAACAT ATAGTAGTGA 5160 

ATCTGCCTTA TTTGCTGGTT TGTTTATCGT 5220 

TCGCGTCAAA ATTACTTATG TAGGATTGAT 5280 

CATTACACCT TTGCCAGCAT TTTTAATTAT 5340 

ATGTATTATG CCATCAACAC TTGCTATTAT 5400 

ACGTGCCTTA AGCTATTGGT CTATTGGTTC 54 60 

TGGTGGCTTA ATGGCTACAT ATATAGGTTG 5520 

AACATTATTA GCAATGTACT TAATCAAACA 5580 

AGGTATGAAA GCAGAAGCTA AAAAGTTTGA 5640 

GATGTTAAGT TTAAATGTAA TCATCACACA 5700 

AATTCTAGGT TTAATTGTTG TGTTTATCTG 5760 
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CAATCACGTG ATATTACGGT CATTATTAAG ATTGAAATGT AATAAATAAA GAACAGCAGT 7680 

AAGGTACTTT CAAATTGAAA TGATCTTGGT GCTGTTTTTC TTGATTGATC TTCGTCATAA 7740 

TTCAGATTTG TCATAGGcTA CGACATACTA TTAGTATTTA CTAGACAGTT TTTACGACGA 7800 

CACTTTGAAA AATTTTGAGG CAAATCATTT GGAAGTCTCA CGTGAATTTT GTAAACTCAT 7860 

CAAGCAAGTA ATTATATTAA AAAGACAAAT AGAGAAAAGG TGTTTATAAT GAGTAAAATT 7920 

TTTGTAACTG GTGCAACGGG CCTTATTGGC ATTAAATTAG TTCAAAGACT AAAAGAAGAG 7980 

GGGCATGAGG TTGCTGGTTT TACTACATCT GAGAATGGTC AACAAAAGCT AGCTGCTGTT 804 0 

AATGTAAAAG CATATATTGG TGATATATTA AAAGCTGATA CTATTGATCA AGCGTTAGCA 8100 

GATTTTAAAC CAGAAATCAT TATCAATCAA ATTACGGATT TAAAAAATGT TGATATGGCA 8160 

GCAAATACGA AAGTACGTAT TGAAGGTTCT AAAAACCTAA TTGATGCGGC GAAAAAGCAT 8220 

GACGTTAAGA AAGTAATTGC CCAAAGTATT GCCTTTATGT ATGAACCTGG CGAAGGATTA 8280 

GCAAATGAGG AAACTTCACT TGATTTTAAC TCAACTGGCG ATAGAAAAGT AACGGTTGAT 8340 

GGTGTGGTTG GTTTAGAAGA AGAAACGGCT CGTATGQATG AATACGTTGT TTTACXSTTTr 8400 

25 GGCTGGTTAT ATGGCCCAGG TACTTGGTAC GGAAAAGATG GCATGATTTA TAATCAATTT 8460 

ATGGATGGTC AAGTGACACT TTCAGATGGC GTAACATCAT TTGTGCATCT TGATGATGCA 8520 

GTTGAAACAT CTATTCAAGC TATTCATTTT GAAAATGGTA TCTATAATGT AGCAGATGAT 8580 

GCACCTGTTA AAGGTTCTGA ATTTGCAGAA TGGTATAAAG AACAACTTGG TGTTGAACCA 8640 

AATATTGATA TTCAACCTGC GCAACCATTT GAACGTGGCG TAAGCAATGA GAAGTTTAAA 8700 

GCGCAAGGTG GTACTCTGAT TTATCAAACT TGGAAAGATG GCATGAATCC AATTAAATAA 8760 

TAATTTATCC GTTTAATATA CAAAGAATAA AGACTTGGTC GAATCGTGGA TGATATATTA 8820 

TCAAACGCAC GGCTCGAACA AGTCTTTTTT ATTATGTCTT CGTTATCTTT GTATGAAGGA 8880 

ATAACAGAAT TACAATTAAT GTACTGAATA ATGCAATTAA TGTTGTGATT AGTGCTAATT 8940 

TAATTTCTAT TGGTAGCCAA GTCAGTACAA AAGACCAATT ATTGCTACCG AGAATGAGAT 9000 

ATGGTAATGC ATATAATATG AGCGCTAAAG CX3ATACATAT ACATAATGAT AACCAACTCA 9060 

ATACAGCAAT CC 9072 
(2) INFORMATION FOR SEQ ID NO: 46: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16826 base pairs 
SO (B) TYPE: nucleic acid 

(C) STRANDBDNESS : doiible 

(D) TOPOLOGY: linear 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTGGAACAGC TGTAACTATA TCATTTCTTT CAACATTTAT TGGGAAAATG TTAGCTACAT 60 

TTCTATATCC GATTAATAAT GTAGTACTTT CATATATnTC TGTAAATQAA AGTGACAATA 120 

TAAAGAAGCA ATATTTGaAA ACTAATCTAA TTGCTATAGC TGCCCTATGT TTAGTCATGA 180 

TTATATGTTA TCCAATTACA ATAATTATTG TCTCTTTACT GTATAACATT GATTCAAGTT 240 

TATATTCGAA GTTTATTATT TTAGGTAATA TAGGTGTTTT ATTCAATGCA GTGAGTATTA 300 

TGATCCAAAC TTTAAATACA AAACACQCAT CAATAACATT ACAAGCGAAT TATATGACGC 360 

TTCACACGAT TACATTTATA TTCATAACTA TTTTAATGAC AATTGCGTTT GGTCTAAATG 420 

GATTCTTTTG GACAACGCTG TTCAGCAACA TTATTAAGTA TGTGATTTTA AATATTATAG 4 BO 

GITTAAAGTC TAAATTCATT AATAAAAAGG ACGTCGATTA GATGAGTGAA AAAAAGATTT 540 

TGATTTTATG TCAGTATTTT TATCCGGAAT ATGTATCTTC T6CGACGTTA CCAACTCAAT 600 

TGGCGGAAGA TTTAAOTGCG AATCACATTA ATGTCGATGT CATGTGTGGA TGGCCATATG 660 

AATATA6TAA TCATAAACAG GTTTCTAAAA CCGA6ATGCA TCGTGGTATT CGCATTCGAC 720 

GTCTCAAGTA TTCGAGGTTT AATAACAAAA GTAAGGTTGG AAGGATCATC AATTTCTTTA 780 

GTTTATTTTC AAAATTCGTG ATTAATATAC CTAAAAT6TT GAAATATGAT CAGATTCTTG 840 

TTTACTCTAA TCCACCAATC TTGCCATTAA TACCAGACGT TTTACACAGA CTGCTTAAGA 900 

30 AAAAATATTC TTTTGTGGTG TATGATATAG CACCTGATAA TGOGATTAAG ACAGGTGCAA 960 

CTCGTCCAGG TAGCATGATT GATAAGCTGA TGCGTTACAT TAATAGACAT GTCTACAAGA 1020 

ATGCTGAAAA TGTCATTGTC CTTGGTACGG AAATGAAAAA CTACTTACTA AATCATCAAA 1080 

35 TTTCTAAAAA TGCTGACAAT ATCCATGTGA TTCCTAACTG GTATGACATG CGTCAATTAC 1140 

AAGJSPUITCG TATCTATAAT GACACATTTA AAGCTTACCG TGAGCAATAC GACAAAATTT 1200 

TATTGTATAG CGGTAATATG GGGCAGTTAC AGGATATGGA GACACTTATC TCATTTTTAA 1260 

AATTAAATAA GGATCAGTCT CAAACGTTAA CAATACTTTG TGGTCATGGT AAGAAATTTG 1320 

CAGATGTCAA AACGGCAATA GaAGACCATC GTATTGAAAA TGTTAAAATG TTTGAGTTTT 1380 

TAACAGGTAC AGACTATGCT GACGTATTAA AAATTGCGGA TGTATGTATT GCATCGCTGA 144 0 

TTAAAGAAGG CGTCGGTTTA GGCGTGCCGA GCAAGAATTA TGGCTATCTT GCAGCTAAGA 1500 

AAGCGTTGGT ACTCATCATG GATAAGCAAT CTGATATCGT TCAACATGTT GAACAATATG 1560 

ATGCGGGTAT CCAAATTGAT AATGGCGATG CACATGCCAT TTATAACTTC ATCAACACTC 1620 

ACTCGAGTAA GGAATTGCAC GAGATGGGTG AGCGCGCACA TCAACTGTTT AAAGATAAAT 1680 
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AAGCGATTAT TCGATGTAGT GAGTTCAATA TATGGTTTAG TAGTTTTAAG TCCGATTCTG 1800 

TTAATTACAG CATTACTAAT TAAAATGGAa TCACCTGGAC CAGCCATTTT CAAACAAAAA 1860 

^ AGACCGACGA TTAATAATGA ATTGTTTAAT ATTTATAAGT TTAGATCAAT GAAAATAGAC 1920 

ACACCTAATG TTGCAACTGA TTTAATGGAT TCAACATCGT ATATAACAAA GACAGGGAAG 1980 

GTCATTCGTA AGACCTCTAT TGATGAATTG CCACAATTAT TGAATGTTTT AAAAGGAGAA 2040 

10 

ATGTCAATTG TAGGTCCTAG ACCAGCGCTT TATAATCAAT ACGAATTAAT CGAAAAACGT 2100 

ACAAAAGCGA ACGTGCATAC GATTAGACCA GGTGTGACAG GACTAGCTCA AGTGATGGGG 2160 

AGAGATGATA TCACTGATGA TCAAAAAGTA GCGTATGATC ATTATTACTT AACACATCAA 2220 

IS 

TCTATGATGC TTGATATGTA TATCIATATAT AAAACAATTA AAAATATCGT TACTTCAGAA 2280 

GGTGTGCATC ACTAATGAGA AAAAATATTT TAATTACAGG CGTACATGGA TATATCGGTA 2340 

ATGCTTTAAA AGATAAGCTT ATTGAACAAG GACATCAAGT AGATCAAATT AATGTTAGGA 2400 

20 

ATCAATTATG GAAGTC6ACC TCGTTCAAAG ATTATGATGT TTTAATTCAT ACAGCAGCTT 2460 

TGGTTCACAA CAATTCACCT CAAGCAAGGC TATCTGATTA TATGCAAGTG AATATGTTGC 2520 

2s TGACGAAACA ATTGGCACAA AAGGCTAAAG CTGAAGACGT TAAACAATTT ATmTATGA 2580 

GTACTATGGC AGTTTATGGA AAAGAAGGTC ATGTTGGTAA ATCAGATCAA GTTGATACAC 2640 

AAACACCAAT GAACCCTACG ACCAACTATG GTATTTCCAA AAAGTTC3GCT GAACAAGCAT 2700 

30 TACAAGAATT GATTAGTQAT TCX3TTTAAAG TAGCAATTGT GAGACCACCA ATGATTTATG 2760 

GTGCACATTG CCCAGGAAAT TTCCAACX5GT TAATGCAATT GTCAAAGCGA TTGCCAATCA 2820 

TTCCCAATAT TAACAATCAG CGCAGTGCAT TATATATTAA ACATCTGACA GCATTTATTG 2880 

^ ATCAATTAAT ATCATTAGAA GTGACAGGTG TGTACCATCC TCAAGATAGT TTTTACTTTG 2940 

ATACATCGTC AGTAATGTAT GAAATACGTC GCCAATCACA TOGTAAAACG OTATTGATCA 3000 

ACATGCCTTC AATGCTAAAT AAGTATTTTA ATAAGTTGTC GGTCTTTAGA AAATTATTCG 3060 

40 

GCAATTTAAT ATACAGCAAT ACGTTATATG AAAATAATAA TGCACTTGAA ATTATTCCTG 3120 

GAAAAATGTC ACTTGTTATT GCGGACATCA TGGATGAAAC 6ACAACCAAA GATAAGGCAT 3180 

AAGTCATCTA TTAAATAAAA TCAACATACA AATCGTTTTA TTTGGAGGTT ATAGTATGAA 3240 

45 

GTTAACAGTA GTTGGCTTAG GTTATATTGG TTTACCAACA TCAATTATGT TTGCAAAACA 3300 

TGGCGTCGAT GTGCTTGGTG TTGATATTAA TCAGCAAACG ATTGATAAGT TACAAAGTGG 3360 

^ TCAAATTAGT ATTGAAGAAC CTGGATTACA AGAGGTTTAT GAAGAGGTAC TGTCATCGGG 3420 

AAAATTGAAG GTATCTACAA CGCCAGATGC ATCTGATGTT TTTATCATTG CCGTTCCGAC 3480 
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TAGTATTTTA TCATTTTTAG AAAAAGGAAA TACCATTATT GTAGAGTCGA GAATTGCGCC 3600 

TAAAACGATG GATGATTTTG TAAAACCAGT CATTGAAAAT TTAGGGTTTA CAATAGGTGA 3660 

AGATATTTAT TTAGTGCATT GTCCAGAACG TGTACTGCCA GGAAAAATTT TAGAAGAATT 3720 

AGTTCATAAC AATCGTATCA TTGGCGGTGT GACTGAAGCT TGTATTGAAG CGGGTAAACG 3780 

TGTCTATCGC ACATTCGTTC AGGGAGAAAT GATTGAAACA GATGCACGTA CTGCTGAAAT 3840 

GAGTAAGCTA ATGGAAAACA CATATAGAGA CGTGAACATT GCTTTAGCTA ATGAATTAAC 3900 

AAAAATTTGC AATAACTTAA ATATTAATGT ATTAGATGTG ATTGAAATGG CAAACAAACA 3960 

TCCGCGTGTT AACATCCATC AGCCTGGTCC AGGTGTAGGC GGTCATTGTT TAGCTGTTGA 4020 

TCCGTACTTT ATTATTGCTA AAGACCCTGA AAATGCAAA6 TTAATTCAAA CTGGACGTGA 4080 

AATTAATAAT TCAATGCCGG CCTATGTTGT TGATACAACG AAGCAAATCA TCAAAGT6TT 4140 

GAGCGGGAAT AAAGTCACAG TATTTGGTTT AACTTATAAA GGTGATGTTG ATGATATAAG 4200 

AGAATCACCA GCATTTGATA TTTATGAGCT ATTAAATCAA GAACCAGACA TAGAAGTATG 4260 

TGCTTATGAT CCACATGTTG AATTAGATTT TGTGGAACAT GATATGTCAC ATGCTGTCAA 4320 

2S AGACGCATCG CTAGTATTGA TTTTAAGTGA CCACTCAGAA TTTAAAAATT TATCGGACAG 4380 

TCATTTTGAT AAAATGAAGC ATAAAGTGAT TTTTGATACA AAAAATGTTG TGAAATCATC 4440 
ATTTGAAGAT GTATCGTATT ATAATTATGO CAATATATTT AATTTTATCG ACAAATAAAA . 4500 

30 TGTGTCAAAC TAGGGCATAC ATGATTAAGG AAAGATAAGC TGTCATGTGT TTGAACTTCA 4560 

GAGAGGATAA TGTTATGAAA AAAATTATGG TTATTTTCGG TACGAGACCC OAAGCAATAA 4620 

AAATGGCACC ATTAGTAAAA GAAATTGATC ATAATGGGAA CTTTGAAGCG AACATTGTGA 4680 

TTACAGCACA ACATAGAGAT ATGTTAGATA GTGTGTTAAG TATATTTGAT ATTCAAGCTG 4740 

ATCMX3ATTT AAATATTATG CAAGATCAAC AAACATTA6C AGGCCTTACG GCGAATGCAC 4800 

TTGCTAAACT TGATAGCATC ATTAATGAGG AACAACCGGA TATGATTTTA GTACATGGTG 4860 

ATACTACAAC GACTTTTGTA GGAAGTTTGG CAGCATTTTA TCATCAAATT CCGGTCGGAC 4920 

ATGTAGAAGC TGGACTTCGA ACACATCAGA AATACTCACC ATTTCCTGAA GAGTTAAATC 4980 

GAGTCATGGT AAGTAATATT GCTGAATTGA ATTTTGCGCC AACAGTAATT GCAGCTAAAA 5040 

ATTTACTTTT TGAAAACAAA GACAAAGAGC GTATCTTTAT TACTGGAAAT ACAGTTATTG 5100 

ACGCATTGTC AACAACAGTT CAAAATGATT TTGTTTCAAC GATTATTAAT AAACATAAAG 5160 

50 GCAAGAAAGT TGTTTTACTA ACAGCGCATC GTCGTGAAAA TATTGGGGAA CCGATGCATC 5220 

AGATTTTTAA AGCAGTAAGA GATTTGGCAG ATGAATATAA AGATGTTGTC TTCATTTATC 5280 
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GGATTGAATT AATTGAGCCA TTAGATGCGA TTGAGTTCCA TAATTTTACA AATCAATCGT 54 00 

ACCTCGTGCT GACAGATTCT GGTGGTATTC AAGAGGAGGC TCCTACATTT GGAAAACCTG 5460 

^ TGTTGGTATT AAGGAATCAT ACAGAGCGTC CCGAAGGCGT TGAGGCGGGA ACATCX3AGAG 5520 

TAATTGGCAC AGATTATGAC AATATTGTTC GAAATGTGAA ACAATTGATT GAGGATGATG 5580 

AAGCGTATCA ACGTATGAGT CAAGCGAATA ATCCATATGG TGATGGACAA GCATCACGAC 5640 

10 

GTATTTGTGA AGCAATAGAA TATTATTTTG GATTGCGCAC AGACAAGCCG GATGAATTCG 5700 

TACCTTTACG TCACAAATAA TAAAAAACCC CTAATCATGA AGTTGGTTTA GACAACCAGC 5760 

GGTGACTAGG GGTTTTTAAT ATATTTATTT TTGATAGTGG TAGCCAATAT CATATTTGAA 5820 

IS 

TACTTTATTT GATAATATTG GACTTTGCTG TCCATOGTCA TCACTTTTTA AAOGTACATT 5880 

TTTATGAGCT TCTTTAAATA CATCGGAATT CAACCAATTA TTAAAOCTAT CTTCAGATTC 5940 

CCAAATAGTT AAGATTTTAA CTTCGTCTGT ATCCTCGGTA TTTAATGTTT TAGTGACAAA 6000 

CATTTGTTGG AAGCCTTCAA TAGTTTCAAT ACCTTGTCTA TTGTAAAAAC GTTCAATCX3T 6060 

TTCTTCCGCA CTGCCTTTTT GTAATTGTAA TCTATTTTCT OCCATAAACA TGGGCAATCA 6120 

25 CTCCTCTATT TTATGATTTG ATTTGGGTAA TGTTTTrACA AATGTAAAGA GTACAGOGGT 6180 

TTGTATGATA ACCATTATGA TTAATCCTAC ACGGACTGCA AGAACATCCA CCATATAAAT 6240 

T6AAAAACCT ATTACAATGT ATAAOCTAAT TAAAATTTTA ATTTTCTGTT GTA6CGT6TA 6300 

30 GCCTCGATGT AAATAAAAGT TTTCTACATA TTCmATAA ATTTTTTGAT TAATAAGCCA 6360 

ATTGTAAAAG CGATCTGAAC TTCGAGCAAA GCAAAAAACT GCTACX3AGTA AAAAAGGGGT 6420 

CGTTGGCAGT AAAGGTAATA CGGCACCTGC AATACCAAGC GCTGTAAATA TTAAGCCAAT 6480 

GACGATTAAA ATAAGTCGGA TTGAAAAAAC TCCATTCTAG TACTAATGCG CATGTAATAT 6540 

TGTTTTAGTA ATATAACTCA TGCTAAATAT AATOTGTATG ATAAGTGCAA T6ACTCAGTA 6600 

AAATCAAACG ATGTTGAATT ATCCTTGTCA CATTAACGCA TTTTAAGCGC GACTTTCATA 6660 

40 

ACAACCAAAC TATTTAATGA GAATTATTCT CAAGTATTAT AGTTATATTA TGTGTTTTAT 6720 

TTTTGAAAAG TGCAATATGT TTTCGAAAAT AAGATTATTT TTATGTGCAA AAACGACGCA 6780 

AAAGTTTTAA AAATGAGACT TCTGTQAGCT GATTATTTTA TAAAATGTAA ACGCTTACTA 684 0 

45 

TATAATGTGA ATCATATCGT TTAAAAGCAT TATTAAATAT GATGCTAAGA GATTTATATT 6900 

ATAGCCAATA AACAAAGGAG AGATAATATG GCAGTAAACG TTCGAGATTA TATTGCAGAG 6960 

AATTATGGTT TATTTATCAA TGGGGAATTT GTTAAAGGTA GCAGTGACGA AACAATCGAA 7020 

GTGACTAATC CAGCAACTGG AGAAACACTA TCACATATTA CAAGAGCAAA AGATAAAGAT 7080 
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TCAGAACGTG CACAAATGTT GCX5TGATATT 
ATTGCAATGA TTGAAACATT AAATAATC3GT 
^ ATTCCATTTG CTGCAAGACA TTTCCATTAT 

ACAGTGAATG ATATCGATAA AGACACAATG 
GTAGGTGCTG TTGITGCTTG GAACTTCCCA 

10 

gCCATTGCTG CAGGTAATAC AATTGTGATT 
TTGGAAGTTG CTAAAATTTT CCAAGAGGTA 
GGTAAAGGTT CAGAATCAGG TAATGGAATT 

IS 

TTXACGGGCT CAACTGAT6T AGGTTATCAA 
CCCGCTACAT TAGAGCTTGG TGGTAAAAGC 

20 GACCTTGCAO TTQAAGGTAT TCAGTTAGGT 
GCAGGTTCTC GATTATTAGT TCATGAAAAA 
GAGGCATTTT CAAATATTAA AGTTGGAAAT 

25 CAAACTGGTA AGGATCAATT AGATAAAATT 
GATGCACAAA TTTTAGCAGG CGGTCATCGC 
TTCTTTGAGC C6ACATTAAT TGctGTGCCA 

^ ATATTTGGAC CAGTGTTAAC AGTGATTAAA 
GCTAATX3ATT CTGAGTATGG TTTAGCAGGC 
TTAAATATTG CTAAAGCTGT ACGTACAGGA 

^ CCAGAAGGCG CACCATTTGG TGGTTATAAA 
6GTGGGTTAA GTAACTATCA ACAAGTTAAA 
AAAGGTTTGT ACTAGAATAA ATATCGTTTC 

40 

AAGTCTTAAC ATTTAACGGC GTTGTTTAGA 
GTATCATGAT ATTAGGATAT AATGACTAAA 
AATCATCTTA CTGCTGTTTT TAATTATGCT 

45 

GTTGTTTATT AATTATGGTG ATTTAGAAAT 
TTTTAATATG CGGAACAATC ATTAAAGTTA 
SO CAATAAATTT GAGATACTTT TTTGTCATTT 
TATTAAAATT TTCTATATGA TAGGAATAAA 



GGTGATAAAT TAATGGCACA AAAAGATAAA 7200 

AAACCGATTC GTGAGACAAC AGCAATTGAT 7260 

TTCGCAAGTG TTATTGAAAC AGAAGAAGGT 7320 

AGTATCGTAC GACATGAGCC GATTGGCGTC 7380 

ATGCTATTAG CTGCATGGAA GATTGCGCCA 744 0 

CAACCTTCGT .CTTCAACACC ATTAAGTTTA 7500 

TTACCTAAAG GTGTTGTCAA TATACTAACG 7560 

TTCAATCATG ATGGTGTAGA TAAATTATCA 7620 

GTTGCCGAAG CTGCAGCAAA ACATCTAGTA 7680 

GCCAATATCA TATTAGATGA TGCTAATTTA 7740 

ATTTTATTCA ACCAAGGTGA AGTATGTAGT 7800 

ATTTATQATC AATTGGTGCC ACGTTTACAA 7860 

CCACAAGATG AAGCTACACA AATGGGTAGT 7920 

CAATCATATA TTGATGCAGC AAAAGAATCA 7980 

TTAACTGAAA ATGGATTAGA TAAAGGGTTC 8040 

GACAATCATC ACAAATTAGC ACAAGAAGAA 8100 

GT6AAGGACG ATCAAGAAGC AATTGATATA 8160 

GGTGTATTTT CTCAAAATAT CACACGTGCA 8220 

C6TATTTGGA TTAACACTTA CAACCAAGTA 8280 

AAATCAGGTA TCGGTCGAGA AACTTATAAA 8340 

AATATTTATA TTGATACAAG CAATGCTTTA 84 00 

TGAAGCGTGT TTGTAGGTCA GTCTAGCGGT 8460 

TTTTAAGCAA AACAAAATAT ATAGGAACAC 8520 

ATAATAGCAG TAGGATGGTT TTTAATTGCA 8580 

AATTTGCGAT GCGGCTATTA TAAGGACAGA 8640 

ATGAAGTTCA ATATGCAAAG TCATCGTTTG 8700 

TTGCGATTTT TTGAACTTAA TGAAACTAAA 8760 

TTATGTAACT AACACAATAA TCTCGTACAT 8820 

GCAAAGCGCG AGTGTGCTGT AAAAGTTTTC 8880 
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GATGATGTAT AAATCATGGT TAATTACGGA AGCATTAATA TTAACCTGAG AAGCTATAAA 9000 

GAATTATTTT TAAAAGCGAC AATATTAAAT ACGACGCATT TATTTAGGAG TGGCAAACGT 9060 

^ ATGAATGGGA AAAAGGCGAA TACGATAAAC AGATACAAAT ATTTTCATCA TGTCAATCAT 9120 

CAAAAAATTC AACAAAGTTC TAAAAAGACG CTGTGGGCAT CACTAATCAT CACATTGTTA 9180 

TTTACAGTGA TTGAATTTGT CGGAGGTTTA GTATCTAATt CATTGGCATT ACTGTCAGAT 9240 

10 

TCATTTCATA TGCTTAGTGA TGTATTAGCA CTTGGTTTAT CTATGTTGGC CATTTATTTT 9300 

GCAAGTAAAA AGCCGACTGC ACGATACACA TTTGGATATT TAAGATTTGA GATATTAGCT 9360 

GCATTTTTAA ATGGTTTAGC ATTAATTGTA ATTTCAATCT GGATTTTATA TGAAGCTATT 9420 

75 

GTACGTATTA TTTATCCGCA ACCAATTGAA AGTGGCATTA TGTTTATGAT TGCTAGTATT 9480 

GGTTTACTCG TCAATATTAT TTTGACTGTT ATCCTTGTAA GGTCTTTAAA ACAAGAAGAC 9540 

AATATCAATA TTCAAAGTGC ATTATGGCAT TTCATGGGAG ACTTATTGAA CTCTATTGGT 9600 

20 

GTCATCGTTG CAGTTGTATT GATTTACTTT ACAGGATGGC GCATCATCGA CCCAATCATT 9660 

AGTATTGTAA TTTCACTCAT CATTTTACGT GGTGGTTATA AAATTACGOG TAATGCgTGG 9720 

25 tTAAJTTTAA TGGAAAGTGT GCCTCAACAT TTGGATACTG ATCAAATTAT GGCAGATATT 9780 

AAAAACATAG ATGGCATATT AGAT6TACAT GAATTTCATT TGT6GA6TAT TACAACAGAG 9840 

CATTATTCAT TAAGTGCCCA TGTTGTGTTA GATAAAAAAT ATGAGGGTGA TGATTATCAA 9900 

30 6CGATTGATC AAGTATCATC ATTGTTGAAA GAAAAATATG GCATTGCACA TTCAACGTTG 9960 

CAAATTGAAA ACTTGCAATT GAATCCATTA GATGAGCCAT ACTTCGACAA ATTAACATAA 10020 

ATAAAACATT GTAGCGCCTA AAACATTAAT CTATGTCATA GGCGCACX^TT TCGTITTATA 10080 

^ CTTATGTTGC ATCATTTAAA TGATTTTCGT CAATTTCTTT GATGCTATCT ACATCTAACA 10140 

CGACSLTCTTT AGGTTTCAAA ATATGAATAT GTTTTTCATC ATTTGTATGT AAAATGC6TT 10200 

CTATGATGTA CCTTTGACCG GCCATTGTTT CTACAGCAAT CTTTTTGTTT CTAGCTAAAC 10260 

40 

TTGCTACGAC AGATTCTTTA TCCATAATGA TAGCCCCCTA TATATATGTT TATTTACTTA 10320 

TACCCTAACA TGATTTTTAT ACTCTTTGAA AATATATTTT ACAGAATTTT ATCTAAATAT 10380 

TTAAAAAAAT ATCTTAATAT CCTTGTAATC CGATAAGAAT TATAGTAATA TTTTTTCAAC 10440 

45 

CATtGTTATA GGAGGTCTTA TTAATGACAT TATTTTTATT AGAAGCTAAC AATCTTGATT 10500 

TTGCATCAAC GAAAGAAGAA CTAGAAGCAA AGGCAGCATC ACTATCTACG AAGACAATTC 10560 

CAACATTAAT TGAAGTACAA GCTACTGAAA ATTTAACTCA TGGTTATTTT ATTGTGGAAG 10620 

SO 

CAAATGACGA aGCAGAAGCT AAACAATTTT TAACAGAAGC AGATATTAGT ATTCAATTAG 10680 
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TTGATTACCT TGTAACTTGG AACATTCCGG AAGGCATTAC GATGGATCAA TATTTAGCAC 10800 

GTAAAAAGAA AAATTCTGTT CATTATGAAG AAGTGCCAGA AGTTGAATTT AAACGCACAT 10860 

5 

ATGTATGTGA AGATATGTCT AAATGTATTT GTTTATACAA CGCACCTGAT GAAGAAGCGG 10 920 

TACGTCGCGC GCGCAAAGCA GTTGATACAC CGATTGATGG CATCGAAAAA CTTTAATAAG 10980 

ACAACAAGTT GATGAGATAT ATGTATATAG GTTTGGCATG GATTTCGATT GCAGTTAATT 11040 

10 

AGAATAGCTC AATGCTATAA ATGTAAGTAG TTGATATGAA GAAACTAATG AACTAAATGC 11100 

AAGTATTGTC TAAAACAATC ATTTTATTGA AATTTAGTAG AGCTGAAATT AATATAACGT 11160 

CGTrAATTGA ATAACGCTTA TGTTATAAGA GCACTCATAC CAAACCATAA TCATCTATAG 11220 

IS 

ATATAACAAT TCACGATATA AGGGCTGTGT TTGGCATAGC CCTTTAGATA TACACTTAAT 11280 

TCCTATTAAA ATAGTAGGGA TTAAAAGGGG GCTTGTCATG ATTAAAATTC AACAATTACA 11340 

20 ACATCACTTT GGATCACATA AAQTAATTCA TAACTTTAAT TTGGACATTA GCAAGGGAGA 11400 

AATAGTCACT TTCATAGGGA AAAGTGGTTG CGGAAAGTCT ACTTTACTCA ATATTATCGG 11460 

TGGATTTATT CATCCATCGT CTGGTCGTGT CATTATTGAT AACGAAATTA AACAACAGCC 11520 

2S ATCTCCAGAT TGTTTAATGC TATTTCAACA TCATAATTTG CTGCCATGGA AAACGATTAA 11580 

TGACAACATT AGGATTGGAT TACAACAGAA AATTAGTGAT GAAGAGATTA ACGCACAGCT 11640 

TAAATTAGTT GATTTAGAAG ACAGGGGAAA OCATTTTCCC GAGCAACTGT CCGGGGGTAT 11700 

GAAACAACGT GTGGCACTAT GTCX5AGCGCA TGTGCATAAG CCTAACGTTA TATTGATGGA 11760 

TGAGCCATTA GGTGCATTAG ATGCATTTAC ACGTTATAAA CTTCAGGATC AACTAGTGCA 11B20 

aCTAAAACAT AAAACGCAAT CAACTATTAT TTTAGTGACG CATGACATTG ATGAAGCTAT 11880 

^ TTATCTTTCC GACCGCATTG TTCTGTTAGG TGAAGGGTGC AATATTATTT CTCAATATGA 11940 

AATB^CAGCA TCACATCCAC GCAGTC6TAA TGATAGCCAC CTACTTAAGA TTCGTAATGA 12000 

AATTATGGAA ACATTTGCAT TGAATCATCA TCAAGTTGAA CCTGAATATT ATTTATAAGG 12060 

40 

AGTGAGTGAC GATGAAAAGG TTAAGCATAA TOGTCATCAT TGGAATCTTT ATAATTACAG 12120 

GAK3TGATTG GCAAAGGACG TCTAAAGAAC GGTCTAAAAA TGCCCAAAAT CAGCAAGTGA 12180 

TTAAAATTGG ATATTTGCCG ATTACACATT CAGCTAATTT GATGATGACT AAAAAATTAT 12240 

TATCACAATA CAATCATCCG AAATATAAAC TAGAATTAGT TAAATTCAAT AATTGGCCAG 12300 

ATTTAATGGA CGCATTAAAC AGTGGTCGTA TTGATGGTGC ATCAACTTTA ATAGAGCTAG 12360 

CGATGAAATC AAAACAGAAG GGCTCAAATA TAAAGGCTGT GGCATTGGGC CATCATGAAG 12420 

SO 

GCAATGTCAT TATGGGACAA AAAGGTATGC ACTTAAATGA ATTTAATAAT AATGGCGATG 12480 

SS 
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GTAAACAATT AAA6A.TTAAA CCGGGGCATT 
TGCCAGCCGC ATTGAGTGAA CACAGAATTA 
^ CACTGGGTGA AAAGTTAGGC AAAGGTAAGA 

ATGCGTATTG CTOTGTGCTA GTACTGAGAG 
CGCAAgCATT TGTACAAGAT TATAAAAAGT 

10 

GTGTAGACAT TATGACGCAT CATTTTAAAC 
CATGGACATC CTATGGTGAT TTAACAATTA 
TGGTAAAACA ACATCATTTG TTTAATCCAC 

IS 

TGTATAAGGA GGCATCGCGT TCATGACACG 
TATCACATTT ATTATTTTCT TAGGCATTTG 
ACCTGTATTG TTACCGGGTC CTGCTCTTGT 

20 

TGGAOAAATT TTCCAACATT TAGCAATTAG 
CGCATTGTTG GTTGCTATTC CATTGGGCTT 

25 CGCTATCGAA CCGCTATTTC AATTGATTAG 

TGTTGTTCTA TGGTTTGGTA TTGGTAGTTT 
TTTTTTCCCA ATTGTGTTCA ATACTATTAA 

30 AAAAATAGCA GCAAATTTAA ATTTAACTGG 

CGGGGCATTT AAACAAATCA TGGCTGGGAT 
TTTAGTTTCT GGTGAAATGA TTGGTGCACA 

35 ACGAAATATG TTGAACTTAG AAGATGTTTT 

TTrXftTTATT GATCGATTCA TTAGTTATAT 
ATAAGGAGAG ATGATGATGA CTTTAGAAAC 
AGTAGAAGTT GATGAAGGGA CGTATTATCC 
TGGTTA7TTC GGTGAGGCGG CATTGAGAAA 
GTCTTGTTTG ACAACAGGAT TTTGTTTATG 

45 

AAATGCCACG CAGCCACATT TAAATAATGA 
ATTAGGTGCT ACCGGATTGT CTAATCCGAT 
CCTTGAACAC ACTTATGTTG ATGGACAATT 

SO 

TAATATTCAA GAAGACCATT ATTTTGGTGC 



TTAGCTATCA TOAAATGTCG CCAGCAGAAA 12600 

CAGGGTATTC TGTAGCCGAA CCATTCGGTG 12660 

CTTTGAAACA TGGTGATGAC GTTATACCTG 12720 

GGGAATTGCT TGATCAACAC AAGGATGTAG 12780 

CTGGCTTTAA AATGAATGAT CGCAAGCAAA 12840 

AAAGTCGTGA CGTTTTAACA CAGTCAGCGG 12900 

AGCCATCCX^G CTATCAAGAA ATTACGACAT 12960 

CTGCATATGA TGACTTTGTT GAACCGTCAT 13020 

TCCCACAAAT AACAAATTTA TATTACCTAT 13080 

GGAAATGGTC ATTATTATTG GGCATTACCA 13140 

AGGAAAAAGT ATATQGTCTT TCATTGTTAC 13200 

TTTATGGAGA TTTGTAGCGG GCTTTGTTGT 13260 

CTTGCTTGGA AGGAATCGTT GGCTATACAA 13320 

GCCGATATCT CCGATAGCAT GGGCACCATT 13380 

GCCAGCGATT GCGATTATTT TTATCGCTGC 13440 

AGGCGTTAGA GACATTGAAC CTCAATATTT 13500 

GTGGTCATTG TATCGCAATA TATTATTTCC 13560 

ACATATGGCG GTAGGAACAA GTTGGATATT 13620 

ATCGGGATTA GGTTTTTTAA TCGTTGATGC 13680 

AGCAGCAATA TTCTTTATCG GATTATTTGG 13740 

TGAGCAGTTT ATACTTAGAA GATTTGGTGA 13800 

GCTTATCAAA GAACAATTAG ATCCTCATTT 13860 

GAGAACATTT ATTCAGCAAT TATTTGTAGA 13920 

AAATGCTGAA GTAATCGAAG CTGTATCGCA 13980 

GTGCCAATTA GCTTTTTCAA C6TATTTAGA 14040 

CTTACAACAG CAATTGTTAT CTGGAGAAAT 14100 

GAAGTCATTT AATGATTTAG AAAAGTTGAA 14160 

GGTTGTCAGT GGACGTATGC CAGCTGTAAG 14220 

GATTTGGAAA CATGAATCAT CAGATGAATT 14280 
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TTTAGGAGTC AACGGGTCAG CAACXSTATCA AATCACATTG AATCAAGTCG TAGTGCCACA 14400 

ATCACAAATT ATCACGCATG ATGCGAAGCA GTTTGCGGCA ACTATTCGCC CGCAATTTAT 14460 

5 

TGCTTACCAA ATTCCAATAG GATTAGGCTC AATTAAAAGT TCTTTAGAGT TAATTGATGC 14520 

ATTTTCAAAT GTGCAAAACG GAATAAATCA ATATTTAGAG TATGATGTTG AAGCTTTTAA 14580 

AAAACGTTAT CGTCAACTTA GAGAGGAATA TTATGCAATA TTAGATGACG GTAACTTAAC 14640 

10 

TTCACATTTA AATGAATTAA TATCATTGAA GAAGGACATC GGCTATTTAT TGTTAGATGT 14700 

AAATCAAGCT TCTGTTGTCA ATGGTGGTTC TAGAGCGTAC ACACCATATT CGCCACAAGT 14760 

TCGCAAGTTA AAAGAAGGAT TCTTCTTCGC AGCATTGACA CCGACATTAA GACATTTAGG 14820 

15 

TAAACTTGAA GCAGAGTTGA AGGGGTAA6T GTGATAAGCT GATTTTTTGT TTAGATGCGT 14880 

TTGTTGAAAC ATTTTTTAAA ATAATATAAA TCTTAGTTTA TAAACATTTT CTGTTAATTT 14940 

20 GTTATATCCT TTTAACTAGG AAAATATACA TTTCGTAATA ATAATAATCG TTATCATTGA 15000 

AAAAGTGTTA ATAAGGTGTA TAATGAAAAT GTGAACAATT AATGAACTTC TTATTTTAAA 15060 

GAAGGTGAAT ACTATAGATA CGCATACTAA AGAACAACAU TTCTCGAATC TAGTAAGATC 15120 

25 TTATCGTAAA GAATACGTGG GTAAAGGACC CAATAGTAT7 CGAGTGTCGT TTAAAGATAA 15180 

TTGGGCGATT GCACATATGA CAGGTGTTTT GAGTAAAGTT GAGAGTTTTT ACCTAAACGA 15240 

CAAACGCAAT GAATCGATGC TCCATTATAC ACGCACAGAG AAGATTAAAC AGATGTATAA 15300 

30 AGAAATAGAT GTAAATGAGA TGGAAAGTCT TGTAGGCGCT AAGTTTGTAA AATTATTTAC 15360 

AGATATTGAT TTGAATGATG ATGAAGTCAT TTCAATATTT GTTTTCGATA AGTCAATAGA 15420 

ATAAGTGTTG CTGGTGTAAG GTACACGGTG CTGTTTGCTA ACTTCGCTTT 6AATTTAACA 15480 

ATAATTCAAG GGGGTGGTAT GTCAAACGGT GCCGTTTTTT TGTCATATTT TTAAAACAAG 15540 

CAACATGCAA CACGTACTTT AAGGAAGTCA AAATTTATCA TTTAGGAGAG ATGGATATGA 15600 

AAATCGTAGC ATTATTTCCA GAAGCAGTAG AAGGTCAAGA AAATCAATTA CTTAATACTA 15660 

40 

AAAAAGCATT AGGATTAAAA ACATTTTTAG AGGAAAGAGG ACATGAGTTC ATTATATTAG 15720 

CAGATAATGG TGAAGACTTA GATAAACATT TACCAGATAT GGATGTGATT ATTAGTGCGC 15780 

CATTTTATCC TGCATATATG ACTCGTGAAC 6TATTGAAAA AGCACCGAAC TTGAAATTAG 15840 

45 

CAATTACAGC AGGTGTAGGA TCTGACCATG TAGATTTAGC GGCAGCAAGT GAACACAATA 15900 

TTGGTGTCGT TGAAGTTACA GGAAGTAATA CAGTTAGTGT GGCAGAACAT GCGGTTATGG 15960 

ATTTATTAAT ACTTCTTAGA AACTATGAAG AAGGTCATCG TCAATCAGTA GAAGGTGAAT 16020 

50 

GGAACTTGTC TCAAGTAGGT AATCATGCGC ATGAATTACA ACACAAAACA AITGGTATTT 16080 
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10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



GTCTAAATTT GTAAGCTTTG 
ACCATTAACA CCAGAAACTG 
ACACAGTTAT TTAGTGAATA 
AGCGTTAgCA TCCGAGCATT 
ACCTGCTGAT CATCCATGGA 
TATGACTTTA GAAGCACAAA 
CTTCAATCAT GAACCTTTCC 
TAAAAGTTAT ACAGCTAAAT 
TATATAAATG AATCATATAA 
ATTACATCTA AGTATAGTCA 
GTGTTCACTT ACGCAATTAT 



TACAACACTA TGATCCT^TC AATCAACAAG ACCATAAATT 

ATGAACTTGT TTCAACAAGT GATGCGATTA CAATTCATGC 

ATAACTTATT TGATAAAGAT GTTTTAAGTC GTATGAAAAA 

CTGCACGTGG TAAAATTGTA AATCGCGATG CGTTAGTTGA 

TACAAGGATA TGCTGGTGAT GTTTGGTATC CaCAACCtGC 

GAACAATGCC TAGAAATGCT ATGACGGTTC ACTATTCAGG 

AACGTATTGA AGATGGAGTT AAAGATATTT TAGAGCGTTT 

AAGATAAAGA TATTATTGTT GCAAGTGGTC GTATTGCTAG 

AGAATAAGGA TGCTGGGCTA GCGATTAACG CTTTCAATTT 

GCACTACTGC TGTTGTAAAG ATGGCAGTAG TTTTTTTATG 

CGGCTATGTT AGGACAATGA TTTAACATTT AOGCACATAT 

TGAnAAATnT CATTCATGTG GnAATC 

(2) INFORMATION FOR SBQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4012 base pairs 
<B) TVPB: nucleic acid 

(C) STRANDEDKESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TTCAATGAGA GTAGTGGGCT GATGTTTAGC GATATCGCGT AAGATTAACC 
ATATATATTG TGTTTTTCTA AAATCGGCTC GGCTAATTTT AAATAGGGGC 
TATJg^CTA TTGAAAAATT CTTGTQATAO CATAGTGACA TCTCCTAAGA 
AGCTTAGCTA mCCTTTTTAC AACAATAGTA ATTATAAAAC GGGAGCAATT 
ATATAATTAT TAAGAGCAAA AATAATTATA CTTTGTTAAA ATAAGCGTAA 
ATAGGGGGAT ACTAATGATA TTGAAATTTG aTCACATCAT TCATTATATA 
ATCGGTTTAG TTTTCCAGGA GATGTTATAA AATTACATTC AGQTGGGTAT 
ATGGAACATT CAATAAATTA GGTTATATCA ATGAAAATTA TATTGAGCTA 
AAAATAATGA AAAGTTGAAA AAGATGGCAA AAACGATAGA mGGCGGAGTC 
CTCAAATTGT TCAAGAGAAG TATGAGCAAG GCTTTAAAAA TATTTGTTTG 
ATATAGAGGC AGTTAAAAAT AAACTACAAA GTGAGCAGGT TGAAGTAGTA 



ATTGGCCATA 
GATATATTGT 
CAAAATAGTT 
A6AAATCAAT 
TTACATGTAA 
GATCAGTTAG 
CATCATAAAT 
CTAGATGTAG 
GCTTTTGCTA 
CGTACAAATG 
GGGCCGATTC 



16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16826 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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JO 



IS 



20 



2S 



ATCAGGATGA TGATGAAATT AAGCCACCAT TTTTTATTCA ATGGGAAGAA AGTGATTCCA 780 
TGCGTACTAA AAAATTGCAA AAATATTTTC AAAAACAATT TTCAATTGAA ACTGTTATTG 840 
TGAAAAGTAA AAACCGATCA CAAACAGTAT CGAATTGGTT GAAATGGTTT GATATGGACA 900 
TTGTAGAAGA GAATGACCAT TACACAGATT TGATTTTAAA AAATGATGAT ATTTATTTTA 960 

GAATTGAAGA TGGTAAAGTT TCAAAATATC ATTCGGTTAT CATAAAAGAC GCACAAGCAA 1020 

CTTCACCATA TTCAATTTTT ATCAGAGGTG CTATTTATCG CTTTGAACCA TTAGTATAAA 1080 

TATACGTAAG TGCTATGAGC GAGAATGCCC ATATGAATAA TGACAAGCAC AATGGAAAGA 1140 

ATCGTTAATA TATTATTTAA TCGTGATGAC TTAATTAAAA TGAAAAAGAT TGATAATATA 1200 

AATGTGAAAA AGATAAGTAT AACCCGTAAA CTAAAGTAAT TCACGGTGAG AGGTTGACTC 1260 

AATGTCATAA TGATTGCAAC GATGTTCATA ATTATAAATA GACTTAAAAT AATTGTTCTC 1320 

ATATCAAACA CCTCATTGTT AGATTATTGA CATTATAACA GGGGTAATTG TATATGAACA 13 80 

TTAATGTGGT TGCTTGAGGA AAAATTTATT CATTGAAGTC AAGTTGGTTC ATTTTAGAAA 1440 

TGAATATCGT GTTAGATGAT GAAAGTATAT TGAAGTATAG GTAACTAGTT GAAAAGTATT 1500 

AATTGTACGA TAACATTAAA TTTAACACGA AACATAGATA TAAAATGATT CACAATTAAA 1560 

ATGGGTAAAT TTGAACTTGC TAAACTATTA ATTG6AGCAT GGACATTTCA AAAATAAGAG 1620 

TTCAAATCTT ACACAAGCTC TGAATCGACA CTATAAGATA CAAACTGTAT AATTAAAGGT 1680 

30 ATTGTTAAAT AGAAGGAGAT ATCATAAATC ATGGAAAAGA TGCATATCAC TAATCAGGAA 1740 

CATGACGCAT TTGTTAAATC CCACCCAAAT GGAGATTTAT TACAATTAAC GAAATGGGCA 1800 

GAAACAAAGA AATTAACIGG ATGGTACGCG CGAAGAATCG CTGTAGGTCG TGACGGTGAA 1860 

35 GTTCAGGGTG TTGCGCAGTT ACTTTTTAAA AAAGTACCTA AATTACCTTA TACGCTATGT 1920 

TATMTTCGC GTGGTTTTGT T6TTGATTAT AGTAATAAAG AAGCGTTAAA TGCATTGTTA 1980 

GACAGTGCAA AAGAAATTGC TAAAGCTGAG AAAGCGTATG CAATTAAAAT CGATCCTGAT 2040 

GTTGAAGTTG ATAAAGGTAC AGATGCTTTG CAAAATTTGA AAGCGCTTGG TTTTAAACAT 2100 

AAAGGATTTA AAGAAGGTTT ATCAAAAGAC TACATCCAAC CACGTATGAC TATGATTACA 2160 

CCAATTGATA AAAATGATGA TGAGTTATTA AATAGTTTTG AAC6CCGAAA TCGTTCAAAA 2220 

GTGCGCTTGG CTTTAAAGCG AGGTACGACA GTAGAACGAT CTGATAGAGA AGGTTTAAAA 2280 

ACATTTGCTG AGTTAATGAA AATCACTGGG GAACGCGATG GCTTCTTAAC GCGTGATATT 2340 

AGTTACTTTG AAAATATTTA TGATGCGTTG CATGAAGATG GAGATGCTGA ACTATTTTTA 2400 

GTAAAGTTGG ATCCAAAAGA AAATATAGGG AAAGTAAATC AAGAATTGAA TGAACTTCAT 2460 
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50 
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5 



20 



25 



35 



CAAAATATGA 


TTAATGATGC 

X X X A X v^v^ 


GrAAAATAAA 


ATTGPTAAAA 


ATGAAGATTT 

A X \ J AAU A XXX 


A A A A PG AG AC 


2580 




TAGAAAAGGA 


AP ATP PTG AA 


GGTATTTATP 


TTTPTGGTGP 
X X X v» X vjrv;r x vtv^ 


A PT ATTZIATG 

X A X X AA X w 


2640 


X wV7V> X 


CAAAATCATA 


TT A <'"■!" 1' JIT AT 

X l^^X XnXnX 


GGTGPGTPTT 

Vjw X X w X X 


PT A ATG A ATT 

\» X ;A^^ X \ X X 


TAGAGA'l"I"!"r 

X AwAO A X X X X 


2700 


TTACCAAATC 


ATCATATGCA 


GT AT A PG ATG 


ATG A AGT ATY* 


PAPGTGAAPA 


TGGTGPAAPA 


2760 


Af'TTAPf^ATT 
.n^M ^ J. n^^yjn. X X 


TCGGTGGTAP 


AG ATA ATG AT 


PPAGATAAAG 


A PTP AG A A PA 


TTATGGATTA 

X X A X WW A X X A 


2820 


X VJVTVJV../^ XXX t\ 


AAAAAGTGTG 


GGGAAPATAP 


TTAAGTGAAA 

X X nAW X VJ AAA 


AGATTGCTOA 

AW A X X \ J^J X \Js\ 


ATTTG ATTAT 

AX X XVJAX XAX 


2880 




AGPPATTGTA 
nws>V«AX XV XA 


PP A ATTA ATT 
VVvAAX XAAX X 


GA GP A AGTT A 

UAUwmAUX lA 


H. A PPGPGTTT 
AAw^VwV^X X X 


AAPAAAAGPT 
AA\..AAAAV9^ X 


9QAn 


AAAATT&A&A 


TATITTCGTAA 
1/%X w x^vaxMn 


ATTAAAA^^A 
nX XAAnAWVjA 


AAATAGATTA 
AAAXAmAX XA 


A^YS A PTG A A A 
AlvltfAW X VXAAA 


TPTVSA APGPT 
X ^ X \ jAAwIiJV> X 


1000 


r'ATAAfilAnYS 


XVmAX X XVJW^V9X 


TP A^S^Tn^TT 
XVv/UxAX X X X X 


TTAPAPAATA 
X XAV.AV.AAXA 


TAG A ATGGTT 
XA!UJU%X\JwX X 


GAGT A A A ATA 

UtAUXAAAAXA 


men 


1 X X 1 XuAMXn 


XAvXOM/vvJM 


GGGTU? A AGT& 
ouvavavjAAw X A 


PTGTGATaaA 
W L\9 X VAX AAA 


AAAGPTATTA 
AAAm^XAX XA 


V>AAX X X X ^ X X 




TAnnnAATAA 
XnUOVXnnXnn 


UX X XW^M XaX^ 


TT^TTTA AT^S^S 
X X X X X AAXwVff 


X XV>X X X XAwX 


^ftTP'F*T^/2/iP 
X W X W X X 


GGTY2TATATG 
oO XoXAXAXVJ 






ATTYIA A ATT A 


GAATTaPTaP 
vAAi XAl. X Aw 


wAAAXuIAvlA 


AAA TPP AGTT 
AAAX ^VAu X X 


ATTTCAGTTA 








APGPPapaaa 

AV.VjL>W%UAnA 


^ITAPPPAAftA 


TTSAMTaaGT 
X\xAAAXAAui 


AnT A A A ATTrt 
AoXAAAaX L\9 




A n A ATP A AfSfT 


AAftATPAT^TG 
AnUtnX^aAX X\9 


GPATATYITGA 

IV^aAX aXW X OA 


AAAAXUX XAA 


A A PGP A ATPP 
AA^vj\.*aA i 


atapaaaaty: 

A X A^k^AAAA X Vs 




CTTCAATTGT 


AACAOTT6AA 


TAfYSAAAATH 
XAX\JiAAAAXA 


AXAwUjaXAX 


VvVxAXAAAwUA 


PAAPAAPArtP 
wAAwtAwAwW 




TT A A & A A Afl A 


AATP^ATAAA 
AAX^nSAX AAA 


ATTAAATTTA 
AX XAMAX X XA 


AAvAXVjtAAu X 


TY2GTP A APP A 


G A ATT A An AP 
uAAX XAAuAw 




fST A ATTrViAT 
\9Xiv\X XWOnX 


A X X X X X 


r'P^^STTTT 
^wlJUX X X XAU 


PATATTPATT 
L>AX AX XvAX X 


TTPA AATA AA 
X X\-AAAXAAA 


GAGAATYiAPT 
uA!LaAAX\JAL. X 




TY^AAAAAAfZT 


AAPGAAAGTA 
AAWvAAAvXA 


PTGAATY2AAP 
^XVxAAXoAAV* 


AATTAATAPP 
AAX XAAXAW^ 


AAAAT7YSPAA 
AAAAX XlJ\«AA 


APGGTAGAfVS 
AwVjv X A^A X \9 




^ X w X ^j^/innA 


TGPGPAATTA 
X \s^\9\miv\ X xn 


AA1Y3GGPAGA 


PGAAPPGTGA 

WUAAVV>W X V>A 


& A^PP & PPPTT 
aA X WAW^ w X X 


A A ATTT A AGP 
aaaX X X#U%^W 


3660 


AAAATflAAfTr 


TCAAAAATAT 
X \j!/in#\fv\ X #\x 


GGrti'l^T^G A PT^G 
wvtfx AUnwxw 


P^HG A TG A TG'P 
v» X \jax uax \> X 


AGAAAAPTAT 

AkXAAAA^ XAX 


PTAAAAAPGG 

\*XaaAAAV»UW 




caacSagaac 


AACGCCACTT 


GGATTGTTCC 


AATTTGGTGA 


TAAAGATAAT 


CAATTGTTGT 


37B0 


TGATGGTCAA 


TATCAATCTG 


TTGATGCTTT 


TAAAAACATA 


AATATTCCAT 


TAACGTGGCA 


3840 


GGAGGACCAA 


GGGCATCTCA 


TCCCAAAGTG 


ACCATAAACC 


AAATTCAGCC 


ATGTCAGACG 


3900 


TTATCAGGCA 


TCACCACAGC 


AAATTCAAAG 


CGTCAGCnCC 


AATATATAGT 


GGATGCCGCA 


3960 


ilGAACTAGOG 


GTTTAGCGnT 


ATCAGTGGTG 


TGGCGACTCT 


ATTCTAAACG 


AT 


40X2 



45 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CAATATAGGT CGCCGAGTTT CAACTaCATC AACTGGTTCA GTTACATTAG ATAATGCGCT 60 

5 

AGGTGTAGGT GGCTATCCTA AAGGACGAAT TATTGAAATT TATGGTCCTG AAAGTTCTGG 120 

TAAGACAACA GTAGCGCTTC ACGCTATTGC TGAAGTACAA AGTAATGGCG GGGTGGCAGC 180 

ATTTATCGAT GCTGAACATG CTTTAGATCC AGAATATGCT CAAGCATTAG GCGTAGATAT 24 0 

10 

CGATAATTTA TATTTATCGC AACCGGATCA TGGTGAACAA GGTCTTGAAA TCGCCGAAGC 300 

ATTTGTTAGA AGTGGTGCAG TTGATATTGT AGTTGTAGAC TCAGTTGCTG CTTTAACACC 360 

TAAAGCTGAA ATTGAAGGAG AAATGGGAGA CACTCACGTT GGTTTACAAG CTCGTTTAAT 420 

IS 

GTCACAAGCG TTACGTAAAC TTTCAGGTGC TATTTCTAAA TCAAATACAA CTGCTATTTT 480 

CATCAACCAA ATTCGTGAAA AAGTTGGTGT TATGTTCGGT AATCCAGAGA CTACACCAGG 540 

2^ TGGACGTGCA TTAAAATTCT ATAGTTCAGT AAGACTAGAA GTACGTCGTG CAGAACAGCT 600 

TAAACAAGGA CAAGAAATTG TAGGTAATAG AACTAAAATT AAAGTCGTTA AAAATAAAGT 660 

GGCACCACCA TTTAGAGTAG CTGAAGTTGA TATTATGTAT GGACAAGGTA TTTCTAAAGA 720 

25 GGGTGAACTT ATTGATTTAG GTGTTGAAAA CGACATCGTT GaTAAATCAG 6AGCATGGTA 780 

TTCTTACAAT GGCGAACGAA TGGGTCAAGG TAAGGAAAAT GTTAAAATGT ACTTGAAAGA 840 

AAATCCACAA ATTAAAGAAG AAATTGATCG TAAATTGAGA GAAAAATTAG GTATATCTGA 900 

30 TGGTGATGTT GAAGAAACAG AAGATGCACC AAAGTCATTA TTTGACGAAG AATAGTACAC 960 

AAATTTATAT CTATAGTTAA ACTTAGCAAA TATCCTTATA GGATTGATTG AAAGTGATAT 1020 

TCATCTCATA AAGCTAGAAT AATATCTAAC TTTATGGGAT ACACTACAAA TCGAGACTAT 1080 

^ AAGGTTTTTT ATTTTATTTA TTATTACATT ATCAATAGTT TTATAATCGA GCTTCAAAAC 1140 

TTTAeAAAAT AGTAGAAATA GCATTCAATA TAGTGCAAAA GTGCAAATTG ATAACTTGAC 1200 

ACTTATCTCC TATAAACCGT ACAATTAATT TGTATGATTT ATATATAATT TCATAAAGTC 1260 

40 

ATATTGAATT TCATATAAAG AGCAAACCCT AGAAAAGGAG GTGTTTGTGT GAATTTATTA 1320 

AGCCTCCTAC TCATTTTGCT GGGGATCATT CTAGGAGTTG TTGGAGGGTA TGTTGTTGCC 1380 

CGAAATTTGT TGCTTCAAAA GCAATCACAA GCTAGACAAA CTGCCGAAGA TATTGTAAAT 1440 

4S 

CAAGCACATA AAGAAGCTGA CAATATCAAA AAAGAGAAAT TACTTGAGGC AAAAGAAGAA 1500 

AACCAAATCC TAAGAGAACA AACTGAAGCA GAACTACGAG AAAGACGTAG CGAACTTCAA 1560 

AGACAAGAAA CCCGACTTCT TCAAAAAGAA GAAAACTTAG AGCGCAAATC TGATCTATTA 1620 

50 

GATAAAAAAG ATGAGATTTT AGAGCAAAAA GAATCAAAAA TTGAAGAAAA ACAACAACAA 1680 
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CGCATCTCCG GTCTCACTCA AGAAGAAGCT ATTAATGAGC AACTTCAAAG AGTAGAGGAA 1800 

GAACTGTCAC AAGATATTGC AGTACTTGTT AAAGAAAAAG AAAAAGAAGC TAAAGAAAAA 1860 

5 

GTTGATAAAA CAGCAAAAGA ATTATTAGCT ACAGCAGTAC AAAGATTAGC AGCAGATCAC 1920 

ACAAGTGAAT CAACGGTATC AGTAGTTAAC TTACCTAATG ATGAGATGAA AGGTCGAATC 1980 

ATTGGACGAG AAGGACGAAA CATCCGCACA CTTGAAACTT TAACTGGCAT TGATTTAATT 2040 

10 

ATTGATQACA CACCAGAAGC GGTTATATTA TCTGGTTTTG ATCCAATAAG AAGAGAAATT 2100 

GCTAGAACAG CACTTGTTAA CTTAGTATCT GATGGACGTA TTCATCCAGG TAGAATTGAA 2160 

GATATGGTC6 AAAAAGCTAG AAAAGAAGTA GACGATATTA TTAGAGAAGC AGGTGAACAA 2220 

IS 

GCTACATTTG AAGTGAACGC ACATAATATG CATCCTGACT TAGTAAAAAT TGTAGGGCGT 2280 

TTAAACTATC GTACGAGTTA CGGTCAAAAT GTACTTAAAC ATTCAATTGA AGTTGCGCAT 2340 

CTTGCTAGTA TGTTAGCTGC TGAGCTAGGC GAAGATGAGA CATTAGCGAA ACGAGCTGGA 2400 

CTTTTACATG ATGTTGGTAA AGCAATTGAT CATGAAGTAG AAGGTAGTCA TGTTGAAATC 2460 

GGTGTAGAAT TAGCGAAAAA ATATGGTGAA AATGAAACAG TTATTAATGC AATCCATTCT 2520 

25 CATCATGGTG ATGTTGAACC TACATCTATT ATATCTATCC TTGTTGCTGC TGCAGATGCA 25 BO 

TTGTCTGCGG CTCGTCCAG6 TGCAAGAAAA GAAACATTAG AGAATTATAT TCGTCGATTA 2640 

GAACGTTTAG AAACGTTATC AGAAAGTTAT GATGGTGTAG AAAAAGCATT TGCGATTCAG 2700 

30 GCAGGTAGAG AAATCCGAGT GATTGTATCT CCTGAAGAAA TTGATGATTT AAAATCTTAT 2760 

CGATTGGCTA GAGATATTAA AAATCAGATT GAAGATGAAT TACAATATCC TGGTCATATC 2820 

AAGGTGACAG TTGTTCGAGA GACTAGAGCA GTAGAATATG C6AAATAATT TTTGTCTCCC 2880 

^ TCACAAATTA GTGAGGGAGC TTTTTTAAGT TGTAGTCTTA AtCTAGTTAG ACAGCACTTT 2940 

ATCGGTAATA ACTATATTAA ACAGTAGTTA TTTGAAAGTA AGACGGACCT TATATTAAAT 3000 

AAGAAGTTAT TGCTTTTAAT AAAAATGTTT TAGGCTTCGT AATTACTATA TTTATATTAT 3060 

40 

GTAAACCTAT AAAGATGATT GGTTTTCTAT CCAATAAAAA AGAAGAGAAG ATGTAACACA 3120 

TCTTCTCTTC yGCAATATTA ATTAGGATTT ATTTCTAAGT TGAGTTAITT TAATTGTAAA 3180 

TCTGTTTTCT TTAATTCTTT TATAACTTCT GCAGTATCAT AACAATTTGT TGCAATTGTT 3240 

45 

GAATATCTCT CTGCTAAACQ ATATGCATTA ATGTAAAGCT TTAAACTTTC TTTAGCTATA 3300 

TCCTCTGCAT CTTCGAATTT TGATGGGTTA GACATAACCA CTAATTCTGC AAATTTTTCT 3360 

GGATCAATAT TAATAGACAT GTATTTATTT ACAACTCCTA TTTATTTTGA TGTCTTAATA 3420 

50 

CTAACATATT GAAGTTTTCA GACAAAGTAA TGTCTCTCTA TAATTGAAGA AAAATAATTC 3480 
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10 



IS 



20 



GGATGAACAA AACATGAGAA TAATGTTTAT AGGGGATATC GTAGGTAAAA TTGGACGAGA 3600 

CGCAATTGAA ACGTACATAC CTCAACTGAA GCAAAAGTAT AAACCAACAG TTACAATTGT 3660 

AAATGCTGAA AATGCAGCAC ATGGTAAAGG TTTGACTGAA AAAATATATA AACAATTACT 3720 

AAGAAATGGT GTAGATTTCA TGACTATGGG TAATCACACA TATGGTCAAC GTGAAATTTA 3780 

TGATTTTATA GATGAAGCAA AACGACTAGT AAGACCAGCG AATTTTCCGG ATGAAGCGCC 3 840 

GGGAATTGGT ATGAGATTTA TACAAATTAA TGATATTAAA CTTGCAGTTA TTAATCTGCA 3900 

AGGAAGAGCG TTTATGCCAG ATATTGATGA TCCTTTTAAA AAGGCAGATC AATTAGTCAA 3 960 

GGAAGCACAA GAACAAACTC CGTTTATATT TGTTGATTTT CATGCAGAAA CAACTTCTGA 4020 

AAAGTATGCA ATGGGATGGC ATTTAGATGG TAGAsTAGCG CTGTTGTTGG AACGCATACA 4080 

CACATTCAAA CAGCAGATGA ACGTATTTTA CCAAAGGGGA CAGGGTATAT AACGGATGTT 4140 

GGTATCACAG GTTTTTATGA TGGCATTTTA GGAATAAATA AAACAGAGGT AATTGAGCGT 4200 

TTTATCACTA GTTTGCCACA AAGACATGTT GTTCCAAATG AAG6TAGAAG TGTATTATCT 4260 

GGTGTTGTTA TTGATTTAGA CAAAGAAGGT AAAACAAAGC ACATCGAACG TATATTGATA 4320 

25 AATGATGACC ATCCATTTTC AACATTTTAA AATTACGTAA GTAAACATTC GAATTGGACC 4380 

CTATCGTCCA TTAGTATGAA TTTAATATAG TACCACTGTT TACATAGTAA ATCGGTGGTT 4440 

CTTTTTGTTA TCATTTAATA TGAAATATAT CCATAGGAGG CATATAACTA TGAAACCACA 4500 

30 ATTATCGTGG AAAGTTGGCG GTCAACAAGG CGAAGGTATT GAATCAACTG GGGAAATCTT 4560 

CGCTACGGCT ATGAATAGAA AAGGATATTA TTTATATGGA TATAGACATT TTTCAAGTCG 4620 

TATCAAAGGT GGACATACGA ATAATAAAAT TAGAGTTTCT ACGACGCCTG TTCATGCAAT 4680 

^ TAGTGATGAT TTAGATATTT TGATTGCATT TGACCAAGAA ACAATTGATG TTAACCATCA 4740 

TGAAATGAGA GAAGACAGTA TTATTTTArC TGATGCCAAG GCTAAACCTG TGAAaCCAGA 4800 

AGGATGTCAT GCACAGCTTA TTGAATTACC TTTTACAGCA ACCGCTAAAG AATTAGGTAC 4860 

AGCATTAATG AAAAACATGG TTGCAATAGG TGCTACTAGC GCATTGATGA ATTTGAATAC 4920 

AAATACATTT GAAGAACTTA TTACTAATAT GTTTTCTAAA AAAGGTGACA AGGTAQTTGA 4980 

AGTCAATATC CAAGCATTAA ACGAAGGTTA TCAATTAATG CAATCTCGCT TACCTGAAAT 5040 

CTACGGGGAC TTTGAATTAG AGTCAACAGA TGCACTACCA CATCTATATA TGATTGGTAA 5100 

CGATGCCATT GGATTAGGTG CAATTGCTGC AGGTTCACAA TTTATGGCGG CATATCCTAT 5160 

TACACCTGCG TCTGAAGTTA TGGAATATAT GATTGCCAAT ATATCTAAAG TAAACGGAGC 5220 

GGTTATTCAA ACAGAAGATG AAATTGCTGC TGTAACTATG GCTATTGGTG CAAATTATGG 5280 
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TGGATTATCT GGTATGACTG AAACGCCATT AGTCATTATT AATACCCAAC GAGGTGGACC 5400 

TTCTACTGGA TTACCTACGA AACAAGAACA GTCAGATTTA ATGCAAATGA TTTATGGTAC 5460 

5 

ACATGGTGAT ATTCCAAAAA TTGTTGTAGC ACCAACAGAT GCAGAAGATG CATTTTATTT 5520 

AACTATGGAA GCATTTAATT TAGCAGAACA ATATCAATGC CCTGTTATAG TTCTAAGTGA 5580 

10 TTTGCAATTA TCTTTAGGTA AACAAACTGT TGAAAAATTA GATTATAATC GTATTGAAAT 5640 

TAAACX5TGGT GAAATCATTC AATCTGATAT TGAACGTGAA GAAGATGATA AAGGTTATTT 5700 

CAAGCGTTAT GCGTtAACAT CCGATGGTGT TTCTCCTAGA CCTATCCCCG GTGTTAAAGG 5760 

AGGTATTCAT CATATAACTG GTGTGGAaCa CAATGAAGAA GGTAAACCTA GTGAATCTGC 5820 

GTCAAATAGA CAACAACAAA TGGAAAAACG AATGCGTAAA ATTGAGCAGT TACTAATTGA 5880 

ATCGCCAGTA GAAGCTAACT TACAACATGA GGATGCAGAT ATTCTTTATA TCGGTTTTAT 5940 

20 

TTCTACAAAA GGTGCAATTC AAGAAGGTAG TAACCGTTTG AATCAACAAG GCATAAAAGT 6000 

TAACACTATA CAAATTAGAC AATTGCATCC ATTCCCAACA AGCGTTATTC AAGATGCAGT 6060 

2^ TAATAAAGCG AAGAAAGTCXS TTGTAGTGGA GCACAATTAT CAAGGACAAT TGGCTAGTAT 6120 

TATAAAAATG AATGTCAATA TTCATGATAA GATTGAAAAT TA7ACAAAGT ATGATGGGAC 6180 

ACCTTTCCTA CCACATGAAA TCX^AAGAAAA AGGCAAAATA ATTGCTACTG AAATAAAGGA 6240 

^ GATGGTATAG ATGGCXy^CAT TTAAAGATTT TAGAAATAAT GTTAAGCCTA ACTGGTGCCC 6300 

CGGATGTGGC GATTTCTCAG TACAAGCTGC AATTCAAAAA GCAGCCGCAA ATATAGGGTT 6360 

AGAACCTGAA GAAGTAGCTA TCATCACCGG TATAGGATGT TCTGGCCGTC TTTCAGGATA 6420 

35 

TATTAATTCT TATGGCGTTC ATTCTATTCA CGGACGTGCA TTACCTTTAG CTCAAGGTGT 6480 

AAAAATGGCG AATAAAGATT TAACTGTTAT TGCATCGGGA GGAGATGGTG ATGGTTATGC 6540 

TATAGGTATG GGGCATACAA TCCATGCTTT AAGAAGAAAT ATGAACATGA CGTATATAGT 6600 

40 

CATGGATAAT CAAATTTATG GTTTGACAAA GGGACAAACA TCGCCGTCAT CAGCAGTAGG 6660 

ATTTGTTACT AAAACAACGC CAAAAGGTAA TATAGAAAAA AATGTTGCGC CTTTAGAATT 6720 

45 AGTATTATCA TCTGGTGCCA CATTTGTAGC CCAAGGTTTT TCAAGCGATA TTAAAGGATT 6780 

AACAAAACTA ATTGAAGATG CAATTAATCA TGATGGATTT TCATTCGTTA ATGTCTTTTC 6840 

ACCATGTGTG ACTTATAATA AAATTAACAC ATACGATTGG TTTaAAGAAC ATTTAACAAG 6900 

^ TGTTGATGAc ATTGAAAATT ATGATTCTAC AGATAAACAA TTAGCGACTA AAACTGTTAT 6960 

TGAACATGAA TCTTTAGTAA CTGGTATTGT TTATCaAGAT AAAGAAACAC CATCATATGA 7020 

ATCtCAAATT AAAGAGTTAG ATGATmCACC ACTTGCTAAA AGAGATATCa AAATTaCTGA 7080 
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TGTATTTATA 


ACAGATCCAT 


TTATGCTACT 


CAGTTTTTTA 


CTATTACAAA AAATAAAGGA 


7200 




GTTTTTAAAA 


ATGAAAGACA 


CATTAATGAG 


TATACAAATA 


ATTCCTAAAA CACCAAACAA 


7260 


5 


TGACAATGTT 


ATACCTTACG 


TAGACGAGGC 


GATTAAAATA 


ATTGACGAAT CTGGTTTGCA 


7320 




TTTTAGAGTA 


GGTCCGTTAG 


AAACGACAGT 


ACAAGGAAAT 


ATGAATGAAT GTTTAATTTT 


7380 


10 


AATACAATCA 


TTAAATGAAC 


GAATGGTGGA 


ACTTGAATGT 


CCAAGTATTA TTAGCCAAGT 


7440 




TAAGTTTTAT 


CATGTGCCAG 


ATGGCATCAC 


TATTGAAACT 


TT AACTGAAA AATATGATGA 


7500 




ATAACATTAA 


AA6TGAAGTA 


A SJWM X X & 




GTTAGAGA*TC3 AOGTATAACT 


7560 


IS 


TTAACTGTTT 


TTGCACTTTA 


TAGTTAAATT 


TAATATAATT 


ATTAAATGAT ACGGGCAAAT 


7620 




AGAAAGGATT 


TTGTAAAGTG 


AACGAAGAAC 


AAAGAAAAGC 


AAGTTCTGTA GATGTTTTAG 


7680 


20 


CTGAGAGAGA 


TAAGAAAGCA 


GAAAAAGATT 


ATAGTAAATA 


TTTTGAACAT GTTTATCAGC 


7740 


CGCCTAATTT 


AAAAGCAAGC 


GCAAAAAAAG 


AGGTnAAA 




7778 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



35 



40 



45 



50 



AGATGAAGTT 


GTTACgAAAA 


TTGCGTACGC 


TGTTTCAGAA 


CATGTCAAAA TAGAAACAGG 


60 


TAATCCATTC 


TTTCAAACAT 


CACATAGTGG 


TTGTGCGACG 


GGCGGATCCT GTAATTGTTC 


120 


ATTATAAAAA 


ACATCGAGTC 


AGAAAAAGGT 


GGTTATTGAA 


cCACTAACTA GCATCTGACT 


180 


CGATGTTTTT 


ATTTATTCGG 


GATTGTTTGT 


TTGAATTGTT 


GTGCTAAATC TGGTCGATCT 


240 


GTCACAATCG 


TGTGTGCACC 


TTTTTGGTAT 


AAATCATTCA 


TCAGATTTAT ACTATTTACG 


300 


CCATAATAGC 


CTGGAATGAT 


ATTCATATCA 


TTTAACCATT 


TGATAAAACG AGATGAAGTC 


360 


AAATCAATGC 


CTTTAAAATG 


AGTAGGCATT 


TGGAACGTTT 


GTGCTAATGG TTGGTAGTAC 


420 


CTACCACCTA 


ATAAATGATA 


TTTTAAAAAT 


GCTTCTGTAA 


CTTCCTGTTG GCTAGCACCA 


480 


ATTGCGACGG 


ATCCTTGTGC 


AATTTTATTA 


AAACGAACGA 


TTTGTTCTTT ATAAAAACTT 


540 


GTCACAAGAA 


CGCGGTCAAA 


TGCTTGATTT 


TCTGCAATTG 


TATCAAACAT AATTTGTGGT 


600 


GCGATTGAGC 


CTTCATAGGA 


TTCAGGAGCA 


TCTTTTAAGT 


CTACGTTTAT ATACATATCA 


660 


GGATATTGCT 


TCAGCAACTc 


ATCGAAGGTT 


AGTATAGCTG 


TGTGTGCATG ACCACGATAT 


720 
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AATGTATGGG CACTAACTTT TCCAGAGCCG TTCGTCGTTC TATCAACAGT TGCGTCATGA 840 

AAAACGATAA GCTGTTGATC TTTTGTGAGT CTCACATCTG TTTCAAAGCC ATCAACGCCT 900 

5 

AATTGTTTAG CATAGTCAAA TGCAAGTTGC GTTTGCTCTG GTCTTAAAGC CATACCACCG 960 

CGATGCGCAA ATATATATGG TGCATTGCCT TTGAAAAAAG CAGGGATGGT TTGCTTTTTA 1020 

GTAATCACTT TATTTTTATT GATCATTAAT AGACTACTTA AAAATCCAGC ACCGACTAGT 1080 

ACCGCATTTA AAATCTTTCT GTTTACnTTT TTCATAAAAA ATTCCTCC 1128 
(2) INFORMATION FOR SEQ ID NO: 50: 
^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



2S . 



40 



CAAGCAAACA 


ATCGTCGATA 


AAATTGCTAA 


AATAATAAAA 


GTAATTCGAA CTTTCATCAT 


60 


GATCATCCTT 


TGTTTATAGA 


GTCAATATAA 


GTATGGAATA 


TGTTAGGTAT ATAGTCAAAT 


120 


GCGTCAACTA 


ATGGGAATTT 


TGGCATAGAT 


AGAGAATTTA 


AGGCAATTAA AAAGGCATCA 


180 


AACAGTAATA 


TGCTGCTTGA 


TGCCCAAATG 


ATGACTTTAG 


CTAAATTGAT TAGTCACTTT 


240 


TAAAGATAAA 


GAATTGTCAT 


GAATTAAAAC 


TCATGTAATG 


ATGTGTTACA TTTCGCAATG 


300 


ATGGCTTTCA 


GTTATTTATC 


GATAACATCA 


CTCTTGATAC 


CTTTAGATTT TAAGAAATCT 


360 


TTAArrriAT 


CTTGTTGCTT 


TTTATTAACA 


TCACCGGCAT 


ATTTTGTTGG CACGTCGACA 


420 


ACATTGATTT 


TATTTTGCGG 


TTGATAGCTA 


AGCTTTTCAA 


TATCTTCATC AACATTGGCG 


480 


ATTGTACTAT 


TTAAAGCTTT 


GAAGTAATTC 


ATCATTAATT 


CAACGGGTTT CTTATATTCT 


540 


TTAGGAATAT 


TGTTTTCAGT 


GACAAATTTC 


TTGAAATGCA 


AATCGTTTTT AACAGCTAAG 


600 


TTAGATAAGT 


GGCTAAGTGT 


TTCTGCTTGT 


TTTTCAGTCA 


CTTTTGTTTG ACTGTCAATT 


660 


TGTTTATCTA 


GTTTATGTTG 


CATAATATAT 


TTGTTATCAA 


GTATATCGCT ATTTACAGAC 


720 


AAATACTTTT 


CTATAGCTTG 


CTTCATCTCT 


GCATCACTAA 


TATCACTATT TTTCTTATCT 


780 


GAGTTAAAGA 


TATCTTTTGT 


tTCTAATTTT 


TTAGCGCTTT 


TAGGTGCATG GATGCCAGTA 


840 


CTTGTATGAT 


GATCTTCGTT 


ATCAGATTGA 


TCGGACGCGC 


AACCTGTAAG AATTAATGTC 


900 


GATGCTAAAA 


ATGTACTTAG 


TAGTAATCTC 


TTTTTCATAA 


TGTAATATAA CTCCTTAGTT 


960 


TATCTTTAAT 


TGAAAAAATA 


TGTATTCATG 


TTTAATAGAG 


TAACATTGAA TTAGTTTGGA 


1020 
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TCTATCAATA ATGCATCATT TTGGACGTTG TTAAGGATAG CTTTATCTAT AAATAACTGC 114 0 

ATAATTGGTT GTACTAATTT AGACGTAGGT ATCGTACGTA AAAGCATAAT AATTTCGTTC 120 0 

ACATACTTTT CTTTCTCAAT ATCATTTTTC ATATTGATTT GTTTGCGAGA GGTACATACT 1260 

TTAAGCATTA TCGCACATCT CGTTGTATAT ATTAAGTTTA TCATAACATG ATTTTATGTC 1320 

GGGATAAAAA AATAACAGCA TCTTAACAAA TGTAAGATAC TGTCAGTGAA ATGAATGAAA 1380 

CTTTAGTTTC TGaTAATATA GTCAAAGGCA TTTAATGCTG CATTTGCACC AGCGCCCATT 1440 

GAAATGATAA TTTGTTTGTT CTTCTGATCT GTGACATCX5C CAGCAGCAAA TATTCCAGGA 1500 

IS ACATTCGTAT TATTGTTACG ATCAATCACA ATTTCACCAC GTTCGTTTAA TTCAACAGCA 1560 

TCGTTTAACC ATGATGTGTT TGGAAGTAAA CCAATTTGAA CAAAGATACC ATCTAAGTTA 1620 

AGTAGATGTT CTTCGCCGGT GTTCATGTCT TCGTAACGTA TACCTGTAAC ATGGTCTTCT 1680 

CCGACAACTT CAGTAGTITT GGCATTTGTT TTGATATCAA CATTTGATAA AGAACGTAAA 1740 

CGATCTTGTA ACACXSTTGTC TGCTTTTAAT TCGCTAGCGA ATTCGAATAA TGTAACATGA 1800 

TTAACGATAC CAGCAAGGTC AATTGCTGCT TCAACCCCAG AGTTACCGCC ACCGATAACT 1860 

GCTACGTCTT TATTTTCAAA TAGAGGTCCG TCACAGTGAG GGCAGAATGC AACACCTTTA 1920 

TTAATCAATT GCTCTTCACC TGGAATGTTT AGCTTACGCC AACCTGCACC AGTAGCAATA 1980 

30 ATGACTGTTT TACTTTCTAA GACAGCACCG TTTTCTAACG TAACTTTAAT TGCTTCGTCA 2040 

GTCTTTTCGA TATCTGTAGC ACGTATACCT GTCATTGCAT CAATGTCATA TTGATCAATG 2100 

TGCGCTGCTA AGTTAGAAGA AAATTCAGAA CCAGTTGTTT CTTTAACAGT AATGAAGTTC 2160 

TCAATACCAG CAGTATCATT AACTTGGCCA CCGATACGAT CAGCAACTAT ACCAGTACGT 2220 

AAACCTTTAC GTGCTGTGTA AATCGCTGCA CTACCACTAG CAGGACCACC ACCAACGATT 2280 

AAGACATCAT AAGGTTCTTT ATTTTCAAAC TCAGATGCAT CTGCCGTACT GCCTAGTTTC 2340 

GAAAGAATAT CTTGGATTGT CATACGACCA TTGCCAAATT CTTCGCCATT TAAAAAGACA 2400 

GCAGGGACTG CCATGATGTT TTCAGATTCT TCACGGAACA CTGCACCATC AATCATAGAA 2460 

4S TGCGTGATGT TAGGGTTGAT CACACTCATT AAGTTAAGTG CTTGAACGAC ATCAGGACAT 2520 

TTTTGACACG TTAAACTAAT GAATGTTTCA AAATGGAATG AACCTTCTAA TTTTTTAATT 2580 

TGGTCAATGA TTGACTGTTT TTCTTTAGGT GCACGACCAC TAACCTGTAA AATTGCTAAA 2640 

ACAAGTGAGT TAAACTCXyTG ACCTAATGGA ATACCTGCAA ATGTTACACC TGTTTCTTCG 2700 

CCAGGACGAT TGACTGAGAA ACTTGGTGTA CGTTTTAAAG ATTTTTCAGA AAGAGATAGT 2760 

CTAGGTGACA TATCAGTAAT TTCTGTCAAC AAATCTTTAA GTTCTTTGGA TTTATCATCT 2820 
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TGTTGTTTTA AATCAGCATT AAGCATGGTT GTAATGCCTC CTTAGATTTT ACCTACTAAA 2940 

TCTAAACCAG GTTGCAATGT TTTAGCGCCT TCTTCCCATT TAGCTGGGCA TACTTCGCCA 3000 

GGGTTTTTAC GAACATATTG AGCTGCTTTG ATTTTGTGAG CTAATGTACT AGCGTCACGG 3060 

CCAATTCCGT CAGCGTTAAT TTCAGATGCT TGTACAACAC CGTCTGGGTC GATAATGAAT 3120 

GTACCACGTT GAGCTAAACC AGTAGCTTCA TCTAATACAT CAAAATTACG AGTGATTGTT 3180 

TGTGATGGGT CACCAATCAT AGTGTAAGTG ATTTTGCTAA TTGCATCTGA ATGGTCATGC 3240 

CATGCTTTGT GTACGAAGTG AGTATCAGTT GATACTGAGA ATACATTTAC GCCTAATTTT 3300 

TGTAATTCTT CATATTGGTT TTGTAAGTCT TCTAATTCAG TTGGACAAAC GAATGAGAAG 3360 

TCAGCAGGAT AGAAGCATAC TACGCTCCAA GAACCTTTTA TVATCTTCTTG TGTAACTTCT 3420 

TTAAATTGAT CTTTTTTTGG ATCGAAArCT TGCGCTGTAA ATGGTAAGAT TTCTTTGTTA 3480 

ATTAATGACA TAAATATCTT CCTCCTAAGA ATTTAAGTAT GAATTAGAAC TATCAATTGA 3540 

TTGCGCTTAA TTATAATAAT TCTAATCTCT TAGTTAGCAT TATTACATTT TGATCCAGAA 3600 

TAGTCAACTG GATAACTTTG TAAAGTGAAT GATTACTTTT AAAATAAAGA AAGATAATAT 3660 

AAAGTGCTTT GATAATGGAT TTTGTAGTTG ATGATTTAAA AGGTTGTGTC TATATtTAAT 3720 

ATCTTGATTT TAATGTAAAA AATGTAAAAA AAGAAGATTT GTATTCTCAA CTAAGTCAAC 3780 

30 CTTATTGATA ATGGTATGAG AATATTTGTT CGAGATGGAT GAAGGTAATG AGTGAGAAAC 3840 

TGGATTTTTA AAGTATGAGA CAATATTTTA AAAAGTTCAA TTATTAACTT ATAAGCAAAT 3900 

AATTGCTATA AAAAAGTTTG GACGTGTACA ATTOCAATAT QAAGATTTTA AATTAATTGT 3960 

AAAGTATCGA GGAGTGGGTA ACGTGTCAGA ACATGTATAT AATCTTGTGA AAAAGCATCA 4020 

TTCTGTTAGA AAATTTAAGA ATAAACCTTT AAGTGAAGAC GTTGTTAAGA AATTGGTAGA 4080 

AGCTCGACAA AGCGCTTCGA CGTCAAGTTT CCTGCAAGCA TACTCAATTA TTGGTATCGA 4140 

CGATX3AGAAG ATTAAAGAAA ATTTACGAGA AGTTTCTGGA CAACCTTATG TTGTAGAAAA 4200 

TOGCTATTTA TTCGTCTTTG TTATTGATTA TTATCGTCAT CATTTAGTTG ATCAACATGC 4260 

4S TGAAACTGAT ATGGAAAATG CATATGGTTC AACX3GAAGGT TTGCTAGTAG GTGCAATCGA 4320 

TGCAGCATTA GTTGCCGAAA ATATTGCGGT AACTGCTGAA GATATGGGGT ATGGCATTGT 4380 

CTTTTTAGGA TCATTAAGAA ATGATGTTGA ACGCGTTCX5A GAAATTTTAG ACTTACCTGA 4440 

CTATGTCTTC CCGGTATTTG GTATGGCAGT AGGGGAACCC GCAGATGACG AAAATGGTGC 4500 

AGCCAAGCCA CGCTTACCAT TTGACCATGT CTTCCATCAT AATAAGTATC ATGCTGATAA 4560 

GGAAACACAG TATGCACAAA TGGCAGATTA CGACCAGACA ATCAGCGAGT ACTATGATCA 4620 
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TO 



15 



CAAAGCAAGA TTAGATATGT TAGAACAATT GCAAAAATCA GGCTTAATAC AGCGATAgCA 4740 

AGATACCAAA ATAACCCGCC CCCCTCTAGC TTAAAATGAT AAGTATAGCT AGAGGGGGCG 4 800 

GGTATTTCTT GCAATGAATT AGTGTGAAGT TAATGCAGCA TTATCATTTG AATCGAAAGT 4860 

ATCTTTATCC CAATGTTTAG TTAACTTGGC GGTACCTGTA CCAGCTAGCA TTGAATCGTT 4920 

CACGTTTAAT GCTGTTCTAC CCATGTCAAT CAATGGTTCA ACGGAGATGA GCACGCCGGc 4980 

TAAAGCGACT GGCAAGTTTA ACGTTGACAA CACCAATATG GATGCAAATG TAGCCCCGCC 5040 

ACCGACGCCA GCAACGCCGA ATGAACTAAT AATCACGACA GCGATTAACG TTACAATAAA 5X00 

TTCTAAATCA ATTTCTACAT TAGCGACGGG TGCGACCATA ATTGCAAGCA TGGCAGGGTA 5160 

AATGCCTGCA CAACCATTTT GTCCAATCGA CAATCCAAAT GTCGCAGCXy^ AATTGGCAAT 5220 

ACCTTCTGGC ACGCCTAGAC GTCTTGTTTG TGTTTGTACA TTCAATGGTA AGGCACCCGC 5280 

^ GCTTGAGCGT GATOTGAATG CAAAGATTAA TACTTCCAAA GTCTTTTTAA CATAGCGAAT 5340 

TGGGCTAATA CCTAACAGGC TTAAAATAAT TAAGTGAATG ATATACATCG TAATTAATGC 5400 

AGCGTACGAT GCGATTAAGA ATTTTCCTAA AGTCCAAATG GCGCCAAAGT CACTTGTCGA 5460 

TAATGTGTTG GCCATAATTG CTAATACACC GTATGGCGTT AAACGTAAGA CGAACGTCAC 5520 

AATCGCCATT ACTAGTGAAT AGATAGCGTC AATCGCACGC TTAAGCAATT CACCATGATC 5580 

AGGTTGTTTG CGTnTACGCG TAAATAAGCA AATCCTATAA ACGAAGCAAA TATCACGACA 5640 

GCAATCGTGG aAGTTGCACG TTGTCCaGTG AAATCTAAGA ATGGATTTTT AGGCAATAAT 5700 

TCCAAAATTT GTTGTGGTAA CGTATGTGCT GTTAAATCTT TCGCTT GTT T AGCAATTTCG 5760 

35 CTTCCACGTG CTTGTTCAGC GTTACCAAGG TTAATTGTTG ATGCATCTAA ACCAAACACC 5820 

AAGGCATACA CAACACCAAC AATCGCAGCA ATGGTGACAG TGCCAATTAA AAAGATAAAA 5880 

ATGAiACTAC CAATTTTAGC AAACTTTTCT CCGATTTGAA TTTTAGTGAA TGCAGCTACA 5940 

ATAGAAATGA AAATTAAAGG CATAACAATC ATTTGCAACA AtGCAACXSTA ACCTTGTCCG 6000 

ACAATGTTGA ACCAGTCACT TGTTGATGTA ATAACATTCG AATGTGTGCC ATAAATAAGA 6060 

TGCAATAACA CACCGAATAC TATACCAATC CCTAAAGCTG TAAACACACX5 TTTCGCAAAA 6120 

GATATATGTT TGCGAGCCAT CATGTGCAAT ATTACGATGA AAATCACCAA TACAATAATA 6180 

TTAATCAGTG TAAGAAAAGC ATTCATGAAC GTCACTCCTT AAATTTTTGA ATATAATTCC 6240 

GACTAGTATG CT 6252 
(2) INFORMATION FOR SBQ ID NO: 51: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6730 base pairs 
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<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATCAAATCnC AAAATATTTA TTAATnAnAA GGGGATTATC CaTGTgAGAA ACAAAGTAAT 60 

10 GCTCTTTTTT TACCTCTTGT GGGTTGAAAA aTGGATCATC AGAGATAGAC TTCTTCTTTT 120 

TCGAAGATGA CATTTGATAC TTTAATCTTC TAAAACCATA ACTTGTCGCA TCAAAAATGC 180 

CTTCTTGTAC AAGTAAAATC AAAAATATGC TAATAAAAAT AATTAATGAA ACATAAAACA 240 

ATATATTTAA ATATGTAATG ATAGTATGGC. TATTAAAAAG CCATATAATA AACGTTAATA 300 

TTGGCGTTAT TAGTGCCATT CCAAGCCATT TTTTCAACAT TTGATCACTC CCACTTATAG 360 

AAAACTCTTA CGCATAGTTT ACATTAAAAT CAGACATTGA GGAATGATTT TTTAATTTCT 420 

20 

TCAGCTTTAT TGAAATTCTA AAATCAATCA TTCTTCATTA GTTTAAAGCA AAAAAATATT 480 

OATATATAGT AAATATTGTA TATATAATAT TAGTTAAGAT TTCaGAAAAT TTTGAAGGGA 540 

^5 ATGGAAATTT AGAAATCGGA ATTTGTTAGA GGAGGGGATT A6ATGGGGAA ATATATTTTC 600 

AAACGATTTA TTTATATGCT TATTTCTTTA TTTATTATTA TTACAATTAC ATTTTTCTTA 660 

ATGAAATTAA TGCCAGGTTC GCCATTTAAC GATGCTAAAT TAAATGCTGA ACAAAAAGAA 720 

30 ATTTTAAATG AAAAATATGG ATTAAATGAT CCTGCAGCTA CGCAgTATTT ACATTATTTA 780 

AAAAATGTTG TTACAGGCGA TTTTGGTAAT TCATTCCAGT ATCATAATCA ACCTGTGTGG 840 

GATTTGATTA AACCGAGACT ACTACCTTCT TTTGAAATGG GTCTTACAGC AATGTTCaTC 900 

^ GGTGTGATAC TGGGACTTAT TTTAGGTGTT GCAGCAGCTA CTAAACAAAA TTCTTGGGTT 960 

GACTATACAA CTACAGTTAT TTCAGTTATT GCAGTATCTG TACCATCTTT TGTACTTGCT 1020 

GTACTTTTAC AATATGTATT TGCAGTTAAA TTAAGATGGT TCCCAGTAGC TGGATGGGAA 1080 

40 

GGTTTTTCGA CCGCGGTATT ACCGTCACTT GCATTATCTG CAGCTGTTTT AGCAACTGTC 1140 

GCCAGATACA TAAGAGCAGA GATGATAGAG GTATTAAGTT CAGACTATAT TTTATTAGCG 1200 

4S AGAGCTAAAG GTAATTCGAC AATGCGTGTA CTTTTTGGAC ATGCACTTAG AAATGCTTTA 1260 

ATTCCAATTA TTACAATTAT CGTTCCCATG TTAGCAAGTA TTTTAACAGG CACTTTAACA 1320 

ATTGAAAATA TTTTTOGAGT TCCTGGATTA GGGGATCAAT TCGTACGTTC AATTACAACA 1380 

^ AATGATTTCT CAGTAATCAT GGCAATCACA CTATTATTTA GCACACTGTT TATCGTTTCT 1440 

ATTTTTATTG TAGATATTTT GTACGGTGTG ATAGATCCAC GAATTCGTGT TCcAAGgAGG 1500 

TAAAAAATAA TGGCTGAAAA TAAAAACAAT TTGTCGATTA ACGACGATCA TTCTAATGCA 1560 
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TGAATCAGGA ACCTGAAATG CAACGAGAAA GCAAAAACTT TTGGCAAGAT GCTTGGGCTC IS BO 

AGTTAAAACG AAATAAGTTA GCTGTTGTCG GTATGATAGG TTTAATTATC ATTGTAATAT 1740 

TTGCTTTTAT CGGTCCAGTT ATAAATAAAC ATGATTATGC TGAACAAAAT GTAGAACATA 1800 

GAAATCTTCC GGCAAAAATA CCTGTATTAG ACAAAGTTCC ATTTTTACCT TTTGATGGTA 1860 

AAGATGCAGA TGGCAAGGAT GCTTATAAAG CAGCAAATGC TAAAGAAAAT TATTGGTTTG 1920 

GTACTGATCA GTTGGGTCGA GATTTATGGA CAAGAACATG GAAAGGTGCT CAAATTTCAT 1980 

TGTTTATCGG TGTTGTTGCA GCGATGTTAG ATATTTTTAT TGGTGTTGTA TATGGTGCGA 2040 

IS TTTCTGGATT CTTCGGTGGA CGTGTCGATA CX3ATTATGCA ACGTATACTT GAAGTCATAG 2100 

CATCTATTCC GAATTTAATT GTCGTAATTT TATTTGTATT AATTTTTGAA CCATCCATTT 2160 

GGACAATTAT ATTOGCTATG TCTATCACAG GCTGGTTAGG CATGAGCAGA GTTGTACGTG 2220 

GAGAATTTTT AAAATTAAAA AATCAAGAGT TTGTCATGGC TTCGAAAACA TTGGGGGCTT 2280 

CAAAATTCAA ATTGATATTT AAGCATATTT TACCTAATAC ATTAGGTGCT ATCGTGGTTA 2340 

CATCAATGTT TACAGTACCT AGTGCTATTT TCTTCGAAGC ATTTTTAAGT TTCATTGGTA 24 00 

TAGGTGTACC CGCACCTCAA ACATCGTTAG GGTCATTAGT AAATGATGGG CGCGCAATGT 2460 

TATTAATTTA TCCACATGAA TTATTTATAC CAGCAATGAT TTTAAGTTTA TTAATTCTAT 2520 

30 TCTTTTACTT ATTTAGTGAT GGATTACGTG ATGCATTTGA TCCGAAAATG CGTAAATAAA 2580 

AAGGGGGCAT AGCATATGAC TGAAAGAATA TTAGAAGTAA ATGATTTGCA TGTTTCCTTT 2640 

GATATTACAG CAGGGGAAGT GCAGGCAGTG AGAGGCGTAG ATTTTTATTT GAACAAAGGG 2700 

^ GAAACATTGG CAATTGTTGG TGAATCAGGT TCAGGTAAAT CTGTAACAAC AAAAGCAATT 2760 

ACAAAATTAT TCCAAGGGGA CACAGGAAGA ATTAAAAAGG GAGAAATTTT ATTTTTAGGG 2820 

GAAGfiTTTAG CAAAAAAACC TGAAAATGAG TTGATTAAAT TACGTGGCAA AGATATTTCA 2880 

ATGATCTTTC AAGATCCAAT GACATCTTTA AACCCAACGA TGCAAATTGG TAAACAAGTC 2940 

ATGGAACCAT TAATTAAGCA CAAAAATTAT AGTAAAGCAC AAGCTAAAAA GCGCGCATTG 3000 

GAAATACTAA ATCTTGTAGG TTTACCAAAT GCAGAAAAAA GATTTAAAGC ATATCCTCAT 3060 

CAATTTTCAG GTGGACAAAG GCAAAGAATT GTTATTGCAA CCGCATTAGC TTGTGAACCT 3120 

AAAGTGCTCA TTGCTGATGA ACCAACGACT GCATTAGACG TAACGATGCA GGCACAAATT 3180 

SO TTAGATTTAA TGAAAGAACT ACAACAAAAA ATCGATACAG CAATTATTTT TATAACGCAT 3240 

GATTTAGGGG TTGTTGCGAA TATTGCTGAT AGAGTGGCAG TTATGTATGG TGGTCAAATG 3300 

GTTGAAACAG GAGATGTTAA CGAAATATTT TATGATCCAA AGCATCCATA TACATGGGGA 3360 
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GGAGCGCCAC CTGATTTATT ACACCCACCT 
ATGCATTAGA TATTGATTTT AAAGTAGAAC 

5 

TTGTGAAATC TTGGTTATTA GACGCACGTG 
AACAACGTAT GAAACCGATG CCTAATAATT 
CGTTCAATGA AAAATGATGA AGTGCTATTA 
GCAGGAAAGA AAAACGAAGT GgaGCGATTG 
AAACATTAGG TTTAGTAGGA GAATCGGGGT 
IS TTAAACTTAA TGATATTACA AGTGGAGAAA 

TTCGTAAACG TAAAGATTTG CTTAAATTTA 
CATATGCGTC TTTAAATCCT AGGTTAAAAG 

20 

TCCATCATTT AGCAACTGaT AAGCGTGACC 
CTGTTGGATT AAGTAAAGAA CATGCCAATC 
GCCAACGTAT TGGaATTGCC CGTGcATTAG 

2S 

AACCAATATC GGCATTGGAT GTTTCAATCC 
TACAACGTGA AAGAGGGATT ACGTTCCTAT 
30 ATATTTCAGA TCGTATTGCA GTCATGCATT 

AAGAAATTTA TCAAAATCCA TTACACGATT 
AACCTGATCC TGAATCAGAA CGCAGTCGCA 
ATAATCATTT AAGACAATTA CATGAAATTA 
AAGAAGCGGC ACAACTACGA GAAAATAAAT 
atgCaatgac GAGAAAATTT AGAACACTTA 

40 

GTGGTTGTGC TAATOACGAT GGTATTTATT 
TGTCATCAGA CTTAACATCC CTTGATACAT 
TGAcTGCGCA AACATTCGAA GGTTTATACA 

45 

GTGTTGCGAA AGCTTTTCCT GAAAAGAGTA 
GAAGCGATGC TAAATGGAGC AATGGTGACA 
50 GGAGAAAAAC AGTTGACCCT AAAACAGGTT 

AAAATGCGAG TGATATTAGT ACTGGTAAGA 
TAAATGATGA AACATTACAA ATTGAATT AG 

55 



AAAGGTGATG CATTTGCGAG ACGTAGcAAT 34 BO 

CACCGTGGTT TAAAGTTTCA CCGACACATT 3540 

CACCAAAAGT TGAACTACCC GAGCTGGTAA 3600 

ATGAAAAACC ACTCAAGGTA GAAAGGGTGT 3660 

TCTATTAAAA ATTTAAAGCA ATATTTTAAC 3720 

AAAATATTTC GTTTGATATA TACAAAGGGG 3780 

GTGGTAAATC TACAACTGGT AAATCAATTA 3840 

TTTTGTATGA GGGTATTGAT ATACAAAAGA 3900 

ATAAAAAGAT ACAGATGATT TTTCAAGACC 3960 

TAATGGATAT AGTAGCTGAA GGTATTGATA 4020 

GAAAAAAACG TGTCTATGaT TTACTTGaAA 4080 

GCTATCCTCA TGAATTTTCA GGTGGaCAAC 4140 

CCGTTGaACC AGAATTCATT ATCGCGGACG 4200 

AAGCTCAAGT AGTTAAT7TA TTATTAAAAT 4260 

TTATAGCTCA TGATCTATCA ATGGTGAAGT 4320 

TTGGGAAAAT AGTTGAAATT GGACCGGCAG 4380 

ATACTAAGTC TTTATTATCA GCCATTCCAC 4440 

AACGATTTAG TTATATTGAT GATGAAGCAA 45 00 

GACCGAATCA CTTTGTCTTT AGTACTGAAG 4560 

TGGTGACACA AAATTAAGGG GAAGGGGGAA 4620 

TTTTAATTTT GATTGCTACA ATTGCATTAA 4680 

CAGATAAAGG TCAAGTATTC AGAAAAATTT 4740 

CATTAATAAC GGATGAAATA TCTTCTGAAG 4800 

CATTAGGAAA AGGTGACAAA CCGGTGTTAG 4860 

AAGATGGTAA AACTTTAAAG GTTAAATTAA 4920 

AAGTGACTGC ACAAGACTTT GTTTATGCTT 4980 

CTGAATTTGC ATACATTATG GGGGACATTA 5040 

AACCTGTAGA GCAATTAGGT ATCAAAGCAT 5100 

AAAAGCCGGT TCCATATATT AATCAATTAT 5160 
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ACGGTACGGC 


AGCTGATAGA GCGGTATACA ATGGTCCaTT TAAAGTTGAT 


GATTGGAAAC 


5280 


5 


AAGAAGATAA 


AACCTTACTA TCTAAAAATC AGTATTATTG GGATAAAAAG AATGTAAAAT 


5340 




TAGATAAAGT 


GAATTATAAA GTTATTAAAG ACTTACAAGC CGGTGCATCA 


TTGTATGATA 


5400 




CTGAATCAGT 


AGATGACGCA TTTATTACTG 


CAGATCAAGT AAATAAATAT 


AAAGACAACA 


5460 


10 


AAGGATTAAA 


CTTTGTGTTA ACGACTGGGA 


CATTTTTTGT AAAAATGAAT 


GAAAAACAAT 


5520 




ATCCTGATTT 


TAAAAACAAA AATTTAAGAT TGsTATCGCA CAAGCAATAG 


ATAAAAAAGG 


5580 




ATACGTTGAT 


TCAGTGAAAA ACAATGGCTC 


AATTCCTTCC GATACACTAA 


CAGCCAAAGG 


5640 


IS 


AATTGCGAAA 


GCGCCTAATG GCAAAGATTA 


TGCGAGTACC ATGAATTCGC 


CTTTAAAATA 


5700 




TAATCCTAAA 


GAAGCAAGAG CACACTGGGA 


CAAAGCTAAA AAAGAGTTAG 


GTAAAAATGA 


5760 


20 


AGTGACATTT 


TCAATGAACA CAGAAGATAC 


ACCAGATGCA AAAATATCTG 


CTGAATATAT 


5620 


CAAATCGCAA 


GTTGAGAAAA ATTTACCAGG 


AGTTACTTTG AAAATTAAGC 


AATTACCGTT 


5680 




TAAACAAAGA 


GTATCACTAG AACTGAGTAA 


CAATTTTGAA GCATCACTTA 


GTGGTTGGTC 


5940 


2S 


TGCAGATTAC 


CCTGATCCTA TGGCTTATTT 


AGAAACAATG ACCACAGGTA 


GCGCACAAAA 


6000 




TAATACAGAC 


TGGGGTAATA AAGAATATGA 


TCAATTACTT AAAGTAGCAA 


GAACCAAATT 


6060 




GGCACTTCAA 


CCGAAC6AAC GATATGAAAA 


CTTGAAAAAA GCAGAAGAAA 


TGTTCCTAGG 


6120 




AGATGCACCG 


GTAGCACCAA TTTATCAAAA 


AGGTCTtGCA CATTTaACAA 


aTCCTCAAGT 


6180 




AAAAGGATTA 


ATTtACCATA AATTTGGTCC 


AAATAACTCA CTTAAACATG 


TATATATTGA 


6240 




TAAATCGATA 


GATAAAGAAA CAGGTAAGAA 


GAAAAAATAA TATGCTTTGT 


AAATTAGGCT 


6300 


35 


GGAGACATAT 


CTCCAGTCTT TTTGTGTTGG 


ATAAAAaCTT TGGGAATAAA 


AATTTAAAAT 


6360 




AAGTCGTTTT 


TTAAATTACT GAAATTGATT 


AAATGCATAA ATAACTGAAT 


ATTCTAAAAA 


6420 


40 


TAA/CtTTGTA 


ATAATTTTTT CTATGAGTAA ACTAAAAAGA AAAAATTAGA 


TTGAAAGTAG 


6480 


GAGGCATATG 


TATGGGGAAG CTAATTAAAT 


ATATTTCAAT ACTTCTTATT 


GTCGTTTTAG 


6540 




TGTTGAGTGC 


TTGCGGAAAA AGCAGTAATA AAGATGAAGG AGTAAAAGAT 


GCTACTAAAA 


6600 


4S 


CGGAAACCTC 


AAAACATAAA GGTGGTACCT 


TAAATGTAGC ATTAACAGCA 


CCX3CCAAGTG 


6660 




GTGTTTATTC 


TTCGTTATTA AATAGTACAC ATGCAGATTC TGTAGTTGAG 


GGATATTTTA 


6720 




ACGAAAGCrr 








6730 


SO 


(2) INFORMATION FOR SBQ ID NO: 52: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AATTTTTGTC ATTATTAAAA ACCTCGCTTT TAAAAGATTG AAAAGTAAAT GAGTGAAATT 60 

AAAGATTATG CACATTAAAA TCACGCCACA ATTTAATTGT GAAAAATATC ACAAATATAT 120 

TATAACACTA AATTTCCCAA AATTCAAAAG TGTGTTTTAT TGCAGAAAAC TTATAACAyG 180 

TGCACAAGTT ATAGTGAATT GCAAACGGAT TACTTTAGTC TTTTTAAAAC ATGAAGTATA 240 

ATTTGTATAG CAATAAATAT AAAAATGGGA GGCTATGTTC AATGAGCAAT ATGAATCAAA 300 

CAATTATGGA TGCATTTCAT TTCAGACATG CGACTAAGCA ATTCGATCCA CAAAAGAAAG 360 

TTTCGAAAGA AGATTTTGAA ACAATATTAG AGTCAGGTAG ATTGTCTCCA AGTTCTCTTG 420 

GGTTAGAACC TTGGAAGTTT GTCGTGATTC AAGATCAAGC GTTACGTGAT GAATTAAAAG 480 

CGCACAGTTG GGGCGCAGCA AAACAATTAG ATACAGCGAG CCATTTTGTG CTAATTTTTG 540 

CGCGTAAAAA TGTAACGTCA AGATCACCGT ATGTACAACA TATGTTAAGA GATATTAAAA 600 

AATATGAGGC ACAAACGATT CCAGCTGTTG AACAAAAATT CGATGCATTC CAAGCAGATT 660 

TCCATATTTC TGATAATGAT CAAGCCTTGT ATGACTGGTC AAGTAAACAA ACGTATATTG 720 

CATTAGGCAA TATGATGACG ACAGCCGCAT TGTTAGGTAT TGATTCATGT CCGATGGAAG 780 

GTTTTAGTCT GGATACAGTG ACAGACATTT TAGCAAATAA AGGGATCTTA GATACTGA6C 840 

AATTTGGTTT ATCAGTGATG GTCGCATTTG GCTACAGACA ACAAGAGCCA CCGAAAAATA 900 

AAACACGCCA AGCTTATGAA GATGTTATTG AATGGGTTGG ACCAAAAGAA TAAATAGAAT 960 

ACCGTATGTC TAAATATATA AAATTAAAAA GTTAGCAATA AAAAAGCCTG CGATTACATA 1020 

AATGAATCGC AGGcTTTTGC GTGAAAAAAT TGTATTAATA AAGTATGGAT GATTATTTTT 1080 

CTGGfiACAAG GTCAGTATTT GAATGAACTG TGATGTCAAA CCCTTCTGGT GCCGTAAATG 1140 

TATGTGTTGA GGCGTCGGGT TGATAAATAT CAACATGTGT TAATCCATAA CTTTGTGAAT 1200 

TGTTTTGTCT TGCTTGATTG GATTGCCAAG TATTAGCAGC AATATGATGG TGATAATGAT 1260 

TCGTTGACAT AAATAGCGCA CGTGGAAAAT CAGACACATG TTGGAATCCT AATTGTTCAA 1320 

TGTAACATTG ATATGCTGCG TCTAAATCAT GTGTTTTTAA ATGTAAGTGT CCAATCATGC 1380 

CTTTTGCTGG CATTCCTTGC CAACCTTCAT CAGTACGATG TGTTAATAAG GTTTGGCTAT 1440 

CAACTTCTAA AGTATCCATT TTAACTTTGC CATTTTGCCA TTCCCATGAA GATGAAG6TC 1500 

TATCGCGATA GACTTCAATA CCATTACCTT CGGGGTCGTT GAAATATAAA GCTTCACTTA 1560 

CTAAATGATC ACCAGCGCCG ATGCCCATAT TTTTTTGTGC CAOGAAATAT AAGAAGTTAG 1620 
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aAGTCTGACG GcCGTCTTCT AATAAATGTA ACGTTAGAGT ATGGcCACCA GTCCCAACAG 1740 

ATAATACGGT TGTATTATCG TCAGAACTTT TAACGGATAG TCCTAAAATG TTTTTGTAAA 1800 

5 

ATGTTGTCAT TAAGTCTAAG TCTCTTACGT TCAGTACAAT GTTTGTCACT TGTGTTGCTG 1860 

TTTTATCGTG AAATGCCATT ATGCATCGCC TCTTTTTCTA TTTTTCTATA AGTTAGTATA 1920 

AAAAGTATAC CAGAAAAGAA AATGAATTGA TAGCATAAAG TTTGAAATGC AAAATAACTA 1980 

70 

GTCGTTTTGC AATTTTAtAT TGATGCGAAC AAAAAAGCGA TGGTACAGTT GCACCATCGC 2040 

AAAATTTATT TAACCAAGAT ATACATCTTG ATATGAATCT TCTTTTTCTA ACATATGTTT 2100 

IS GGCAAATGAA CATGAGGCAA TAATTTTCAA ATTATTTTCT CGAGCGTGTT CAACAACTGC 21S0 

TTTAAGTAGT TTTTTGCCAA CACCTTGACC ACCAAGTTCA TCAGATACGC CTGTATGATC 2220 

AATGTTAATT TCATTATTAT CCACAAAACG GTATGTGATT TCAGCTAAAG CATTATTTTC 2280 

ATCATCACCA ATATAGAATT TGTTCTCGCC TTGTTTGATT TCAAGGTTAC TCATACATAT 2340 

CAACTCCTAT CATGATTGAT TATAGTATTT CCCTATTCTA TTTTAACTTA AACGAAGTCA 2400 

AAGGTGCATG ACAGTCATGT GACGACATTG CCACATCTAT GTAGTCGTTT TTATTAAGCA 2460 

25 

CAGTTTGAAA TGAAGATGAA AACACGTATC TTGACATTAA ATCTATTCAG CTATATAATT 2520 

TATCTCGAAA TCGAAATAAA ATAAAAAAGT TGGTGATCAT ATGGATCGAA CGAAACAATC 2580 

TCTCAATGTT TTTGTCGGAA TGAATAGGGC GTTAGACACA TTAGAGCAAA TTACAAAAGA 2640 

30 

AGACGTAAAG CGATATGGCT TAAATATTAC TGAATTTGCA GTGCTCGAGT TGCTTTATAA 2700 

TAAAGGTCCG CAACCAATTC AACGTATTAG AGACCGCGTA TTAATTGCAA GTAGCAGCAT 2760 

35 TTCATAT6TT GTAAGTCAAT TAGAGGACAA AGGTTGGATT ACACGTGAAA AGGATAAAGA 2820 

TGATAAAGGT GTATATATGG CTTGTTTAAC TGAAAAAGGT CAAAGTCAAA TGGCAGATAT 2880 

TTTGCCTAAG CATGCTGAGA CATTAACAAA AGCGTTTGAT GTGTTAACAA AGGATGAATT 2940 

AACAATCTTA CAACAAGCGT TTAAGAAACT AAGTGCACAA TCTACAGAAG TGTAAGGCGT 3000 

GCACTAAAAA TTTACATTAA AGTATCTCGA TTTCGAGATA AATGCACTAA AAATATAAAG 3060 

AGGGTATATA AAATGATAAA TAATCATGAA TTACTAGGTA TTCACCATGT TACTGCAATG 3120 

45 

ACAGATGATG CAGAAOGTAA TTATAAATTT TTTACAGAAG TACTAGGCAT GCGTTTAGTT 3180 

AAAAAGACAG TCAATCAAGA TGATATTTAT ACGTATCATA CTTTTTTTGC AGATGATGTA 3240 

GGTTCGGCAG GTACAGACAT GACGTTCTTT GATTTTCCAA ATATTACAAA AGGGCAGGCA 3300 

SO 

GGAACAAATT CCATTACAAG ACCGTCTTTT AGAGTGCCTA ACGATGACGC ATTAACATAT 3360 

TATGAACAGC GCTTTGATGA GTTTGGTGTT AAACACGAAG GTATTCAAGA ATTATTTGGT 3420 
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TTAAATGAAG GGGTAGCACC TGGTGTACCT TGGAAGAATG GACCGGTTCC AGTAGATAAA 3540 

GCGATTTATG GATTAGGCCC CATTGAAATT AAAGTAAGTT ATTTTGACGA CTTTAAAAAT 3600 

5 

ATTTTAGAGA CTGTTTACGG TATGACAACT ATTGCGCATG AAGATAATGT CGCATTACTT 3660 

GAAGTTGGCG AAGGAGGCAA TGGTGGCCAG GTAATCTTAA TAAAAGATGA TAAAGGGCCa 3720 

GCaGCACGTC AAGGTTATGG tGAGGTACAT CATGTGTCAT TTCGTGTGAA AGATCATGAT 3780 

10 

GCAATAGAAG CGTGGGCAAC GAAATATAAA GAGGTAGGTA TTAATAACTC AGGCATCGTT 3840 

AATCGTTTCT ATTTTGAAGC ATTATATGCA CGTGTGGGGC ATATTTTAAT AGAAATTTCA 3900 

,5 ACAGATGGAC CAGGATTTAT GGAAGATGAA CCTTATGAAA CATTAGGCGA AGGGTTATCC 3960 

TTACCACCAT TTTTAGAAAA TAAAAGAGAA TATATTGAAT CGGAAGTTAG ACCTTTTAAT 4020 

ACGAA0CX5TC AACATGGTTA ATTGGAATGA GGAGGATTTG TGATGGAACA TATTTTTAGA 4080 

GAAGGACAAA ATGGTGCGCC AACACTAATA TTATTGCATG GTACAGGTGG TGATGAGTTC 4140 

GATTTATTAC CGTTAGGCGA AgcATTGAAT GAAAATTATC ACTTGTTAAG TATTAGAGGA 4200 

CAAGTTTCAG AAAATGGGAT GAACCGTTAT TTCAAACGTC TTGGTGAAGG TGTTTATGAT 4260 

25 

GAAGAAGATT TGGCATTTCG TGGACAAGAA TTGTTGACGT TCATTAAAGA AGCTGCTGaA 4320 

CGTTATGATT TTGaTATTGA AAAAGCAGTA CTTGTTGGAT TTTCAAATGG ATCAAATATA 4380 

GCGATTAACT TAATGTTGCG TTCAGAAGCA CCATTTAAAA AAGCATTGTT ATATGCACCG 4440 

30 

TTATACCCAG TTGAAGTAAC GTCAACAAAG GATTTATCAG ATGTCAGTGT GTTGCTTTCT 4500 

ATGGGGAAAC ATGATCCAAT TGTGCCATTA GCTGCAAGTG AACAAGTCAT TAACTTGTTT 4560 

35 AATACACGTG GGGCACAAGT CGAAGAAGTT TGGGTGAAGG GCCATGAAAT TACAGAAACT 4620 

GGATTAACGG CTGGTCAACA AATACTTGGG AAATAACAGT TCTATTAAGA AGGGGACAGA 4680 

TGGAAAAGAT TTTrACTTTT CATCTGCCCG CTTTTTTGAT TTTGAAGTGC TGTACTAAAT 4740 

^ TTTACAATAG TATAGATATT TTAATCGATA TGAGATTTGC CGGTAATACG CTTAATTAAA 4800 

CCTTTATAGA GTACAGGTAT GAGTAAGATG AAACCGAACA ATCCCATAAT AGGGAATACT 4860 

TTTCCAATTA ATGAAATGAa ACCGATAAAT GTACTAATAT AAGTGATGAC AGCCATTGTA 4920 

45 

ATAATAATGA TGAAGTAACG TCTGCTGAAT GGAACX3CTGA AACGTGACGC AAATGCATAC 4980 

ATTAATCCAA CAACAGTATT GTAGATGACA AGTATCATAA TGACAGACAT AATAATACCA 5040 

ATTGACGGAG ACATTTGTGT CGCTAATTTT AATGTAGGTA GATCTACGTG TTTAATTTTA 5100 

50 

TCGAATTGAG AAATTAAACC TAGATTAATC ATCATGAGTA AAAATGTAAT GATTAAACCG 5160 

CCAATCAAGC CCCCGTATAA CGTTGAGTCA CGATATTTAA CTTTACTACC CATCACTGAT 5220 
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10 



IS 



CCAGGTGATA ATGATTTCTG CTTATGAATC TGAGCATCAT TATTAGCGGC AGTAAAATCA 5340 

AGATGACTTG TTGTGAAATA GTAGACCGCA ATCATAATGA CAATCGCAAT TAAAAATGGG 5400 

GTAACACCGC CAAGCACAGC AATTAAACGA TCGAATTTTA GAAACAGTGT TGCTAAAATA 5460 

AAGGCX3ACTA ATATGAGTGC GCTCAGCCAA TACGGTAAGT TGAAACTTTG ATGAATGGTT 5520 

GACGCACCAC CTGCAGTCAT AATAATAGCT AAAGACAACA TAAACATTGT TAAAATAATA 5580 

TCAAAACCTC TTGCAATAGA GGGGTATAAG AAATAGTTAA TTGAATCAGA ATGATTTCTG 5640 

GACTTTAGAT OATOACCTGT ATGCATGACA ACCATTCCAC CTAAAGTAAT CAATAGTCCT 5700 

GTTACAATAA TGCCTGAAAT GCTATATGOG CCATGACTTG TGAAAAACTG GAAAATTTCT 5760 

TGACCAGTAG CAAAOCOGGC ACCAACGACA ACACCAACAA AGGCAAATGC CACAATAATG 5820 

GACTCTTTTA AGATACX3CAT GATTTAAAAA TGTCCCTTCG TAATTTTAAG TAATATAGAA 5880 

AATGTAACAT ACATGTTAAT GAAAAATATA GTACTAATAT AGTATTTTGT TAAATTGGAG 5940 

TAGAAGCGAG GGTGTCX3GTC ATTTCATTAA TTTATTAGTT GATTTTGCAT TTTTTTGCTG 600 0 

TAAAGTTGTT ATAATACAGT TAACAGGAAT TAGCATAGAT ACACCAATCC CCTCACTACT 6060 

CGCAATAGTG AGGGGATTTT TTTCGGTGTA GCTAGGTCGC CTATTTATCA TCGTGTTTGC 6120 

GTAgCaATGC GTAAACACAG TACCACTAAA TAAGTGCACG ATACATGCAT CAAATGTCGT 6130 

CTTTAGTCTA AGTAACX5ATC ATGCATTAAC ATTTTCAAAA TATCTATTTG AGCTTGAAGA 6240 

TCTTTACCAA TATTGGTATC ACGAATCTTC TTACGTTGTA ATTCTTTATC TACGACGCGC 6300 

TTTATAGAAA GTTCATCGAT ACCTTCGGAA AGTATTTTTn CTTTAGCGTT AAATTGTTGG 6360 

TGTGCAACGA GTTCCATACC GAATGAATTA TACAATAGTG TATAGCCTGC AATGCCAGTn 6420 

GTTCACTGAT AAGCTTTTGA AAAGCCACCA TCAATGACAA GCATCTTTCC ATCAGCCTTG 6480 

AT ^ 6482 
(2)* INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 16592 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
ATTTAAGGCG ATTGCTTGTG TATTTCTCTC TTTTGTAGGC AAACCTGCAC TCGTTCCAAA 60 
AAATGTAACT TCCATATATG CCCCTCCTTT TCTTCAATTC ATTTTATCAT AAAATTTGTA 120 
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AATTTTTCTA ACTTTAACGT AGACATAACT ATATAAATTT TGATAATTAC GTTATACTTA 240 

TCATTAATAA GTATCACATT AAACATGATA CATGAATCGA TATTTCATTT AAGACACTGC 300 

ATACAGTCGA GCATATTGTA TGACCTACTG AATGGATTAT CTTATAATAA TAAATCATAT 360 

ATCTAATTAA GAATTGAGGT TTTAATCTTG AGTACTAAAA ACAAACACAT CCCATGTTTA 420 

ATCACAATCT TTGGTGCACT GCGTGACTTA AGCCATCX3TA AGTnGTTTCC ATCAATATTC 480 

CATCTCTACC AACAAGACAA TTTAGATGAA CATATTGCCA TcATCgGTAT TGGACGTCGT 540 

GACATkwnTA ATGATQATTT CCGTAATCAA GTAAAATCAT CAATTCAAAA GCACGTAAAA 600 

75 GATACAAACA AAATTGACGC GTTTATGGAA CATGTCTTCT ATCATAGACA TGATGTTAGT 660 

AATGAAGAAA GCTATCAAGA ATTACTAGAT TTTAGTAATG AATTAGATAG CCAATTTGAA 720 

TTAAAAGGTA ATCGACTATT CTATTTAGCA ATGGCACCAC AATTCTTTGG CGTTATTTCT 780 

^ GATTATCTAA AATCTTCTGG TCTTACTGAT ACAAAAGGAT TTAAACGCCT TGTTATCGAA 840 

AAACCATTCG GTAGTGATTT AAAATCAGCC GAAGCATTAA ACAATCAAAT TCGTAAATCA 900 

TTTAAAGAAG AAGAAATTTA TCGTATTGAC CACTATTTAG GAAAAGACAT GGTTCAAAAT 960 

ATCGAGGTAT TACGTTTTGC GAATGCGATG TTTGAACCAT TATGGAATAA CAAATATATT 1020 

TCAAACATCC AAGTTACATC TTCTGAAATA CTAGGTGTTG AAGATCGTGG TGGTTATTAT 1080 

GAATCAAGTG GCGCGCTAAA AGATATGGTG CAAAACCACA TGTTACAAAT GGTTGcATTA 1140 

TTAGCTATGG AAGCACCTAT TAGTTTAAAT AGTGAAGATA TCCGTGCTGA GAAAGTAAAA 1200 

GTACTTAAAT CACTGCGTCA TTTCCAATCT GAAGATGTTA AAAAGAACTT TGTTCGTGGT 1260 

3S CAATATGGCG AAGGCTATAT CGATGGTAAA CAAGTTAAAG CATACCGTGA TGAAGATCGC 1320 

GTTGCAGATG ACTCTAACAC ACCTACCTTT GTTTCAGGTA AATTAACAAT TGATAACTTT 1380 

AGATCGGCTG GTGTACCATT CTATATTCGT ACTGGTAAAC GTATGAAATC TAAAACAATT 1440 

CAAGTTGTCG TTGAATTTAA AGAAGTACCA ATGAACTTAT ACTATGgAAA CTGaTAAACT 1500 

GTTAGATTCA AACCTATTAG TAATCAATAT CCAACCTAAT GAAGGTGgTA TCTTTtACAT 1560 

CtAAATGcTA AGaAAAATAC ACAAGGTATC gAAACAGrAC CTGtCCmATT GtCTTACTCm 1620 

ATGaGCGcTC aAGaTAAAAT GaATACTGTA GATGCATATG AAAATCTATT ATTTGATTGT 1680 

CTTAAAGGTG ATGCCACTAA CTTCACGCAC TGGGAAGAAT TAAaATCAAC ATGGAAATTT 1740 

GTTGATGCAA TTCAAGATGA ATGGAATATG GTTGaTCCAG AATTCCCTAA CTATGAATCA 1800 

GGTACTAATG GTCCATTAGA AAGTGATTTA CTACTTGCTC GTGATGGTAA CCATTGGTGG 1860 

GGACGATATT CAATAATTGA ATTAAAACGC ACATGTTAAA CAAAAATAAA TGAGCGAATG 1920 
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TATATTATGA AATTATATTT TACAATGCCC 
GGTGTATAAT TTATAGAAAT AATGTAGAAT 

^ AAGTTTTGGA CGTTATCAAG CAAATACAAC 

TAGAGCCTTT CAATTTGCAA GAATTAAGTG 
ATCAATCATT TAAAATGATT GTCGGCTTAT 
TGACACTCGC TGCAAATGAT GTGATTAATG 
AATATCACTA TGCAAATTCA AATGATTTTG 
CACCTATTCA AGCCTCTACT AAAAAAGATG 
AATTATCAAC TACTGAGAGA GCACCTTATC 
CATTGGTTGG ATATGCACGA TTTATAGACA 

SO CGGATTTTTT AGAAGACTTG CTCATTGATG 
ACGTTAGTCC ATTTGAACTA TTTGTTATTA 
TTGTAGOTGT ACCAAGTGAA CGTTATCCTG 
AACATTGTGC GAAATTCAAT TTACAAGGTG 
ACTATATTGA ATCAAGTTTG CAGTTAACAT 
AAGTGTACCC TCTCGATATT TCATTTAATG 

30 

CTGTTAAACA GAGTCCTTAT GACGAAGATT 
ATATCTATAG GTCTACAAAT GGCCTTAGAT 
TAGTTCGATA ACACATGCTT CATATGGACG 

35 

ATTAAATAGC TTTACTTCTC CATGGCTTAA 
GTTAGTAAGA TTACCTACAA TAAGAACTTG 

40 AACTTGTGAA ttttcagcat ctactaaatc 

CTTTCTTAAT TGAATTAAAT CTTTATAAAA 
TTGTGCAACA TTGATAGTTT TATAATTCGG 
^ AAATCCTCCA TTTAACGTAT CATCCCATTG 
ATCTTTATAT TTCGCAAGTA AAGCGTCTAC 
GTCATTTTTA ACAGCAACAT CGTTAAACGT 

SO 

ACCAATTTCT TGACCTTGAT AAATGAATGG 
ATGACTTGTT GCTGATTCAT ACCAATACTT 

55 



AAAACTATTT TAATAATCAT TGAACAAATG 2040 

AAAAATAAAT GATTGAATTA ATTGGAGTGA 2100 

AGGCAATTGT TTATATTGAA GATCGTTTAT 2160 

ATTACGTTGG TCTTTCGCCA TACCATCTTG 2220 

CTCO^GAAGC TTATGCACGC GCGCGTAAAA 2280 

GTGCTACACG ACTTGTAGAT ATCGCTAAAA 2340 

CAAATGATTT TAGTGATTTT CACGGCGTAT 2400 

AATTACAAAT TCAAGAGCGA TTATATATCA 2460 

CATACAGATT AGAAGAGACA GATGATATTT 2520 

CTAAGTATTT GTCACATCCT TTTAATGTTC 2580 

GTAAAATTAA AGAGTTACGA CGATATAATG 2640 

GTTGTCCTCT TGAAAATGGT TTAGAAATAT 2700 

CACACTTAGA AAGTCGATTT TTACCTGGCA 2760 

AAATTGATTA TGCAACTAAT GAAGCTTGGT 2820 

TGCCATATGA ACGAAATGAT TTATATGTTG 2880 

ACCCATTCAC TAAAATTCAG CTTTGGATTC 2940 

AAATAATAAA AAACAAAGAA GCCCCCTAAT 3000 

TCTATTAGGG GGCATATTAA TATGTTAATT 3060 

TAACTGTTTT AAATTAACTT TGGCATCATA 3120 

ATCAAATGGT ACAGTTAATT CTGCTTCGTG 3180 

CTTTTCATTT AATGTTCTCG TGTACGCAAA 3240 

AAATTGACCA TATACGTATA CATCATTAGA 3300 

TTGTAATACT GAATGCTCAT CTTCTAATTG 3360 

ATTCACTOGG AACCACGGTT CACCATTTGT 3420 

CATTGGTGTG CGAGAATTAT CTCGGTTCTC 3480 

ATCTCCACCT TGAGCTTTCA CTATTTGATA 3540 

TTCAATACTT TCAAATGGAT AATTCGTCAT 3600 

CGTACCTTGT TGCAAGAAAT AAACAGCTGC 3660 

GTCATCGTCA CCCCACGTCG ATACACGTCG 3720 
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CCATCTATTT AATACAGATT TATACGAATT TACATCAAAG TGAGAATCAC CACTATTCCA 3840 

CAGTCCCAAA TGTTCAAATT GGAATATCAT ATTAAATTTA CCATTTTCTT CCCCGACCCA 3900 

GTCATCAGCA TCATCAGGGC TTACACCATT CGCTTCACCA ACAGTCATAA TGTCATACTT 3960 

ACTTAATGAG CGATCTTTCA TCTCTTGTAA CCAAGTTTGT ATACCTGGCT GATTCATATC 4020 

TACATCAAAT GCTGGGGCAT ATGTTTTACC CTCAGGTACA GGTAAGTCAC CCGCTTCAAA 40 80 

CGTCTTCTTA ATATGCGTAA TTGCATCTAC TCTAAATCCA TCAATGCCTT TATCAAACCA 4140 

CCAGTTCATC ATTTCAAATA CAQCATCTCT AACTTCCGGA TTACCCCAAT TCAAATCAGG 4200 

TTGTTTTTTA CTGAATAAAT GGAAATAATA TTGCTCAGTA TTAGCATCAT ATTCCCATGT 4260 

AGATCCATTA AATATACTTT CCCAGTTGTT AGGTTCAGAG CCATCTGGCT TTGGATCTTG 4320 

CCAAATGTAC CAATCACGTT TGGGATTGTC TTTACTAGAT TTGGATTCTA TAAACCAAGG 4380 

20 ATGTTCATCA GATOTATGAT TTACAACTAA ATCTAAAATA AGCTTCATGC CTCTATCATG 4440 

AACACCTTTT AATAAACGAT CAAAGTCTTC CATCGTTCCA AATTCATCCA TAATCTCTTG 4500 

GTAGTCACTA ATATCATAAC CATTGTCATC ATTAGGTGAT TTAAACATTG GACTGAGCCA 4560 

AATGACATCG ATACCGAAAT CTTTTAAGTA GTCCAATTTA TCAATCATTC CAGGTAAATC 4620 

CCCAATACCA TCGTGATTAC TATCATTAAA ACTTCTTGGA TATACTTGAT ATGCTACTGC 4680 

TTCTTTCCAC CATTGCTTAT TCATTTTAAA ACTCCTTTGC TATCGCTGTG TTGATTTTCT 4740 

TATTTTTAAT TCTGTATCTA TAATGACGAG TTCAATAACA TCCTGTGCTT TGTTTTTCAA 4800 

TATATTTAAA ATTGCTGCAC CAGCCTGTTG ACCTAACATT CGAGGCTTGA TGTCAATACA 4860 

GGTTTGTGGT GGTGACGCAA TTTCGGTTAA ATAAGAATCA TTGAACGTTG CTGTCATTAC 4920 

ATCTTTCGGA ATTTCAATAT TAAGTTCATA TAGGACACTT AAAATCGCTA AATGTAACAT 4980 

AGCATCTAAC GAAATGATTG CCTGTTTAAT ATTTGGGTCC TTCAAACGCG TATGTAGATT 5040 

TTGCATGTAA TTAAAAATAA CTTCTCTTTC ATTACTAGTC TCAATAATTT GATAATTAAT 5100 

TTTATTTTGA GAAGCTATCG TTTCAAATCC TTGAATTCTA TCTTTTQAAA CTTCAAAATT 5160 

TCCTTTTTCT GTAATAAATA TTAATTCATC TACACCTTGT TCAATAACAT GTCGTGTCAA 5220 

4S ATTTTCAGAA GCTAATATAT TATCATTATC TATATGTGTA AATTGATGAT CTATATCCGA 5280 

TGTAGGCTTA CCAATCACAA TAAATGGCAT GCTTTCATCA ATTAACATTT GTTTAATCGG 5340 

ATCATTTTCT TTTGAATAGA GCAGTATAAA CGCATCAACC ATTCGTTGTT TAATCATTTT 5400 

SO 

ATAAACTTCA TCCATTAAAT CATTCATATT ATTTGAGACT GTCGTTTGTO TACCATAGCC 5460 

ATGCTGGTTA CACGTTTCAG AAATTCCTA6 CAATACATTG ATGTAGAATG GATTCAGTCG 5520 
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AGTTCTAGCA GCGGTATTAG GAAAATAATT 
AATTGTCGCT TCGCTAATAC GTTGATTTCC 
^ AACACCGGCT TTTAGTGCAA CATCTTTAAT 
CTGCATCGGA AAACGCTTCC AACCACTGTA 
AGTCAAAAGA TTTGTGCAAA CGATTGCATA 

10 

ATTCAATCCG AAAATCAATA TAAAGGTTTG 
TAACTGCAAA ATAGTTACCT TGGCCATCTT 

,5 CTACTATATC ATGCCCAGTT AAACCTAAAT 
TTTTCrCTTT AAACATTAAA GATGGTGTTC 
TAAATTCTTT TGCCATAATC GTCAATGACG 

20 CTGCAACATA GTCCTCAGCT AACGGTGTTT 
TCCAAAATGC TTTCGCTTTT TOGACATCAT 
CCATAAAATA GTACCTCTAT TTCTCTATAG 
ATTACTTCAC AAAATGCTTA AAAATATGGC 
GTGTAATTTT TTATAGGTTG TAGCTACTCT 
TATAAAAAAG GATAGTATCT TTCAACTATC 

30 

TGTATGGAAT ACGCCtTCTT TATCTTTTCT 
TTGTGCTTGA ATTAAGTTTG CAGGTAAATC 
ACTTGATGAG AAACCAGGTG TTGGTACACC 

35 

TAACGCATCT TGATATTCAG TAACGATGTT 
taatcctgga ttattatcgt aagcatcttt 

40 gca;Cccttct CTCCAAATCA TAGCTAAATC 
acttgcttta cgcatttgcg cgaaaccttg 
tgctttacga attttttcta aaaagtcttt 

^ ATTTAATTCT TTAGAAGCAT TTACGCGCTC 
TACAGATTCA GTAATGATTG TTAATGGAAT 
TTTTCCTGTA CCTTTTTGaC CTGCAGTATC 

SO 

TTCATCTAAT TTCATGAAAA TATCACCAGT 
AGCATTCCAG TCTTTGAACG TTTGAGCAAT 

55 



CAATTCTTCC ATAACTTTCT TCACTTTTGA 5640 

TTTTATAACT CTTGAAACTG TCGAAGGAGA 5700 

CGTAACCATT TAATCACCTC CTGTTAATTT 5760 

TAATACCAGT TTAGTCACAC TTTCTAAAAA 5820 

AAACGATAAA AATAAAACCT TCATACTGAA 5880 

TATAAATATT AAAATCGATT GTTTAGTCAC 5940 

GAAAATTAAA TACACGTTGA CCATTCATTT 6000 

CATTTAATTT TGAGTATAAT GCATCAAAGT 6060 

CTAGGTTCAC TTCCGGGCTA TGCTTTTCAA 6120 

TTTCAGCATC TTTGGTAGGT GATACTTCAA 6180 

CACTTACAAC AACAAATTCT AAAGTTTCTG 6240 

CAACATATAA CATAACTTGA TTTAACTTTT 6300 

TACATGCTAT CATAACACAG TAAATATTTT 6360 

GGGATGCTTT TAAGGTCAAG GATAATACTT 6420 

ATCACACTCT CTTTTATATT TATCAAAAGA 6480 

CTTTAATCAA TATTATTCTT CAATCCATTG 6540 

TTCGTACGTA TGAGCACCGA AGTAGTCACG 6600 

AGCAGCACGG TAACTATCAT AGTAATTAAT 6660 

ATTTTGAACA CCAGTTGCGA CAACATCACG 6720 

TTTAAAGTAA GGATCTAGCA ATAAGTTTTG 6730 

GATCTTTTGT AAGAATTGTG CACGGATAAT 6840 

ACCAAGTTTT AAATTCCATT CATTATCTTC 6900 

TGCATAAGAA CAAATTTTAC TCATATATAA 6960 

CTTGTCACCA TCAAATGATG CTTTTGGACC 7020 

TTCTTTGaTT GAAGAGATAA AACGTGCAAA 7080 

ACCTAATTCT AATGCGTTAA TTGAAGTCCA 7140 

AAGAATTTTT TCAACTAATG CTTCTTTATT 7200 

GATTTCAATT AAATAACTTT CTAATTCACC 7260 

GTCTTCATGA GACATGCCTA ATAATTCTTT 7320 
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CATTTTCACA TAGTGTCCAG CACCATTAGG TCCAATATAA GTAACACATG AAGCACCX3TC 7440 

TTTTGCCTTT GCAGCAATTG CATCAAGAAT ATCTGCAACT TTGTTATAAG CTTCTTCTTG 7500 

TCCACCCGGC ATTAATGACG GACCAGTTAA CGCTCCAATT TCACCACCAG AAACGCCCAT 7560 

ACCAATAAAG TTGATTGCAC TTTGTGywAA TGCTTTATTA CGTCTGATAG TATCTTGATA 7620 

GTTTGTATTA CCACCATCAA TTAAAATATC TCCATCATCT AATAAAGGTA ACAAACTATC 76 80 

AATCGTTGCG TCCGTAGCTT TACCTGCTTG AACCATTAAT AAAATTTTAC GTGGTTTTTC 7740 

TAAAGAATTA ACAAATTCTT CCAATGAATA CGTTGGATGA ATATTTTTCC CTTTTGATTC 7800 

TTCAACCATT AAATCAGTTT TTTCACTTGA GCGGTTAAAT ACAGATACAC TATATCCGCG 7860 

TGATTCAATA TTCCAAGCTA GGTTTTTACC CATAACGGCT AAACCAATAA CTCCAATTTG 7920 

TTGTGTCATA TTACTTACCT CACTTGTTGA TTTTTCATTA GTATTGTATC ACAAAATAGA 7980 

20 CATACACTAC ACTAAATCAT TTCGAATGTC GCGCAACTAT TTTGATTATT TCTAACACTT 8040 

GACTTGCAAG CAAGTTCAAT GATTTAATCG GCATTCTCTC ATTTGTTGTA TGGATTTTTT 8100 

CATAACCCAC TCCTAAAATG ACTGAAGGAA TACCAAATGT ATTAATAATA CTGCCGTCTG 8160 

25 AACCGCCACC AGAAATAATT GTATTPGCAG ATAATCX^'AA ATTACGAGCA CTTTCTTGTG 8220 

CAATTTTAAC AACCGCTTCA TTATCATTAA TTTTAAATCC TGGATAACTT TGCTCCACTG 8280 

TAACTACTGC TTTCCCACCT AATTCTGATG CAGTAGTTTC AAACACATCA GTCATATGTT 8340 

TGACTTGTGT TTTTATTCTT TCTGGATCGT GAGAACGTGC CTCTGCTTCT AAAATGACTT 8400 

CATCTGCAAC AATATTCGTA GCTGAACCGC CATGAAACTT ACCAATATTG GCAGTAGTTA 8460 

TTTCATCAAC TTGTCCTAAT TTCATTCGAC TAATTGcTTT CGCCGCAATA TTAATAGCAC 8520 

TAACACCCTC TTTTGGCGTA CTTGCATGAG CCGTTTTGCC AAAAATTTTA GCTGAAATTA 8580 

ACATTTGCX5T CGGTGCACCT ACAACCGTAG TACCGACATC AGCACTTGCA TCAATAGCAT 864 0 

AACCAAAGTC CGCGTCCAAC AACTCTGAAT TTAATTCTTT AGCACCAATT AAACCTGATT 8700 

CTTCTCCAAC AGTAATCACA AATTGAATTT GTCCATGTGG GATTTGTTGT TCCTTTATCA 8760 

CTTGCAAAAC TTCAAGCATC GCTGATAATC CTGCTTTATC ATCTGCACCT AGAATAGTCG 8820 

45 TACCATCAGA GTATATGTAG CCGTCATCTT TTACAATTQG CTTTACATTA ATTGCGGGTA 8880 

CAACAGTATC CATATGGCTC GTC/^TATA ATTTAGGTAC TTCGCCTTCT TCGATAGTAC 8940 

TATTCATTGT ACACACTAGA TTATTGGCAC CTAATTTAGG ATGTTTAGCC GCTTCATCTT 9000 

50 CTTTAACATC TAACCCTAAT GCTATGAATT TTTCTTTTAA AATAGGTTGG ATTGTTGATT 9060 

CATTCCCTGT CTCAGAATCG ATTTGTACAA GTTCAAAAAA CGTATTAAGT AATCTTTGCT 9120 
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GATGAAATAA AATGTTACAG TAATTGACGT 
TCATATTATT TTCAATTTAT TATATATAAT 
^ ATATATTGAT TGATTTTTCA AAGATATCGT 

CTCATGTTTT TTATTATATT CCTTTATGAT 
GTTTTATATC TTATGTCGTA ATTATTAATA 

10 

TGCAAAATAA AGTTTTAAGA ATTATCATTA 
TGTTATTAAC GAGTATCATT CCAATTTTAT 
ATACAATTAA AAATCCATCC ATTATAAACG 
ATTACTTTCA AACATGGGTC ATACGGATGG 
TTCAATGAAC CAAATTGCX3A TTTGATTTGT 
20 ATAATACTTG CTCTCGAGTT AAGCGCTTTG 

TTCAAACGCG TCTCATACAA ATTGTGT7UA 
TTTAGACCAA TTTCTTTCAG CAGTGACGCA 
ATACTTTOGA TGAGCGGTTT CATTCTCATT 
AAATAATGAT AGTATGAATT TTCGTTTCTA 

ATAGACTtCG CCTTTTTAAT ATTTAAAAGT 

$0 

TCATATTTTT CACAAATATA ACTATATTTA 

TTACATTTGT ATTCGTCtAG TTGTTTGTCT 

GCAATGCTTA GTCCAATTAA CAGTAATAAT 

35 

GATTTTGCAT TAAAAACATG AAGTAATATA 
TTTAATACGA CAGTTAATGG TATAAATAAC 
TGACCTAATA AACTAAATAT TGCTGAACCT 

40 

CTTGAAATAA TCGCTTGTAG CGAATGTACT 
ATGGCGCTTG AAGCATAATT ATCTAAACCT 

45 ATACCCACTG CTGTTTTTAT TGTTCTAAAT 

GGTTAGCGCC TCTTATCTTT CTTCACAATA 
TACGTTCATC ACATCATGAC CTTCGATTTG 

^ ATCTTTTACT AATGCAAATG ACGGACTTGA 

TCTTTGTGTC GCTTCTTTAT CTTGTCCAGC 

55 
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TACACAGATT TATCAG6TTT GTAAATTGTG 9240 

TATTGTAACT CAAACTAAGC TTTGTCAAAA 9300 

ATAATGAGGA AAATGACATA AGCAAACTTA 9360 

GATTGCTAGT TATATCGTCT CAAGTTAAAA 9420 

CAAAGGTTAT TCATTTGGAG GCACACAAAA 94 80 

TCGTTATGCT TGTATCAGTT GTATTAGCAT 9540 

AAACTATATC TCAACTACCT ATACAAAATC 9600 

CATGTATTAA TAAGTTATCG TATTGCAACG 9660 

ATTATTTTTT AAGCTACTTC ACTATGCATT 9720 

AAATATTCTT CTAATTCATT TAATATTTGA 9780 

TGTGTTGTTG GCAATGGCAG TTCATCCAAT 9840 

CGCATTGCTG TATAGTCATT ACTATTCACA 9900 

ACATCATTTA AAAGCGGATC TTTATGACAG 9960 

AACAATTCCA CTTGCTCTTC TCGCATATCA 10020 

ACAAAATGAT TTTTAACATC TCGGAACGCG 10080 

AACACTTCAA ATTCAATCGC AATGGTATCT 10140 

CTAAAAATAT CAGCAATTTG TTGCTCAATT 10200 

AAACTTGGCA TCATTAAATT CaTTGTAAAT 10260 

GTTTCATTAA CAATTAAATG TGCATCAATT 10320 

ACGCAACTCG TAATGACACC TTCTTGTACT 10380 

AATACGATAA TACCGAGTAC AATTGGACTC 10440 

AAAAACAATA CTAAAAAACA TGATACTAAT 10500 

TTTGTATGTT TAATACATAA TACGACTAAT 10560 

AACAGCTTAC TAATAATTAC ACCTAAAGTC 10620 

CCAATCTTGT AAGGATTTAA CTTTAACATG 10680 

TTTATTGAAT AATGTTTGTA ATTGATTAAT 10740 

ATGTCTTTCA ATCATTTCTG TAATCTTTCC 10800 

AGGGGCATAA CCTTCGAAGT ATTCACX3CGC 10860 

AAATACTGTC ACTAGACX^AT CAGGTAATAC 10920 
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AGAATTGATC ATAACTAGTG TTGTACCATC 
AGTAGTTAAT TGCTCATATC CCGCAGATTC 

5 

GTTCATGTAT AAATCGAAAT TCATGnCCAT 
ACTAtCCTCA TTCTACTAAT TAATAACATA 
ATATTTAGAC ACAATTTTAA CAATATACCA 

10 

AATTTGTTCA CATGTTTTCA TTAATATGTT 
GCAAAAATGC ATTCAACCAT GTTGATTATT 
IS TATTTTAGTG CCAAAAAATA ATACATCCAT 
TAGATGCATC TATGTTATCA CTAATATATA 
CGCTGTTTAA TATGATTCAT ArATTTACCT 
ATT6AAATAC ATAAATTAAC CATGTTACGA 
TTTTTAACOA TTGATTCTAC TTGTAAAATC 

GATGATACTG AACCAAATGT ACCAGTATTA 

25 

TCAGCTGTCA ATTGCTTATT ACGCGCTTTC 

CCTTTGATTG ACTTTTCGTC TGCATGCTTA 

GCAACAGCAA TTGAAATATT AATGTCTTTA 

30 

CTATTTAATA AAGGATATGC TTTTAAAGCA 
AACGTTAGAT TATATCCTTC TTTATTTTTA 

35 ACAAGATTTG TAGCATCTAC TTCAATCATC 
TTAACCATAT TTTGCGCAAT TGCTTTACGC 
CTATTCTCTT CAGATGATTG GTTACTTGAT 

40 TGTTTGTCAG ATTGAGCTGT GGTACCACCA 
GTTACACGAC CTTCAAATCC ACTACCTACA 
GCGAGTTTAA ATACAACAGG TGAAAAGCGA 

^ GTAGATGTCT GTTCCACTGT TGCACTAGCT 
ACTTTTGCTT GTATCTCTTC AGTTGTTTCA 
CAGATAATTG TATCAATAGC TACTGTCTGC 

so 

CCTGATATCX5 TGGAAGGGAC TTCAGCTGTC 
TCATATTCAT CAATATGATC ACCAACAGAA 
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TTGTTTAAGA ACTTTGTCAA CATCTTCTGC 1X040 

AATTTCATTC CTTGCTTGTT CTACAACACC 11100 

AAGTTCAATC ACCTATCCCT TTATATTTAA 11160 

TTGTTCAATA AACTAATCTG AATCACACCT 11220 

AACATTATTG TGCTTAAAAT CATGGTAACT 11280 

TCAAGTATGA TGTCTTATTT TGACTTTACT 11340 

GTTCTTTATC TTTTTTGAAT ATATTGCACA 11400 

CGACAAGAAC AAGATAAAAC AAGTTGTCGA 11460 

TTTGTATTTT CTAAAGTATA CTGTTCGATA 11520 

GTTTGTAAAC CATCTAAAAT ACGAT6ATCA 11580 

ATTGCAATCA TATCATTAAT TACTACTGGC 11640 

GCTGCTTGTG GATGATTTAT AATACCCATT 11700 

TTTACCXyTAA ATGTACCGCC CTGCATATCT 11760 

GTTGCTAAAG TATTAATTTC TCTAGCTATA 11820 

ATCACAGGTA CGTATAATTT ATTTTCATCA 118 BO 

TGTAAGACAA TTTCATTTCC TTGCCAGCTA 11940 

TCTGCTACAG CTTTTACAAA GAAAGCAAAG 12000 

AAGCTGTTTT TATAATGATT TCTCGTATTC 12060 

ATCCATGCAT GTGGAATCTC TGTTACACTA 12120 

ACACCATTTA CTGGTATTGT GCTGTTTTCA 12180 

GTATCTACTG ATGTTGATTT TGTTTGAACT 12240 

TTTTCAATAA CTGACATTAT ATCCTTCTTA 12300 

ACTTGTGATA AATCAATGTC ATGCTCTGAA 12360 

CCATTATTAC GTGGTTGATT TTGTTTAGCA 12420 

TTTTTAGTAG ATTTCTGAGT ATGCTCATCC 12480 

TTTGTCTTTT CATCAGCAGT TTCAATTTTA 12540 

CCCGCTTCAA CTAAAATTTC TGTAATTGTT 12600 

ACTTTATCTG TAATAACTTC ACATAATGGT 12660 

ACTAACCATT GTTCAATGGT GCCTTCATGA 12720 
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AATTCACGCA TTTTATTTAA GATTTTTrCT GGATTCATCA TAATTTCATT TTCTAATACA 12840 

GGAGAAAATG GCATAGATGG TACAtCTGGA GCAGCTAAAC GCATGATTGG TGCATCTAAA 12900 

5 

TCGAACAAGC AATGCTCTGC AATAATCGCT GACACTTCTG ACATAATACT ACCTTCTAAA 12960 

TTATCTTCAG TTACAAGTAA AACTTTACCT GTATGTTTAG CACGATCAAT AATTGTTTCT 13020 

TTATCTAATG GATAAACAGT TCGTAAATCA ACGACTTCAA CATTGATACC GTCTGCAGCT 13080 

TO 

AAAATATCCG CTGCTTGTAA ACAATAATTG ACCATTAATC CATAACAAAA TACTGTTAAA 13140 

TCTTCACCTT CACGTTTCAC ATCTGCTTTT CCTAAAGGTA CAGTGTAATA TTCTTCTGGC 13200 

^5 ACTTCTTCCT TTAAGAAACG ATAAGCTTTT TTATGCTCAA AGTACAATAC TGGATCATTT 13260 

GATTCGATAG ATGATAATAA AAGCCCTTTA GCATCATACG GTGTGGAAGG AATAACAATT 13320 

GTTAAACCTG GCGATGAAGC AAATATACTT TCAATACTTT GTGAATGATA TAGTCCTCOG 13380 

20 TGAACACCX3C CACCAAATGG TGCACGAATC GTTAATGGGC ATTGCCAATC ATTATTTGAA 13440 

CGATAACGCA TTTTCGCAGC TTCACTAATA ATTTGATTTG TCGCAGGTAA AATAAAATCT 13500 

GCAAATTGAA TTTCTGCAAT TGGTCTTTTA CCTACCATAG CTGCACCAAT GGCAGTTCCA 13560 

25 

ACAATATTTG ACTCAGCTAA TGGCGTATCG ATAACTCTGT CTTCACCATA TTTTTGTTGC 13620 

AGTCCTTGAG TAGTACCAAA TAOGCCACCT TTTCTACCAA CATCTTCACC AAGAATAAAC 13680 

ACATCTTTAT TTTGTTGTAA TX5CTAAGTCT TGTGCCtGcG TATCGCCTCT AAATAAGATA 13740 

30 

ATTTAGCCAT TAGTTAAGAC TCCCTTCTTC GTACACAAAT GCATAGGCTT CTTCGACACT 13 800 

TGGATATGGC GCGTCTTCAG CAGCCTTTGT CGCTTTATTG ATGATGTCTT TnATgTCCGC 13 860 

TTCTATTTCT GCCAACCAAG CATCATCGAT AATGCCAGCT GAAAGCAACT CTTTTTTGAA 13 920 

35 

CTTTTCATTG CAGTCTGCTT TTTTAAGcGT TTCACGCTCT TCTTTCGTAC GATATTGGTC 13980 

GTOCrCATCT GATGAATGAG CTGTCATACG ACTTGTTACT GCTTCAATCA AAGTTGAACC 14040 

40 TTGACCAGAA ATAGCTCGAT CTCTTGCTTC TTTCATCGCT TTATACATTG CTAATGGATC 14100 

ATTACCATCT ACTTGTTCAC CATGTATACC GTAACCAAGT GCTCTATCCG ATAATTTTTC 14160 

AGCTGCGTAT TGTAATGAAT CAGGTACTGA AATTGCATAT TTATTATTTA TAATGACACA 14220 

^ TACAAAAGGA AGTTTGTGTA CACCCGCGAA GTTTAAACCT TCATGGAAGT CACCTTGGTT 14280 

TGAGCTACCT TCACCAACAG TTGCTGTTGC AATTTrCTTC TTACCATCCA TTTTTAAAGC 14340 

TAAAGCAGCA CCAACAGCAT GGGGTATTTG AGTTGCTACC GGTGAACTTT GAGACAAAAT 14400 

SO 

ATTCTTAGCT CTACTACTAA AGTGTGATGG CATTTGTTTT CCACCAGAGT TAACATCGTC 14460 

TTTCTTTCCA AACGCTGATA AAAACGTATC ATACGCTGAG ATACCCATAT AAGTAACGAA 14520 
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10 



IS 



20 



25 



30 



35 



40 



4S 



SO 



AATCTGAGTT GCTTCTTGTC CTTGACCACT TACAACAAAT GGAATTTTAC CTGCACGGTT 14640 

CAATAACCAC AGTCTTTCAT CTATTTTTCT ACCTAAATCC ATCCATTTAT ATATTACTTT 14700 

TAGGTCTTCT TCGCTAAGGC CTAATGATTT ATAATCAATC ATGTTAAATC CTCCTATTTA 14760 

TACGTGAATA GCTCTACTTT CTGCTTTCAA TCCTAATTCC ATCAACACTT CAGAGATGGA 14820 

AGGATGTGCG TGTGTTGTTA GTCCTAATTC TAATGCCGAG CCATTCATGA ACTGTAACAG 148B0 

TGATGCCTCA TTAATCAATT CTGTTACATG TGGACCAATC ATATTAATAC CCACAATTTC 14940 

TTCAGTTGAT TGATCAATCA CCATTTCGCT ATACCCTTCG TTTGTGTCAT GGCTATCAAT 15000 

CACTGCTTTA CCAATTGCTT TAAATGGTAC TTTAAAACTT TTAACTTTCA TTCCCTCTGC 15060 

CTTTGCTTGT TCAATOTTTA AACCGATAGA AGCAATTTCA GGTTGTGAAT AAATACACTT 15120 

AGGCATCATG TTATAGTTTA CTGGGATTGG GTTCCCCTCA AACATATGAT CAACAGCCAC 15180 

AACACCTTCT TTPGATCCAA CATGTGCCAA TTGTAATTTT CCTATACAAT CACCAGCTGC 15240 

ATAAATATGT TTATCTTCAG TTTGTTGAAA TTCGTTCGTT AAAATATGTC CTGATGTTGa 15300 

AAGtTTTATT TTAGTGTTGT TTAAACCAAT ATCTGATGTG TTAGGTTTTC TACCAATCGA 15360 

TAGCAACACT TTATCTACTT TAATTATGTC TGAGGAAATT TCAAACGTAA CACCATCTTC 15420 

GTTAACATTT ATATCATTTT CAGAAAGTTT TATTCCCTCA TAGAATTTAA CACCACGTGC 15480 

TGACAATGAT TTTTTTAATA GTTGTGAAGC TTGTTTACTT TCAGTTGGTA AAATTCTTTC 15540 

ACCTGCTTCT ATAACTGTTA CGTCAACACC TAAATCTATC ATCAATGATG CAAATTCCAT 15600 

TCCGATAACA CCACCACCAA TAATACCAAT ACTTGATGGT AACGTCTTTA ATGATAATAT 15660 

ATCATCGCTA GATAAAATTT TATCATGATC AAATGATAAG AATGGCAACT CTGCAGGCGA 15720 

AGAACCAGTT GCAATTAATA CAAATTGGTT GGGTAATAAG TCTGATTCAC CATCTTCATA 15780 

TTCGACAGAA ATTGTGCCAC TTTGAGGTGA AAATATAGAT GTACCTAGAA TACGTCCCGT 15840 

GCCATTATAA ATGTCAATGT GATTGTGTTG CATTAAATGC TTTACACCTT GATACATTTG 15900 

ATTAATAATG TCTTCTTTTC GTGCCAACAT ATTTTCAAAA TTAACATTAG CATCTTTGAC 15960 

ATCAACGCCA AACATTGCTG CCTGTTTTAC TGTTTGAAAT ACTTCAGCAG ATTTAAGCAO 16020 

CGATTTAGTA GGAATACAAC CTTTATGGAG ACAAGTACCT CCTAATAGTT GTCGTTCTAC 16080 

TATTGCCACT TTTTTACCTA ATTGAGACGC ACGTATCGCA GCAACATATC CTGCAGTACC 16140 

TCCACCGAGA ACGACTAAAT CATATTGTTT CTCTGACATG TTCTTACTCC TAACTAATGA 16200 

TATATATCCA TTGAAAATTT ATTAATACAT AGTTTTCATG TCCATTAATT ACCTATTTTA 16260 

CATGATTGTC TATTTAGTTT GAATGCACAT AAATAAATCC ATAAATGAGT ATTCAACACA 16320 
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TAAATCAGTA ACACTTGCAC CTGAAATCAT TCGTGCAATT TCATCTACTT TATCATCGCT 16440 

AATTAACTCT TGAACTTGTG TTGTTGTACG ATCATCTTTT GATGATTTCG AAATTAATAA 16500 

5 

ATGATGGTCG CTCATCGATG CAACTTGTGG TAAGTGAGAG ATACAAATAA CTTGTATATA 16560 

TTCTGCTaTA TCTCGCATTT TCTCTGCCAT TT 16592 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

CCAATACAAC GTAAAAAGAT TGCTTGTGTT ATTAATGAGT TAGATAAAAT AATTAAAGGA 60 

TTTAATAAGG AAAGAGACTA CATAAAATAT CAATGGGCTC CAAAATATAG CAAAGAnTTT 120 

TTTATACTTT TTATGAACAT TATGTACTCA AAAGATTTTT TAAAATATCG ATTTAATTTA 180 

25 

ACATTTCTTG ATTTATCTAT CTTATATGTA ATATCATCTC GAAAAAATGA GATACTAAAT 240 

TTAAAAGATT T6TTTGAAAG TATTAOATTT ATOTATCCTC AAATTGTTAG GTCAGTTAAT 300 

AGATTAAATA ATAAAGGTAT GCTAATCAAA GAACGATCCC TTGCAGATGA AAGGATTGTG 360 

30 

TTAATCAAAA TAAATAAAAT ACAATATAAC ACTATTAAAA GCATATTCAC AGATACTTCC 420 

AAGATTCTCA AACCAAGAAA ATTTTTCTTT TAAATTTAAA CAGATTTACC TCTTGATAAA 4 80 

^ ATAAATAAGC AATCATACTA CTTCTCAATT TAGTATAAAT AAAAATACAT AATTAACTTT 540 

C TT T T G TTTT TATATTATTT CAATACCCTA CTATATATCA CAACACATAA ATTAAGCATG 600 

ACACTCATTC AATTTAGTTC ACCATTTCGT GTTCCAATTT TACTGAGTAT CATGCTTTTA 660 

40 ATGTTATAAA CCTAATGCTT TAATAAATCG TGTTAATTCT TCTCGCATAC TGTCATCTTT 720 

CAATGCATAT TCTATGGTAG TTTTAACGAA GCCTAATTTT TCTCCAACGT CATAACGTTC 780 

GCCTTCGAAG TCATATGCAT ACACTTGGTT ATCATTATTC ATACGTTCAA TCGCATCTGT 840 

TAACTGAATT TCGTTACCTG OGCCTTCTTT TTGCGTTTTT AAATAATCGA AAATTTCAGG 900 

CGTTAATACA TAACGTCCCA TAATAGCTAG GTTTGATGGT GCCGTACCTT GTGCTGGCTT 960 

TTCAACAAAC TTTTTCACTT CATACTGACG TCC6TTTTTA GTTAATGGGT CAATAATTCC 1020 

50 

ATAACGAT6A GTATCTGCTT CCGGAACTTC TTGGACACCT ATAACTGAGT GCCCTGTTTC 1080 

TTCATAAACG TCAATCAACT GTTTCACTGC TGGCACTTCA GATTCAACAA TATCGTCACC 1140 
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TCTTCGATTG TACCAGTCAA TTCATGCTTC TCCACTGGCG GCTCTGATTT AAATTCAAGT 3060 

TCGATAGGAG TACTATGTTC TATAATAGGT TCCTTTAGTT TATCTTTGCC GTCGCCTTGA 3120 

5 

GCGTTATTAG AGTAAAATGC AACGCCATTT TTCCaAGTTA AATTACTTGT ATAATAATAG 3130 

TTATAATATC CAAAAAGGTG TGTTTGAAAT TCTAAGTTGC TAGCATTTGA ATCATAATAC 3240 

CCTTCATATT TTATTACATA ATTTTTACTT TGGTCTAAAT TATTAAAGTT TAAAGAATAA 3300 

10 

CCACCATTAG TATCAAAATC TAAACTCATA TTATCAGTCA CATCTTCAAA TTTGCTGACA 3360 

TCATCAAGCT TTGCATAnTn AgctTTCAGC TAAATCGTCT GAACCAATGT GTTTATATAC 3420 

IS CTTAACTGTT GGATTATTAA CCCCTGGTTT ATTTCCTTTA GTTACTTGAC CAGTTACTGT 3480 

CACAGAGCTT AACGACTGGT TGTTAGGTTT CATGTACGCA AAATGACTAA ATTTCCCATC 3540 

TACTTTATTT AAAGTATCAA TTCGACCATT AGCTGTTACT CCCCAATTAT CTCTAACTCC 3600 

^ ACCTAAATAT TGAATATTAA ATATTTTGCT AACCGTAGTC TCACCCAATT TAACTTCAAC 3660 

ATTTTGGTTA CCTTTnGCG TCACTGTTGT AGGATCAATA AATAGATTTA AAGATAATTC 3720 

AGCAGTTAAA TCTTTCTTTT CTTGTACATA TTCTTTAAAC GTATATCTAA CTTTTCTTTC 3780 

TCCAATTATT TCTGCTGTCG CCATAACTTG ACCATCTGTA CTTTTTATCT CCGGAACTTT 3840 

ACGCAGTGTT GAGATACCAT GAGTTTCAAC ATTATCGCTT AATGTGAAAT CAAAATAATC 3900 

TCCCGCCTTA ATTCCTTCTC CAAATTTCCA TTTATATTTC AAGGTTACTC TTTCTGCGTT 3960 

30 

ATGAGGATTT ACAACATTCG TATCTTGTTT ATGTCCTACA ATTTCACTAC CTTCTTCTAC 4020 

TTCCACTTTA TTTGTTACAT CTGTACCTGT CGCTTTAGTT TCTTCCACTA CTTCTTTCTC 4030 

TGCAACTGCT GTAACGTCAt TGatCTTTTC ATTCTTGGTT TAATTTCTGA GACGTTACTT 414 0 

35 

GGTTGAGCTA TGTCAACTTG AGTTCCTGTA GTTTCCTTAT CAGCAACTTT TTCCGATGGC 4200 

AAATCAACTC GCGAAgTTTC TACTTTTGGT GCTTGCAcAG TTTTCGGTGC TTCTTCTGTT 4260 

40 GTTACTTGTG TTGATTGTGA TGGTTGCTCA GTTGATGTCX; CGCTGTATGA TTGTGTTTCA 4320 

TCTATTGTAT TAACGTTATT TGTAGTTGTT TGTGTTTCGC TTGCTTTACT TTCAGTAGCT 4380 

GAACTCCCAC TTTCCTCTAC TGTAGTATTG TTTTGTTCCG ATGCTGCAGC TTCTTTTTCT 4440 

^ TGTCCCATTC CAACAACGAT CATTGTTCCT AAGAATACTG AGGCCGCTCC CAATTTGTGT 4500 

1TTCTTATGC CGTATCTAAG ATTGCTTTTC ACTATAATAT TCTCCCTTAA ATGCAAAATT 4560 

CATTTATTTT TAAAACTCAA TAAATGCAAT TCTATATTGT TCGGTTTTTA AAAGCAATGA 4620 

SO 

AAAAAAGCGA GTTAATAAAA AGTTAAGATT GTTGTTAACT TTATGTATAA TGAGTTTTTT 4680 

ATTATTTGAA ACTCACATAT ATATTGCATA CAAAGCTCTT GAACACCTTG ATATAACAGG 4740 
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TACTAAACCA TACATAATAA TCGCCTGTAC 

TTTCATTGTA TTAATAACGT CACTATAATT 

5 

ATGATTATTT ATACAAAAAC AGCCGTATTT 

GCATCTAGTT AATAATTGCA TTTATCAAAT 

CACAAACTAG TTTAAAATTC TAACTTTATC 

10 

ACCAATGAAG CAATCAGAAA ACACTCTAAT 
AAACAGGCTT ACTTCATATA ATTTATGAAA 
TTGTGATTCT TTTTATTTCT GCGTAATAAT 
AACAACATAC CTTTGTTTGT TGATTCTTCT 
GATTGTGGTT TTTTAGTTGG TGCCACTGCT 
20 GTTACTACTT TACCTTGTTC CACTGGTTTA 
GGTACTGGTT TACCAGGTTC AGCTGGTACC 
TCACTCX3GCA CTTCTGGTGT CGGTGGTGTT 
GTTGGTGGCG TTGGTGTTTC CGGCTCACTT 
ACGATTGGAG GTGTTGTATC TTCTTCAATC 

TTTGGAAGTG TATCTTCTTC AAAGTCAACA 

30 

TTATCTTTAT TTGTATCTTC TTCAATAATT 

GGCACACTGT CGAAGTCGAT ATCAATGATG 

TTTTCTGTAT CTTCCTCGAA TGACTGATTA 

35 

CTATCAAAAT CGaTATCTAC GATATTGCCA 
GTGTSTTCCT CGAATGACTG GTTACCGCTA 

^ TTAATATCAA CGTGGCTATT TTCTTCXaATT 
TTTTCAGTTC CTAAACCAGA ATGAGAAATA 
GGTCCTTGTG CTTGACCATG CTCTTCAGGT 

^ tCAGTTGTAT ATTCTTTCGT ATCTTCAACT 
ATACCTTTTG TAGACTCTTC GTCAAATTCA 

CCACCTGGGT TTGTATCTTC TTCATATTCA 

SO 

TGTGTAGATT CTTCAAAGTC AATTGGATTT 

ACGTGACCTG CtTCGCTATC CACAGCAGTA 

SS 



AATGCATCAT TAACAAGTCA CTGAAACGCC 4860 

TTTATATCGT TCGGTTTTTG TTTGATTTTA 4 920 

CAAGCCGACA TTTTAAATTT AACTAAATTT 4930 

TTGTCTTATT GATCCAATCT AATTTGTACT 5040 

TCTCAGTTCG TTATCAATCA TCAGACATAA 5100 

TTTCTATTAG AAATTTGATT TAATATAAAA 5160 

TAAACCCGTC AATTTTTGTT TAATTATGCT 5220 

GCTAAACCTA GAATGCTGAA TAATCCGCCG 5280 

CCACCTGTTT CAGGTAGTTC AGATTTCTTA 5340 

TTAACCTTTT CATTGATTTC AATAACAGGT 5400 

GAAGGCTTTT TAGGTTCTTC TTTGGCAGGT 5460 

TCTGGTGTTG GCGGTGTTGG AGTTTCTGGC 5520 

GGTGTTTCCG GCTCACTTGG TACTTCTGGT 5580 

GGTACTTCTG GTGTCGGTGG CGTTGGTGGC 5640 

GTTTGTTQAC CTTCATTTTG GCCGCTTACT 5700 

CTATTGTGTC CACCGAATTG ATAACTTGGT 5760 

TCAGTGTGCT TATTGAATCC GTGAATATGT 5820 

TTACCGCCAT GTTCATACTT AGGTTTGTCT 5880 

CCTTTATTTT GACCATGAAT TTGAGGTACA 5940 

CCTTGTTCAT ATTTAGGTTT GTCTTCTTCT 6000 

TTTTGGCCAC CTTCATAACC TAATTCACTC 6060 

TCTTCAATCA CGTCATAATT CCCGTGACCA 6120 

TGATGATTGT TTTTAGTAAT TTCCTCGACT 6180 

AATTCATCCA CTAATTCAAT CAGATTACTT 6240 

GTTGTATGAT CGCTCACtGC GCCAGTTACA 6300 

ACTAAGTTAG ACTCAGTAGT AACCTGACCA 6360 

ACAACATCAG CGTGATGTTT TGAATTTTCA 6420 

GATTCCTCAG AGGACTCAGT GTATCCTCCA 6480 

TGGTAATCGA TATCAATAGC TGATGAATCC 6540 
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10 



15 



TGGTAATCAA TGTCAAGAGT TGATGAATCA TATTCCTCTT CAACAGTAGT TACTAAATTC 6660 

TTATCATATT GACCTGTAAG AGTTTCTTTA ATTGTATCTT CTTTATATTC AAATTTATTA 6720 

TTTTGAATAA TCGGACCATT TTTCTCATTT CCGTTCGCTT TATTACTGTA TAAAACTAAA 6780 

CCATTATCCC AAGTTAAGGT ATATCCTCTA TCATAATAAT ACTTATAAAG TTGCTCTGGA 6840 

TGTCCTACCA TTTGTGTTCT AAAATCAACT TCATCAGTAC CATTTAAATA CTCTCCATCA 6900 

TAGTGAACAA CATAAGTTTT ATCTAGATTT TCTATATTCA ATGAATAGCT TCCATTATTT 6960 

TGTAAATTCA AATTCCCACT CATATTACTT GTGACTTCTT TAAATTTAGA AGTATCTGTC 7020 

GTATTTGCAT ATACACTCTT CGCTATGTCT TCATTATTAC CCAAGTATTC AAATATCCTA 7080 

ACTTTTGGTT GATTTCCATT CTGATTACTA CCTTTCATTA AAGTTCCAGT AACAGTCACA 7140 

CTTGTCGTTT TACCATTATT AGGTTTAATA AATGCAACAT GCGAAAATCT ATTATTCGCT 7200 

20 TTATTAAATG TCTCAATCGA TCCATTTAAA TTGGCATAAT AATTCCCAAT ACCATCTTTA 7260 

TATTTAACAT CTAATTCCTT TGAAGTTTGT TCTTCATTTA GTGTTGAAGT TATAGTTTGA 7320 

TTTCCATTAG TTTGTACAGT TTTAGGATCA ATAAATAAAT TAATTTCTAG TTCAGCCGTT 7380 

ACATCAACCT TATCTTCAAT ATCATTTGTA AATGTATATC TAATCTTTCC ACCTTCTAAA 7440 

ACTTCACCTO TCGCCATTAC GACTGAACCA TTTTTAATTT CTGQTACTTT TCTAGCAGTT 7500 

GATACGCCAT GCGTATTTAC ATTATTTGAT AAAGTAAAGT CAAAGTAGTC ACCTTGATGT 7560 

AAACCATTCT CAAATTTCAA CTTATATTTT AGTACCGCTC GTTGTCCTGC ATGAGGTTCT 7620 

ACTTTATTTG TATTGTTATG CCCCTCAATA GAACCAATTT CTACTGTAAC TTTACTTGTT 7680 

ACATCTGTAC CCGTrXCCAC TTTCGCGTTA CTAGCTTCCT TAGCTTCCGC TACATCTGCT 7740 

GATCTTGTCA CACGTGGCTT ACTTTCTGAT GCCGTTCTTG GCTGTGCCAC TTCAACTTGT 7800 

GTTUrrGCGA CTTGATTTTG TGTAGCCTTT TTAGGTGTTA AATCTACTTG TCTTTGATCT 7860 

40 CCGCTATTGT CTTGAGATTG TGTTGTTTCC TTAACTTGAG GTTTCGCTTC TTCCTTAACT 7920 

ACCTCTTCTT TAACTGTTTC TATATTTGCT GGTTGTGCAG TTTGTGGTGC TTGTACTGCT 7980 

TTTGGTGCTT CTTCAGTTGT TACTTGTGTT GCGTTTGACG GTTGTTCTGT TACTGTTGCG 8040 

45 TTATATGArr GAGTTTCTTC TATATOATTA ACGTTAGTTG CAGTTGTTTG TGTTTCACTT 8100 

GTTTTATTAT CAGTAGCTGA ATTCCCATTT TCTTCTACTG TAGTTGTCTT TTGTTCTGAT 8160 

GCTGCAGCTT CTTTGTCTTG TCCCATCCCA ACAACGATCA TTGTTCCTAA GAATACTGAT 8220 

SO 

GCTGCTCCCA ATTTATGTTT TCTAATGCCG TACCTAAGAT TGTTTTTCAC TATAATATCT 8280 

CCCTTTAAAT GCAAAATTCA TTAATTTTTT AAACTTAATA AATGCAAGTC TATATTOTTC 8340 
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70 



75 



20 



25 



30 



35 



40 



45 



SO 



ATGTTAATTG ATAATTTTAT TATTTGAAAT ATACCTATAA ATTGTATTCA AGTCATCAGA 8460 

AACCCTTGTC ACACAAGGCT TGTATTTTTT ATACTTATTT TTTAAATTAA ATTCATCATT 8520 

ATCTAATTTA AAACAATATA CTAAACGTTT CATAATTATC GCCTGTACAA TACGCACAAA 8580 

AACATGTCTT GAAACGCCTT TCATTACTCT AAAATACCCA ATATACTTTT TATATCGTTC 8640 

GGATTCTGAG TATTTCAGAC GATTTTCTGC ATAAAAATAA ACGTGTTTCA AGGCAATATA 8700 

TTGCAATTAC CTAAAAACAC GTTTACTTAA TATTTAGTTA AACAAATAAG CTAATGAATA 8760 

AAATGAAGAT GATACCTGAA ACGGAAATAA TCGTTTCTAA TAATGACCAT GTTAAGAATG 8820 

TTTCTTTTAC AGTTAAACCA AAATATTCTT TAAACATCCA AAATCCTGCG TCATTTACAT 8880 

GAGACAAAAT CACACTACCT GCACCTATCG CAAGTACAAC TAATGCAACA TTTACATCTG 8940 

ATGATTGTAA TAATGGTAAG ACAATACCTG TAGTTGAAAT CGCAGCTACT GTAGCCGAAC 9000 

CTAATGCGAT ACGTAGCACA GCTGGAACAA TCCATGCTAG TAAAATCX3GA GACATCTCTG 9060 

TACCTTCAAA CATTTTAGCA ATTGTATTTC CGACACCGCC GTCAATTAAT ACTTGTTTAA 9120 

ATGTACCGCC ACCGCCAATA ATCAATAACA TCATTCCGAT TGGATAAATC GCATTCGTCA 9180 

CTGATTCCAT AATATGATTC ATCTTACGCT TTCTCATTAA TCCCATCGTA ACGATTGCAA 9240 

ATAATACTGC TATTAGCATG GCTGTCCCTG CTGTTCCTAT CATATAAATG ATAGATTCAA 9300 

ATAGATTTGT AGGTTTGTCA TGCCCAGTTA CAAGTTGCGT TATCGTAGAC ACTAACATTA 9360 

ATATGACTGG TAATGTTGCT GTTAATAAAC TCATACCAAA TCCTGGCATC TCTTGATCCG 9420 

TAAATTCTTT TTGTGCACCT AACGCTGAAA TATCGCCTTC TCGTGTATAC GCAGACGGAA 9480 

TCATTTTTTG TGCAcTTTGT TAAATATAGG CCCTGCAATG AGTGTAACTG GaATGGCAAT 954 0 

AATCATACCA TACAGTAATA CATCTCCAAC ATTTGCCTTT AATTCTTTTG CGATGACTAC 9600 

CGGTCCTGGA TGTGGTGGTA AAAAGCCATG TGTCACTGAT AAAGCTGTTA CCATAGGTAG 9660 

TCCTAGTTTT AACACTGAAA CATTTGCGCG TTTTGCTACT GTAAATACTA ATGGAATCAG 9720 

TAAGACTAAA CCTACTTCAA AGAACAATGC AATACCGACG ATAAATGCTG CAACAAGCAT 9780 

TGCCCATTGT ACATGTTTTT GACCAAATTT TTGAATCAAC GTGTCTGCGA TTCGAGTTGC 9840 

ACCACCACCA TCAGCAAGCA ATTTCCCAAG TATGGCACCT AAACCGAATA TCAGTGCAAT 9900 

GTGGCCGAGC GTACTGCCCA TTCCTTTCTC AATCGTCTCC ATAATTTTAG TCAATGGTAT 9960 

ACCTAGCATT AACGCTGTAA TCATCGATGT GATAATTAAT GAAATAAATG TATTTAATTT 10020 

AAACCGAATA ATTAATACTA ATAAAATAAC GATACCTAAA ACAACACTGA TTAACGGCCA 10080 

TATTTCGTTA AACATGACAT TCCCCTCTTT CTCTTTTCAA TAGAATGTAA CACCGTCGTC 10140 
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GAGTGACGTA TTTATTGTGT TTTATTTTCA 
TGTTCATAAT TCTCTGTTAA AGAACGACTT 

5 

TAAACAGTGA CATTTTCTTC AATCGGCGTA 
ACGATTGAAA AATCTTCAAT GTCACCTACA 
CATGAACTTT CATAACTTTC AGGAACCACT 

10 

TGACGCCATA CTTCACTTTT CGCAAAACXIA 
TTCATTACTT CAATAAGCGC AAGATAGACG 

,5 GCAGCGCGAA TCATATGTTC TTTTTTATGA 

GCATTTGCGT TCCAAAGCGG CGCACGTTCT 
TCTGCACCTG GTTTAACACG CTTTGCAATT 

20 AGACGTTTCG CAGTTTCGAC TTCACTCGCT 

ACACCACCAT TATTTACAGG ACCTCCGATG 
AATATTCTAC CTTTGTAATC AGTACGCGGT 
GTACCGATTG TGACAGCAAC TTCTCCTTTA 
ACCCCATCAC TCGCACCAAT AACAAACGGT 
TAACGTTCTT TCATACCTTT CAtCACATAC 

30 

TTGGAAATAC CCAGCAGTTC TAATGCCTCA 
ATCCCTGTTG CGGAAGCCAT TGAATAATCA 
ATGTATGTTT TAATATCTGC AAACTTAGCA 

35 

TTCATCCAAA AAATCTTCGC TAATGGCGAC 
TAAATCGCAT TGCCATCATG CACTTCATTT 

40 TCTGCCCAAG TAATATTATT TGTTAATCTT 

TGCATTTGCG CACTAAATGA CACAAACTTA 
ATAACATATT TAATAGTCAT TAGTACTGCA 

^ ACATCAACGT TTGGTGTGTG TAAATCATAO 

TTTTCATCAT ATAAGACTGA CTTGGTACTC 
TTCATGATAA ATCCTTCTTT CTTTCATTTT 

SO 

CAACATCGTC GAAATTTAAA TGAAACGCTT 
CTGCATCAAT AAACACTTGA TGATTATGAT 
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GCGATATGTT 


GGCGTTGAAA 


ATCTGCAATT 


10260 


AAATTGATAA 


AAATGGATAC 


GATCTCTTGG 


10320 


TGATTGTTTG 


TGGCACCGAC 


CATCGATGAA 


10380 


GCTTTAAGTC 


CGAGCACGCA 


GGCACCTAAG 


10440 


AACTCTGTGT 


CAAATATATC 


TGACATCATT 


10500 


CCTGTTGCTT 


TTATCATCTT 


AGGTGTTTCA 


10560 


GTATACAAAT 


TGTAAAGAAC 


ACCTTCTAAT 


10620 


GATAAAGTTA 


AACCGAAGAA 


TGAACCTCTT 


10680 


CCTGCTAAAT 


AGGGATGGAA 


TATTAAACCA 


10740 


TGAGTTAAGA 


CATCATAAGG 


ATCAACACCG 


10800 


AGCAACTCGT 


CGCGCAACCA 


TCTCAATACX3 


10860 


ACGTAGTGGT 


CCTCTGTTAA GACATAACAA 


10920 


TTATCTATCA 


CAGTACGAAT 


CGCCCCAGAT 


10980 


CCAACACTAT 


TGACACCTAA ATTAGAAAGG 


11040 


GTATCTTTAT 


TAAGCCCCAT 


TAATGTTGCA 


11100 


GTTGTTGGAA 


CTAATTCCGG 


CAACATTTCC 


11160 


ACATCCCAAT 


CTAATGTTTC 


TAAATTAAAC 


11220 


ATGATATATG 


TATCAAATAA 


ATGATAGAAA 


11280 


GTACGTTGAA 


ATACATCTTG 


CCATTCATGT 


11340 


ATAGGATGAA 


TCGGTGTGCC 


TGTTCGCTGG 


11400 


ATTACTGTTG 


CATATTTTGC 


AGCGCGGTTA 


11460 


TGATGTTGCT 


GATCCATCGC 


AATCAAGCTA 


11520 


ATGTCGTCTT 


TATTAACTTT 


GGATTCTCTC 


11580 


TCAAATAATT 


GATCTGGGTT 


TTCTTCTGAG 


11640 


CCTATTTGAT 


GTTTCATGAT 


AAAAGTTCCA 


11700 


GTCGTTCCAA 


TGTCGACACC 


AATCATATAT 


11760 


AATTCAACCA 


AAATCCTTCA ATATCTTTAC 


11820 


CTTTCAAAAT 


TTGACTGTCG TATTGTTCCA 


11880 


GTATGCGTTC 


AAAATCTTGC 


GGGTTCTGTT 


11940 



419 



EP 0 786 519 A2 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



AAAATGAGTT 
CATGCTTCGT 
GTTTCTTCAT 
ATGACTCTAA 
ACACATGTGC 
CATCTCGAAT 
ACX3TACCTTC 
CTCCAGTTGT 
CTTCATTACA 
CGCTTTCTTT 
TGGTCACTAT 
CTGTACTCAT 
ATTCGAATCG 
AGAATATATG 
CACCAATGCC 
TCAAAATTGC 
AAATGTTACG 
TTAAGTGTGT 
TGTTTAATAC 
GTAAAGTATT 
ACGC&CAACC 
TATACACACG 
AATATTGAAT 
TAGACATAGA 
ACAATTTCCA 
CCATGTTGAT 
TATTTCAGAA 
TTTCAGTAAA 
TAAACTTGcC 



TAAATATTGA 
AAATGATTCT 
TTCTTTTACG 
CATCAGTCGC 
ACCCATTCTT 
TGGCGAACGA 
AGCTATGTGA 
CATACCTTCC 
CGACATACTT 
ACGAGCACTA 
CACACGAATG 
ATGCGCTTTA 
ACGTGTTGTC 
CTGGCAGTTA 
ATAACCAATC 
ACTTATAATG 
ACGAATACTT 
GATTGGAGAC 
CGCTTGTTGC 
GAGCGTCTTC 
TAAATCTTTA 
ACGCTTTCCT 
OGTTCGTGTT 
TTCCACCTCC 
CATTTTAAAQ 
TTACAAACTC 
TGAATTTGTT 
TCTCGATACT 
CACATATCCA 



TGATTAGATG 
GCATCCTCAA 
ATAGGTAGTC 
AAATCATACA 
TCTAATTGGA 
CTCACATTAA 
CCATTCACAA 
AACCATTTCT 
GTATACAAGT 
AAATAGTGAC 
ATTTAATTGT 
GGTACTTCAA 
GCTGTATGTT 
GGACATAAAT 
ATAAATGGTA 
CTAGAATATT 
TTCATTTCAG 
GTCGACGCTT 
TGTTTAACTT 
AAAGTACCTT 
AGCAATAAGA 
TCTGTAAATC 
GTCACATTGC 
TATAATTACC 
AAATTTATTA 
ACTCAAACTA 
GTAATTTATC 
TTTCCGTCTC 
TACCTATCGT 



CTTTGATTAA 
ATTTTACTGC 
GCTCTTGATT 
TTTCTTTCTT 
TGAGTTGATT 
ATTGCTTTGC 
TGCCTAAGCG 
CTGGATATCC 
ATGTTAATAT 
CACCCCTTTT 
TATGTTGTAT 
TGCAATAATG 
TCGCTTTGAT 
AGGCAACCTT 
AAGCAATTAA 
GAATTATTCC 
CTGATTTACT 
CATTTACGTA 
GTTGGTTAAT 
CACCTTTTAG 
TTAACTCTAA 
CTTGTGGTTT 
ATAATTTTGC 
ATAGTTGATG 
TACTAGGCGT 
AGTAACACAC 
TTCAACTTCA 
TGCATTGTTT 
TCTAATTTGA 



TGTTTCATGA 
CACTTTCATC 
TTTAACTCTT 
TTCTTGTTCC 
TTGTTGCAAT 
CATTTGATTT 
TAATTCTGCC 
ATACATCATC 
AGTTATTATG 
CGATTTAAAT 
GTGGGATATT 
CGTTTCATGA 
AACTGCCCAC 
TTGTTGGTAA 
AAACGGCCAT 
TATAATACCA 
CATGACATGC 
ATATTGAACA 
TTCTTGTTGT 
CAACATATCT 
TGTTTGTCGC 
CAAAATACCT 
GAGTTCTCCA 
ACCCGAOGTC 
CTTATTTTTA 
CTACTAAACA 
GTAATCTCTG 
TTATAACGTA 
ATTTCAACAG 



AATTCAAAGT 
ATTTCAAGTT 
GAAAATGCAA 
CCAAACGGCA 
AATTTAAATG 
TCAGTGAGTA 
GCGATACCTT 
AAAGTCACrC 
AGTTTGCAAG 
TTAAAGGAAA 
TCTAATTGTT 
CAGTTTGGAC 
AAAGATGGTG 
TAAAAAGTAA 
TTATTTTTCA 
GCACTAATCC 
TCTATGTCTT 
TTTTTAATTT 
TTCATAGTTA 
ATATCGCTTA 
TGTTGTTCTG 
TTGCGATCAT 
GTCGAATAGT 
ACGAGCAAGT 
TGATTTCGTA 
TCTACTCTGT 
TCGCACATTC 
TTTTATGTTC 
GCAATACCTC 



12060 

12120 

12180 

12240 

12300 

12360 

12420 

124B0 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 









AX 1 IV. 11X11 


CAATTTTGAT 


TTTCATTGTT 


X Vv^W^wV XXX 


60 


IS 


TAAAATAAA,\v 


TTAGTTGCTT 


CTGTTCCTCA 


TATTCCAAAT 


CACTTTGCTT 


xaxaxaxux X 


120 

X A w 




TCAAGCTCTT 




AAATGTCTTT 


TTCACACCTT GCCAACCTGG 


^AV\M% X't X OA 


180 






AATAAGT6CU 


AX 1 xAwx A<JA 


TGGATATGTG 


CCACTCGTTC 


VvX XAlW^XVyA 




20 




JL JlAw%lV-V 


AAAuAAl XVjA 


TTTAGGTATT 


CTTTACGCGC 


WV> X AX WX w X^ 


300 






CTCCTTTTAA 


V.AAX lAViVa^^A 


GACCAAACGA 


CATGCATTCG 


X X AX AVXV* X 


360 




CTTCATTACT 


TATGCTTGCC 


X XAXAwX XXX 


CAATCACATT 


GCTAACTTCT 


X XAXUAv. X ^■■•A 


420 


25 






WXAXAX X X X X 


CGCAGTCTTC 


TAATTCCAGT 


TGCTCCTGTA 


480 




ATGACATCAC 


ATATTCAACT 


TGTCTTTGGG 


TTGCCATCGT 


TAACCCTCCC 


ACAAGTCAAA 


540 




AGCTCTTTGG 


ACGTAAAACT 


TCGCCTTTGC 


TAAATCCTCA 


TGACCATTCT 


TTAACGGTGC 


600 


30 


TCTAGACATG 


TATTTGATTG 


CATTACCTAT 


TGCGAATGCT 


AGTTGAGGTG 


GATACTGTGC 


660 




CGTAACCTGT 


TCGATAAAAT 


CTATAATTTC 


AATGTCGCCG 


TATGTGTAGT 


GCGCTGGTTG 


720 


3S 


CTTAACATTG 


TCTTGCGCTT 


,CGTTCATATC 


TACTTTTCTG 


TTACTGATTA 


CGCTCATTAT 


780 


GCTTCACTCC 


ATTTCTTGAA 


CATTTGGTTA 


TAAGTGACAT 


CGAACCAGTA 


CGGATCACGT 


840 




GAAT&TTTTT 


GTGGCGTTCC 


ATCATAAAGC 


CATGGTCTTA 


ATCTTCTCTT 


TCTTTCCTGT 


900 


40 


TCATATTCCG 


CTCTCACATT 


TCGTTGGTAT 


CGGTTCAAAA 


TCGCTTTTTT 


TCTGATTTTT 


960 




TCTCTCCCTT 


TTTCTTCATC 


TTTnATtTGA 


CTCTnCATAT 


ATTCAACTTC 


TTCTGTAGAT 


1020 




nTTGAGTCCT 


TTCTTCCACA 


CAATAATTCA 


nCGCCGCGC 






1059 


45 


(2) INFORMATION FOR SEQ ID NO: 56: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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GAAGTAAAAG AAGAATTAAA TTTAACATTA ACAATGGATG AAATTGAATA TGTCGGGACA 60 

ATTGTAGGTC CTGCATATCC ACAACAGGAT ATGTTAACTG AGTTAAATGG ATTTCGCGCA 120 

TTAACCAAAA TCGATTGGGA AAACGTAACT ATCAATAATG AAATTACGGA TATACGCTGG 180 

ATTGATAAAG ATAATGATGC GTTGATTGCG CCTGCTGTCA AAGTTTGGAT TGAAACTTAT 240 

GGTGGTAAAC ATGACAAATA ATGACACCAT CATGTTACGA CATTATGTCC CACAAGATTA 300 

TTCGAT6TTA GAAGCTTTTC AATTAAGTGA AAGTGATTTG AAGTTTGTTA AAACGCCAGA 360 

GGAAAATATT ACAGCTGCAA TGTCTGATAA TGAAAGGTAT CCCATCGTTG TAATGGATGG 420 

CAGGCAATGT GTGGCCTTTT TTACATTACA TCGTGGAAAA GGGGTCGCAC CATTTAGCGA 480 

TAACCAAGAT GCAGTATTTT TCAGGTCATT TAGTGTTGAT CAACGTTATC GTAATAGAGG 540 

AATAGGTAAA GTGGTAATGG AAAAATTG6C GTCATTTATC ACTTCAACAT TTCAGGATAT 600 

20 TAATGAGATT GTGTTAACGG TTAATACTGA CAATCCACAT GCCATGGCAC TTTATCGCCA 660 

ACAAGGATAT CAATATATGG GAGATA6TAT GTTCGTC06A AGACCTGTTC ATATTATGGC 720 

6TTAACTATA AAATAAATTA AATTTAAAAG CATCTTTACT CATCGTCGAC CACAACAATT 780 

AATGATGAAT AAAGGTGCTT TTTGTTATAG ATCATCXX5AC AATTTACTAT AGTAAAAAGC 840 

GACCTAGTGA ACAATTGACA TATATCCACA GGTCX^CTTAA CTTAAGTTAT ATTGCTAGTT 900 

GCGATTAATT GATAGACTCA TCATTTTTGC GCTGTOGAGA TGGTCTTTTT ATTAAAAATG 960 

CCGTAATCCA AGCCGTAATC GGAATACTGA TTGCAACGGC AATACCGCCT AAAATAATAG 1020 

AAATAAATTC TTGGGCAAAT ATTTTCGAGT TTATAATATG ACCAAATGAA TATTTAAGTT 1080 

TGAAAAACCA AATAAATAAA GCAAGTTGGC CACCAAAAAA GGCAAGGTAA ATCGTGTTCG 1140 

CAGATGTCGC TAAAATTTCT CTACCAACAC GCATGCCAGA TTGGAATAAT TCGTATTGCG 1200 

TAAOSTTgGA TTCACTTGAT GCAATTCATA AATGGGTGAA CTAATGGTAA TTGTTAAATC 1260 

TATCACAGCT GCAATAACAG CAAGAATAAT AGTGAACACC ATAAAITGAA CCATATCAAT 1320 

GCCAATATTC ATTGAATACA CATATGTTTC ATCTTGTTGT TCGGTTGaAA AGCCTTGTAG 1380 

ATGACCGAAG TAGACCGATA AATAAATGAG TGTAATCAAC AATATTGTTG TAACGATAgT 1440 

45 GCtG^ATAAA TGCaGCTTGT GTTTTAACAT TGTAACTATT GAGTACXSAAT AAATTACAAG 1500 

CGCCAATAAT AATGCAGAAA AAGAATGTGA CGACATAAAT CGGTACGCCA AAAATAATCA 1560 

ATACAATACT AATAATTAAA ATAGCGAAAT TTAAAAATAG GGTTAAATAA GAGATGAATC 1620 

SO C C TTTTT A CC TCCGAAAATT ATCATCAGAA AGAGGAGCAA TAACGCCAAT ATAAATACAG 1680 

CATTCATTGT TTCGCCCTCC TTAATGTTTC AAATATTTCC ATAAACAATA TTGTGATAGG 1740 
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CATCGAAATA GTATAAGTCA CTGTATTGGC ATTTTTTAAA AAGATTAAAA ACATAGGTAG 1860 

TGCACCGGAT AAATATGAGA ATAATAAGAT GTTAGTCATT GTTCCCATAA TATCTTGGCC 1920 

GATGTTTCGC CCAGCAAGCG CCCATCTCCT CATTGAAATG TGTGGCXjTAC GCTGTAAAAT 1980 

TTCATGCATA CCACTAGCAA TTGTAATTGC AACATCCATA ATAGCGCCAA GTGAACCTAT 2040 

TAACACTGAG GCTAGGAAGA TATCTTTCGG TGGTAATGAT AAAAAGTTCA TCGTTTCATA 2100 

TTTAATGCCT TTACCATCTG TCATATATAT GATTAATTCT GTTAAACCTA TACTCAAAAA 2160 

AGTTCCGATA ATTGTACTGG CTATGGTAAT GAGTGTACGC ATATGCCAGC CTGTAACGAG 2220 

CAATAAAGTG AGTATTGTTG AACAGATCAT GGCAATGGTC ATGAGTAAGA ATAAATTAAT 2280 

ATTGCTATGT TGAATATGAA TGTAAATTGC GATTAATATG GCAATAGAAT TCAAGATTAA 2340 

CGATAAAATC GATTGCAGTC CGACTTTGCG ACCAACCAAT AATACAGTTA ATAAGAACAA 2400 

ACCAGTGATG ATAACCGTTA AGGTATCACG CTTCTTTTCT ATAATATAAG CATCACTCGG 2460 

CTTGTTAGAA ATATGTAATA ATACTTTTTC GTGTGTGCGA AATGCCTCAG AATCTGCTTG 2520 

CGATTTGACX3 TACTGATGAT TAATCGTCGT CGTTTCTCCA GCAAATTGAC CATTTAATAT 2580 

2s TTTGACTTTT AATTGATTTT TATATTTAAT ATCAOGATTA TTTTGTGCAT CTTTTGTAGG 2640 

TGTCGAAGAA ACATGTTTGA CATCTATAAT TTGACCAATT GGTTTGTTGT AAAAGTTCTC 2700 

ATTATTGAAT GTAAATAAAA TAGCACCAAT GAATGCGATG CAGAACAAAC CTAAAATTAT 2760 

30 ATTAAATGGC TTTGTAAATA AATTTCTATA TTTCAAAAAC AAAACCCCAA TTCTATGAAT 2820 

GAATTAATAT GGTGATTATA CGCCCTTAAT TTTTTATTTT CAAAGATATT ACTGCTAAGT 2880 

GTAAAACQAA AATCATCATT GATAGCATCQ AATTACTTAA TGGAATGTAG ACGTTTTAGT 2940 

CATTAATTGC TGAATAAGTG TTAATAATAT GCCAATATCA CTCTTTGTAT AAGGCTCCTT 3000 

TGTi^ATAGCA CATATCGTTC TTTTTAATTC AGTATGATCT AATTTTATAT CTATCCATGA 3060 

TTTAGATTCT GGTAAATGTA TATTTTGTGA TGAAATGATG TAACCTTCTT TTTGACGAAG 3120 

GAGATACTGC GCAAGTGGTT GGCTACTGAT TGTGTATACA TCTGATTTAG TAATCTTGCG 3180 

CAATTGTTTT TTTACAGTTT CGGCAAATGG TGCCAAGCAA TAAATATGAC TATGCTCAAA 3240 

CTGAATTAAT GGTGGGTGTG TCGCCATCGT AATTGGATCG TCTGAAGGCG CATATAAATG 3300 

ATAGTGCTCT TCGAATAAAG GTAGCATATG TAATTGTTTG TGTTTACGTA TTTCTGGTGT 3360 

AAGTTCCGTG AAACCAATGT CTATATTCCC ATTTAATACG CTATTTATAA TTGTGTCATG 3420 

TTCTAATAAG CTCGGTATGA CATGTGTATC ATTTTGTAAA TGAAACGTTT GGATAAGTGG 3480 

TAGTAACATG TGGGATACGT CACTCTCATC ATAGCCAATG TAGATACTTT TATTTTTAGT 3540 
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TTCATTAAAT AATAATTTCC CTTCAGATGT GAGCGTAATA TTGCGTCCTT GCTTTTTAAA 3660 

TAAAGACACA TTAAGTTCTT GTTCTAATAA TGTAATTTGA CGGCTTATCG CTGATTGAGC 3720 

AATGTTTAGT TCAAGTGCTG TTTCGGAGAT ATGTTCTCTT TTAGCGACCT CGATAAAATA 3780 

TCTTAATTGT TTAATTTCCA TAGCGATATA GGCACCTCCA AAAATGAGTG TTTTGTAACT 3840 

ATTATAGCAA TATTATTGAT AAATGTTCTA TTTTTTAGAT GAATATCTTC TATTTTATAT 3900 

ATTGAACAGA TAAATTTTTT AGATTATAGT AATTATCATT AATAACTAAT ATCAGAATAT 3960 

TCTAAAAAAG GGGTGTGCAT CATGCACAAT GAGAAATTAA TTAAAGGCTT ATATGACTAT 4020 

CGTGAGGAAC ATGATGCGTG TGGTATTGGT TTTTATGCGA ATATGGATAA TAAAAGGTCT 4090 

CACGACATCA TTGATAAATC GCTTGAAATG TTGCGACGCT TAGATCACAG GGGCGGGGTC 4140 

GGCX;CAGATG GCATCACTGG TGATGGCGCA GGTATTATGA CTGAAATACC TTTTGCATTT 4200 

20 TTCAAACAAC ATGTAACGGA CTTTGATATC CCAGGTGAAG GTGAATATGC CGTGGGGTTA 4260 

TTTTTTTCCA AAGAACGCAT TTTAGGTTCT GAACATGAAG TAGTTTTTAA AAAATATTTT 4320 

GAAGGCGAAG GGTTATCAAT TCTTGGTTAT CGTAATGTAC CAGTTAATAA AGATGCCATT 4380 

25 GCTAAACATG TAGCAGATAC GATGCCAGTC ATTCAACAAG TGTTTATTGA TATTAGGGAC 4440 

ATTGAAGATG TTGAAAAGCG TTTGTTTTTA GCGAGAAAAC AATTAGAGTT CTATTCGACT 4500 

CAGTGCGATT TAGAATTGTA TTTTACGAGC TTATCACGCA AAACAATTGT ATATAAAGGT 4560 

TGGTTACGAT CAGACCAAAT TAAAAAACTA TATACAGATT TATCGGATGA TTTATATCAA 4620 

TCAAAGCTAG GGTTAGTGCA TTCGAGATTT AGTACGAATA CATTCCCGAG TTGGAAAAGG 4680 

GCACATCCTA ACCGTATGTT AATGGATAAT GGTGAGATTA ACACGATTAA AGGTAATGTA 4740 

AACTGGATGC GAGCACXKTCA ACATAAATTA ATCGAAACAT TATTTGGCGA GGATCAACAT 4800 

AAAGTGTTTC AAATTGTCGA TGAGGATGGT AGTGACTCTO CCATTGTAGA TAATGCGCTA 4860 

GAGTTCTTAT CGTTAGCCAT GGAGCCAGAA AAGGCAGCGA TGTTACTCAT ACCTGAACCT 4920 

TGGTTATATA ATGAAGCGAA TGATGCAAAT GTACGTGCGT TTTATGAATT TTATAGTTAT 4980 

TTAATGGAAC CGTGGGATGG TCCTACAATG ATTTCGTTCT GTAACGGTGA CAAACTTGGC 5040 

GCGCTTACAG ATAGAAATGG ATTACGTCCA GGTCGTTATA CGATTACTAA AGATAACTTT 5100 

ATTGTCTTTT CATCTGAAGT GGGTGTTGTG GACGTACCTG AAAGTAATGT TGCTTTTAAA 5160 

GGTCAATTGA ATCCTGGAAA GTTATTGCTT GTTGATTTTA AACAGAATAA AGTCATTGAA 5220 

50 AATAATGATT TAAAAGGTGC GATTGCTGGA GAATTACCAT ATAAAGCGTG GATTGATAAC 5280 

CATAAAGTTG ACTTTGATTT TGAAAATATA CAATATCAAG ATTCGCAATG GAAAGATGAG 5340 
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CAGGAACTTG TAGAAGGTAA GAAGGATCCT ATCGGTGCAA TGGGATATGA TGCGCCAATT 5460 

GCAGTG1TGA ACGAGCGACC AGAATCACTA TTTAATTACT TTAAACAGCT GTTTGCACAA 5520 

5 

GTTACGAATC CACCAATTGA TGCX5TATCGT GAAAAAATCG TAACGAGTGA ACTTTCTTAT 5580 

TTAGGTGGCG AAGGTAACTT ACTAGCACCT GACGAAACGG TTTTAGATCG TATTCAATTG 5640 

AAAAGGCCGG TATTGAATGA ATCACACTTA GCAGCGATTG ATCAGGAACA TTTTAAATTA 5700 

10 

ACTTATTTAT CAACGGTATA TGAAGGGGAT TTGGAAGATG CGTTAGAAGC ATTAGGCCGA 5760 

GAAGCAGTGA ATGCTGTAAA GCAAGGCGCT CAAATTCTAG TGTTAGATGA TAGTGGATTA 5820 

,5 GTTGATAGCA ATGGCTTTGC AATGCCX3ATG TTACTCGCAA TAAGTCATGT GCATCAATTA 5880 

CTTATTAAAG CAGATTTACG TATGTCTACA AGTTTAGTCG CTAAATCTGG TGAGACACGA 5940 

GAAGTGCATC ATGTTGCTTG TTTACTCGCA TATGGCX3CGA ATGCAATTGT GCCATACCTA 6000 

20 GCGCAACGTA CAGTTGAACA ACTGACATTG ACAGAAGGGT TACAAGGCAC CGTTGTCGAT 6060 

AATGTTAAGA CATATACGGA TGTATTGTCA GAAGGTGTCA TTAAAGTAAT GGCTAAGATG 6120 

GGAATTTCGA CAGTGCAAAG TTATCAAGGG GCACAAATAT TTGAAGCGAT TGGCTTGTCT 6180 

CATGATGTGA TTGATCGTTA TTTTACTGGG ACACAGTCTA AGTTATCTGG TATTTCGATT 6240 

GATCAAATTG ATGCTGAAAA TAAAGCACGT CAACAAA6TG ATGATAATTA TCTTGCATCA 6300 

GGTAGTACAT TCCAATGGAG ACAACAAGGT CAACATCATG CTTTTAATCC GGAATCTATT 6360 

30 

TTCTTATTGC AGCACGCATG TAAAGAAAAT GACTATGCGC AATTTAAAGC ATACTCTGAA 6420 

GCGGTGAACA AAAATAGAAC AGATCACATT AGACATTTAC TXXSAATTTAA AGCATGTACA 6480 

CCGATTGACA TCGACCAAGT TGAACCGGTA AGTGACATTG TCAAACGCTT TAATACAGGG 6540 

35 

GCGATGAGTT ATGGATCGAT TTCAGCGGAA GCACATGAAA CGTTAGCACA AGCCATGAAC 6600 

CAAtTAGGTG GAAAGAGTAA TAGTGGTGAA GGTGGCGAAG ATGCAAAACG TTATGAAGTA 6660 

CAAGTTGATG GAAGCAACAA AGTAAGTGCG ATTAAACAAG TTGCTTCTGG GCGTTTTGGT 6720 

40 

GTAACTAGTG ATTATTTACA ACATGCCAAA GAAATTCAAA TTAAAGTTGC GCAAGGTGCA 6780 

AAGCCTGGTG AAGGTGGTCA ATTACCTGGT ACTAAGGTAT ATCCGTGGAT TGCGAAGACA 6840 

45 AGAGGGTCAA CGCCAGGTAT CGGTCTGATT TCACCACCGC CACATCATGA TATTTATTCA 6900 

ATAGAAGATT TAGCGCAACT GATACATGAT TTGAAAAATG CGAATAAAGA TGCAGATATC 6960 

GCGGTAAAAT TAGTTTCGAA AACAGGTGIT GGTACCATTG CATCTGGGGT GGCAAAAGCA 7020 

SO TTTGCAGATA AAATTGTCAT CAGTGGTTAC GATGGTGGTA CAGGGGCTTC ACCCAAAACG 7080 

AGTATTCAGC ATGCCGGTGT TCCTTGGGAG ATTGGTTTAG CAGAAACACA TCAAACATTA 7140 
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AAAGATGTAG CGTACGCATG TGCGCTTGGA GCGGAAGAAT TTGGATTTGC AACTGCACCA 7260 

TTAGTGGTGT TGC3GCTGTAT TATGATGCGT GTATGCCATA AAGATACATG TCCAGTAGGA 7320 

^ GTTGCAACTC AAAACAAAGA TTTACGTGCT TTATATAGAG GTAAAGCACA TCATGTTGTT 7380 

AATTTTATGC ATTTTATTGC ACAAGAATTA AGAGAAATTT TAGCATCTTT AGGTTTGAAA 7440 

CGTGTAGAAG ACTTAGTTGG AAGAACTGAT TTATTACAAC GATCATCAAC ATTAAAAGCG 7500 

10 

AATAGCAAAG CGGCTAGTAT TGATGTTGAA AAACTGTTAT GTCCTTTCGA TGGGCCAAAC 7560 

ACAAAAGAAA TTCAACAAAA TCATAATCTT GAGCATGGAT TTGATTTAAC AAA1TTATAT 7620 

GAAGTAACGA AGCCATATAT TGCTGAAGGG CGTCGCTATA CAGGTAGCTT TACAGTAAAT 7680 

IS 

AATGAACAAC GTGATGTAGG GGTTATTACA GGTAGTGAGA TTTCGAAACA ATATGGAGAA 7740 

GCAGGACTTC CTGAAAATAC AATTAATGTT TATAOGAATG GTCATGCTGG TCAAAGTCTT 7300 

20 GCAGCATATG CACCX5AAAGG CTTAATGATT CATCATACTG GAGATGCGAA TGACTATGTT 7860 

GGTAAAGGAT TATCTGGTGG TACX»TCATT GTCAAAGCAC CTTTTGAAGA ACGACAAAAT 7920 

GAAATTATTG CTGGTAACGT CTCATTCTAT GGTGCGACAA GTGGTAAGGC ATTTATTAAC 7980 

25 GGTAGTGCAG GAGAAAGATT CTGTATTAGA AATAGTGGTG TAGATGTTGT CGTTGAAGGT 8040 

ATCGGCGACC ATGGATTAGA GTATATGACT GGTGGACATQ TCATTAATTT AGGTGATGTA 8100 

GGTAAGAACT TCGGTCAAGO TATGAGTGGT GGTATTGCTT ACGTTATCCC GTCTGATGTA 8160 

30 GAAGCTTTTG TTGAAAATAA TCAACTAGAT ACGCTTTCGT TTACAAAGAT TAAACACCAA 8220 

GAAGAAAAAG CATTCATTAA GCAAATGCTG GAAGAACATO TGTCACACAC GAATAGTACG 8280 

AGAGCGATTC ATGTGTTAAA ACATTTTGAT OGCATTGAAG ATGTCGTCGT TAAAGTTATT 8340 

CCTAAAGATT ATCAATTAAT GATGCAAAAA ATTCATTTGC ACAAATCATT ACATGACAAT 8400 

GAAGATGAAG C6ATGTTAGC TGCATTTTAC GATGACAGTA AAACAATCGA TGCTAAACAT 8460 

AAACCAGCCG TTGTGTATTA AGGAAAGGGG GAGATACGAT GGOTGAATTT AAAGGATTTA 8520 

40 

TGAAGTATGA CAAACAGTAC TTAGGTGAAT TATCACTGGT AGACCGTTTG AAGCATCATA 8580 

AAGCATATCA ACAACXSATTT ACTAAAGAAG ATGCCTCTAT CCAAGGTOCA CGATGTATGG 8640 

ATTGTGGAAC GCCGTTTTGT CAAACCGGAC AACAGTATGO TAGGGAAACA ATAGGTTGTC 8700 

45 

CAATTGGAAA CTACATTCCT GAATGGAACG ACTTAGTGTA TCATCAAGAT TTTAAAACTG 8760 

CTTATGAACG CTTAAGCGAA ACAAATAACT TTCCTGACTT TACAGGGCGT GTATGTCCTG 8820 

^0 CACCATGCGA AAGTGCTTGT GTGATGAAGA TTAATAGAGA ATCGATTGCG ATTAAAGGTA 8880 

TTGAACGCAC AATTATTGAT GAAGCTTTTG AAAATGGTTG GGTAGCGCCG AAAGTTCCGA 8940 
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CTGAAGAACT TAATCTACTA GGATATCAAG 
GCGGTrTATT AATGTATGGT ATTCCGAATA 

^ GTATTAAGTT AATGQAAGAA GCGGGCATTA 

ATATTGATAA AGCAACGTTA GAATCTGAGT 
AAAAAGGTAG AGATTTACCT TTAGAAGGAC 

10 

ATTATTTAAC TGAACAAACG CAGTTGTTAA 
CAAAAGATAA GAATGTCATT ATCATTGGTG 
CAGCATTAAG AGAAAATTGT AAATCGATTG 

IS 

AAGCAATTAC ATTTACAGAA AATGCATCAT 
ACTATGCGCA CCAAGAGTAC GAAGCTAAGT 
AAACAATGC6 TTACGATGTT GACGATAAAG 

20 

TAGAGCAAGG CX3AAAATGGT ATGGTCATGA 
ACCTTGTATT ATTATCAATC GGCTTCGAAG 

2s ACATTAAAAC GQATAGAAAT CGAATCGTGG 
AAAAGGTATT T6CTGCTGGA GATGCTAGAC 
AAGAAGGTAG AGGCGTAGGG AAAGCAGTAG 

30 AATCTTTGTA TGGAAATGGT GGTTACGTTG 
AAATCTAGTA TCTATCAACG TCACATGCCA 
GACAACAAAT AGATTAATTA TAAGTAGTGA 

55 GTGGAAGACA ATGATTTGTG GTAATCATGT 
AACCHTCATA TATGATAAAT ATTGTGTTTA 
CGGTCTTGAA AACCGACAGG GGCTTAACGG 

40 

CCATCAATAT TTATATTAAA TTCTATATAT 
TTACCTTTTT TATTTGTCTT TGAATGGCTC 
TTGAGATTGG AAGGGCATTT GGCTTGTGCA 

4S 

GTGAAATATG ATGGATGGCT TGTGTGGACA 
ATTTAGGAGT TGGCCATGCA TCTACACTTT 
AATAACGCAA TTGTAGCGAG GAGTTATTGC 

SO 

TGAAATGGCT ACCCCAGATA ATTGTGACAA 



TAACTATTTA TGAACGTGCT AGAGAATCAG 


9060 


TGAAACTTGA TAAAGATGTG GTTCGACGTC 


9120 


CTTTCATTAA TGGTGTTGAA GTCGGTGTTG 


9180 


ATGATGCCAT TATATTATGT 


ACTGGTGCAC 


9240 


GCATGGGTGA TGGTATACAT 


TTCGCTATGG 


9300 


ATGGAGAAAT TGATGATATA 


ACAATAACTG 


9360 


CTGGTGATAC AGGGGCAGAC 


TGTGTAGCGA 


9420 


TTCAATTTAA TAAATATACG 


AAATTGCCAG 


9480 


GGCCTTTAGC AATGCCGGTG 


TTTAAAATGG 


9540 


TTGGTAAGGA ACCACGTGCA 


TATGGTGTTC 


9600 


GACACATACG TGGTTTGTAT 


ACTCAAATTT 


9660 


AAGAAGGACC TGAAAGATTT 


TGGCCTGCTG 


9720 


GTACAGAACC AACAGTACCG 


AATGCTTTTA 


9780 


CGGATGATAC AAACTATCAA ACTAATAATG 


9840 


GTGGTCAAAG TTTAGTTGTA 


TGGGCAATTA 


9900 


ATCAGTATTT AGCTAGTAAA GTTTGTGTAT 


9960 


ACGTTGTGAC ATGCTGAATC 


GAGTTTGAAA 


10020 


TCTTTGTAAC CTAAAAACAA 


AGGTTTGTAA 


10080 


TTTTTTACAT TCGTTTATAG 


GTCAACTGTA 


10140 


AATGCTTAAA AACAATATTG ACTTTTACAG 


10200 


GGAGGAATAC CCAAGTCCGG 


CTGAAGGGAT 


10260 


CTCGCGGGGG TTCGAATCCC 


TCTTCCTCCG 


10320 


AATGAAGGTA AGTGCTCAAA TTTTGAGTAT 


10380 


GTAATTTTTG ATAATAGAAA 


TGATAAGGCA 


10440 


ATATACATAG CTAAATGTCT 


TTTTTGTTTT 




AGTTTGCTAT TTATAGATAT 


GCATTTTTCA 


10560 


ATAATGGTGA GAGCGTGGTG 


AGGTATTGTT 


10620 


TACATATGTC GTTATGGCTC ATTGATTTTC 


10680 


AATAAAAATA TTTTGTTGAA 


AGCCTTTACA 


10740 
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TAAAAAGAGA 
CTGCAACGCA 
AAGACX^CATT 
AAATTGTCAT 
GTGCTCAAGA 
TACAACGTTT 
CAGCTGGTTT 
AAGCACTTAT 
CGAGTACGGC 
GTGGTAGTCC 
CTCAGTATGA 
AGCAGTTGAA 
AAATTGAAAA 
CCGTAGCGCT 
TGaTTGGTAC 
GCGGAGCAAT 
TTTTAGCAGT 
TTGTTGCGAT 
AACGTCGTAA 
TAGGTGTTAC 
CTGQGATATC 
AAGTTGGTGT 
ATCTTATTGT 
ATTTTAGTAA 
TCGTTATTTG 
ATTGGAGAAA 
ATGGTATAGG 
TTGATTATAT 
TCAGCAATTA 



AGATGTAAAA 
TTGTGTAACA 
AAGTAATAAC 
TGGTCCAGGA 
AGCTTCGAAA 
GATCAAATTG 
GTTAATGGGA 
TGAGATGTAT 
ATTTATTTTC 
GATTCTAGGC 
TTTGGCAAAA 
TTACCAAGGT 
AGGATTAAAT 
TTTAGTTACT 
AGGTATTACA 
ATATGGATTG 
AGATTTCCAA 
TTCCAATATT 
AATCGTTAAA 
TGAACCAGCC 
AA0GTCTT6T 
TGGTGGCGTG 
GACAGCTATT 
ACAAAAAGCG 
GACGTCCTTT 
ATCCGTTGTA 
AGATATCAAT 
TTGGTTAACA 
TTTAGAAATC 



GCCATCGTAA 
CGATTACGTT 
GCGTTGGTCA 
ACAGTCGATG 
GATGAAGCGA 
TtGGGGGATA 
ATCAATAATT 
CCACAAATTG 
TTACCAGCAT 
ATAGTCTTAG 
GGGAATATTC 
CAAGTGTTGC 
AAAGTCGTTC 
GGATTTTTAG 
TCTGGTGTTA 
TTATATGCAC 
TTGATGGGTA 
TGTCAGGGCT 
GAAGAAGGCT 
ATGTTGGGTG 
GTATTGGGGG 
CCAGCATTCA 
GCTATTGTTG 
AAAGAAATTG 
ATTACGTTAT 
TATCAAATTT 
GGAATTATAG 
CCAGTGTATG 
aATGAAGACT 



CCGCTATTGG 
TAGTGCTGAA 
AGGGGCAGTT 
AAGTGTATAA 
AACAAGCAGC 
TTTTTATACC 
TACTTACAAT 
CTGATATTTC 
TAATTGGTTG 
GTTTGATTTT 
CGACGTGGAA 
CAGTtTTAAT 
ACX5ATTCGAT 
CATTTATTAT 
CATTTATATT 
CACTT6TAAT 
GCAGCTTAGG 
CTGCAGCATT 
TGGCATTAAC 
TGAACTTACC 
CAATCGTTGG 
TTTCAATTCA 
TACCATGTAT 
TTGAAGATTA 
AAGGTGGTAA 
ATCCTAAGTC 
AAAAATTGGA 
AATCACCGAT 
TTGGAACGAT 



GGGAAAAGAA 
GGATGAAAGT 
TAAAGCAGAC 
GCAGTTTATT 
TGCACAAAAA 
AATATTACCT 
GAAAGGTTTA 
AAACATCATT 
GAGTAGTATG 
AATGCATCCG 
CTTATTTGGC 
TGCAGCTTAC 
AAAAATGTTG 
CATTGGACCA 
CCAACATGCA 
TACAGGACTA 
CGGTACGTAT 
TGGAGCATGG 
ATCTTGTATT 
TCTGAAATAT 
TATGAATAAC 
AAAAGAATTT 
ACTAACAATT 
ATAAAATAAA 
TTGTGTGTCG 
GTTTAATGAT 
TTATATCAAG 
GAATGATAAT 
GGATGATTTT 



AATCTTGAAG 

AAAGTTGATA 

CATCAATATC 

GATGAAACAG 

GGGAATCCAG 

GCGATTGTGA 

TTTGGTCCAA 

AATGTGATTG 

CGTGTATTTG 

CAATTAGTAT 

TTAGAGATTA 

GTTCTAGCTA . 

GTCGTTGGAC 

GTTGCGTTAT 

GGATGGCTTG 

CACCATATGT 

TTATGGCCAA 

TTTGTCTATA 

TCTGGTATGT 

CCATTTATCG 

GTACTTGGAA 

TGGCCAGTAT 

GTGATGTCTC 

AAAGGGGCGT 

AAAGAAATAG 

ACGAGGGGGA 

TTATTGGGTG 

GGCTATGATA 

GaAAAGTTAA 



10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
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CX^ACGGAGCA 
ATTACTTTTT 
GTAATGCATG 
GTCAAGCTGA 
ATCATTGGAT 
AAGGTGAATT 
TGCATGAGTT 
TAGGAGAAAT 
AAGAATTGAA 
AGTGGACAAA 
GAGGTATTTA 
GGGTAGTGTC 
TGTTAGCTAT 
TTGGTATGAC 
ATGCCTACCA 
GACAGAAGTC 
GATTTACAGC 
GACAAGCACT 
GACATACGCA 
ATTTATTTGT 
CAGCATCGGC 
CAGGCACAGT 
CGTAAAATAA 
ATGAGGCGTT 
CTGAACATGA 
ATTTGTTGGC 
ATCAGGAGGT 
AAATGGGCGT 
TTCCAGAAGT 



TGaATGGTTT 
CAGATCATCT 
GAAGTATGAT 
TTTAAATTGG 
AGACTTCGGC 
TAAGGACTCT 
TCTGCATGAA 
GTCTTCGACG 
TAGTGTTTTT 
TGCGAgcTTG 
TGACGGTGGC 
TAGATTTGGT 

cgcactgcat 

GGACCCACAT 
TCAGTTGTTA 
ACGAGACAAT 
TGGTAAnCCT 
TCAGAATAAA 
TGATATTATT 
TTATGAACGT 
TGTTGATTTG 
GGAAAATAAT 
ATTGAGTGGA 
GAAAGAAGAT 
TTTGGTGCAA 
ATTAGACGGC 
TACAGAGTTT 
CGCATATTTA 
TCAACATGCT 



AAAGAAGCCC 
GAAGACGGGC 
TCTGAGACAG 
GATAATCCGG 
GTTGATGGTT 
GACAAAATAG 
TTAAATCGTC 
ACGATTGAAA 
AATTTTCATC 
nATTTTCATA 
GGATGGAACG 
GATGATACGT 
ATGTTGCAAG 

tttacatcaa 
agtgaagggc 
tcgagaacxk: 
tggattgata 

GAGTCTATTT 
ACGTATGGAG 
CATTATAAGA 
CCAOAAGGAT 
ACGATAAGCG 
TGCGTTTATA 
ATATTAAACG 
TTGTACCAGT 
ATGATTCAAA 
CCATTTTCTG 
ACTGAAGTTG 
TTAAACATCA 



GTAAATCTAA 
CGCCAACAAA 
ATGAATATTA 
AAGTACGTCA 
TTCGATTTGA 
GTAAAGAATT 
AAACGTTTGG 
ATTGTATTAA 
ATCTAAAGGT 
AGTTAAAGGA 
CGATTTTCTG 
CGGAAGAGAT 
GGACGCCATA 
TAGCACAATA 
ATGCTGAAGC 
CTATGCAATG 
TTTCGGAAAA 
TCTATACGTA 
ACATTGTGCC 
ATCAACAATG 
TGGCTAGAGA 
GGTTTGGTGC 
TGGCGAAACA 
GGCAGATTCA 
CATCTCXSAGA 
AGATTCATGG 
AACTTGTTAG 
TTGTGAATGA 
ATTCTAGTGA 



AGATAACCCy 
TTGGCATTCT 
TTTACATTTA 
ATCGTTATAT 
TGTCATTAAC 
TTATACGGAT 
TAACACTGAC 
GTATACACAA 
TGATTATGTT 
AATTCTGATG 
GTGTAATCAT 
GAGGATACAA 
TATTTACCAA 
TCGTGATGTT 
GGATGTGTTA 
GAGTGATGAT 
TTATCATCAG 
TCAAAAATTA 
ACGTTTTATG 
GCTAGTAATT 
AGGTTGTGTT 
AATTGTAATC 
AAAAAAGTTT 
ATATGGTGAA 
GACCGTGCGT 
TAAAGGGTCA 
TTTTAAAGAA 
GGTTGTTGAA 
ATCACTCATT 



TATAGAGATT 
AAATTCGGTG 
TTTGATGTCA 
CGCATAGTCA 
TTAATTTCTA 
GGTCCTAGAG 
ATGATGACTA 
CCAGAACGCC 
GATGGTQAAA 
CAATGGCAAC 
GATCAGCCAC 
AGTGCTAAAA 
GGTGAAGAAA 
GAATCGATTA 
GCGATTTTAG 
GTTAATGCTG 
GTCAACGTTA 
ATACAATTAA 
GATCATGATC 
GCGAATTTCT 
GTGATTCAAA 
GAAACAAACG 
ATGAAGATTT 
CAAATTCCGT 
AAGGCATTAG 
CTTGTCATTT 
ATGCAAGAAG 
GCGCATGAAG 
CATATTGTTA 



12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
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TTGTTTCAGA TATAGGTAAT GATGTTGCGA 
TATTAAATCT TAATATTAGT TATTCAAGTA 
AAGCATATCA ATTGTTTGGT GATGTATCGG 
TGTATTTAGA AAATACAATG CCGTTTCAAT 
TTAAATTTAA TGACTTCTCA AGACGTCGTA 
CTTGCAATTA ACTATTAAAA TATAGTAATA 
CGGTTCCCTG TACTCGAAAT CCGCTTTATG 
TTTTGCGAAG TCTGCCCAAA GCACGTAGTG 
CCCATGAACC ATGTCAGGTC CTGACGGAAG 
AGGgTAGCCG AGATTTAGCT AACGACTTTG 
GGTGCACGGT TTTTTATTTT TTAAATATTA 
TTATAGAAGC TACTTTCTTG AAGACAATTC 
AAGTAGCTTT TTTATATGTG AAGTTTGATT 
TTTTGTGTCA AT6AAAAGTA AGAAGTTATA 
AGGGGGAGTA TCTTACAATA GAATTATTAA 
TGCCTACGGA GGACATATGC AAATATATTT 
ATCTTTAAAT AGTATTGAAG AAAGTTTTGA 
TGCGAAAGTA AAACATTTAA GAAAATCTCC 
GAAAAATGAA AATAACGATG TCGTTGGACA 
TGATGATAAG ACGTATTATG GTTTGGCGAT 
TGGACAAAAA TTAGGTCGTG GCTTGGTTCA 
GTATAGTAC6 GTTGTTGTAG ACCATTGTTT 
TGCTGCTGAG CATGACATTA AATTAGAATC 
ATGGGATAAT TTGACGGATG CACCACACGG 
ATTGTTCAAT TAAGAAGTAA AGGTATTATC 
GGTGCTAACT TGAATTATCA AGCCTTATAT 
GTCGTCGGAC AAGAACATGT CACGAAGACA 
TCGCATGCTT ATATTTTTAG TGGTCCGAGA 
TTTGCTAAAG CAATCAACTG TCTAAATAGC 



GTGATTCTAT TTATGATTAT TTGGAAAAGG 14460 

AGTCTATTAC TTTTGAACCG TTTGATGAAC 14520 

TGGCTTATTC AGCAACAGTT CGAAGTATTG 14580 

ATAATATTTC AAAACATCTT GCAAATGAAT 14640 

TAAAGTAAAC AATGATATAA ATGATTTATA 14700 

TATATCTTGC CGTGCTAGGT GGGGAGGTAG 14760 

CGAGGCTTAA TTCCTTTGTT GAGGCCGTAT 14820 

TTTGAAGATT TCGGTCCTAT GCAATATQAA 14880 

CAGCATTAAG TGGATCATCA TATGTGCCGT 14940 

GTTACGTTCG TGAATTACGT TCGATGCTTA 15000 

AACCGATTAT TAAGAGTTGA AAATATATAA 15060 

AGCGTATTAT ACGTGGAACA TGTTTGTGGG 15120 

CAAGTGAACT CGATGTGCAG TTTGAATGAT 15180 

ATTTGATGAT AAAGAAATGA TGGTGAAATG 15240 

TGAGATACX5T TATGATTATT GACAATCAAA 15300 

AAGTACTTTA ACAGAGTTAG ATTATGATAA 15360 

TGATAATCCT GAAACGAGTT GGCAAGCACG 15420 

TTGCTATAAT TTTGAATTAG AAGTAATAGC 15480 

CGTTTTATTA ATTGAAGTAG AAATTAATAG 15540 

TGCCTCTTTA TCAGTTCATC CTGAATTACSG 15600 

AGCAGTAGAA GAGCGTGCCA AAGCACAAGA 15660 

TGACTACTTT GAAAAGTTGG GTTATCAAAA 15720 

TGGTGATGCA CCGTTACTTG TAAAATATTT 15780 

AATGGTAAAA TTTCCAGAAC ATTTTTATTA 15840 

ATGCTATAAT GAGAGGTAAT TGTTTATGGA 15900 

CGTATGTACA GACCCCAAAG TTTCGAGGAT 15960 

TTGCGCAATG CGATTTCGAA AGAAAAACAG 16020 

GGTACX3GGGA AAACGAGTAT TGCCAAAGTG 16080 

ACTGATGGAG AACCTTGTAA TGAATGTCAT 16140 
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AATAATGGCG TTGATGAAAT AAGAAATATT 
TCGAAATATA AAGTTTATAT TATAGATGAG 
^ GCCCTTTTAA AGACGTTAGA AGAACCTCCA 

GAACCACATA AAATCCCTCC AACAATCATT 
ATTAGCCTAG ATCAAATTGT TGAACGTTTA 

10 

TGTGAAGATG AAGCCTTGGC ATTTAtcgCT 
TTAAGTATTA TGGATCAGGC TATTGCATTT 
TTGAATGTCA CAGGTAGCGT ACATGATGAA 

IS 

CAAGGTGACG TACAAGCATC mTAAAAAA 
GTOAATCGCC TAATAAATGa TATGATTTAT 
TCTGAGAAAG ATACTGAGTA TCGAGCACTG 

20 

ATGATTGATC TTATTAATGA TACATTAGTG 
CATTTTGAAG TGTTGTrAGT AAAATTAGCT 
GCGAATGTAG CTGAACCAGC ACAAATTGCT 
CGTATGGAAC AGTTAGAGCA AGAACTAAAA 
CCTGTTCAAA AATCTTCGAA AAAGCCTGCG 

30 TCAATGCAAC AAATTGCAAA AGTGCTAGAT 
AAAGATCATT GGCAAGAAGT GATTGATCAT 
AGTTTATTGC AAAATTGGGA ACCTGTGGGG 

35 GAGGAAGAGA TCCATTGTGA AATCGTCAAT 
AGTCTTGTAT GTAATATGGT TAATAAAAAC 
TGGCAAAGAG TTCGPAOGGPi ATATTTACAA 
AAGCAACAAG CACAACAAAC AGATATTGCT 
ACTGTACATG TGATAGATGA AGAGTGATAC 
AAAGAAACAT CATTTTATTG ATAAATATTT 

45 

GCGGTGGCGG AAACATGCAA CAAATGATGA 
CTCAAGAACA AGAAAAACTT AAAGAA6AGC 
TTGCAGTTAC TGTAACTGGT CATAAAGAAG 

SO 

TAGACCCAGA GGATATTGAA ATGCTACAAG 



AGAGACAAAG TTAAATATGC ACCAAGTGAA 16260 

GTGCACATGG TAACAACAGG TGCTTTTAAT 16320 

GCACACX3CTA TTTTTATATT GGCAACGACA 16380 

TCTAGGGCAC AACGTTTTGA TTTTAAAGCA 15440 

AAATTTGTAG CAGATGCACA ACAAATTGAA 16500 

AAAGCGTCTG AAGGGGGTAT GCGTGATGCA 16560 

GGTGATGGTA CGTTAACATT GCAAGATGCG 16620 

GCGTTGGATC ACTTGTTTGA TGATATTGTA 16680 

TACCATCAGT TTATAACAGA AGGTAAAGAA 16740 

TTTGTCaGAG ATACGATTAT GAATAAAACA 16800 

ATGAACTTAG AATTAGATAT GTTATATCAA 16860 

TCGATTCGTT TTAGTGTGAA TCAAAACGTT 16920 

GAGCAGATTA AGGGTCAACC ACAAGTGATT 16980 

TCATCGCCAA ACACAGATGT ATTGTTGCAA 17040 

ACACTAAAAG CACAAGGAGT GAGTGTCGCT 17100 

AGAGGCATAC AAAAATCTAA AAATGCATTT 17160 

AAAGCGAATA AGGCAGATAT CAAATTGTTG 17220 

GCCAAAAATA ATGATAAAAA ATCACTCGTT 17280 

GCAAGTGAAG ATCACGTACT TGTGAAATTT 17340 

AAAGACGACG AGAAACGTAG TAGTATAGAA 17400 

GTTAAAGTTG TTGGTGTACC ATCAGATCAA 17460 

AATCGTAAAA ACGAAGGCGA TGATATGCCA 17520 

CAAAAAGCAA AAGATCTTTT CGGTGAAGAA 17580 

ATGACAAGCG ATATAATCGT ATGTATAATG 17640 

ATTGATTTTC AAGGAGGAAA TGGAATATGC 17700 

AACAAATGCA AAAAATGCAA AAGAAAATGG 17760 

GTATTGTAGG AACAGCTGGC GGTGGCATGG 17820 

TTGTCGACGT TGAAATCAAA GAAGAAGCTG 17880 

ACTTAOTGTT AGCAOCTACT AATGAAGCGA 17940 
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TCCCTGGaAT 
TTATGAAATT 
ATATGAAAGA 
TAACATATTG 
ATAAGCAAAG 
TGGAAAAAAT 
TGGATGGCAT 
ATGAAGTTAG 
TGTATATTTC 
TATCGGTAGG 
GTAGAACAGA 
AAGTCACAGT 
GCGGTGTGGC 
CAAGCCGTTG 
TAGATCTATG 
CAAATQAAAT 
GGTCTAAAGC 
TAAATTTATT 
ATATGGTTAG 
TTTTTAAGTT 
ACTGXTGTTA 
CTCAACTTAA 
AAATGTTAAT 
AATGACAATA 
GTATTTATGA 
GAACGCTGGC 
CTGATGTTAG 
ACTTCGGGAA 
AGACGGTCTT 



GTGATCATAG 
GCCAGGCATT 
AGACGATGTT 
TAGCGTATGT 
AGATCGTTCA 
GAGAGAATAC 
TGGACCAGAA 
CGAATTAATC 
TAGATTAGTT 
TGGCGATTTA 
AATGTAATkT 
GTAATCATTG 
GGTGGTATGG 
GTTGTGATTT 
GTTATGGTGT 
TCTTTTGTAA 
TTATTAAATC 
TTTAATTTCT 
AGAAAAATCT 
CQATTTTTAO 
AGCAGTTTGA 
GAAATAACTT 
AAAATGTATA 
TGTCAAOGTT 
GCTAATCAAA 
GGCGTGCCTA 
CGGCGGACGG 
ACCXSkAGCTA 
GCTGTCACTT 



ATGCATTATC 
GGTCCAAAGA 
GTTCAGTTTG 
GGTCACATTA 
GTTATTTGTG 
AAAGGTTTAT 
GATATTAATA 
TTAGCTATGA 
AAGCCTATAG 
GAGTATGCTG 
CTTCTATTAA 
TGGCTTTTTT 
TTTACCTAGT 
GTTACTTCTA 
GTTGGTGCTA 
TTGAAATGAT 
AGCCTGTATA 
GGTAAAAAAA 
GTTTCTTGTT 
GATAAGGGCG 
AAGCCTGTAT 
GAATTACTAA 
ATTAATTCTT 
AATTCCAAAA 
CATCATAATT 
ATACATGCAA 
GTGAGTAACA 
ATACCGGATA 
ATAGATGGAT 



CAGAACCTAT 
CAGCCCAACG 
CCAAAGCATT 
CTGAAAATGA 
TTGTGGAAGA 
ATCACGTTTT 
TTCCTTCATT 
ACCCGAACTT 
GTATCAAAGT 
ACGAAGTAAC 
ACATTTTTGA 
TATGGTGTGG 
TTTACTGAGG 
ATAGTAATGA 
TTAATTTGAT 
AGATGCTGGC 
GCGGTGTTTT 
TAACGTTCTG 
CTAAAAAACG 
TTCAGTACAG 
AGTATTTATT 
CGAAAATTAA 
GTCGGTAAGA 
AACGTAACTA 
TTTATGGAGA 
GTCGAGCGAA 
CGTGGATAAC 
ATATTTTGAA 
CCGCGCTGCA 



ATCAAAACTT 
TCTGGCTTTT 
AGTAGATGTT 
TCCATGTTAT 
TGACAAAGAT 
ACATGGGTCT 
GATTGAACGC 
AGAGGGGGAA 
GACGAGATTA 
ATTATCTAAA 
TTTTAATACT 
TGTGATGTAC 
GATGGGTAAT 
TGTGAATTGG 
AAATGCX^GTT 
TTAGTAAGTT 
GAGAGATTAT 
TTTTGCGTTT 
TACTATTTAT 
ATGACAAAGG 
TGTTGAGGCA 
TTTTAAAAAG 
AAAATGAACA 
TAAGTTACAA 
GTTTGATCCT 
CGGACGAGAA 
CTACCTATAA 
CCGCATGGTT 
TTAGCTAGTT 



ATTGATAGCT 
CATACCTTAG 
AAGAGAGAAT 
ATTTGTGAAG 
GTCATAGCTA 
ATTTCGCCTA 
TTGAAAAACG 
TCTACAGCCA 
GCACAAGGGT 
GCAATCGCAG 
ATAGTAAGAA 
TACTTTATTT 
CTTTAGGAAG 
ATTATCGAAT 
AATGACTATQ 
GTACTTCTTT 
TTAAAACTTG 
TTTTTGATTG 
AAGTGGGGAT 
TGTAATTTTT 
AACAAAACAA 
TTATTGACTT 
TTGAAAACTG 
ACATTATTTA 
GGCTCAGGAT 
GCTTGCTTCT 
GACTGGGATA 
CAAAAGTGAA 
GGTAAGGTAA 



18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
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GAGACACGGT CCAGACTCCT ACGGGAGGCA GCAGTAGGGA ATCTTCCGCA ATGGGCGAAA 19860 

gCtGaCGGAG CAACGCCGCG TGAGTGATGA AGGTCTTCGG ATCGTAAAAC TCTGTTATrA 19920 

GGGAAGAACA TATGTGTAAG TAACTGTGCA CATCTTGACG GTACCTAATC AGAAAGCCAC 19980 

GGCTAACTAC GTGCCAGCAG CCGCGGTAAT ACGTAGGTGG CAAGCGTTAT CCGGAATTAT 20040 

TGGGCGTAAA GCGCGCGTAG GCGGTTTTTT AAGTCTGATG TGAAAGCCCA CGGCTCAACC 20100 

GTGGAGGGTC ATTGGAAACT GGAAAACTTG AGTGCAGAAG AGGAAAGTGG AATTCCATGT 20160 

GTAGCGGTGA AATGCGCAGA GATATGGAGG AACACCAGTG GCGAAGGCGA CTTTCTGGTC 20220 

TGTAACTGAC GCTGATGTGC GAAAgCGTGG GGATCAAACA GGATTAGATA CCCTGGTAGT 20280 

CCACGCCGTA AACGATGAGT GCTAAGTGTT AGGGGGTTTC CGCCCCTTAG TGCTGCAGCT 20340 

AACGCATTAA GCACTCCGCC TGGGGAGTAC GACCGCAAGt TGAAACTCAA AGGAATTGAC 2 0400 

GGGGACCCGC ACAAGCGGTQ GAGCATGTGQ TTTAATTCGA AGCAACXJCOA AGAACCTTAC 20460 

CAAATCTTGA CATCCTTTGA CAACTCTAGA GATAGAGCCT TCCCCTTCGG GGGACAAAGT 20520 

GACAGGTGGT GCATGGTTGT CGTCAGCTCG TGTCGTGAGA TGTTGGGTTA AGTCCCGCAA 20580 

CGAGCGCAAC CCTTAAGCTT AGTTGCCATC ATTAAGTTGG GCACTCTAAG TTGACTGCCG 20640 

GTGACAAACC GGAGGAAGGT GGGGATGACG TCAAATCATC ATGCCCCTTA TGATTTGGGC 20700 

TACACACGTG CTACAATGGA CAATACAAAG GGCAGCGAAA CCGCGAGGTC AAGCAAATCC 20760 

CATAAAGTTG TTCTCAGTTC GGATTGTAGT CTGCAACTCG ACTACATGAA GCTGGAATCG 20820 

CTAGTAATCG TAGATCAGCA TGCTACGGTG AATACGTTCC CGGGTCTTGT ACACACCGCC 20880 

CGTCACACCA CGAGAGTTTG TAACACCCGA AGCCGGTGGA GTAACCTTTT AGGAGCTAGC 20940 

CGTCGAAGGT GGGACAAATG ATTGGGGTGA AGTCGTAACA AGGTAGCCXTT ATCGGAAGGT 21000 

GCGC^^TGGAT CACCTCCTTT CTAAGOATAT ATTCGGAACA TCTTCTTCAG AAGATGCGGA 21060 

ATAACGTGAC ATATTGTATT CAGTTTTGAA TGTTTATTTA ACATTCAAAT ATTTTTTGGT 21120 

TAAAGTGATA TTGCTTATGA AAATAAAGCA GTATGCGAGC GCTTGACTAA AAAGAAATTG 21180 

TACATTGAAA ACTAGATAAG TAAGTAAAAT ATAGATTTTA CCAAGCAAAA CCGAGTGAAT 21240 

AAAGAGTTTT AAATAAGCTT GAATTCATAA GAAATAATCG CTAGTGTTCG AAAGAACACT 21300 

CACAAGATTA ATAACGCGTT TAAATCTTTT TATAAAAGAA CGTAACTTCA TGTTAACGTT 21360 

TGACTTATAA AAATGGTGGA AACATAGATT AAGTTATTAA GGGCGCACGG TGGATGCCTT 21420 

GGCACTAGAA GCCGATGAAG GACGTTACTA ACGACGATAT GCTTTGGGGA GCTGTAAGTA 21480 

AGCTTTGATC CAGAGATTTC CGAATGGGGA AACCCAGCAT GAGTTATGTC ATGTTATCGA 21540 
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6AGGAAGAGA AAGAAAATTC GATTCCCTTA GTAGCGGCGA GCGAAACGGG AAGA6CCCAA 21660 

ACCAACAAGC TTGCTTGTTG GGGTTGTAGG ACACTCTATA CGGAGTTACA AAGGACGACA 21720 

TTAGACGAAT CATCTGGAAA GATGAATCAA AGAAGGTAAT AATCCTQTAG TCGAAAATGT 21780 

TGTCTCTCTT GAGTGGATCC TGAGTACGAC GGAGCACGTG AAATTCCGTC GGAATCTGGG 21840 

AGGACCATCT CCTAAGGCTA AATACTCTCT AGTGACCGAT AGTGAACCAG TACCGTGAGG 21900 

GAAAGGTGAA AAGCACCCCXS GAAGGGGAGT GAAATAGAAC CTGAAACCGT GTGCTTACAA 21960 

GTAGTCAGAG CCCGTTAATG GGTGATGGCG TGCCTTTTGT AGAATGAACC GGCGAGTTAC 22020 

GATTTGATGC AAGGTTAAGC AGTAAATGTG GAGCCGTAGC GAAAGCGAGT CTGAATAGGG 22080 

CGTTTAGTAT TTGGTCGTAG ACCCGAAACC AGGTGATCTA CCCTTGGTCA GGTTGAAGTT 22140 

CAGGTAACAC TGAATGGAGG ACCGAACCX3A CTTACGTTGA AAAGTGAGCG GATGAACTGA 22200 

GGGTAGCGGA GAAATTCCAA TCGAACCTGG AGATAGCTGG TTCTCTCCGA AATAGCTTTA 22260 

GGGCTAGCCT CAAGTGATGA TTATTGGAGG TAGAGCACTG TTTGGACGAG GGGCCCCTCT 22320 

CGGGTTACCG AATTCAGACA AACTCCGAAT GCCAATTAAT TTAACTTGGG AGTCAGAACA 22380 

TGGGTGATAA GGTCCGTGTT CGAAAGGGAA ACAGCCCAGA CCACCAGCTA AGGTCCCAAA 2244 0 

ATATATGTTA AGTGGAAAAG GATGTGGCGT TGCCCAGACA ACTAGGATGT TGGCTTAGAA 22500 

GCAGCCATCA TTTAAAGAGT GCGTAATAGC TCACTAGTCG AGTGACACTG CGCCGAAAAT 22560 

GTACCGGGGC TAAACATATT ACCGAAGCTG TGGATTGTCC TTTGGaCAAT GGtAGGAGAG 22620 

CGTTCTAAGG GCGTTGAAGC ATGATCGTAA GGACATGTGG AGCGCTTAGA AGTGAGAATG 22680 

CCGGTGTGAG TAGCGAAAGA CGGGTGAGAA TCCCGTCCAC CX^ATTGACTA AGGTTTCCAG 22740 

AGGAAGGCTC GTCCGCTCTG GGTTAGTCGG GTCCTAAGCT GAGGCCGACA GcGTAGGCX^ 22800 

TGGAfAACAG 6TTGATATTC CTGTACCACC TATAATCGTT TTAATCGATG GGGGGACGCA 22860 

tAGGATAGGC GAAgcGTGcG ATTGGATTGC ACGTCTAAGC AGTAAGGCTG AGTATTAGGC 22920 

AAATCCGGTA CTCGTTAAGG CTGAGCTGTG ATGGGGAGAA GACATTGTGT CTTCGAGTCG 22980 

TTGATTTCAC ACTGCCGAGA AAAGCCTCTA GATAGAAAAT AGGTGCCCGT ACCXSCAAACC 23040 

GACACAGGTA GTCAAGATGA GAATTCTAAG GTGAGCGAGC GAACTCTCGT TAAGGAACTC 23100 

GGCAAAATGA CCCCGTAACT TCGGGAGAAG GGGTGCTCTT TAGGGTTAAC GCCCAGAAGA 23160 

GCCGCAGTGA ATAGGCCCAA GCGACTGTTT ATCAAAAACA CAGGTCTCTG CTAAACCGTA 23220 

AGGTGATGTA TagGGcTGAC GCCTGCCCGG TGCTGGAAGG TTAAGAGGAG TGGTTAGcTT 23280 

CTGCGAAgCT ACGAATCGAA GCCCCAGTAA ACX3GCGGCCG TAACTATAAC GGTCCTAAGG 23340 
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TGTCTCAACG 
AGGACGGAAA 
TACAGGATAG 
ATACTACCCT 
TCAGGCGGGC 
TTCCCTCAGA 
GAGACCTACA 
AAGGGCCATC 
AGTTCACATC 
AGTCGGTCCC 
CGTCGTGAGA 
CTTAGTACGA 
ATAGCTGGGT 
CCTCAAGATG 
GGTTCGAGGT 
AATCAAAATA 
ATAAATTACA 
AACACA6AAG 
AACGTTGCCA 
TTTATGTCTA 
AAACGTTTGA 
TAAdyGAGAG 
TAGCX3ASGAT 
AATGTACACT 
AAATGATATC 
AATTGAAAAT 
TTATTATACA 
CTTTATAATT 
ACCGCCAAAG 



AGAGACTCGG 
GACCCCGTGG 
GTAGGAGCCT 
AGC7GTGTTG 
AGTTTGACTG 
ATGGTTGGAA 
AGTCGAGCAG 
GCTCAACGGA 
GACGGGGAGG 
AAGGGTTGGg 
CAGTTCGGTC 
GAGGACCGGG 
AGCTATGTGT 
AGATTTCCCA 
GGAAGCATGG 
AATGTTTTGC 
TTCATATGTC 
TTAAGCTCCT 
GGCAAAAAAT 
AAACGTCAAA 
ATCTGACGAA 
CCGAAGrAGA 
GGTAGCCAAC 
TTCGATTGTC 
ATCGAAAACA 
GATCTTACTG 
ATAGACAAGC 
AATGATTTTA 
CCTAATATAA 



TGAAATCATA 
AGCTTTACTG 
TTGAAACGTG 
GCTTTCTAAC 
GGGCGGTCGC 
ATCATTCATA 
GGTCGAAAGA 
TAAAAGCTAC 
TTTGGCACCT 
CTGTTCGCCC 
CCTATCCGTC 
ATGGACATAC 
GGACGGGATA 
ACTTCGGTTA 
TGACATGTGG 
GAAGCAAAAT 
TGGTGACTAT 
TAGCGTCGAT 
GGATGCGAT6 
ATAAAAAGCA 
ACGAGAAAAG 
GGAAAGAAGC 
TTACGTTCCG 
TAAGTATGTA 
AAATATTGTA 
CTCTTTTATA 
TATTGCATAA 
TTAGAGCGTC 
ATTTAGGGGT 



GTACCTCTGA 
TAGCCTGATA 
AGCGCTAGCT 
CCGCACCACT 
CTCCTAAAAG 
GAGTGTAAAG 
CGGACTTAGT 
CCCGGGGATA 
CGATGTCGGC 
ATTAAAGCGG 
GTGGGCGTAG 
CTCTGGTGTA 
AGTGCTGAAA 
TAAGATCCCT 
AGCTGACGAA 
CACTTTTACT 
AGCAAGGAGG 
GGTAGTcGAA 
AGCCGCATTG 
AACACAAAGA 
ArCGCAACGA 
AAGCOATTGT 
CTAGAGTAGA 
CAACTTTAAT 
TAAATAGAGA 
TACTTTATTG 
GTAACACTAA 
TACATGCGGT 
TTTCTTATAG 



AGATGCAGGT 
TTGAAATTCG 
TACGTGGAGG 
TATCGTGGTG 
GTAACGGAGG 
GCATAAGGGA 
GATCCGGTGG 
ACAGGCTTAT 
TCATCX5CATC 
TACGCGAGCT 
GAAATTTGAG 
CCAGTTGTCG 
GCATCTAAGC 
CAAAGATGAT 
TACTAATCGA 
TACTATCTAG 
TCACACCTGT 
CTTACGTTCC 
AGACCGCAAG 
AAAATGGCTT 
GTTTAGTAGA 
CACAAGTCAA 
ACTGGAAATG 
TTTGTGTTTA 
AGAGCAGTAA 
AAATACAAAA 
CTTTTATCAA 
TTTAAAGCAT 
TCTTQATCAT 



TACCCGCGAC 
GCACAGCTTG 
CGCTGGTGGG 
GGAGACAGTG 
CGCTCAAAGG 
GCTTGACTGC 
TTCCGCATGG 
CTCCCCCAAG 
CTGGGGCTGT 
GGGTTCAGAA 
AGGAGCTGTC 
TGCCAACGGC 
ATGAAGCCCC 
GAGGTTAATA 
TCGAAGACTT 
TTTTGAATGT 
TCCCATGCCG 
GCTAGAGTAG 
GTCTCTTTTT 
GGCGAAGTGA 
GCTAAATGAG 
GAAAGGTTCT 
ATAATTTAAT 
TATAAATTTA 
GACGGTATCT 
AGGAAATTAA 
AGAAGTGTTA 
CATCGTCTAT 
CATCAAAATT 



23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
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TCCATTTTTT ACTGTAATTG TAAAATGCAT ACCCGTTTCA GCACCTTGAA TATCAAGCTG 25260 

CTCTTTGTAA GGTTTCAATC TTTTTAAAAT ATAGGTTAGT TTTCTAOGAT AAATTCGTCT 25320 

5 CATTTTATTT AAATGCCTTT CAAAACCACC GGAAGATATA AACGTTGCAA TAAGGTTTTG 25380 

CATATGAACA GGTACAGTGT TGCCTTCAAT GTGATTTTGA GAATGATATT TTTTCATTAT 25440 

AGAATAGGGT AACACCATAT ATGCAACTCG ACAGCTAGGA AAAATAGACT TTGAAAATGT 25S00 

ACTGATATAA ATCACTTTTT CTCCTCTTGA ATATAGACCT TGAATTGCTG GAATGGGTTT 25560 

GCCX5AAATAT CTAAACTCGG AATCATAATC ATCTTCTATA ATAAATCGTT CTTCTTTTTC 25620 

TTGAGCCCAT TGTATTAATT GAGTTCGTTT TTTTAAGTCC ATCACATATC CAGTTGQAAA 25680 

IS 

TTGATGGGAA GGCGTTATAT ATACTATATT TTTTTGTGAT TTAATAACTT CATCTACGTT 25740 

TATTCCATTA TCTTCAACTT CAATTTGITC ATATTCAACT TGTTTTTTAT CTAAAATATT 25800 

TTTGATTGGT GGATAACTAG GTTTTTCGAT AATAAATGTT GAAGTATAAA GTAAATCGAC 25860 

20 

TAATTGATTT ACTAATTGTT CGGTAGATGA GCCAATTATA ATTTGATTAG GATCACAAAT 25920 

TACGCCACGA TTAGTAAATA AATAAAATGC CAGTTGAAAC CGCAAATGTA ATTCTCCTTG 25980 

AAAATGTCCT CTACGTAATT GATTTAAATG ATTTGTATCA TAAAGATCTT TGGAATACTT 26040 

2S 

TCTGAAAAGT TCTATAGGGA AATGTTTCGT ATCTATTTCA TCCAAATTAA AAGCATAATC 26100 

ATAAGCTTCA TCACTCGCTT TTGGTTTATA TGAATCATCA TCAAAAAGAG AGGGGATAGO 26160 

3^ TTGATTGTTT AAAATTGTTA AAGATTCAAT TTCGGACACA AAATATCCAG AGCGAGGTCT 26220 

TGAATAAATG TAACCTTCGT CTAATAGAAG TTGATATGCA TGCTCTACGG TTGTTTGGCT 26280 

AATAGATAAA TGTrrGCTTA ATTGTCTTTT AGAATAAAAT TTATCGCCTT CTTTAAATTG 26340 

SS ACCTTCAATT ATTTGTTTTT TTAATTTTTC ATAAAGTTGA TGGTATAAAG TGTTTTTCAA 26400 

TTTTRTAACT GACCTCCTAA ATTTATCTTA TTTTGTACCT TTTTAAATAT CAGTTTATAC 26460 

ATTACAATGT ATTTAATCAA CTTQAAAAGG GGTTTTATGT ATAATGAGTA AAATTATTGG 26520 

40 ATCAGACAGA GTCAAAAGAG GTATGGCTGA AATGCAAAAA GGCGGCGTTA TTATGGATGT 26580 

CGTTAATGCT GAGCAAGCAA GAATTGCAGA AGAAGCTGGC GCGGTAgCAG TTATGGCATT 26640 

AGAACGAGTA CCTTCTGATA TTAGAGCTGC TGGTGGTGTT GCACGTATGG CAAACCCTAA 26700 

^ AATTGTAOAA GAAGTAATGA ATGCTOTTTC TATTCCAGTC ATGGCTAAAG CACGTATTGG 26760 

TCATATCACT GAAGCAAGAG TATTAGAGGC GATGGGTGTT GACTATATTG ATGAATCAGA 26820 

AGTGTTAACA CCAGCAGATG AGGAATATCA CTTAAGAAAA GATCAATTTA CAGTACCATT 26880 

SO 

TGTATGTGGA TGTCGTAATT TAGGTGAAgm TGCGCGTAGA ATTGGTGAAG GTGCTGCTAT 26940 

SS 



436 



EP 0 786 519 A2 



10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



ACAAGTTAAT 
TGCGAAAGAT 
ACCGGTAGTT 
GGAATTAGGT 
AAAATTTGCT 
AAGATTAGCA 
AGAAGAACGT 
AGGTGCAGTA 
TAAAAAAGTT 
AACGTTACGT 
ACCTATGTTT 
AGGATACCTT 
CAGCTTTGAA 
AAGAGCCCCA 
GAAAATTGTA 
AGATGACTAT 
TGTATGCTAA 
TCAAACTTAG 
GCTATACATA 
TAATTCGATG 
ATACTAGTGT 
TGTCATTAAG 
ATACTAAGAA 
GTGCTTGCAT 
GAACTGCATC 
TTAATGTTTG 
CAATTCCATT 
CAGCTACTTT 
TTTCTTCAGT 



TCAGAAGTTA 
ATCGGTGCGC 
AACTTTGCAG 
GCTGACGGTG 
AAAGCAATTG 
AGTGAACTTG 
ATGCAAGAGC 
CGTGAACATA 
GAACAATTAG 
CGATTAATGA 
GGTACATGCG 
AACAAGTTGA 
ACA6AATTAG 
CATATTGAAA 
GCTGTTCAGC 
AGAGTAACTG 
ATCAACGAAT 
CTTTGGAGGA 
AGAAAAAAAC 
TTGATGTATT 
tGCACCGAAT 
TGATTTAATC 
TACAGATGTA 
TGCTACAAAT 
TTGCCATGGC 
GAAATCCCAA 
TAATAGAGCG 
AAATCCATCT 
TTCTTCAACT 



GTCGATTGAC 
CTTATGAAAT 
CTGGTGGCGT 
TATTCGTTGG 
TTCAAGCAAC 
GCACTGCTAT 
GTGGTTGGTA 
TTAGACATAT 
AAGAAATCGA 
ATTTATATGG 
CAGGATTAAT 
ATATTACTGT 
ATATTAAAGG 
AAGTAGGTCA 
AAGGTAAATA 
ATTACTTTAT 
TATTGATATT 
GTrATTTTTT 
CCTTCAAAGA 
TGTTAAATAA 
AATAATTTCA 
GCACCTGAAA 
ACACCTTTTG 
TCGTTAGATA 
ACACCGACTA 
GAAATAGCGC 
ATAATGGCAA 
AAAATATATT 
AATAATTTGT 



TGTAATGAAT 
TTTAAAACAA 
TGCGACTCCT 
ATCAGGTATT 
AACACATTAC 
GAAAGGTTTA 
AGATATGAAA 
TGAATTAAGT 
GGGCTTAATA 
ATTTAAAGAG 
AGTTCTAGCG 
ACAACGAAAC 
TATCGCTACA 
AGGCGTAGAT 
TTTAGGCGTA 
TAATCATATT 
TATAGATTTG 
ATGTCAAAAT 
GACTGAGAAT 
AGAATCcAGC 
AACCAAAGCG 
TAATACCGAT 
CGTGTTCAGA 
ATAGTTTTGT 
AGAATGCAAA 
CACCTGAAAC 
TGTATCCGAT 
CTCCTAGCAT 
CATCTTCTTC 



GATGATGAGA 
ATTAAAGACA 
CAAGATGCTG 
TTTAAATCAG 
CAAGACTATG 
GATATCAATC 
ATAGGTGTAT 
GGTCATGAAG 
TTACCTGGTG 
GCTTTACAAA 
CAAGATATAG 
TCATTCX3GTA 
GATATTGAAG 
ATCCTATGTA 
TCATTCCATC 
GTAAAaAAAG 
TTGAGAAGAA 
TAAAAATGAT 
AGTCAAAATT 
GATTGCAGCT 
GGCAACTGTA 
AGAGCTAAAG 
TAAATCACTA 
CGCCATAACT 
TGGTGCAAAG 
TGTACTAAAG 
TAACATTGCG 
TTCGAAGAAT 
ATTAACTTTA 



TTATGACTTT 
ATGGTCGTTT 
CTTTAATGAT 
AAGATCCAGA 
AACTAATTGG 
AATTATCATT 
TAGCATTACA 
GTATTGCAGT 
GCGAGTCTAC 
ATTCAACTTT 
TTGGTGAAGA 
GACAAGTTGA 
GTGTCTTTAT 
AGGTTAATGA 
CTGAATTAAC 
CATAGCTTAA 
AATATCTCCT 
AAAAAATAAA 
TTGAAGGGGT 
GAAATGAAAG 
TCTCCTTTTT 
TTAGCAAATG 
AGTTTACCAA 
GAACCGGCTT 
ACAAAACCAA 
ATATTGCTTA 
CCTACAATGA 
GATTGTTGTC 
TAAGGGTTAA 



27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
26740 
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TAGGTTCAAT TAAGGTAAAG TATGCACCGA TAATTGAAGC AGAAACAGTC GACATTGCTG 23860 

AAGCTGTTAA TGTGTATAAA CGTTGCTTAG GTATGTATGG TAATTGTTTT TTAATTGAAA 2 8920 

^ TAAATACTTC AGATTGTCCC AAAATTGCTG CAGCAACTGC ATTGTATGAT TCTAAACGTC 28 980 

CCATACCATT AATTTTAGAA ATTAAGAATC CTAAAACATT AATGATTAAA GGTAAAATCT 29040 

TTGTGTATTG AAGGATACCG ATAATCGCTG AAATAAATAC GATAGGTAAT AATACACTGA 29100 

10 

AGAAGAATGG TGGTTGCTTA GGATCGATAT ATTGAATACC ACCGAATACA AAGTTAACAC 29160 

CATCTGCTGC TTTTAATAAT AAGTAGTTAA AACCGTTTGA AATACCACCA ATAACCTTGA 29220 

TTCCCATTGT AGTTTTAAGC AAGATAAATG CAAAGATAAG CTGAATTGCA AGTAAAATTC 29280 

CTACATATTT CCAGCGAATA TTTTTCCTGT CTGAGCTAAA TAGAAACGCA AGTGCTAAAA 29340 

AGAAGATAAT TCCGATAATC CCAATTAGAA TATGCATATA TTTCTCATTC CTTTAGTTTT 29400 

20 TTCTACaATc TATCATACAA TAAAATGGAA GGGCTAACAT CATAAATTTT TGAAAATATA 29460 

AAAACAAATT AATTGAAAAA GGTCAAAATA GGTCATATAA TATAGTCAAA GAAGGTCAAA 29520 

AAGGGGTGAT ATACATGCAC AATATGTCTG ACATCATAGA ACAATAaTCA AACGTTTATT 29580 

TGAAGAGTCX3 AATGAAGATG TCGTTGAAAT TCAGAGAGCG AATATCGCAC AGCGTTTTGA 29640 

TTGCGTACCA TCACAATTAA ATTATGTAAT CAAAACACGA TTCACTAATG AACATGGTTA 29700 

TGAAATCGAA AGTAAACGTG GTGGTGGTGG TTACATCCX3A ATCACTAAAA TTGAAAATAA 29760 

30 

AGATGCAACA GGTTATATTA ATCATTTGCT TCAGCTGATT GGACCTTCTA TTTCTCAACA 29820 

ACAAGCTTAT TATATTATTG ATGGGCTTTT AGATAAAATG TTAATAAATG AACGTGAAQC 29880 

TAAAATGATT CAAGCAGTTA TTGATAGAGA AACGCTATCA ATGGATATGG TTTCTAGAGA 29940 

35 

TATTATTAGA GCAAATATTT TAAAACGTTT GTTACCAGTT ATAAATTATT ACTAAATGAA 30000 

ATGAGGTGTT GAAGTGCTTT GTGAAAATTG TCAACTTAAT GAAGCGGAAT TAAAAGTTAA 30060 

AGTTACAAGT AAAAATAAAA CAGAAGAAAA AATGGTGTGT CAAACTTGTG CT6AGGGGCA 30120 

CCATCCGTGG AATCAAGCTA ATGAACAACC TGAaTATCAA GAACATCAAG ATAATTTCGA 30180 

AGAAGCATTT GTTGTTAAGC AAATTTTACA ACATTTAGCT ACXjAAACATG GAATTAATTT 30240 

45 TCAAGA 30246 

<2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 14333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TATTCCCCCA TCGGTTTATT AAATCGTCCA TTTCAATACT GTTTTTCCCC AAGATGTCGA 60 

TAAATCCATT TCAAACGCTT GGACGATATC TTGCATCGTA CATACATTAA TTTCATGTCC 120 

TTTTAATAAT GCTAACTTTT CAACTATGTC TGGGTACTTA CGATATAAAT CAACAACTTG 180 

CTCAAAATCT TTAGAGCCGC TTCGACTACT ACCAATCAAC GTTAATCCTT TTTCAAGTAC 240 

TAATCGTGTA TTCACTTCCA CGGGTAATTC ACTTACGCCT AACAAAGCAA TACTGCCTTC 300 

TGGTGAAATA TGTTCAACTA TTTGTTGAAG TGCAACTTGA CTTCCTTTAC CTCCAACACA 360 

TTCAAATGCA TGATCAATTT TAAGATCATC TGGTATTTGA TTTACTGTAA AGATGTCATC 420 

TACAAATGAA AAATGACTTA ATTTATAGTC TGTCTTACCA AATACATAAG TTTTAGCTTC 4 80 

TGGGTACAAC TTACGTAGCA AAATAGCAGT AATATAACCT AAGTTACCAT CACCCCAAAT 540 

20 ACCAAAGCTG GTTTTCAAAG GTATAGATTT ACGTTCAAAT CGITGTATAG CATGATAACT 600 

TACTGACACT AACTCTGTGT ATGAAATCGT ACTCAAATCA ATGTCATTAG GCAGCGGAAC 660 

GATACGATCA TGTGCCATCA CAACGTAGTC TTGCATAAAA CCATCATAAC CACTAGATCT 720 

AAAATAACTA GAGGCTAAGT AATTCTCCGC AATAATATGA TGTTGCTCTG TAGGTGTATT 730 

CGGTACCATT ACTACTTTCG TACCTTTTTC AAATACCCCT TTACTATCAA ATACAACTTC 840 

ACCAACAGCT TCATGAACTA ATGACATTGG TAATTTTTTG CGTAGTACAT TTTCATCTCT 900 

TCGACCTGTG TAATACCTTT GATCAGCTGC ACAAATAGAC AAGTATAAAG GTCTTACGAT 960 

GACATGATTA CCATAAATAT CAACATTATT ATATGTGACG TCGAACTGTC TCGGTGCAAC 1020 

GAGTTGATAT ACTTGATTAA TCATCGGCAA TATCACCTTG AATAATGGCA TTTGCTACTT 1080 

TTAAATCATA CGGTGTTGTC ACTTTAATGT TGTATAGTTC TCCaCGTACC AATTTAACTG 114 0 

CATgrCCAGA TTCGACAATG ATTTTACATG CATCTGATAA GATTTCTTTT TGTTCACTAC 1200 

40 TTAAGGCGCG ATAACTATCT TGTAATAATT TAATATTAAA TGATTGTGGT GTTTGGCCTT 1260 

GATACATTTC ATTCCTTACA GGGATACTGT GTATGTTCTG TTTATCTTTA GACATTACAA 1320 

TCGTATCAAT TGCTTCAATG ACTGTATCTA CTGCACCATA TTTTGCTGCT ACTTCAATGT 1380 

4S TCTCTTTAAT AATACGTTGA GTTAAAAATG GTCTTACGGC ATCATGAGTT ACAATCACAT 1440 

CATCATTATT AATTCCATTT ACATTGCGAA TATGGTCGAT AATGTTCATA ATTGTTTCAT 1500 

TTCGATCCGT ACCACCTGCA ACTACTTTGA CACGTTGATC TGTAATGTTA TATTTTTTTA 1560 

SO 

AAATATCCTG TGTATGGGAA ATCCACTGTG CTGGCGTTGC GATAATAATC TCATTAAATT 1620 

CACTCACTAA AATGAACTTC TCAATTGTAT GGATTAAAAT CGGTTTATTA TCAATATCTA 1680 
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CTGCATAAAT CATGTTGTCC TCCATTCTGT CATTACATCA TTTCCATTTA TACATTACTG 1800 

ACCTATGCCC GCACATAAGC CTAACCTATT GCTCACTTGC CTCTTTTATT AATCCAAAGA 1860 

TAGTTGTCAC AATAGTGTGA TAATTTTTTA TAAAAATGTA TTTTTGTAAC TGACCATTCT 1920 

AAGTTGTTTT GCCATGCAGT TAATCATTAA CTCTGACGAT ATTAAATTGT TAAAGGTATT 1980 

AATGTTTACT CTTTTTCAAA TTCATTATTA CTGCCATCAT TTTACCATAT ATTATAATAA 2040 

ATTTATCTTA TTAAGTGGCT GTACTTGATT TTCACTTTAA AAATTATCAA ATATTGCCAT 2100 

CTCATTTTAA GTATACAAAA TGCAAAACAA CCGATTCACA AGCATATTTC ACACAAGTAA 2160 

ACCGGCTATT TATCAACGTA TATTCGAAGA TGAATTATTT CXaVTAGTATC TATAGACCAG 2220 

ACXXSCATTCG CACTTTCATA GCTATAACTA TACCAGCGTT TTCGTCCTCA AAGGTGCATA 2280 

CTAATAAATC GTAAACATGA CTTTATCAAA TCGTTCTTTC TTGTTAACTA ATTTATCAAA 2340 

20 TGTCTCCGGG CCTTTTTCTA ACGGTAAAAA ATGAGAAATA ATAGGCTTTA CATTAATATC 2400 

TTTCGTCTTC ATATAATGTA AGGTTGCCGT CCACTCTTTO CCCGGAAAAT TACTGGACAA 2460 

ACAGTTCCAA GAGCCACATA CTGTCAACTC GTTACGCAGA ATTTTTTCAA AATGAACGCG 2520 

ATCAATCTCA ATATCATCAT ATGGTATTCC GAGTAATACC ACCTCGCCAC CTTTTTTAGG 2580 

TAGCGTCAAT ATTTGACCAA TCGTAACTTT AGCACCTGAT GATTCTATAG CTAAATCGAT 2640 

TTGATTGGCG TAATGATTTT C6ATGAATTT CTCAAGATTT TCTTCTTTTG AATTGATTGT 2700 

TTGATGTGCG CCCAATGATG TTGCAATATC TAGTTTATGC GCATCTATAT CTATAGCGAT 2760 

GATATGTGCA GCACCAAATA TTCGTGCCCA TTGAATAGCT AACAAACCTA TACTGCCACA 2820 

CCCCATTACT GCAACAGTCA TACCAGGTTG TATATTCGAT TTATAAAACC CATGCGCAAC 2880 

AACGGCTGAT GGCTCAACCA TTGCTGCTTC AATGTAATCA ACATTGTCTG GAACCTTTAA 2940 

AACATTTTGC GCTGGCAATT TGACATATTC CGCGAACGAT CCAGGTTCAT ATGAGCCAAT 3000 

40 GACGAATAAC TTTTCAGATC GTGCATATTC ACCTTTTAAA CAATACTCGC ATTGATAACA 3060 

AGGTATTGCT GGGCAACCTG TCACrTTGTC GCCCACATTA ACATGCX5TAA CATCACTTCC 3120 

AATGGCATCT ACTACACCTG AAAATTCATO ACCAAATGGC ATACCTTTAA TGTATGGCCC 3180 

^ CATTTTTTTG TATCGTGACG TGTCTGAACC ACATATGCCA GTCGCTCGTA CTTTAATAAT 3240 

AACGTCATTC GCACTTTCAA TGACTGGCTT TTCATTATCC TCATACCGTA AATCTTCCAC 3300 

GCCATATAAT TTCAATGCTT TCACTTGTAA ATCACCTCAA ATTTGATTTA ATTCACAACT 3360 

SO 

TTTTTCTTTT TAAAAATACC TGTCGCAAAA TAACCTGCAA TGACAATGGA ATTACTTACG 3420 

AGTAAATGTT CCATATAAAA ATCAGTGATT TGTCTTAATG GCCCAAGCAT AAAAGTTAGC 3480 
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TGCTTTAATA CCTTCGCCGG ATTTTAAATG TTGATACGCC TCGTCCCATT TCGAAATATC 3600 

ATATATTTTT GTCACCAAAG CTTCAGCATT TACTAAACCA TCCGCCATAA GTTGCAATGA 3660 

AGGTTCCCAA TCTGCTGGCT TTTGACTTCT ACTACCAACA ACTGTTATTT CTTTTTGAAT 3720 

CACTTTTTCC ATATCAAATG GAATTTCAGC ATCCTTAAAA ATACCTATTT GACTGTAGAA 37 80 

ACCTTTTTTG CGTAAAATAT CCAAACCTTG TCGTGCTGCT GGAACTGCAC CTGAACATTC 3840 

AACAACAACA TCTGCACCGT AACCGTCTGT AATTCCATTG ATATACGTTT TTAAGTCTGT 3 900 

TTGTTGTAAA TTGACTACAT AATCCATGTG CAATGCTTCT GCTTTATCTA ATCTGACTTT 3 960 

GTCATTGTCC AATCCAGTTA CCACAACAGT TGCGCCTTTA CTTTTTAACA CTTGTGCTAC 4020 

AAGTAATCCG ATTGGCCCAG GTCCCATTAC AACTGCTACA TCGCCTGAAT TGACTTGAAT 4080 

CTTAOAAACG CCATGATGTO CACATGCTAA TGGTTCTGTC ATAGCTGCAG ACTGATACGA 4140 

20 TAtTOGTCTG GAATATGATG CAAACTTTCT TCACGTGCAA TGACATAATT AGTAAATGCG 4200 

CCATCAACTT GTGTTCCAAT ACCTTTTCGA TGGTTGCATA AATTATAGTC TTTTGATTTA 4260 

CAGTATTCAC ACTCATTACA AACATAGAAT GTCGTTTCAO aTGtGACACG GTCACCAACT 4320 

2S TTAAAATCTT TAACGTCTGC TCCAACTTCA ACGATTTCAC CAGAAAATTC ATGACCTAAT 4380 

GTCACTGGAA AATTAACTTT ATAATGACCT TCATAAGTAT GAATATCTGT GCCACAAATT 4440 

CCTGCATAAT GTACTTTAAT CTTTACTTTA TCATCTAGCQ GTQTTGCAAC TTCTTTATCA 4500 

AGAAGTTCTA AGTTGCCATG TCCTTCTCTT GTTTTTACTA AAGCTTTCAC CACAAACACC 4560 

TCGATTTTTA ATTGAATAGA CTAAATAGTT TAAAGATAAG ATAGTTAACG ATATTACCAC 4620 

CTTGATCAAT ACTTGAAATT TCAGATGAAC CTTTTGGCAT TTGTACATTC GTACCTTTCG 4680 

CCATATCTGT GAAAATGGGT GCTACGTCTG TTGCAATATA TAGTGAAATT GCAATCATAA 4740 

TCGTACCCAC AATGACAGAA TGAATAATGT TTCCTCTTGC TGCACCAACA ATAAACGCGA 4800 

CAACAAATGG TATCGTTGCT AAGTCACCAA AAGGTAGTAC TTGGTTTCCT GGTAAAATAA 4860 

CGGCTAATAA AACAGTGATA GGTACTAAAA TTAATGCTGT CGAAATAACT GCTGGATGAC 4 920 

CTAATGCTAC AGCCGCATCC AATCCAATAT AAATTTCACG TTCGCCAAAA CGTTTATTTA 4 980 

4S GCCATGTTCT TGCAGACTCT GAAACTGGCA TTAAACCTTC CATTAAGATT TTTACCATTC 5040 

TAGGCATTAA TACCATTACT GCAGCCATTG ACATTCCTAA ATTAATGATG TCTCCAGGTT 5100 

TGTAACCTGC TAACACACCA ATACCTAAAC CTAAAATTAA GCCGACAAAT ATAGACTCTC 5160 

CAAATGCGCC AAAACGTTTT TGAATTGTTT CAGGATCAGC ATCTAACTTA TTCAGACCGG 5220 

GTACTTTTTG TAACAATTTA ACTAAGTAAA TACCTGGTGC ATAAGAAATT GTACTTCCTG 5280 
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CTACTTTCAA ACAGATAATT TGGAAAATAA CTGCTGCTAA TAACGCTTGC CAAATACTGC 54 00 

CTGATACGGC ATAAACCATT GCTGCTGTAA ACGTATAATG CCAAAAATTC CAAATATCTA 5460 

CATTCATCGT CTTTGTCACT TTAGTTACTA GCAATACAAC GTTAACTATG ATTCCGAGTG 5520 

GAATAATAAA TGCTGCGACA GATGATGCCC AAGCGATAGA TGATGTTGCT GGCCAACCTA 5580 

CATCAATCAC ATTCAGACTG ACGCCTAAAT TTTTAACCAT CX3CTTGTGCT GCTGGCCCTA 5640 

AATTTTTAAC TAATAAATCG ATGACTAAGA AAATCCCTAC AAAAGCCACA CCTATTGTTA 5700 

AACCAGACCT AAATGCCGCT CCAATTTTCT GCCTAAAGAA TAGGCCAAGC AAGAATATGA 5760 

^5 CAACCGGTAA AATAACAGTt GCACCTAAAT CTAAAAATCC CCTTACAAAA TCAGTGAAGT 5820 

AACTCATATT TAAACCCTCC CTGTTATATA TCCATTGTCA CGATACTTTC CGATTGTGAT 5880 

TACATTTOAC GTTACAGTCA TTTCAACGAC AACCCTTGCT AAATTCGACT GCAGTCCTTT 5940 

20 TGAATTACAG tCACTGCGTT TCTATGTCAT CAACAATCAT TTGTCGTGAT AGTCATTTAT 6000 

ATGCAATTTG CATATATTAA TATGTTATCG ACCCACGTTA CATATCAATT CCGTTATTTT 6060 

TGTAACTCTG TTAAGATTTG TTGTTTTGTT TCTTCAATAC CAATACCAGT TAAGAAATTA 6 120 

CGTGCGTTGA TAACTGGGAA TTTATATTCT TTTTTTGtCA TTGCAGTTGT AACTAATAAA 6180 

TCTGCAGTGT CTTCATAAGG TCCAACTTCT GTAATTTTGA TTTGTTTT^T ATCTACTTTA 6240 

ATATTGTGTT CCTTTGCCAT TTCTTCAATT GCATTATTTA CTACTGTTGA CGTTGCAATA 6300 

CCTGCACCAC ACGCTACTAA TACTTGTTTC ATTTTCAATT CCTCCAATTA ATTTTTAGTT 6360 

ATATTCCAAA TAATCATTGA TTAGTGTTGC TAAAATTGTT TCATCTTTCG TTCGTAGAAT 6420 

CTGCTCCAAT TTTTCTTCAC TTTGAAAAAT TTGCATCAAC TGTTGTAACA GCTTAAGTTG 6480 

ATCATCTACT TTATCCATTG CTAACATAAA AACGATTTTC ACTTCTGTCT GTTGATCAAG 6540 

TGTTCCCATT TCAATAAACG GCACTTCTTT TTCTAGAACA GCCACACCTA TCGTTCTATG 6600 

40 GTTATATGT TCGACATCTG TATGCGGTAT AGCGACCGAA CATAGATGCG TTGGTAAACC 6660 

AGTAGCAAAT TCTTTTTCTC TGTCGATGAC TGCATCTTTA AACGTTGACT TCACGAACCC 6720 

ATTTTGAAAT AACACATCTG ACATTTGTGA CAATACGGAT TCTTTATCAG TTGCCGACAA 6780 

^ ATTGAGCATT ATATTTTCTT TATGCACTAA TTGCTGTCCC ATCCATTTTC CCTCGCTTCT 6840 

TTATTTGAAT AATTTTTTAA AATCTCATTT ACATCAGAAT TTTTGCGACT TTGTATGATG 6900 

CGCTTAATTG CGTCATTGTC TTGCGCCACA TCTCTCAATT GTAGTAACGC TCTTAAGTGT 6960 

SO 

GTCACTTTAT CAACAGCAGC AATAGGTACA ATAATATGGA TTGCTGTGCC ATCTGACATG 7020 

TATATTGGTT CTTGTAATAT CAACATACTC ATCGCTGTTT TATGTACATG CTTTTCAGAG 7080 
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TGCATCTCAT GAATATATTT AATATCAATA AAATGATTAG CAACTAACAC ATCACTTGCT 7200 

TTAGCAATAG CTTCATCAAT ATTTTCAACA TGATGCATTC TTTTCACGTG CCTTGCCGGT 7260 

ATCAAGTCAG CTAAATCTAA TGyCTwATTT tGTGtGACaA TCGATCCATT AATGGTTGAA 7320 

ATTGAATTAT AATTOGCAAT AAAATCTTCT AAACCATCAC GTAGTcTGTA ATGTCATTAA 7380 

CTGTCGTTGT GCGTTCAATT AATGCCATTA ACTTGTTTAT TTCCTTATCA ATGTCAGCCG 7440 

ATTCCTTATT AATGTACTTC ATCACTTCTT TACGTAACTT TCGTTGCTCA TTTTCAGATA 7500 

AAGCTACTTT TGTGATAAAT AATTTTTTAT GTGTTAGGAC AAACATTGGT GAAAAGACGA 7560 

75 TGTCATAATC TAATGTGTAA TTTTCAAATG TTCTAAGTGA AATCGCATCT AAGAAAATAA 7620 

TTTCTGGAAA TAAGTTTCGC AACTCGTATA ACATCATTTG TGATACTGAC GTGCCTTGTG 7680 

TACACACGAT AATAGCTTTT ATCTTGCCAT CGAAGTTTTC ATCTTGACGT CTCAAACTAC 7740 

20 CTCCGAACAA CATGGTTAAA TATGCTATTT CATTATCAGG CAACGATTTT CCGAAATATT 7800 

CAGTTAACGA TTQACATGAT TGTTTCACCA TATGAAATAA GGATTGATAA TTTCCTTGTA 7860 

AAGGATTTAT TAATTCATCA CGATCCGTTA AGTTATATTT AATCCTATAA AAAGCAGGCG 7920 

TTAAATGTAA CAAGAGTTGC TGTGATAATT TCTCCTTATC TTCAATGTTA ATAAAAGTGA 7980 

TTTGTTCAAA ATGGTGAATC ATTTGAGCGA TGGCCATCGT TAAATTCGAT ATGCTATCTG 8040 

ATTCTTGCAA ATCAGTCCAT TGCACACTTG TTGAAAGTAA GTGTAATGTC AAATATAACT 8100 

TTTCCGCTTC TGGCAAATCC GGCTCATGTT GCGTCATAAT CTCCGTTGCT TGATATTCTT 8160 

TCGTATCCCT CAAATACTGA TAATTAATAT TTAATGGATT CATCACATGA CCACTTTGAA 8220 

TTCGTCTACG AATCACACAA AGGACATAAG GCAATGAACT AAGTGATTTG TCTATAAAGC 8280 

GACTCTTCAA AAATTGTTCT ACCTGTTTGA TCTTGTCTTT TTGATATGCG ATATCTTCGA 8340 

ATG'&AAGTT GAGCGCCTTT AAAACTTCAC TTTTAGTAAT ATCATGATTC AACCTTTGAT 8400 

40 CAAtCAACTT AATGAAGAAA CGGCGAACTT CAAATTCATC ACCAACAATT TCATAACCAT 8460 

GTTTTCGAGA ATACTTAAGT GACAAACCAT GATTTTCCAA TTGCTCTTTC ACATGATTTA 8520 

TATCGTGAAT GACAGTATTT TTACTGACTT GTAAATCAAT TGAAAAATGG TTTAGAGACA 8580 

45 TTGCGTTTTC CTTACTAAAA AGCATGAGCA TTAAATAATA ACGACGTGTT TCTATGCTAA 8640 

AAATGACATT GTTGCCGTTT AACATTTGCT GCTCCGATAC ATCTCGCTTG AATAACGTCA 8700 

TGATTTCAGA ACTTACAATA AAATTTCCTT GGCTTGTTCT TTCAAGTTTT GGATAACCCT 8760 

SO 

CTTGTTCAAG CCACAAATTG ATTTTTTGAA TGCGATATCC TAGTTGTCTA CGAGACAAAC 8820 

CAAATATCGA TTCAAGTTCT TTACCATGAA TAGTAGGATT CAATACAATT TCTCTGAGTA 8880 
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TCAATCGTCA CACCGATGTA CACACTTTGA ACACATATTT TCAAAATGAG CATGTACATC 9000 

ATTGTGATGT TTTAACAACA TTTCAATTAT ATCTATATTT TTTGTGATTT TAATCTTTTA 9060 

5 

AAATAAAGCA ATTGAAATTT TTGCATATAT TTTTGTGTTT TGTGTTTTTT TGAAGCATTT 9120 

TTAACATACA TATCTCAATC ATTATCAAAT TGTCATGACC ATTGTAACCC AATACAAAAA 9180 

CCCTAAGGAC GCTTATATCA GGCGCCTTAG GGTTAACTGT ATCTATTTAA TTAAGTATTA 9240 

10 

TTATTCGTAT GTACGTAACT TATGGTCTAT CAAGTTCCAC ACTTCTTCAA CATCAACTGC 9300 

TGTAGCAAAA TAAGCATTGG CAGGCTTACC TGTAACATGA TTTAAATCGA CAGCCATAGT 9360 

GCCATAAGTT AGTGGACTTT GATGTTCAAT GTCGATATTA ACGGGTACCA TTGTAAACAA 9420 

TTCTGGTTGT AACAAATACA AAATTGTACA AGCATCATGT ATTGGACCAC CATCCATATT 9480 

AAAGTGAGTC TT6TATGTCT TCTTAAAGAA TTGCAATAAT TCTACGACXSA ACTGTGCAAC 9540 

20 AGGATTATTG ATACTTTCAA AGCGTTCAAT CACGTGATCG TCGGCTAAAA CTTGATGTGT 9600 

TACATCTAAA CCAAACACAT TTATAGTAAT CCCACTTTCA AAAACACGCT TCGCTGCTTC 9660 

AGCATCTACC CAAATATTGA ATTCTGCTGT AGGCGTCCAA TTTCCAAATG TACCACCACC 9720 

CATCAAAGTA ATAGATTCAA TATGCTCAGC GATTCTTGGC TCACGAATCA ATGCCGTTGC 9780 

TACATTCGTA AGAGGACCTG TCGCTACAAT TGTTACAGGT GTATCACTCG TCATCACTTT 9840 

GTTTATAATC ACATCTGATG CTGGCATTGC AACTGCTTGA CX3TGATGGTG TCGACGGTAG 9900 

30 

TTTCGGACCA TCTAATCCAG ATTCCCCATG TATTTCAGAA GCAAAGGCAG CTGGTTTAAT 9960 

TAACGGCCTA TCCGCACCTT TCGCTACTGC TATATCTTGG CGTCCCATAA TATCCAATAC 10020 

GTTCAAGGCG TTTGTCGTAT TCTTGTCAAC TGATTGATTA CCTGCGACTG TTGTTACAGC 10080 

35 

TAATATCTCT AGTGGACTGT CAATTGCCCC CGCTAAAATT AATGCTATTG CATCATCGTG 10140 

TCC1BGATCA CAATCCATAA TAATCTTTCT TTTCATTTAT ATATCCACCT TTCTTAAGTT 10200 

GTTATCGATA GCTTATGTAT ATTTATTTAT GTGGTGAATC ATGTTTATTT TGAAAAATAG 10260 

40 

TTTTAACTTT CTCATATTTT TGGATACAAA CACTATTTAT CTATTTTATG GCTTATAAAT 10320 

TTATCCGATA TGCCTTATCA ACCTACCTCG CTAAAAATAG GATGTCTACA TATCTATACC 10380 

45 GACTTTTGTC AACTCATTTT CACAACAATA TAAACAGCAA TTTATATGAT TGTTACATGA 10440 

TTCAAACAAT TTTTATGAAA AATATTTTCA TACACAGAAT ATATATTGAT ATTAAATTTC 10500 

TCAAAAGCTA TATTGAGAAT AATTAGGAGG GATGTTGATG AAATCTTTAT TTGAAAAAGC 10560 

ACAGCAGTTC GGCAAGTCCT TTATGTTACC TATCGCAATC TTACCAGCTG CAGGTCTATT 10620 

GTTGGGTATC GGTGGTGCAT TAAGTAATCC AAACACCGTT AAAGCATACC CTATTTTAGA 10680 
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AAATTTACCG GTCATCTTTG 
TACTGCAGGT tTAGctGCGC 
ATTAACTATC ACGGGCACAT 
GCTCGGTATA CAAACGGTTG 
CGCAATACTT CACAACAAAT 
TGGCTCTAGA TTTGTCCCTA 
GTTTTTCATT TGGCCAAGCA 
AACAGGTGCC ATCGGTACTT 
TTTACACCAT ATTTTTTACT 
CAAAGGGCAC TTAGTTCAAG 
TGTGACGAAG TATTATTCAG 
GTTCX3GCTTA TGTGGTGCCG 
AGTTGTCGQC GGTTTAATGT 
ACCTTTAGAG TTTAGTTTCT 
TGATGGATTA GCATTTATGA 
TGGAGGCTTT ATCGATTTCT 
CCTATACGTC ATACCTATTG 
CTTAATTACG AAATTTAATT 
AGTTGAGOCT ACTGAAAGAG 
TGAAATCGTT GACTGTTGTG 
CGATAAAGTA TTACTCGAAA 
GCAAGTAATT TATGGGCCTC 
GGATTAAGAC TAACCGAAAT 
GACATCGTTG CTTTTATT7T 
GTGAGCCCGC AAATCGCCTC 
GCCAGATTTG AATTTAATTT 
CACATATAGT TGTATCAAAT 
TGGGATGGTA TAGATTTTAT 
TTTAATX5TTT TAGCTGAATO 



CAATTGGTGT CGCAATCGGA TTATCTAGAA GCGATAAAGG 10800 

TGCTCGGTTT CTTAATTATG AACGCAACTA TGAATGGCTT 10860 

TGGCAAAAGA TCAGCTTGCA CAAAATGGAC AAGGCATGGT 10 920 

AAACCGGTGT TTTTGGCGGG ATTATCACAG GTATTATGAC 10980 

ATCACAAAGT GGTATTACCA CCGTATTTAG GTTTCTTTGG 11040 

TTGTCACAGC ATTTGCCGCA ATCTTTTTAG GTGTATTGAT 11100 

TACAAGCCGG CATTTATCAT GTTGGTGGAT TTGTAACGAA 11160 

TTGTTTATGG CTTCATCTTA AGATTGTTAG GTCCACTCGG 11220 

TACOGTTTTG GCAGACGGCA CTTGGTGGTA CTTTAGAAGT 11280 

GTACGCAGAA CATCTTCTTT GCTCAACTTG GTGATCCAGA 11340 

GTGTGTCACG CTTTATGTCA GGCCGTTTTA TTACGATGAT 11400 

CACTTGCAAT TTATCACACA GCTAAACCTG AACATAAAAA 11460 

TATCCGCTGC ACTCACTTCA TTTTTAACAG GTATTACCGA 11520 

TGTTTGTCGC ACCTATTCTT TATGTAATCC ATGCCTTCTT 11580 

TGGCAGACAT TTTCAACATT ACAATTGGTC AAACCTTCAG 11640 

TACTCTTTGG TGTGCTACAA GGTAATAGTA AAACAAACTA 11700 

GAATTGTGTG GTTCTGTTTG TATTACATCG TTTTCAGATT 11760 

TCAAAACACC TGGTCGAGAA GATAAAGCTG CAGCACAACA 11820 

CACAAACTAT TGT7GCTGGT TTGGGAGGCA AAGATAACAT 11880 

CAACGAGACT ACGCGTCACA CTTCATCAAA ATGACAAAGT 11940 

GTACTGGTGC CAAAGGTGTA ATCCAGCAAG GCACTGGTGT 12000 

ACGTTACAGT TATCAAAAAT GAAATTGAAG AATTGCTCGG 12060 

ATCAACAGAA CTAATGGCAA CGATGTACGA AGTAAGAAGT 12120 

TAATGTTACA TTTGAAGCAT TAAGTTCATC ATGCACTGTA 12180 

TGCTAGACAA TCATCTTAAT GCTATGATTA AAGCTTAAGT 12240 

CAACAACGAC TTTCACTACA TTAAAAATAG GGCCACTCGA 12300 

AGCCCTTTAT ACAATTTTTT GGGTAAGGTT TTACAATTTT 12360 

AAAAAGTTAT TTAAGTTCTT CTGCTTCAGC CATAATATCT 12420 

TOCGAACTTG CTTTGTTCTT CGTCGTTTAA TGGGATTTCT 12480 
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TCCTCATATT CGCCTTCTAA TAATGCTGAT ACAGTCAATA CGGCATCTTC ATTTCTGAAA 12600 

ATCGCTTCAG TAATTCTAGC TAATCCCATT GCAACACCAT AATAAGTGGC ACCTTTAGCT 12660 

TGAATAATGT CATATGCTGC ATCACGTGTT TGAACAAAAA TTTGTTCAAT TTGCGCTTTG 12720 

CCCTCAGGAC GTTGTTCAAG TAATGTCTTC AAAGGTTGAC CCGCAATATT AGCGTGTGAC 12780 

CATACTGGTA ATTCAGTGTC ACCATGTTCA CCAATAATTT GAGCATCGAC 6CTACGTGGC 12840 

GCAACATCGn AcgyTcGCTT AACAATAATC TAAAGCGTGC AGAGTCTAAA ATTGTACCAG 12900 

AACCTATAAC ACGTTCTTTA GGTAAACCAG AGAATTTCCA TGTTGCATAC GCTAAAATAT 12960 

CAACAGGATT TGTAGCTACC AAGAAAATAC CATCAAATTT TGATGCCATT ACTTCACCAA 13020 

CAATTGATTT GAATATTTTC AAGTTTTTAG ATACTAAATC TAAACGTGTT TCTCCAGGTT 13080 

TTTGTGCAGC ACCAGCACAG ATGACAACTA GATCCGCATC ATGACAATCA CTGTATTCGC 13140 

CAGCTTTCAC ACGAACTGTT GTTGGMAAT ATGGTGTGGC ATGTTTTAAA TCCATAACAT 13200 

CTCCTCGAAC TTTTTCAGTG TCTAAATCAA TGATGACTAA TTCATCAACA ATGCTTrGGT 13260 

TCACTAATGA AAATGCGTAG CTTGAACCTA CTGCACCATT ACCTATTAAT ACAACTTTGT 13320 

TCCCTTTAAA TTTGTTCATT ACAAAAACTC CCTTATGATT AATTCACTAA CATACATGTA 13380 

GCTTCAAATA TGTTAGTTTA ATGCTGCTTA TTGAOGATAC AAAAGCAAAT AAACATCTCT 13440 

TTTATTTTCA ACGCATAACT TAAAAGGTCA TGTGTCATCC GCTTTTAAGT TTGTGATTTA 13500 

TTTCACATAT AAAATGTAAC ATGCATTAAG TACTGGGTCA ATATTAAATT GTGATTTATT 13560 

TCACATTTTA TTTTAATTTT TACACCTTTT TAATTTGTAT tnCGATTACAT CTTAGATGTC 13620 

TTTAGTCTTC GTACTTCGCC AGTGATTATT TACACTTTCA CATTTTTATT ATCATGTTTA 13680 

CTTTTTTCTA GGAAAACAAC AATGTTTTTT GAATTAGTCA AATAAATGCG CTCAATCGTC 13740 

GGTGTGCAAA CAGACAATTG TACACAATGC TTATTGATAA GTATTTAAAA AATTAAAAAT 13800 

GTCATACAAT TATCAAATTT GCCATTTTAT TTATATTTTC TCAAACCAAT TAATTGAATA 13860 

TCGAAATTTT TAGTAGAATA ATCAAAATAT ACAGATTAAA GGAG6AGTAT CATGCTTACA 13920 

GAACAAGAGA AAGACATTAT CAAACAAACG GTGCCTTTAC TTAAAGAGAA AGGGACAGAA 13980 

ATTACGTCAA TCTTTTATCC AAAAATGTTT AAAGCGCATC CTGAACTTTT AAACATGTTT 14040 

AATCAAACGA ACCAAAAACG AGGCATGCAA TCTTCAGCAT TAGCACAAGC TGTAATGGCC 14100 

GCAGCGGTTA ATATCGATAA CTTAAGTGTT ATTAAACCAG TCATTATGCC AGTCGCATAT 14160 

AAACACTGCG CACTACAAGT TTATGCTGAA CATTATCCAA TTGTGGGGAA AAATTTATTA 14220 

AAAGCCATTC AAGACX5TGAC AGGATTAGAA GAAAATGACC CTGTCATTCA AGCTTGGGCA 14280 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 8779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GGTATmTnG 


GAnGGGTACC 


TAAAGC^AATT 




n*Pn & A'Pr'f* Aft 


PTAPPPHAUT 
V31 1 AC k.ksnnjr\ X 




GGACTTCCCG 


TTATCfSATAA 






A6TAGATTTT 


A iAjKjAuAi 1 i 


12U 


AGGCTTACTT 


TTACTTGTfSA 


TAATTV3CATG 




ft/^THTTTJl ^ 
(AJxAi 1 lACV* 


(.axcaAxAwi 




TATCATTCAA 






i X V> i 1 1 X X 1 V. 




L.GCjTGaCGTT 


240 




ACACTCGGT6 


XVJX XAwX XA\7 


Aft AT AC A C A A 


Ai\>l lAAxAj^ 






AAGAATATTA 


TTTTAfTTTT 

X A A * ^ A A 


v>nv»v«jmx x i x 


SJ 1 Vj\3v. Xn^ 




V7 1 A X AVyTvsCj 


"3 £ A 


TTTAATTCAT 


GAAATGATGA 


AATATAATPC 


X X Xnw XXX 




V-A i AUVA9 JTv?^ 




AGCAATTTTA 


TATCACGAAT 


GGTA'IITCAT 


ftftATPATTVSG 
uwnxwnx XwVi> 


AAATT21ATY1T 
MnnX XAAXuX 


1 AX 1 X 


4 tSU 


CGGTATTGTT 


GCCATTTTCT 


TTGCAATTGQ 


'rofYi'T*A<'*l*rA 


PAP ATVIA A AT 


ATZkftllftlkTPa 




ATTTXjCAGAC 


TTCTTGTAAT 


ATATTTATAT 


GACGAAACCC 


CGrTAAPPAT 

wVJVv X /V\w X 


TAATAAATftft 

X inA X AAA X VJ^J 




AAGTGGGGTT 


CATTTTTGTT 


TATAATTTAA 


GTAAATAACA 


TATTAAGTTG 
xnx x^wwjx xur 


uXValAX lAXVj 


o o u 


aacgttttaa 


TAAAGAAATT 


TTATCATTTG 


GTAGTTCGAA 


TACTTTCTAA 


AATGATTAPG 

AA X vj A X X AVv\J 


720 


CCTCAAGTGA 


TTGATAAACC 


GCATATCGTA 


TTTATGATGA 


CTTTTCCAGA 


AGATATTAAG 


780 


cctatcatca 


AAGCATTAAA 


TAATTCGTCG 


TATCAGAAAA 


CTGTTTTAAC 


AACACCAAAA 


840 


caagCgcctt 


ATTTATCTGA 


ACTTAGCGAC 


GATGTTGATG 


TGATAGAAAT 


GACTAATCGA 


900 


acattggtaa 


AACAAATTAA 


GGCTTTGAAA 


AGCGCGCAGA 


TGATTATTAT 


CGATAATTAT 


960 


tacctattgc 


TAGGTQOATA 


TAATAAGACT 


TCTAATCAAC 


ACATTGTTCA 


AACGTGGCAT 


1020 


gcaagtggtg 


CATTAAAAAA 


CTTTGGCTTA 


ACAGATCATC 


AAGTCGATGT 


GTCTGACAAG 


1080 


gcaatggttc 


AGCAGTACCG 


TAAAGTTTAT 


CAAGCGACGG 


ATTTTTACTT 


AGTGGGTTGT 


1140 


gaacaaatgt 


CACAATGTTT 


TAAACAGTCT 


TTAGGTGCAA 


CAGAAGAGCA 


AATGCTGTAT 


1200 


tttgggcttc 


CGAGAATTAA 


TAAATATTAC 


ACAGCTGATA 


GAGCAACG6T 


TAAGGCAGAG 


1260 


ttaaaggata 


AATATGGAAT 


TACAAATAAG 


TTGGTATTAT 


ATGTACCAAC 


ATATAGAGAA 


1320 


gataaagcag 


ATAATAGGGC 


TATTGATAAA 


GCTTATTTTG 


AAAAAT6TTT 


ACCAGGATAT 


1380 
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ATCGACACGT CTACATTAAT GCTAATGTCA GATATAATTA TTAGCGACTA TAGTTCGCTG 1500 

CCAATAGAAG CTAGCTTGTT AGATATTCCA ACTATATTTT AT6TGTATGA TGAAGGAACA 1560 

TATGATCAGG TGAGAGGCCT GAATCAATTT TACAAAGCAA TACCGGATAG CTACAAAGTG 1620 

TATACTGAAG AAGATTTAAT AATGACGATA CAAGAAAAAG AACATCTATT AAGTCCGTTA 1680 

TTTAAAGATT GGCATAAGTA TAATACTGAT AAAAGTTTAC ATCAGCTCAC AGAATATATA 1740 

GATAAGATGG TGACAAAATG AGGTTTACGA TAATCATACC TACATGTAAT AATGAGGCAA 1800 

CAATTCGACA ATTGTTAATA TCTATTGAGA GTAAAGAACA CTATAGAATC CTTTGTATTG 1860 

ATGGTGGTTC TACTGATCAA ACAATTCCTA TGATTGAACG GTTACAAAGA GAACTCAAGC 1920 

ATATTTCATT AATACAATTA CAAAATGCTT CGATAGCTAC GT6TATTAAT AAAGGTTTGA 1980 

TGGATATCAA AATGACAGAT CCACATGATA GTGACGCATT TATGGTCATA AAACCAACAT 2040 

CAATCGTATT GCCAGGTAAA TTAGATAGGT TAACTGCTGC TTTCAAAAAT AATGATAATA 2100 

TTGATATGGT AATAGGGCAS CGAGCTTACA ATTACCATGG TGAATGGAAA TTGAAAAGTG 2160 

CTGATGAGTT TATTAAAGAC AATCGAATCG TTACATTAAC GGAACAACCA GATTTGTTAT 2220 

25 CAATGATGTC TTTTGACGGA AAGTTATTCA GTGCTAAATT TGCTGAATTA CAGTGTGaCO 2280 

AAACTTTAGC TAACaCATAC AATCACGCAA TACTTGTCAA GGCGATGCAA AAAGCTACGG 2340 

ATATACATTT AGTTTCACAG ATGATTGTCG GAGATAACGA TATAGATACA CATGCTACAA 2400 

GTAACGATGA AGATTTTAAT AGATATATCA CAGAAATTAT GAAAATAAGA CAACGAGTCA 2460 

TGGAAATGTT ACTATTACCT GAACAAAGGC TATTATATAG TGATATGGTT GATCGTATTT 2520 

TATTCAATAA TTCATTAAAA TATTATATGA ACGAACACCC AGCAGTAACG CACACGACAA 2580 

TTCAACTCGT AAAAGACTAT ATTATGTCTA TGCAGCATTC TGATTATGTA TCGCAAAACA 2640 

TGTTJGACAT TATAAATACA GTTGAATTTA TTGGTGAGAA TTGGGATAGA GAAATATACG 2700 

AATTCTGGCG ACAAACATTA ATTCAAGTGG GCATTAATAG GCCGACTTAT AAAAAATTCT 2760 

TGATACAACT TAAAGGGAGA AAGTTTGCAC ATCGAACAAA ATCAATGTTA AAACGATAAC 2820 

GTGTACATTG ATGACCATAA ACTGCAATCC TATGATGTGA CAATATGAGG AGGATAACTT 2880 

AATGAAACGT GTAATAACAT ATGGCACATA TGACTTACTT CACTATGGTC ATATCGAATT 2940 

GCTTCGTCGT GCAAGAGAGA TGGGCGATTA TTTAATAGTA GCATTATCAA CAGATGAATT 3000 

TAATCAAATT AAACATAAAA AATCTTATTA TGATTATGAA CAACGAAAAA TGATGCTTGa 3060 

SO ATCAATACGC TATGTCGATT TAGTCATTCC AGAAAAGGGC TGGGGACAAA AAGAAGACGA 3120 

TGTCGAAAAA TTTGATGTAG ATGTTTTTGT TATGGGACAT GACTGGGAAG GTGAATTCGA 3180 



SS 



3S 



40 



4S 
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TAAAATCAAA CAAGAATTAT ATGGTAAAGA 
CTAAACGATA AATTAACTTA GGTTATTATA 
5 CTTTATAATG TGCAACTTGT CCXSTTTTTAG 

ATTGATTATC ATATGAACAA TAAGTGCTAA 
AGTGAATAAT GGATGTTCTT CCCACATACT 
TGGCTGATGA ACTTCTACAG TTGGAGGTCC 
GTCCGCGTGT ACTTTACCAT CTTCGACTAC 
TGGTAAAACA TCGTAAAGAT CATTTTCAAC 

IS 

ACCTTTAGAC AATATTTTTA CATATTTATA 
CCCCATCATA TTACGTTGCT TCTCGCCACC 
AACTTGATTA ATTCTAAATT TACCTCGTTT 

20 

ATCACTTGAT CCAGTTTTTA AACCATCTGT 
TGAAAAGTTG AATGTGTAAT ACGTAACTGC 

2s GTCTAATATT TTAGGTGTCT CTTTAATCAC 

AGTCGTTACA GTACGTTCTT GGTCTTTATA 
ATTTTCAGCA CCCGTTGGAT TGACGAAATG 

30 GTTATTCATT AAATCAACGA AATCGCTGGT 

TGCCGCGGCA TTACTAGAAT TAGATACTGT 
TTGTCCAGGA TATAGTTTCG TATTACTCAA 

35 

GTTCGTCATT GTGACTGTGT CATCAAGTGA 
TAACTACATT GTCATTAATT TAGTCATAGA 
TTGATACAGT AATTGTCCAG TTTGACTTAC 

40 

CX3ACAAACCT GCATAACCAT ATTGATTTGC 
AGCTTGTGCA TATGGTGTCA TAATACTTAA 
TAAATTTTTC ATAAAGCGTT AATCTTCCCT 

45 

ATGGTTATTC CTACTTACGG AAATCATTGC 
AAATAAAGTT TTCTGCAGTT AATTTGAAAA 
SO AGGTGTTATA ATGTTTGTAC GAAGAGCAAA 

TAATATAAAG ACTTTGAGAA GTTATTACAA 

55 
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TGCTAAATAA ATTATATAGA ACTATCGATA 3300 

AAATAAATAT AAAACGGACA AGTTTCGCAG 3360 

TATGTTTTAT TTTCTTTTTC TAAATAAACG 3420 

TCCAGCGACA AGGCATGTAC CACCAATGAT 3480 

TTTAGCAACA GTATTTGCCT TTTGAATAAT 3540 

ATAATCTTTA TTAATAAATT CTCTTGGATA 3600 

AAGTTTATAA TCTTTTTTAC TAAAATCACT 3660 

ATAATATTTC TTACCATTTA TCCTTTGCTC 3720 

CTGATCAAAT GAGCGTTCCA TTAATGCATT 3780 

AAGGTTTTTA TAGTCTCCTG CACCCATGAT 3 840 

GGTAGTAATC GTATGGTTGT AATTTGCTGT 3900 

ACCCGGCAAA CTCATTTTTG CACCTTCCAA 3960 

ATGCGTTGTT GGTGCTAACT GCTTTGTAAA 4020 

GTGTAAATCT AAAATGGCAT AGTCTCTAGC 4080 

CTTTGTTGGT GCAAATGTAC GTAATCTTGA 4140 

TGTATTTTTC ATTCCGATAG CTTTAQCTTT 4200 

GTTTTTTGAA ACCTTCTTAG CTAAAATTAA 4260 

AATTTGTAAT AGGTCTGCGA TTGTCCATAC 4320 

CTCAGGTAGT GTAGACATAA TATATTCTTT 4380 

AAGCTGCCCC TTATTTACAG CTTCCAATGT 4440 

CGCTGGAtTC CACTTAGTAT CGATATTGTA 4500 

ATTAACAGCA CTCX5TCGGTT CGTATGCAGC 4560 

TGCTTGTACA GGGGTTACGT CACTGTTAGT 4620 

TGTTAAACAT AAAATGATGA TAATAGATAT 4680 

TTTCCAATTC TTAAATATTC CCTAAAAGCA 4740 

TAATTCACTT CACCTTAATT AAATTGTTGA 4 800 

ATAATGCAAA TATATTACX3T GTGTAGCTAA 4 860 

CTTACTCAAA AGCGATTAAT TTTCATGTTT 4 920 

AAAATGCAAT AGAAATATTC TATCATATAA 4980 
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AAGTATATGA TAGAAATGCA TGTATCTATC TAAATGAATT AACTATAAAT TTCAAACAGA 5100 

AGAGGTAAAA CTATGAAACG AGAAAATCCA TTGTTTTTCT TATTTAAAAA ACTATCATGG 5160 

^ CCAGTGGGTC TTATCGTTGC AGCTATCACT ATTTCATCAC TAGGGAGCTT AAGTGGACTA 522 0 

TTAGTGCCAC TGTTTACTGG ACGAATTGTA GATAAATTTT CCgTGAGCCA TATCAATTGG 5280 

AATCtAATCG CATTATTTGG TGGTATCTTT GTCATCAATG CTTTATTAAG CGGATTAGGT 534 0 

10 

TTATATTTAT TAAGTAAAAT TGGTGAAAAG ATTATTTATG CGATACGCTC AGTTTTATGG 5400 

GAGCATATCA TACAATTAAA AATGCCATTC TTTGACAAAA ATGAAAGTGG TGAATTAATG 5460 

AGTCGATTAA CTGACGATAC GAAAGTGATA AATGAATTTA TTTGACAAAA GCTACCTmAC 5520 

IS 

TTATTACCAT CAATCGTTAC ATtAGTTGGG TCACTAATCA TGTTATTTAT TTTAGATTGG 5580 

AAAATGACAT TATTAACATT TATAACGATA CCGATATTCG TTTTaATTAT GATTCCTCTA 5640 

20 GGTCGTATTA TGCAAAAGAT ATCGACAAGT ACACAATCTG AAATTGCAAA CTTCAGTGGT 5700 

TTGTTAGGGC GTGTCCTAAC TGAAATGCGT CTTGTTAAAA TATCAAATAC AGAGCGTCTT 5760 

GAATTAGATA ATGCACATAA AAATTTGAAT GAAATATATA AATTAGGTTT AAAACAGGCT 5820 

25 AAAATTGCGG CAGTTGTACA ACCAATTTCA GGTATAGTTA TGTTGCTAAC AATTGCAATT 5880 

ATTTTAGGTT TTGGTGCATT AGAAATTGC6 ACTGGTGCAA TCACTGCAGG TACATTAATT 5940 

GCAATGATAT TTTATGTTAT TCAGTTATCT ATGCCTTTAA TCAATCTTTC CACGTTAGTT 6000 

ACAGATTATA AAAAGGCAGT CGGTGCAAGT AGTAGAATAT ACGAAATCAT GCAAGAACCT 6060 

ATTGAACCGA CAGAAGCTCT TGAAGATTCT GAAAATGTAT TAATTGATGA CG6TGTATTG 6120 

TCATTTGAAC ATGTAGACTT TAAATATGAT GTGAAGAAAA TATTAGATGA TGTGTCGTTC 6180 

35 

CAAATCCCAC AAGGTCAAGT GAGTGCTTTT GTAGGCCCTT CTGGGTCTGG TAAAAGTACG 6240 

ATATTTAATC TGATAGAACG TATGTATGAA ATTGAGTCAG GTGATATTAA ATATGGCCTT 6300 

GAAAGTGTCT ATGATATCCC GTTATCTAAG TGGCGACGCA AAATTGGATA TGTTATGCAA 636 0 

40 

TCAAATTCGA TGATGAGTGG TACAATTAGA GACAATATTT TATACGGAAT TAATCGTCAT 6420 

GTTTCAGATG AAGAACTTAT TAATTATGCT AAATTAGCGA ACTGTCATGA TTTTATCATG 64 80 

45 CAATTTGATG AAGGATATGA CACGCTTGTA GGTGAACGAG GATTGAAACT GTCTGGCGGA 6540 

CAACGTCAAC GTATTGATAT TGCTAGAAGT TTTGTTAAAA ATCCTGATAT TTTGTTACTT 6600 

GATGAAGCAA CAGCTAATCT CGATAGTGAA AGTGAATTGA AAATTCAAGA AGCTTTAGAA 6660 

50 ACATTGATGG AAGGTAGAAC AACGATTGTC ATTGCXJCATC GTTTGTCTAC AATTAAAAAA 6720 

GCCGGTCAAA TTATATTCTT AGACAAAGGA CAGGTAACAG GTAAAGGTAC GCATTCAGAA 6780 

SS 



450 



TTTTATATAT ATAAGTAAGC TTGGAGCAAA 
TGGCACATTG ATGGATATAG ATGTTAATAA 
5 ATTTGAGAAG TTACGACATA ATAATTCTTA 

ATGCTCATGG TATAATACAA GTTATAAGCA 
AGTCATAATT CTTCATATTT TACATAGCAA 
AGTTTGATAT TTCAATGTTT CTAAATTTTT 
GTTTTTATTA ATCAACATTA TTGGTCTAAT 
AAGAGATCGC AAAAATATCC AATGGCAATC 

15 

TTTAGCATGG TTCTTTATTT ATTTTGATTG 
TGGTATCGCT TGGGTAGTTC AGTCAGCGCA 
GACAAATGTT AAAATGATGG ATATGGCTGT 

20 

GCCATTATTT GATATCTTAA TGTACTTTAA 
TTGGTTACTA GCTAAAGTAA CAAGACAACC 

25 GATGTTCTTA GGAAATACTG AAGCATTAGC 

TGAAATGCGT GTATTAACAA TCGCAATGAT 
AGGTGCGTAT GTACAAATGG TACCAGGAGA 

30 CGTTAACGCG ATTATTGTGT CATGCTTGTT 

TATTATTTAC AGTCTTAAAA ACAATGAAGT 

AGATTCTGTA TTAGCAGCAG GTAAATTAGT 

35 - 

TGTAGCGTTA GCTGATCTAT TTGATCX5TTT 

ATGGATAGGC ATAAAAGGTA GTTTCGGTTT 

ATTTGCGCTA TTACTCGGTT TACCTTATGA 

40 

TAAGAAAATT GTTACAAATG AATTTGTTGT 
TTATACACCA CACCATCGTG CGGTTATTAC 

AACX5ATTGGT ATGATTATCG GTACATTGAA 

45 

TGTATCTAAA TATGTACCTA TGATGCTATT 
AGCTTTCGTT GGTTTATTTG CATGGTAATA 
SO AACTAATAAA TATGTCCAGG CATGTCGTCT 

TATTATTGAT ATAAAGGTAT ilTAAATATTT 

55 
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TACACATATA CCATCGAGGA AATTAAAGTG 6900 

ATTGCTTCAA GCTTTTGTCT ATTTTAAATC 6960 

AATTAATGAA ATCGATATTT TAAGAAAAAA 7020 

AACATACATA TATTAAATAC TGTAGCCACG 7080 

TTTAACTGAT TTTAGAGTCC ACGGTACAGA 7140 

AAAAAATTAA ATCATAGGTG GGTGCCAAAT 7200 

TGTATTTCTT GGTATTGCGG TATTATTTTC 7260 

AATTGGGATC TTAGTTGTTT TAAACCTGTT 7320 

GGGTCAAAAA GCAGTAAGAG GAGCAGCCAA 7380 

TGCTGGTACA GGTTTTGCAT TTGCAAGTTT 7440 

TGCAGCCTTA TTCCCAATAT TATTAATAGT 7500 

TATTTTACCG AAAATTATTG GAGGTATTGG 7560 

TAAATTCGAG TCATTCTTTG GGATAGAAAT 7620 

CGTATCAAGT GAGCAACTAA AACGTATGAA 7680 

GTCAATGAGC TCTGTATCGG GAGCTATTGT 7740 

ACTGGTACTA ACGGCAATTC CACTAAATAT 7800 

GAATCCAGTA AGTGTTGAAG AGAAAGAAGA 7860 

TGAACGTCAA CCATTCTTCT CATTCCTTGG 7920 

ATTAATCATC ATCGCATTTG TTATTAGTTT 7980 

TATCAATTTG ATTACAGGAT TGATAGCAGG 8040 

AAACCAAATT TTAGGTGTGT TTATGTATCC 8100 

TGAAGCGTGG TTGGTAGCAC AACAAATGGC 8160 

TATGGGTGAA ATTTCTAAAG ATATTGCATC 8220 

AACATTCTTA ATTTCATTTG CAAACTTCTC 8280 

AGGCATTGTT GATAAAAAGA CATCAGACTT 8340 

ATCAGGTATC CTAGTTTCAT TATTAACAGC 84 00 

TGTCGAAGAG TGACTATGAT AATACATTTT 84 60 

ATTQATATAG GTGAGATGCT TGGACTTTTT 8520 

rrAAAGTTAC CGAAATTGAA GCATTATAAA 8580 
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GACAGTAAGG ACTAGGTACA GTCATAGTAC TTCGAGCAAA ATTTGTTTTG TTATTATAAA 8700 

CAACACAAAG GAGATAACTT CTCTAnTGAA GAAGTTAAAA ACATTATAGC AGACAATGAA 8760 

^ ATGAAAGTAA ATTAAAAAT 8779 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31096 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 





GTTGCAGTAG 


TCAAAGAATT 


AAACAAGGTG 


AAGGcGTGTA 


GCTTGCACAC 


CCGAAAATGT 


60 


20 


GCGTAAGTTA 


aCGGATGCAG 


GACATAAAGT 


AATTGTTGAA 


AAAAATGCTG GCATTGGTTC 


120 




AGGATlTi'd' 


AACGATATGT 


ATGAAAAAGA 


AGGCGCTAAG 


ATCGTAACTC ACGAACAAGC 


180 




ATGGGAAGCT 


GATCTTGTTA 


TCAAAGTAAA 


AGAACCTCAT 


GAAAGCGAAT ATCAATATTT 


240 


25 


CAAAAAGAAT 


CAAATTATCT 


GGGGATTTTT 


ACATCTAGCA 


TCTTCAAAAG AAATAGTAGA 


aoo 




AAAAATGCAA 


GAAGTTGGTG 


TAACTGCGAT 


TAGTGGTGAA 


ACCATTATAA AAAATGGAAA 


360 








1 VdAlv I \7U TA r 




CGCTCAGCAA TTATGGGAGC 


420 


30 


TTACTACTCT 


GAAGCACAAC 


ATGGTGGTCA 


AGGTACTTTA 


GTGACTG6TG 


TACATGAAAA 


480 




TGTGGATATA 


CCTGGTAGTA 


CATATGT6AT 


TTTCGGTGGT 


GGAGTAGCAG 


CAACAAATGC 


540 


35 


AGCAAATGTT 


GCCTTGGGAC 


TAAATGCTAA 


AGTAATCATT 


ATCGAGTTAA 


ACGATGACCG 


600 


CATTAAATAT 


CTTGAAGATA 


TGTATGCAGA 


AAAAGATGTC 


ACAGTAGTCA 


AATCAACACC 


660 




AGAipU^TTTA 


GCAGAACAAA 


TTAAGAAAGC 


AGATGTATTT 


ATTTCTACAA 


TTTTAATTTC 


720 


40 


AGGTGCGAAA 


CCGCCAAAAT 


TGGTTACTCG 


TGAGATGGTT 


AAATCAATGA 


AAAAAGGTTC 


780 




AGTATTAATC 


GATATAGCTA 


TTGACCAAGG 


TGGAACTATT 


GAAACAATTA 


GACCAACTAC 


840 




AATTTCTGAT 


CCAGTGTATG 


AAGAAGAAGG 


TGTGATTCAT 


TATGGTGTAC 


CAAATCAACC 


900 


45 


AGGAGCAGTC 


CCAAGAACTT 


CAACAATGGC 


ATTAGCACAA 


GGAAATATTG ATTATATATT 


960 




AGAAATTTGT 


GACAAAGGCT 


TAGAACAAGC 


AATTAAAGAT 


AATGAAGCCT 


TAAGTACTG6 


1020 




TGTAAACATT 


TACCAAGGAC 


AAGTGACAAA 


TCAAGGATTA 


GCTTCATCAC ATGACCTAGA 


1080 


SO 


TTATAAAGAA 


ATATTAAATG 


TTATCGAATA 


GATAGTAATT 


TAAATGAAAT 


TGAGTGAAAT 


1140 




GAATATTTTA 


AATATAGCAT 


TATAGTTTGG 


ACTAAAAATT 


TACAAAACGG 


AAGGATGTAA 


1200 
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TCGAAGAAGC TAAAGCAAGC ATTAAACCAT 
TGTATTTAAG CCAAAGTATA ACTAAAGGGA 
5 TCACAGGATC TTTTAAATTT AGAGGCGCTA 

CAAAAAGAAA AAGGCATTAT CGCAGCATCT 
AACAGCTAAA TTATTAGGCA TTGATGCAAC 
GAAACAACAA GCAACAAAAG GCTATGGGGC 
CGAAACTAGA CTTTATATGG AAGAATTAGC 
ATATGACGAT AAGTTTGTAA TGGCAGGCCA 

IS 

TATTTGGAAT GTGAATACAG TCATCGTACC 
TGCCACCGCA TTAAAATCAT TTAACCCTTC 
TGTTCATGGT ATGGCTGAGT CTTTCTATAA 

20 

CACAATAGCA GATGGTTGTG ATGTAAAAGT 
ACATTTAGTA GATGAATTTA TTCTTGTTAC 

25 TTTAATGCAG CGTGCCAAAA TTATTACTGA 
TTTAAGTGGA AAAATAAACA ATAAATGGCT 
AGGCGGGAAT GTTGACTTAA CTAGAGTTTC 

30 AGATACAAGC AAGGGTGTGG TAGGTTAAAA 
GTCAAATGGT AAAGAATTAC AAAAAAATAT 
GGGGACAGTT ATTGGTTCAG GAGTATTCTT 

35 

AACAGCAGGA ATGGCCTTGT TTGTATGGTT 
GTTAACAGCA GCAGAACTTG CTGCTGCAAT 
AGAATATACA TACGGTGATT TCTGGGGCTT 

40 

TTTTCCAGCT AACGTAGCAG CATTGTCTAT 
CCATTTATCT ATAGGTTCGT TAATACCAAT 
GATAAATTTC CTAGGTTCAA AAGCAGGCGG 

45 

ACTGATTCCA ATCATCGTTA TTGTAATTTT 
TTCATTAATT CCAACTACAG GTAATTCaGG 
50 TTTATTAGCA ACTATGTTTG CATATGATGG 
ACTTAAAAAT CCTAAACGCG ATTTACCTTT 

SS 
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TTATTCGTCG AACACCTCTA ATTAAATCAA 1320 

ATGTATTTCT AAAATTAGAA AATATGCAAT 1380 

gCAATnAAAA TTAATCACTT AACAGATGAA 1440 

GCTGGGgAAC CATGCACAAG GTGTTGCTTT 1500 

GATTGTAATG CCTGAAACAG CACCACAAGC 1560 

AAAGGTTATT TTAAAAGGTA AAAACTTTAA 1620 

GAAAGAAAAT GGCATGACAA TCGTTCATCC 1680 

AGGAACAATT GGTTTAGAAA TTTTAGATGA 1740 

AGTTGGCGGT GGAGGATTAA TTGCAGGTAT 1800 

AATTCATATT ATCGGTGTTC AATCTGAfiAA 1860 

GAGAGATTTA ACTGAACATC GAGTGGATAG 1920 

TCCTGGTGAA CAAACATATG AAGTAGTTAA 1980 

TGAAGAAGAA ATTGAACATG CTATGAAAGA 2040 

AGGTGCAGGC GCATTACCAA CAGCTGCAAT 2100 

TGAAGATAAA AATGTTGTTG CATTAOTTTC 2160 

AGGTGTCATT GAACATGGAC TGAATATTGC 2220 

CATTTAATCT TAAAAATGAG GIGTAATTAT 2280 

AGGTTTCTTC TCAGCGTTTG CTATTGTTAT 2340 

TAAAATATCA AACGTAACAG AAGTAACAGG 2400 

CCTAGGCGGC ATCATTACCA TTTGTGCGGG 2460 

CCCTGAAACA GGTGGCTTAA CGAAGTATAT 2520 

CCTATCAGGT TGGGCGCAAT CATTTATTTA 2580 

CGTATTTGCX3 ACACAGCTAA TTAATTTATT 2640 

AGCAATCGCA TCTGCGTTAT CTATTGTGTT 2700 

AATTTTACAA TCAGTTACTT TAGTAATTAA 2760 

TGGTATTTTT CAATCTGGAG ATATCACTTT 2820 

AAATGGCTTC TTTACAGCAA TTGGTAGTGG 2680 

TTGGATTCAT GTAGGAAATG TTGCGGGGGA 2940 

AGCGATTTCA GTTGGTATCG GTTGTATTAT 3000 
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TGGTAATTTA AATGCAGCTT CAGATACATC AAAAATATTA TTTGGTGAAA ATGGCGGTAA 3120 

GATTATTACA ATCGGTATAT TAATTTCTGT TTATGGTACG ATCAATGGCT ATACTATGAC 3180 

^ TGGTATGCGC GTACCATATG CAATGGCTGA AAGAAAATTA TTGCCATTTA GCCATTTATT 3240 

CGCAAAATTA ACAAAATCTG GCGCACCATG GTTTGGCGCA ATTATACAAC TTATAATCGC 3300 

TATCATCATG ATGTCAATGG GAGCATTTGA TACAATTACA AATATGTTAA TCTTTGTTAT 3360 

10 

TTGGTTGTTC TATTGTATGT CATTTGTTGC GGTAATAATT TTAAGAAAAC GTGAACCAAA 3420 

TATGGAACGA CCATATAAAG TACCGTTATA TCCGATCATA CCTTTAATTG CTATTTTGGC 3480 

AGGATCATTT GTATTAATTA ATACACTGTT TACACAATTT ATATTAGCAA TCATTGGAAT 3540 

IS 

TCTAATAACA GCACTTGGTA TACCAGTTTA TTACTATAAA AAGAAACAAA AAGCAGCATA 3600 

AGGTAAGATA ACTAGCATTG AGAATAAATG GATGGACTAC TAATAAATTT AAAGTTTTAC 3660 

^ ACATTAAAAT CAAAAACCAT TCAATTATTC TATGGAACAG ACAAATTTCT GTTATGGAAT 3720 

TTGTCTGTTT TTCAAAAGTA TAGGGAGGCA AATAGAGATG GAAAAGCCGT CAAGAGAGGC 3780 

ATTTGAAGGC AATAATAAGT TGTTAATAGG AATTGTTCTA AGTGTAATAA CGTTTTGGCT 3840 

2S ATTTGCACAA TCATTGGTTA ATGTTGTACC AATACTTGAA GATAGTTTCA ATACAGATAT 3900 

TGGAACGGTT AATATCGCCG TTAGTATAAC TGCTTTATTT TCAGGAATGT TTQTAGTAGG 3960 

AGCAGGTGGT CTTGCTGATA AATATGGCAG AATTAAACTC ACGAACATTG GTATTATCTT 4020 

30 AAATATATTA GGTTCATTAT TAATCATTAT TTCAAATATT CCTTTATTAC TTATTATAGG 4080 

AAGATTAATT CAAGGACTTT CAGCAGCATG TATTATGCCT GCAACTTTGT CTATTATTAA 4140 

GTCATATTAC ATTGGGAAAG ATAGACAACG CGCTTTAAGT TATTGGTCAA TTGGCTCATG 4200 

35 

GGGCGGCTCT GGTGTTTGTT CATTTTTTGG AGGTGCAGTT GCAACGCTTT TAGGTTGGCG 4260 

TTQGATTTTC ATCCTATCAA TTATAATTTC ATTAATTGCA CTGTTTCTTA TTAAAGGCAC 4320 

ACCTGAAACT AAATCTAAAT CGATTTCTCT AAATAAATTT GACATTAAAG GTCTGGTTCT 43 80 

40 

TTTAGTCATT ATGCTCCTCA GTTTAAATAT TTTAATTACT AAAGGATCAG AATTAGGTGT 4440 

AACCTCACTT CTTTTTATTA CTTTATTAGC TATTGCAATT GGATCTTTTA GTTTATTTAT 4500 

^ AGTTCTTGAA AAGCGTGCTA CAAATCCTTT AATCGATTTT AAATTATTTA AAAATAAAGC 4560 

TTACACAGGT GCAACAGCTT CAAACTTTTT GTTAAATGGT GTTGCAGGAA CATTAATAGT 4620 

AGCCAACACA TTTGTTCAAA GAGGTTTAGG ATATTCTTCA TTGCAAGCAG GAAGTTTATC 4680 

SO AATCACTTAT TTAGTAATGG TACTAATTAT GATTCGTGTT GGTGAAAAGT TACTTCAAAC 4740 

ACTCGGATGC AAGAAACCAA TGTTAATTGG AACAGGAGTT CTTATTGTCG GAGAATGTCT 4800 
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10 



IS 



ATTCTTTGGT TTAGGACTAG GGATATAT6C TACACCATCA ACAGATACAG CAATTGCAAA 4920 

TGCACCGTTA GAAAAAGTAG GCGTTGCTGC AGGTATCTAT AAAATGGCTT CTGCATTAGG 4980 

TGGAGCATTT GGCGTCGCAT TGAGTGGTGC AGTATATGCA ATCGTATCAA ATATGaCAAA 5040 

CATTTATACA GGTGcAATGa TTGnCATTAT GGTTaAATGC AGGTATGGGa ATATTATCaT 5100 

TCGTTATCAT TTTGtTACTT GTGcCTAAAC mAAACGACAC TCAATTATGA TAATTGAGAA 5160 

TTAAATTGAA ATCATACAAG TCGCTACAAT ATTAAACAAA AATATAAACC GATTCTTATG 5220 

TGTCATTATT TTAAATGAAC ATAGGGATTG GTmTTATT ACTCTTTTAC GCTACTTTAT 5280 

TTATAATTAT TATAAATTGT CACAAATTCA ATTTACCTTA CAATATATTT TGTGTTATTA 5340 

TATTCTGGAG CATAAATAAA TTGTTCAACA CATAGTTGTA ATGTGTTTCA ATACTTTTTG 5400 

GATAGATTGC GAAATTGTAT TOAATCGTCA TCGTTTTAAA TTTTTAAATG AGAATGGAAT 5460 

20 GAGCATTACA ATACACAAGC AATCAAAAGT AAATACATTC ACAACACAAC AGAGACATAA 5520 

CAACAAGATA AGGAGTGAAC AATAGCTGTG AATTATCGTG ATAAAATTCA AAA6TTTAGT 5580 

ATTCGTAAAT ATACAGTTGG TACATTTTCA ACTGTCATTG CGACATTGGT ATTTTTAGGA 5640 

2S TTCAATACAT CACAAGCACA TGCTGCTGAA ACAAATCAAC CAGCAAGCGT GGTTAAACAG 5700 

AAACAACAAA GTAATAATGA ACAGACTGAG AATCGAGAAT CTCAAGTACA AAATTCTCAA 5760 

AATTCACAAA ATGGTCAATC ATTATCTGCT ACTCATGAAA ATGAGCAACC AAATATTAGT 5820 

CAAGCTAATT TAGTAGATCA AAAAGTAGCG CAATCATCTA CTACTAATGA TGAACAACCA 5880 

GCATCTCAAA ATGTAAATAC AAAGAAAGAT TCGGCAACGG CTGCGACAAC ACAACCAGAT 5940 

AAAGAACAAA GTAAGCATAA ACAAAACGAA AGTCAATCTG CTAATAAAAA TGGAAACGAC 6000 

AATAGAGCGG CTCATGTAGA AAATCATGAA GCAAATGTAG TAACAGCTTC AGATTCATCT 6060 

GATJ^TGGTA ACGTACAACA TGACCGAAAT GAATTACAAG CGTTTTTTGA TGCAAATTAT 6120 

CATGATTATC GCTTTATTGA CCGTGAAAAT GCAGATTCTG GCACATTTAA CTATGTAAAA 6180 

GGCATTTTTG ATAAGATTAA TACGTTATTA GGCAGTAATG ATCCAATAAA CAATAAAGAC 6240 

TTGCAACTTG CATACAAAGA ATTGGAACAA GCTGTTGCTT TAATTCGTAC AATGCCTCAA 6300 

4S CGTCAACAGA CTAGCCGACG TTCAAATAGA ATTCAAACXX: GTTCGGTTGA GTCAAGAGCT 6360 

GCAGAGCCTA GATCAGTATC AGACTATCAA AATGCAAATT CATCATATTA TGTTGAAAAT 6420 

6CTAATGATG GTTCGGGCTA TCCTGTTGGT ACATATATCa ATGCTTCTAG TAAAGGGGCG 6480 

50 CCATATAATT TACCAACTAC ACCATGGAAT ACATTGAAGG CCTCTGACTC AAAGGAAATT 6540 

GCTCTTATGA CAGCGAAACA AACTGGAGAC GGGTACCAAT GGGTTATTAA GTTTAATAAA 6600 
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30 



35 



40 



455 



GTAGGAAGAA CTGACTTTGT AACAGTTAAT 
GGAGCAGGAG CAGGTGCAAA TAAACCACTT 
5 CCTCATCGTT CACATGACTT TAAAATAAGA 

CCAACTGTCC ATATTTATTC TTTAGAAGAT 
GCTGGAGCGA CACCTGCTAC TAAAGCTTTT 
CAAAAACCTG CTGAATCACC GGGTGTTCCT 
GCAAGTTATA CAATTTCATT TAAAACACAA 
GCAGGTGGGC GTGCTTTAGA GTACAATCAA 

IS 

TC7ACGCAAG ACCATCAACA ACGTCTTAAT 
CGCATAGGTA CAACTAAACG TGTAGAAGTG 
TTAGAAAGTA CAAACCTAAA TATAGATGAT 

20 

ACGCCGAGTA ATAAAGTGTT AGGATTTTAT 
CCGGGTGGAG CCCAACAATT AAATGAATAT 

25 TTACAAGAAG CAGCAAGAAC TAGAAACCCA 
GATGCTTATG GTAATAGTGA AcTTTAGTTC 
AACATAATAt TaAATTCTTT AAAAATGACG 

30 CAAAACAAGC TGGGCATCCA GTTTTCTATG 
TGAATTTAGG TGGTAGCGTA ACATCTATTC 
AGAATTTTAC AGATAAAGAT TGGCAAATTA 

^ ACTCGACAAA TAGACCTAAT AATGCGAGAG 
TACC5GGGGA TTACTTTGGA ACGATACGTT 
GTGTTAAACC ACATACACCA ACAATTACAA 

40 

TACAAAAAGT GCCTGTTAAT AITTCGGGAA 
TAGTTGCACC AACAAATCAA ACTACGAATG 

GTTATACGAT ACTTGCGACT GGTACACCTG 

4S 

CGCAAGATTA TGTTGTATTC ATACCACCTG 
ATAATAAAGT AGTTGCATCT AATATGAGTA 
CACCAACAAT CAATAATCCT GTTGGAATAA 
CTTTACAATG GGTGTCTCTG ATAGACATTC 
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TCAGATGGAA CAAATGTACA ATGGAGTCAT 6720 

CAACAAATGT GGGAATATGG AGTAAATGAT 6780 

AATAGAAGTG GCCAAGTAAT ATATGACTGG 6840 

TTATCTAGAG CGAGTGATTA TTTTAGTGAA 6900 

GGTAGACAAA ATTTTGAATA TATTAATGGT 6960 

AAAGTTTATA CTTTCATCGG TCAAGGTGAT 7020 

GGTCCAACTG TTAATAAATT GTACTATGCA 708 0 

TTATTTATGT ACAGTCAACT ATACGTCGAA 7140 

GGTTTAAGAC AAGTGGTTAA TCGTACATAT 7200 

AGTCAAGGAA ATGTACAAAC GAAAAAGGTA 7260 

TTTGTTGATG ATCCTTTAAG TTATGTTAAG 7320 

TOGAATAATG CAAATACTAA TGCTTTTAGA 7380 

CAATTAAGTC AATTATTTAC TGATCAAAAA 7440 

ATAAGATTAA TGATTGGTTT CGACTATCCT 7500 

CTGTTAACTT AACGGTATTA CCTGAAATCC 7560 

ATACTCAAAA TATTGCTGAA AAACCATTTT 7620 

TATATGCAGG TAACCAAGGG AATGCTTCCG 7680 

AACCATTACG TATTAATTTA ACAAGTAATG 7740 

CAGGTATTCC GCGTACATTA CACATTGAAA 7800 

AACGCAATAT TGAACTTGTT GGTAACTTAT 7860 

TTGGACX3TAA AGAACAATTA TTCGAAATTC 7920 

CGACAGCTGA GCAATTAAGA GGTACAGCAT 7980 

TACCGTTGGA TCCATCGGCA TTGGTTTATT 8040 

GTGGTAGTGA GGCAGATCAA ATACCATCTG 8100 

ATGGGGTGCA TAATACAATT ACTATACGAC 8160 

TAGGTAAACA AATTAGAGCA GTAGTTTATT 8220 

ATGCTGTTAC TATTTTGCCA GATGACATTC 82 80 

ATGCCAAATA CTATCGAGGC GACGAAkCAA 8340 

TGGTATAAAA AATACAACTA TTACGACATT 8400 
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TACAGGTAGA GTGAGTATGA ATCAGGCATT 
GACAGaCAAT GTCAATAATA CGACAAATGA 
^ AGGTAAAATT AGTGAAGATG CTCATCCGAT 

AGTCAATCCG ACTGCTGTAT CTAATGATGA 
TAAAAACCAA AATATAAGAG GATATTTAGC 

10 

TGGTAATGTC ACATTACATT ACCGTGATGG 
GATGACATAC GAACCAGTTG TGAAACCTGA 
AACGGTAACG ATTGCTAAAG GACAATCATT 

IS 

TTTAAGTAAT GGACAACCTA TTCCAAGTGG 
TATTCCAACT GCACAAGAAG TTAGTCAAAT 
TGCTACAAAT GCGTATCATA AAGATAGTGA 
TGTGAAACAA CCAGAAGGCG ATCAACGTGT 
TGATGAAATC TCAAAAGTAA AACAAGCATT 
25 TGCCGAAGGT GATATTTCAG TTACAAATAC 

AGTAAATATT AATAAAGG7C GATTAACGAA 
TTTCTTGCGT TGGGTTAATT TCCCACAAGA 
TGCAAACAGA CCAACAGATG GTGGTTTATC 
TCGTTATGAT GCTACATTAG GTACTCAAAT 
AGCAACAACT ACAGTGCCTG GATTGCGAAA 

35 

AGAAGCTGGC GGAAGACCTA ACTTTAGAAC 
TGATCGTCAA CGTCAATTTA CGTTGAATGG 
CCCTTCAAAC GGTTATGGTG GGCAACCTGT 

40 

TAACTCAACT GTTGTTAACG TAAACGAACC 
TGACCACGTT GTAAAAAGTA ATTCTACACA 

^ GTTATACTTA ACGCCATATG GTCCAAAACA 

AAATACTACT GACGCTATTA ACATTTATTT 
TTCAGTAGGT AATTACACTA ATCATCAAGT 

50 TACAGCGAAT GATAACTTTG GTGTGCAATC 

AGGTACTGTT GATAATAACC ATCAACATGT 

55 



TAACAGTGAT ATTACATTTA AAGTGTCAGC 8520 

TAGTCAATCT AAACATGTTT CAATTCATGT 8580 

TGTATTAGGA AATACTGAGA AAGTTGTAGT 8640 

AAAGCAAAGC ATAATTACTG CCTTTATGAA 8700 

ATCAACTGAT CCAGTAACTG TCGATAATAA 8760 

CTCATCGACA ACGCTTGATG CTACAAATGT 8820 

ATACCAAACT GTCAATGCTG CTAAAACAGC 8880 

TAGTATTGGT GATATTAAAC AATATTTTAC 8940 

CACATTTACA AATATTACAT CTGATAGAAC 9000 

GAACGCAGGC ACGCAGTTAT ACCATATAAC 9060 

AGACTTCTAT ATTAGTTTGA AAATCATCXSA 9120 

ATATCGTACA 7CAACATATG ATTTAACTAC 9180 

TATTAATGCA AATAGAGATG TAATTACX3CT 9240 

ACCTAATGGT GCTAATGTAA GTACTATTAC 9300 

ATCATTCGCG TCAAACCTAG CTAATAT6AA 9360 

TTATACAGTG ACATGGACGA ATGCAAAAAT 9420 

ATGGTCTGAT GACCATAAAT CTTTAATTTA 9480 

TACGACGAAT GATATTTTAA CAATGTTAAA 9540 

TAACATTACT GGTAATGAAA AATCACAAGC 9600 

GACTGGTTAT TCACAATCAA ATGCGACAAC 9660 

TCAAGTGATT CAAGTGTTAG ACATCATCAA 9720 

TACAAATTCA AATACTCX3TG CAAACCATAG 9780 

GGCAGCTAAT GGTGcTGGCG CATTTACAAT 9840 

TAATGCAAGT GATGCAGTTT ATAAAGCACA 9900 

ATATGTTGAA CATTTAAATC AAAATACAGG 9960 

TGTACCAAGT GACTTAGTGA ATCCAACAAT 10020 

GTTCTCAGGT GAAACATTTA CAAATACTAT 10080 

TGTAACTGTA CCAAATACAT CACAAATTAC 10140 

TTCTGCAACG GCACCAAATG TGACATCAGC 10200 
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GTTCAATGTA ACAGTGAAAC 


CTTTGCGTGA 


TAAATATCGA 


GTTGGTACTT 


CATCAACGGC 


10320 




TGCTAATCCT GTGAGAATTG 


CCAATATTTC 


GAATAATGCG 


ACAGTATCAC 


AAGCTGATCA 


10360 


5 


AACGACAATT ATTAATTCGT 


TAACGTTTAC 


TGAAACAGTA 


CCAAATAGAA 


GTTATGCAAG 


10440 




AGCAAGTGCG AATGAAATCA 


CTAGTAAAAC 


AGTTAGTAAT 


GTCAGTCGTA 


CTGGAAATAA 


10500 




TGCCAATGTg CACAGTAACT 


GTTACTTATC 


AAGATGGAAC 


AACATCAACA 


GTGACTGTAC 


10560 


10 


CTGTAAAGCA TGTCATTCCA 


GAAATCGTTG 


CACATTCGCA 


TTACACTGTA 


CAAGGCCAAG 


10620 




ACTTCCCAGC AGGTAATGGT 


TCTAGTGCAT 


CAGATTACTT 


TAAGTTATCT 


AATGGTAGTG 


10680 




ACATTGC?iGA TGCAACTATT 


ACATGGGTAA 


GTGGACAAGC 


GCCAAATAAA 


GATAATACAC 


10740 


GTATTGGTGA AGATATAACT 


GTAACTGCAC 


ATATCTTAAT 


TGATGGCGAA 


ACAACGCCGA 


10800 




TTACGAAAAC AGCAACATAT 


AAAGTAGTAA 


GAACTGTACC 


GAAACATGTC 


TTTGAAACAG 


10860 


20 


CCAGAGGTGT TTTATACCCA 


GGTGTTTCAG 


ATATGTATGA 


TGCGAAACAA 


TATGTTAAGC 


10920 




CAGTAAATAA TTCTTGGTCX; 


ACAAATGOGC 


AACATATGAA 


TTTCCAATTT 


GTTGGAAGAT 


10980 




ATGGTCCTAA CAAAGATGTT 


GTAGGCATAT 


CTACTOGTCT 


TATTAGAGTG 


ACATATGATA 


11040 


25 


ATAGACAAAC AGAAGATTTA 


ACTATTTTAT 


CTAAAGTTAA 


ACCTGACCCA 


CCTAGAATTG 


11100 




ACGCAAACTC TGTGACATAT 


AAAGCAGGTC 


TTACAAACCA 


AGAAATTAAA 


GTTAATAACG 


11160 




TATTAAATAA CTCGTCAGTA 


AAATTATTTA 


AAGCAGATAA 


TACACCATTA 


AATGTCACAA 


11220 


30 


ATATTACTCA TGGTAGCGGT 


TTTAGTTCGG 


TTGTGACAGT 


AAGTGACX3CG 


TTACCAAATG 


11280 




6CGGAATTAA AGCAAAATCT 


TCAATTTCAA 


TGAACAATGT 


GACX5TATACG 


ACGCAAGACG 


11340 




AACATGGTCA AGTTGTTACA 


GTAACAAGAA 


ATGAATCTGT 


TGATTCAAAT 


GACAGTGCAa 


11400 


35 


CAGTAACAGT GACACCACAA 


TTACAAGCAA 


CTACTGAAGG 


CGCTGTATTT 


ATTAAAGGTG 


11460 




GCGA&GTTT TGATTTCGGA 


CACGTAGAAA 


GATTTATTCA 


AAACCCGCCA 


CATGGGGCAA 


11520 




CGGTTGCATG GCATGATAGT 


CCAGATACAT 


GGAAGAATAC 


AGTCGGTAAC 


ACTCATAAAA 


11580 


CTGCGGTTGT AACATTACCT 


AATGGTCAAG 


GTACGCGTAA 


TGTTGAAGTT 


CCAGTCAAAG 


11640 




TTTATCCAGT TGCTAATGCA 


AAGGCGCCAT 


CACGTGATGT 


GAAAGGTCAA 


AATTTGACTA 


11700 




ATGGAACGGA TGCGATGAAC 


TACATTACAT 


TTGATCCAAA 


TACAAACACA 


AATGGTATCA 


11760 




CTGCAGCATG GGCAAATAGA 


CAACAACCAA 


ATAACCAACA 


AGCAGGCGTG 


CAACATTTAA 


11820 




ATGTCGATGT CACATATCCA 


GGTATTTCAG 


CTGCTAAACG 


AGTTCCTGTT 


ACTGTTAATG 


11880 


SO 


TATATCAATT TGAATTCCCT 


CAAACTACTT 


ATACGACAAC 


GGTTGGAGGC 


ACTTTAGCAA 


11940 




GTGGTACGCA AGCATCAGGA 


TATGCACATA 


TGCAAAATGC 


TACTGGTTTA 


CCAACAGATG 


12000 



SS 
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20 



25 



30 



35 



40 



45 



50 



TGAATAAACC GAATGTGGCT AAAGTCGTTA ACGCAAAATA TGACGTCATC TATAACGGAC 12120 

ATACTTTTGC AACATCTTTA CCAGCGAAAT TTGTAGTAAA AGATGTGCAA CCAGCXSAAAC 12180 

CAACTGTGAC TGAAACAGCG GCAGGAGCGA TTACAATTGC ACCTGGAGCA AACCAAACAG 12240 

TGAATACACA TGCCGGTAAC GTAACGACAT ACGCTGATAA ATTAGTTATT AAACGTAATG 12300 

GTAACGTTGT GACGACATTT ACACGTCGCA ATAATACGAG TCCATGGGTG AAAGAAGCAT 12360 

CTGCAGCAAC TGTAGCAGGT ATTGCTGGAA CTAATAATGG TATTACTGTT GCAGCAGGTA 12420 

CTTTCAACCC TGCTGATACA ATTCAAGTTG TTGCAACXSCA AGGAAGCGGA GAGACAGTGA 12480 

GTGATGAGCA ACGTAGTGAT GATTTCACAG TTGTCGCACC ACAACCGAAC CAAGCGACTA 12540 

CTAAGATTTG GCAAAATGGT CATATTGATA TCACGCCTAA TAATCCATCA GGACATTTAA 12600 

TTAATCCAAC TCAAGCAATG GATATTGCTT ACACTGAAAA AGTGG6TAAT GGTCCAGAAC 12660 

ATAGTAAGAC AATTAATGTT GTTCGTGGTC AAAATAATCA ATGGACAATT GCGAATAAGC 12720 

CTGACTATGT AACGTTAGAT GCACAAACTG GTAAAGTGAC GTTCAATGCC AATACTATAA 12780 

AACCAAATTC ATCAATCACA ATTACTCCGA AAGCAGGTAC AGGTCACTCA GTAAGTAGTA 12840 

ATCCAAGTAC ATTAACTGCA CCX^GCAGCTC ATACTGTCAA CACAACTGAA ATTGTGAAAG 12900 

ATTATGGTTC AAATGTAACA GCAGCTGAAA TTAACAATQC AGTTCaAGTT GCTAATAAAC 12960 

GTACTGCAAC GATTAAAAAT GGCACAGCAA TGCCTACTAA TTTAGCTGGT GGTAGCACAA 13020 

CGACGATTCC TGTGACAGTA ACTTACAATG ATGGTAGTAC TGAAGAAGTA CAAGAGTCCA 13080 

TTTTCACAAA AGCGGATAAA OGTGAGTTAA TCACAGCTAA AAATCATTTA GATGATCCAG 13140 

TAAGCACTGA AGGTAAAAAG CCAGGTACAA TTACGCAGTA CAATAATGCA ATGCATAATG 13200 

CGCAACAACA AATCAATACT GCGAAAACAG AAGCACAACA AGTGATTAAT AATGAGCGTG 13260 

CAAaCCACA ACAAGTTTCT GACGCACTAA CTAAAGTTCG TGCAGCACAA ACTAAGATTG 13320 

ATCAAGCTAA AGCATTACTT CAAAATAAAG AAGATAATAG CCAATTAGTA ACGTCTAAAA 13380 

ATAACTTACA AAGTTCTGTG AACCAAGTAC CATCAACTGC TGGTATGACG CAACAAAGTA 13440 

TTGATAACTA TAATGCGAAG AAGCGTGAAG CAGAAACTGA AATAACTGCA GCTCAACGTG 13500 

TTATTGACAA TGGCGATGCA ACTGCACAAC AAATTTCAGA TGAAAAACAT CGTGTCGATA 13560 

ACGCATTAAC AGCATTAAAC CAAGCGAAAC ATGATTTAAC TGCAGATACA CATGCCTTAG 13620 

AGCAAGCAGT GCAACAATTG AATCGCACAG GTACAACGAC TGGTAAGAAG CCGGCAAGTA 13680 

TTACTGCTTA CAATAATTCG ATTCGTGCAC TTCAAAGTGA CTTAACAAGT GCTAAAAATA 13740 

GCGCTAATGC TATTATTCAA AAGCCAATAA GAACAGTACA AGAAGTGCAA TCTGCGTTAA 13800 
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CTGU^TAATAG 

TAACTACTGA 

GTCAAACAGA 

AAATTGCCGC 

CTGGATTAAC 

TTGATCAGCC 

AACTTTCAGC 

ATGTTGCGAC 

AAGCACGTAA 

AACATA6TAT 

ACAATGCGAA 

GTTCACCGAC 

ATTTAGATCA 

CGCAATTAGA 

TAAATGCGTA 

TGTTGAATGG 

CTAAGGATCA 

CAACATTACA 

TTAATGCTGC 

ATACTGCXa^T 

AAAA3TACAC 

CTAAAGGTGT 

AAGCAGCATC 

CGAAAACAGA 

ATGCATTAAC 

AAACGACTCA 

ACCAAGTCGT 

ACAATGCATA 

CACCAAGTGA 



TGCTTTAAAA 
TGGTATGACA 
ATCAACAAAT 
AGAAAAAACA 
TCCAGACTTG 
AACGAGTACG 
AGCTAGAACT 
AATACGTCAA 
TGGCTTAACA 
TGACACGCAA 
GTTAACAGCT 
TGTAGAACAA 
TGCACGTCAA 
ACAAAGCATT 
CAACCAAAAA 
CAACCCAACT 
ATTAAATACA 
TGGTGCATCT 
TCAAAATcAT 
GACGAAATTA 
TGACGCAACA 
CATTGGAGAA 
TGTTAAATCG 
AGCAACAAAT 
ACAACAAGTG 
AAGCTTAAAT 
ACAAAGTGAT 
CAACCATGCG 
TGTTAACAAT 



ACTGCTAAGA 
CAATCATCAA 
GCACAAAATG 
AAAGTAGAAG 
GCACCATTAC 
ACTGGTATGA 
AAAATTCAAG 
AACGTGACAG 
GTCGATAAAG 
ACAAGTACAA 
GCACGTAATA 
ATTAATACAA 
GCTTTAACAC 
AATCAACCAA 
TTACAAGCAG 
GTCCAAAATA 
GCACGTCAAG 
AACTTAAACC 
GctGCGCTTG 
AAAGACAGTG 
CCAGCTAATA 
ACGACTAATC 
ACGAAAGATG 
GCGATTACGC 
AATAGTGCAC 
ACTGCTATGA 
AATTATGTCA 
AATGACATTA 
GCTTTATCAA 



CGAAACTTGA 
TCCAAGCATA 
TTATTAACAA 
AAAAATATAA 
AAACTGCAAA 
CAAGCGCATC 
AAATTGATCG 
CAGCGAATGC 
CX3CCTTTAGA 
CTGGTATGAC 
AGATTCAACA 
ATACGTCTAC 
CAGATAAAGC 
CGGATACAAC 
CGCGTCAAAA 
TCAATGATAA 
GXTTAACATT 
AAGCACAACA 
AAACAATTAA 
TTGCGGATAA 
AACAAGCGTA 
CAACGATGGA 
CTTTAGATGG 
ATGCAAGTGA 
AAAACGTGCA 
CAGGTTTAAA 
ACGCAGATAC 
TTAATGGTAA 
ATGTCACAAG 



TGAAGAAATC 
TGAAAATGCT 
TGGTGATGCG 
TAGCTTAAAA 
AACTCAGTTG 
TATTGCAGCA 
TGTATTAGCC 
CGCTAAATCA 
AAATGCGAAA 
ACAAGACTCT 
AATCAATCAA 
AGCAAATCAA 
GCCGCTTCAA 
AGGTATGACG 
GTTAACTGAA 
AGTGACAGAG 
AGATAGACAG 
AAATAATTTC 
GTCTAACATT 
TAATACAATT 
TGATAATGCA 
TGTTAACACA 
TCAACAAAAC 
TTTAAACCAA 
AGCAGTAAAT 
ACGTGGCGTT 
7AATAAGAAA 
TGCACAACAT 
TAAAGAACAT 



AATAAATCAG 
AAACGTGCGG 
ACTGACCAAC 
CAAGCAATTG 
CAAAATGATA 
TTTAATGAAA 
TCACATCCAG 
GCACTTGATC 
AATCAACTAC 
ATAAATGCAT 
GTATTAGCAG 
GCTAAATCTG 
ACTGCGAAAA 
ACCGCTTCGT 
ATTAATCAAG 
GCAAACCAAG 
CCAGCGTTAA 
ACGCAACAAA 
ACGGCTTTAA 
AAATCAGATC 
GTTAATGCGG 
GTGAACCAAA 
TTACAACGTG 
GCACAAAAGA 
GATATTAAAC 
GCTAATCATA 
AATGATTACA 
CCAGTTATAA 
GCATTGAATG 



13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14560 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 
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ATTTAAATAA TGCACAACGT CAAAACTTAC AATCGCAAAT TAATGGTGCG CATCAAATTG 15720 

ATGCAGTTAA TACAATTAAG CAAAATGCAA CAAACTTGAA TAGTGCAATG GGTAACTTAA 15780 

GACAAGCTGT TGCAGATAAA GATCAAGTGA AACGTACAGA AGATTATGCG GATGCAGATA 15840 

CAGCTAAACA AAATGCATAT AACAGTGCAG TTTCAAGTGC CGAAACAATC ATTAATCAAA 15900 

CAACAAATCC AACGATGTCT GTTGATGATG TTAATCGTGC AACTTCAGCT GTTACTTCTA 15960 

ATAAAAATGC ATTAAATGGT TATGAAAAAT TAGCACAATC TAAAACAGAT GCTGCAAGAG 16020 

CAATTGATGC ATTACCACAT TTAAATAATG CACAAAAAGC AGATGITAAA TCTAAAATTA 16080 

ATGCTGCATC AAATATTGCT GGCGTAAATA CTGTTAAACA ACAAGGTACA GATTTAAATA 16140 

CAJcCGATGGg TAACTTGCAA GGTGCAATCA ATGATGAACA AACGACGCTT AATAGTCAAA 16200 

ACTATCAAGA TGCGACACCT AGTAAGAAAA CAGCATACAC AAATGCGGTA CAAGCTGCGA 16260 

AAGATATTTT AAATAAATCA AATGGTCAAA ATAAAACGAA AGATCAAGTT ACTGAAGCX3A 16320 

TGAATCAAGT GAATTCTGCT AAAAATAACT TAGATGGTAC GCGTTTATTA GATCAAGCGA 16380 

nCAAaCAGCA AAACAGCAGT TAAATAATAT GACGCATTTA ACAACTGCAC AAAAAACGAA 16440 

TTTAACAAAC CAAATTAATA GTGGTACTAC TGTCGCTGGT GTTCAAACGG TTCAATCAAA 16500 

TGCCAATACA TTAGATCAAG CCATGAATAC GTTAAGACAA AGTATTGCCA ACAAAGATGC 16560 

GACTAAAGCA AGTGAAGA7T ACGTAGATGC TAATAATGAT AAGCAAACAG CATATAACAA 16620 

CGCAGTAGCT GCTGCTGAAA CGATTATTAA TGCTAATAGT AATCCAGAAA TGAATCCAAG 16680 

TACGATTACA CAAAAAGCAG AGCAAGTGAA TAGTTCTAAA ACGGCACTTA ACGGTGATGA 16740 

AAACTTAGCT GCTGCAAAAC AAAATGCGAA AACGTACTTA AACACATTGA CAAGTATTAC 16800 

AGATGCTCAA AAGAACAATT TGATTAGTCA AATTACTAGT GCGACAAGAG TOAGTGGTGT 16860 

TGAT3^CTGTA AAACAAAATG CGCAACATCT AGACCAAGCT ATGGCTAGCT TACAGAATGG 16920 

TATTAACAAC GAATCTCAAG TGAAATCATC TGAGAAATAT CGTGATGCTG ATACAAATAA 16980 

ACAACAAGAG TATGATAATG CTATTACTGC AGCGAAAGCG ATTTTAAATA AATCGACAGG 17040 

TCCAAACACT GCGCAAAATG CAGTTGAAGC AGCATTACAA CGTGTTAATA ATGCGAAAGA 17100 

TGCATTGAAT GGTGATGCAA AATTAATTGC AGCTCAAAAC GCAGCGAAAC AACATTTAGG 17160 

TACTTTAACG CATATCACTA CAGCTCAACG TAATGATTTA ACAAATCAAA TTTCACAAGC 17220 

TACAAACTTA GCTGGTGTTG AATCTGTTAA ACAAAATGCG AATAGTTTAG ATGGTGCTAT 17280 

GGGTAACTTA CAAACGGCTA TCAACGATAA GTCAGGAACA TTAGCGAGCC AAAACTTCTT 17340 

GGATGCTGAT GAGCAAAAAC GTAATGCATA CAATCAAGCT GTATCAGCAG CCGAAACCAT 17400 
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TGTTAATAAT GCGAAACATG CATTAAATGG TACGCAAAAC TTAAACAATG CGAAACAAGC 17520 

AGCGATTACA GCAATCAATG GCGCATCTGA TTTAAATCAA AAACAAAAAG ATGCATTAAA 17580 

5 AGCACAAGCT AATGGTGCTC AACGCGTATC TAATGCACAA GATGTACAGC ACAATGCX3AC 17640 

TGAACTGAAC ACGGCAATGG GCACATTAAA ACATGCCATC GCAGATAAGA CGAATACGTT 17700 

AGCAAGCAGT AAATATGTTA ATGCCGATAG CACTAAACAA AATGCTTACA CAACTAAAGT 17760 

TACCAATGCT GAACATATTA TTAGCGGTAC GCCAACGGTT GTTACGACAC CTTCAGAAGT 17820 

AACAGCTGCA GCTAATCAAG TAAACAGCGC GAAACAAGAA TTAAATGGTG ACGAAAGATT 17880 

ACGTGAAGCA AAACAAAACG CCAATACTGC TATTGATGCA TTAACACAAT TAAATACACC 17940 

IS 

TCAAAAAGCT AAATTAAAAG AACAAGTGGG ACAAGCCAAT AGATTAGAAG ACGTACAAAC 18000 

TGTTCAAACA AATGGACAAG CATTGAACAA TGCAATGAAA GGCTTAAGAG ATAGTATTGC 18060 

TAACGAAACA ACAGTCAAAA CAAGTCAAAA CTATACAGAC GCAAGTCCGA ATAACCAATC 18120 

20 

AACATATAAT AGCGCTGTGT CAAATGCGAA AGGTATCATT AATCAAACTA ACAATCCGAC 18180 

TATGGATACT AGTGCGATTA CCCAAGCTAC AACACAAGTG AATAATGCTA AAAATGGTTT 18240 . 

25 AAACGGTGCT GAAAACTTAA GAAATGCACA AAACACTGCT AAGCAAAACT TAAATACATT 18300 

ATCACACTTA ACAAATAACC AAAAATCTGC CATCTCATCA CAAATTGATC GTGCAGGTCA 18360 

TGTGAGTGAG GTAACTGCTA CTAAAAATGC AGCAACTGAG TTGAATACGC AAATGGGTAA 18420 

30 CTTGGAACAA GCTATCCATG ATCAAAACAC AGTTAAACAA AGTGTTAAAT TTACTGATGC 18480 

AGATAAAGCT AAACGTGATG CGTATACAAA TGCXK3TAAGC AGAGCTGAAG CAATTCTGAA 18540 

TAAAACGCAA GGTGCAAATA CGTCTAAACA AGATGTTGAA GCGGCTATTC AAAATGTTTC 18600 

^ AAGTGCTAAA AATGCATTGA ATGGTGATCA AAACGTTACA AATGCGAAGA ATGCAGCTAA 18660 

AAATgCATTA AATAACTTAA CGTCAATTAA TAATGCACAA AAACGTGACT TAACAACTAA 18720 

AATTGATCAA GCAACAACTG TAGCTGGTGT TGAAGCTGTA TCTAATACGA GTACACAATT 18780 

40 

GAAtACAGCG ATGGCTAACT TGCAAAATGG TATTAATGAT AAAACAAATA CACTAGCAAG 18840 

TGAAAACTAT CATGATGCTG ATTCAGATAA GAAAACTGCT TATACTCAAG CCGTTACGAA 18900 

CGCAGAAAAT ATTTTAAATA AAAATAGTGG ATCAAATTTA GACAAAACTG CCGTTGAAAA 18960 

45 

CGCGTTGTCA CAAGTTGCTA ATGCGAAAGG TGCCCTAAAT GGTAACCATA ATTTAGAGCA 19020 

AGCTAAATCA AATGCAAACA CTACTATAAA CGGACTTCAA CATTTAACAA CTGCTCAAAA 19080 

SO AGATAAATTG AAACAACAAG TGCAACAAGC ACAAAATGTT GCAGGTGTAG ATACTGTTAA 1914 0 

ATCAAGTGCC AACACATTAA ATGGTGCTAT GGGTACGTTA AGAAATAGCA TACAAGATAA 19200 
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TAACAATGCT 
TGCTAATGCA 
TACACATAAT 
CTTAAATAAA 
AAATGTAACA 
ACATGGTATT 
AAGTAAGAAA 
AACAGGTTCA 
GAAAGATGCA 
CTTAGGCACT 
TCAAGOGACG 
CGCAATGAAT 
TTATCTTGAT 
AGGCATTTTA 
AAATGCAGTT 
AACTTCAGCA 
CTTGAAGCAT 
AGGTAATACG 
GACGAAAACA 
TGCTGTAAAT 
TGCGATTAAT 
AAACTTAGCT 
CCAAAAACAA 
AAATAACGTT 
TATTGCTGAT 
ACXSTCAAGCG 
TGCGACATTA 
AGCATTAAAT 
CGGCTTAGCA 



GTTGATAGTG 
ATTAACCAAA 
TTAACGCAAG 
GCGCAAAAAG 
AGTATCCAAC 
GATGATGAAA 
ACTGCTTATG 
AATTCAGATA 
TTGAATGGTG 
TTAAACCATA 
ACTGTTGATG 
AGCTTACAAG 
GCGGAT6AAT 
AATAAACAAA 
ACAAGAGCGA 
ACAAATACGA 
CAAGTTGAaC 
TTAAATACTG 
AGTCAAAATT 
AATGCAAATG 
GGCATGGCAA 
CAAGCTAAAA 
AAAGATGCAT 
CAACACACTG 
AAAGAAAGAA 
TATGATTCAA 
ACAGTCAATG 
GGTGATAACA 
CAATTGAATA 



CTAATGGTGT 
TCGCTACACA 
CGAAACAAAC 
ATGCGTTAAA 
AAACTGCAAA 
ATGCAACAAA 
ATCAAGCTGT 
AAGCAGCAGT 
ATGCAAAACT 
TTACGAATGC 
GCGTTAATAC 
GTTCAATCAA 
CAAAACGAAA 
CTGGTGGTAA 
AAGcGgCTTT 
TTGATGGTTT 
AAGCGCAAAA 
CCATGGGTGC 
ATCTTGATGC 
GTGTTATTAA 
ATCAAGTCAA 
CAAATGCGAC 
TAAAAACACA 
CAACTGAATT 
CAAAAGCAAG 
AAGTGACTAA 
ACGTAAATAG 
ACTTACGTGT 
ATGCACAAAA 



CATTAATGCA 
AGTGACATCA 
AGCAACAAAT 
AGCACAAGTT 
TGAACTTAAT 
ACAAACTCAA 
AGCTGCTGCG 
TGACCGTGCA 
GGCAGAAGCG 
ACAACGTACT 
TGTAAAAACA 
TGATAAAGAT 
TGCATATACG 
CACATCTAAA 
AAATGGTGCT 
ACCTAACTTA 
TGTAGCAGGT 
ATTACGTACA 
ATCTGACAGC 
TGCAACGAAC 
TACAACAAAA 
GAACACAATT 
AGTTAACAAT 
GAACAGTGCG 
CGGTAATTAT 
CGCTGAAAAT 
TGCGGCATCA 
AGCGAAAGAG 
AGCAAAATTA 



ACAAGCAATC 
ACGAAAAATG 
GCCATCGATG 
ACAAGTGCGC 
ACAGCTATGG 
AAATATCGTG 
AAAGCAATTT 
TTACAACAAG 
AAAGCGGCAG 
GACTTAGAAG 
AATGCCAATA 
GCGACATTAA 
CAAGCTGTCA 
GCAGACGTTG 
QACAACTTAA 
ACACAATTAC 
GTAAATGGTG 
AGTATCCAAA 
AACAAAAATA 
AATCCAAATA 
GCAGCGTTAA 
AACAACGCAC 
GCACAACGTG 
ATGACAGCAC 
GTCAATGCTG 
ATCATTAGTG 
CAAGTCAATG 
CATGCCAACA 
AAAGAACAAG 



CAAATATGGA 
CATTAGATGG 
GTGCTACTAA 
AACGTGTTGC 
GTCAATTACA 
ACGcTGAACA 
TAAATAAACA 
TAACAAGTAC 
CTAAACAAAA 
GCCAAATCAA 
CATTAGACX3G 
GAAATCAAAA 
CAGCGGCTGA 
ATAATGCATT 
GAAATGCGAA 
AAAAAGACAA 
TTAAAGATAA 
ATGATAATAC 
ATTACAATAC 
TGGATGCTAA 
ATGGTGCACA 
ATGACTTAAA 
TATcTGATGC 
TTAAAGCAGC 
ATCAAGAAAA 
GTACACCGAA 
CGGCTAAAAC 
ATACAATTGA 
TTCAAAGTGC 



19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
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GAAAGGCTTA AGAGATAGTA 
TGACGCAAGT CCAAATAATC 
CATTAATCAA ACATCGAACC 
AGTGACAACT AAA6AACAGG 
TGCGAAAAAC AACTTGAATA 
GCGTAgcATT GATGGTGCAA 
AGAATTAAAT AACGCAATGC 
ACAAACTCAG AAATACCTAG 
AAATGCAGCG AAAGCAATTT 
TGAACAAGCA TTGCAAAATG 
AAATGAAGCT AAAGCAGCTG 
ACAACGTACA GCGTTAGACA 
AGTTAAAGCC AAAGCGCAAC 
TGATAAAGAC ACGACGTTAC 
TGCTTATTCT CAAGCAGTAA 
TACACCTAAA GCAGATGTTG 
AAACGGTATT CAmAACTTAG 
TTCGGACTTA AATACAAAAC 
TGTATCTGCA GCAAATGGTG 
TTTAAAGCGT GCCATTGCTG 
TGAXaCGAAT AAACGTCAAG 
TGGTACACCA ACACCAACQT 
GAATGCTAAG ACGCAGTTAA 
CACTGCAATT GATGGTTTAA 
AGTGGGTCAA GCGACGACGT 
AAACACTGCA ATGAAAGGTC 
TCAAAACTAC ACAGATGCAA 
AGCAAAAGCA ATCATTGGTC 
AGC6AAAGAC CAAGTGACAG 



TTGCGAATGA AGCAACAATT AAAGCAGGTC AAAACTACAC 21120 

GTAACGAGTA CGACAGTGCA GTTACTGCAG CAAAAGCAAT 21180 

CAACGATGGA ACCAAATACT ATTACGCAAG TAACATCACA 21240 

CATTAAATGG TGCGCGAAAC TTAGCTCAAG CTAAGACAAC 21300 

ACTTAACATC AATTAACAAT GCACAAAAAG ATGCGTTAAC 21360 

CAACAGTAGC TGGTGTAAAT CAAGAAACTG CAAAAGCAAC 21420 

ATAGTTTACA AAATGGTATC AATGATGAGA CACAAACAAA 21480 

ATGCAGAGCC AAGTAAGAAA TCAGCTTATG ATCAAGCAGT 21540 

TAACAAAAGC TAGTGGTCAA AATGTAGACA AAGCAGCAGT 21600 

TGAACAGTAC GAAGACGGCG TTGAACGGTG ATGCXy\AATT 21660 

CGAAACAAAC GTTAGGTACA TTAACACACA TTAATAATGC 21720 

ATGAAATTAC ACAAGCAACA AATGTTGAAG 6TGTTAATAC 21780 

AATTAGATGG TGCTATGGGT CAATTAGAAA CATCAATTCG 21840 

AAAGTCAAAA TTATCAAGAT GCTGATGATG CTAAACGAAC 21900 

ATGCAGCAGC AACTATTTTA AATAAAACAg CTGGCGGTAA 21960 

AAAGAGCAAT GCAAGCTGTT ACACAAGCAA ATACTGcATT 22020 

ATCGTGCGAA ACArGCTGCT AACACAGCGA TTACAAATGC 22080 

mAAAAGAAGC ATTAAAAgCA CAAGTAACAA GTGCAGGACG 22140 

TTGAACATAC TGCGACTGAA TTAAATACTG CGATGACAGC 22200 

ATAAAGCTGA GACAAAAGCT AGTGGTAACT ATGTCAATGC 22260 

CATATGATGA AAAAGTTACA GCTGCC6AAA ATATCGTTAG 22320 

TAACACCAGC AGATGTrACA AATGCAGCAA CGCAAGTAAC 22380 

ACGGTAATCA TAATTTAGAA GTAGCGAAAC AAAATGCTAA 22440 

CTTCTTTAAA TGGTCCGCAA AAAGCAAAAC TTAAAGAACA 22500 

TGCCAAATGT TCAAACTGTT CGTGATAATG CACAAACATT 22560 

TACGAGATAG CATTGCGAAT GAAGCAACGA TTAAAGCAGG 22620 

GTCAAAACAA ACAAACTGAC TACAACAGTG CAGTCACTGC 22680 

AAACAACTAG TCCATCAATG AATGCGCAAG AAATTAATCA 22740 

CTAAACAACA AGCGTTAAAC GGTCAAGAAA ACTTAAGAAC 22800 
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AGATGCAGTG 

AAATAATGCG 

GAATACGATT 

TACAAATGCA 

AAAAGACGGT 

TAATCAAAAT 

AATTAATAAT 

AGGTGTAAAT 

AAATGGTATT 

AAAAGCTAAA 

AAAAGAGGCA 

TGAGCAAAAA 

TAATGCGACA 

GTACAATAAT 

AGATCCGACT 

TGGTAATGAA 

TAACTTAAAT 

TGATGAAGCA 

GAAACAAGCG 

TCAAGCAAAA 

AGCTAATGGC 

TGCAAAACAA 

ATTAATTAAT 

TCAAAATGCA 

CAATGCTATG 

TAACTTTGTC 

TGAAGCATTA 

GTTAAATAAA 

GAAACAAAAT 



AAACGTCAAA 

GATGCaTTAA 

AAGCAAGGTG 

GTGACGCAAG 

GTCGAAACTG 

GTTGCGAACG 

GCACAAAAAG 

CAAGTGTCTA 

AATGATGAAG 

CAACACGCAA 

TTAAAACAAT 

GCAAACAATG 

ACAAAACAAA 

GCTGTCACAA 

GTTATCAATC 

AACCTAGAGG 

AATGCGCAAA 

AATCAAATTA 

ATAGcTGACA 

CAACAAGCAT 

GGCAATGCAA 

GCATTAAATG 

AGCTCTAATG 

ACTACTGTAG 

ACACAATTAA 

AATGCAGATC 

ATTAGTGctA 

GTTACGCAAG 

GTTCAACATG 



TCGAAGGTGC 
ATACAGCTAT 
TTAACTTCAC 
CTGAACAAAT 
CGTTAGAaAA 
CTAAGACAAC 
AAGCATTGAA 
CAACGGCATC 
CAGCTACAAA 
ATACAGCAAT 
TGGTACAACA 
TTGATGCAGC 
ACCAAAATTA 
CTGCACAA6G 
AAGCTGCTGG 
CAGGGAAACA 
AACAAACAGT 
AGCAAAATGC 
AAGATGCTAC 
ATAACaCTGC 
CACAAGCTGA 
GTAATGCCAA 
ACCTTAACCA 
CTGGTGTAAA 
AACAAGGCAT 
CTGATAAGCA 
CGCCTGATGT 
CTAAAAATGA 
CTATTGATCA 



AACGCATGTT 

GACGAACTTG 

TGATGCCGAC 

TTTAAATAAA 

TGTACAACGT 

TGCGAAAAAT 

ATCACAAATT 

TGAATTAAAT 

AGCAGCGCTT 

TGACGGTTTA 

ATCXIACTACT 

AATGGACAAA 

TACTGATGCA 

TATTATTGAT 

ACAAGTAAGC 

ACAAGCGTCA 

TACTGATCAA 

GCAAAACTTA 

GAAAGCGACA 

TGTTACAAAT 

AGTTGAACAA 

CGTTCAACAT 

AGCACAAAAA 

CAATGTTAAA 

TGCAGATAAA 

AAATGCATAT 

TGTCGTTACA 

TTTAAATGGT 

ATTGCCAAAC 



AATGAAGTAA 

AAAAATGGTA 

GAAGCGAAAC 

GCACAAGGTC 

GCTAAAAACG 

GCATTGAATA 

GAAGGTGCGA 

ACAGCAATGA 

AATGGTACTC 

AGCCATTTAA 

GTTGCAGAAG 

TTACGTCAAA 

AGTCAGAATA 

CAAACTACAA 

ACAACTAAAA 

CAATCATTAG 

ATTAATGGCG 

AATACAGCX^A 

GTTAACTTCA 

GCTGAAAATA 

GCAATCAAAC 

GCAAAAGACG 

GACGCATTAA 

CAAACAGCAC 

GAACAAACAA 

AATCAAGCAG 

CCTAGCGAAA 

AATACAAACT 

TTAAACCAAG 



CACAAGCACA 

TTCAAGATCA 

GTAATGCATA 

CAAATACTTC 

AATTGAACGG 

ACCTAACATC 

CAACAGTTGC 

GCAACTTACA 

AAAACCTTGA 

CAAATGCACA 

CACAAGGTAA 

GTATTGCAGA 

AAAAGGATGC 

GTCCAACTTT 

ATGCATTAAA 

GTTCATTAGA 

CGCATACTGT 

TGGGTAACTT 

CTGATGCAGA 

TCATTTCAAA 

AAGTTAATGC 

AAGCAACAGC 

AACAACAAGT 

AAGAGTTAAA 

AAGCTGATGG 

TAGCGAAAGC 

TTACTGCAGC 

TAGCAACGGC 

CGCAACGTGA 



22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
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AGCGGCGACA ACGCTTAATG ACGCGATGAC ACAATTGAAA CAAGGTATTG CGAATAAAGC 24720 

ACAAATTAAA GGTAGCGAGA ACTATCACGA TGCTGATACT GACAAGCAAA CAGCATATGA 24780 

TAATGCAGTA ACAAAAGCAG AAGAATTGTT AAAACAAACA ACAAATCCAA CAATGGATCC 2484 0 

AAATACAATT CAACAAGCAT TAACTAAAGT 6AATGACACA AATCAAGCAC TTAACGGTAA 24900 

TCAAAAATTA GCTGATGCCA AACAAGATGC TAAGACAACA CTTGGTACAC TAGATCATTT 24 960 

AAATGATGCT CAAAAACAAG CGCTAACAAC TCAAGTTGAA CAAGCACCAG ATATTGCAAC 25020 

AGTTAATAAT GTTAAGCAAA ATGCTCAAAA TCTGAATAAT GCTATGACTA ACTTAAACAA 25080 

TGCATTACAA GATAAAACTG AGACATTAAA TAGCATTAAC TTTACTGATG CAGATCAAGC 25140 

TAAGAAAGAT GCTTATACTA ATGCGGTTTC ACATGCAGAA GGTATTTTAT CTAAAGCAAA 25200 

TGGCAGCAAT GCAAGTCAAA CTGAAGTGGA ACAAGCGATG CAACGTGTGA ACGAAGCGAA 25260 

ACAAGCATTG AATGGTAATG ACAATGTACA ACGTGCAAAA 6ATGCAGCGA AACAAGTGAT 25320 

TACAAATGCA AATGATTTAA ATCAAGCAAT GACACAATTG AAACAAGGTA TTGCAGATAA 25380 

AGACCAAACT AAAGCAAATG GTAACTTTGT CAATGCTGAT ACTGATAAGC AAAATGCTTA 25440 

CAACAATGCG GTAGCACATG CTGAACAAAT AATTAGTGGT ACACCAAATG CAAACGTGGA 25500 

TCCACAACAA GTGGCTCAAG CGTTACAACA AGTGAATCaA GCTAAGGGTG ATTTAAACGG 25560 

TAACCATAAC TTACAAGTTG CTAAAGACAA TGCAAATACA GCCATTGATC AGTTACCAAA 25620 

CTTAAATCAA CCACAAAAAA CAGCATTAAA AGACCAAGTG TCGCATGCAG AACTTGTTAC 25680 

AGGTGTTAAT GCTATTAAGC AAAATGCTGA TGCGTTAAAT AATGcAATGG GTACATTGAA 25740 

ACAACAAATT CAAGCGAACA GTCAAGTACC ACAGTCAGTT GACTTTACAC AAGCGGATCA 25800 

AGACAAACAA CAAGCATATA ACAATGCGGC TAACCAAGCG CAACAAATCG CAAATGGCAT 25860 

ACCAACACCT GTATTGACGC CTGATACAGT AACACAAOCA GTGACAACTA TGAATCAAGC 25920 

GAAAGATGCA TTAAACXjGTG ATGAAAAATT AGCACAAGCG AAACAAGAAG CTTTAGCAAA 25980 

TCTTGATACG TTACGCGATT TAAATCAACC ACAACGTGAT GCATTACGTA ACCAAATCAA 26040 

TCAAGCACAA GCGTTAGCTA CAGTTGAACA AACTAAACAA AATGCACAAA ATGTGAATAC 26100 

aGCaATGAGT AACTTGAAAC aAGGTATTGC aAACAAAGAT ACTGTCAAAG CAAGTGAGAA 26160 

CTATCATGAT GCTGATGCCG ATAAGCAAAC AGCATATACA AATGCAGTGT CTCAAGCGGA 26220 

AGGTATTATC AATCAAACGA CAAATCCAAC GCTTAACCCA GATGAAATAA CACGTGCATT 26280 

AACTCAAGTG ACTGATGCTA AAAATGGCTT AAACGGTGAA GCTAAATTGG CAACTGAAAA 26340 

GCAAAATGCT AAAGATGCCG TAAGTGGGAT GACGCATTTA AACGATGCTC AAAAACAAGC 26400 
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AGCAACGAGC CTAGATCAAG CAATGGATCA ATTATCACAA GCTATTAATG ATAAAGCTCA 26520 

AACATTAGCG GACGGTAATT ACTTAAATGC AGATCCTGAC AAACAAAATG CGTATAAACA 26530 

GGCAGTAGCA AAAGCTGAAG CATTATTGAA TAAACAAAGT GGTACTAATG AAGTACAAGC 26640 

ACAAGTTGAA AGCATCACTA ATGAAGTGAA CGCAGCGAAA CAAGCATTAA ATGGTAATGA 26700 

CAATTTGGCA AATGCAAAAC AACAAGCAAA ACAACAATTG GCGAACTTAA CACACTTAAA 26760 

TGAK3CACAA AAACAATCAT TTGAAAGTCA AATTACACAA GCGCCACTTG TTACAGATGT 26820 

CACTACGATT AATCAAAAAG CACAAACGTT AGATCATGCG ATGGAATTAT TAAGAAATAG 26880 

TGTTGCGGAT AATCAAACGA CATTAGCGTC TGAAGATTAT CATGATGCAA CTGCGCAAAG 26940 

ACAAAATGAC TATAACCAAG CTGTAACAGC TGCTAATAAT ATAATTAATC AAACTACATC 27000 

GCCTACGATG AATCCAGATG ATGTTAATGG TGCAACX3ACA CAAGTGAATA ATACGAAAGT 27060 

TGCATTAGAT GGTGATGAAA ACCTTGCAGC AGCTAAACAA CAAGCAAACA ACAGACTTGA 27120 

TCAATTAGAT CATTTGAATA ATGCGCAAAA GCAACAGTTA CAATCACAAA TTACGCAATC 27180 

ATCTGATATT GCTGCAGTTA ATGGTCACAA ACAAACAGCA GAATCTTTAA ATACTGCGAT 27240 

GGGTAACTTA ATTAATGCGA TTGCAGATCA TCAAGCCGTT GAACAACGTG GTAACTTCAT 27300 

CAATGCTGAT ACTGATAAAC AAACTGCTTA TAATACAGCG GTAAATGAAG CAGCAGCAAT 27360 

GATTAACAAA CAAACTGGTC AAAATGCGAA CCAAACAGAA GTAGAACAAG CTATTACTAA 27420 

AGTTCAAACA ACACTTCAAG CGTTAAATGG AGACCATAAT TTACAAGTTG CTAAAACAAA 27480 

TGCGACGCAA GCAATT6ATG CTTTAACAAG CTTAAATGAT CCTCAAAAAA CAGCATTAAA 27540 

AGACCAAGTT ACAGCTGCAA CTTTAGTAAC TGCAGTTCAT CAAATTGAAC AAAATGCGAA 27600 

TACGCTTAAC CAAGCAATGC ATGGTTTAAG ACAGAGCATT CAAGATAACG CAGCAACTAA 27660 

AGCAAATAGC AAATATATCA ACGAAGATCA ACCAGAGCAA CAAAACTATG ATCAAGCTGT 27720 

TCAAGCOGCA AATAATATTA TCAATGAACA AACTGCAACA TTAGATAATA ATGCGATTAA 27780 

TCAAGCAGCG ACAACTGTGA ATACAACGAA AGCAGCATTA CATGGTGATG TGAAGTTACA 27840 

AAATGATAAA GATCATGCTA AGCAAACGGT TAGTCAATTA GCACATCTAA ACAATGCACA 27900 

AAAACATATG GAAGATACX3T TAATTGATAG TX3AAACAACT AGAACAGCAG TTAAGCAAGA 27960 

TTTGACTGAA GCACAAGCAT TAGATCAACT TATGGATGCA TTACAACAAA GTATTGCTGA 28020 

CAAAGATGCA ACACGTGCGA GCAGTGCATA TGTCAATGCA GAACCGAATA AAAAACAATC 28080 

CTATGATGAA GCAGTTCAAA ATGCTGAGTC TATCATTGCA GGATTAAATA ATCCAACTAT 28140 

CAATAAAGGT AATGTATCAA GTGCGACTCA AGCAGTAATA. TCATCTAAAA ATGCATTAGA 28200 
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TCAATTAACA CCAGCTCAAC AACAAGCGCT AGAAAATCAA ATTAATAATG CAACAACTCG 28320 

TGATAAAGTG GCTGAAATCA TTGCACAAGC GCAAgCATtA AATGAAGCGA TGAAAGCATT 28380 

AAAAGAAAGT ATTAAGGATC AACCACAAAC TGAAGCAAGT AGTAAATTTA TTAACGAGGA 28440 

TCAAGCGCAA AAAGATGCTT ATACGCAAGC AGTACAACAC GCX3AAAGATT TGATTAACAA 28500 

AACAACTGAT CCTACATTAG CTAAATCAAT CATTGATCAA GCGACACAGG CAGTGACAGA 28560 

TGCTAAAAAC AATTTACATG GTGATCAAAA ACTAGCTCAA GATAAGCAAC GTGCAACAGA 28620 

AACGTTAAAT AACTTGTCTA ACTTGAATAC ACCACAACGT CAAGCACTTG AAAATCAAAT 28680 

TAATAATGCA GCAACTCGTG GCGAAGTAGC ACAAAAATTA ACTGAAGCAC AAGCACTTAA 28740 

CCAAGCAATG GAAGCTTTAC GTAATAGCAT TCAAGATCAA CAGCAAACGG AAGCGGGTAG 28800 

CAAGTTTATC AATGAAGATA AaCCaCmAAA AGrTGCTTAC CAAGCAGCA6 TTCAAAATGC 28860 

AAAAGATTTA ATTAATCAAA CTAACAATCC AACGCTTGAT AAAGCACAAG TTGAACAATT 28920 

GACACAAGCT GTTAACCAAG CTAAAGATAA CCTACACQQT QATCAAAAAC TTGCAQACGA 28980 

TAAACAACAT GCGGTTACTG ATTTAAATCA ATTAAATGGT TTGAATAATC CJGCAACGTCA 29040 

AGCACTTGAA AGCCAAATAA ACAACGCAGC AACTCGTGGC GAAGTAGCAC AAAAATTAGC 29100 

TGAAGCAAAA GCGCTTGATC AAGCAAT6CA AGGATTACGT AATAGTATTC AAGATCAACA 29X60 

ACAAACAGAA TCTGGTAGCA AGTTTATCAA TGAAGATAAA CCGCAAAAAG ATGCTTACCA 29220 

AGCAGCAGTT CAAAATGCAA AAGATTTAAT TAACCAAACA GGTAATCCAA CACTCGACAA 29280 

ATCACAAGTA GAACAATTGA CACAAGCAGT AACAACTGCA AAAGATAATC TACATGGTGA 29340 

TCAAAAACTT GCTCGTGATC AACAACAAGC AGTAACAACT GTAAATGCAT TGCCAAACTT 29400 

AAATCATGCA CAACAACAAG CATTAACTGA TGCTATAAAT GCAGCGCCTA CAAGAACAGA 29460 

GGTT8CACAA CATGTTCAAA CTGCTACTGA ACTTGATCAC GCGATGGAAA CATTGAAAAA 29520 

TAAAGTTGAT CAAGTGAATA CAGATAAGGC TCAACCAAAT TACACTGAAG GGTCAACTGA 29530 

TAAAAAAGAA GCAGTAGATC AAGCGTTACA AGCTGCAGAA AGCATTACAG ATCCAACTAA 29640 

TGGTTCAAAT GCGAATAAAG ACGCTGTAGA CCAAGTATTA ACTAAGCTTC AAGAAAAAGA 29700 

AAATGAGTTA AATGGTAATG AGAGAGTCGC TGAAGCTAAA ACACAAGCGA AACAAACTAT 29760 

TGACCAATTA ACACATTTAA ATGCTGATCA AATTGCAACT GCTAAACAAA ACATTGATCA 29820 

AGCGACGAAA CTTCAACCAA TTGCTGAATT AGTAGATCAA GCAACGCAAT TGAATCAATC 29880 

TATGGATCAA TTACAACAAG CAGTTAATGA ACATGCTAAC GTTGAGCAAA CTGTAGATTA 29940 

CACACAAGCA GATTCAGATA AACAAAATGC TTATAAACAA GCTATTGCTG ATGCTGAAAA 30000 
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TGCAAAACAA GCATTAAATG GTGATGAACG TGTAGCACTT 
TGACATCGAC CAATTGAATG CATTAAACAA TGCTCAACAA 
CGATCAATCA AACGATTTAA ATCAAATCCA ACAAATTGTA 
TCGTGCAATG GATCAATTGT CACAAGAAAT CACTGACAAT 
CACGAACTAT GTCAATGCAG ATACACAAGT CAAACAAGTA 
AGCGAAACAA GCACTTGATA AATCGACTGG TCAAAACTTA 
ATTAAATGAT GCAGTCACTG CAGCTAAGAA AGCATTAAAT 
TCGTAAAGCT GAAGCATTAC AAAGATTGGA TCAATTAACA 
ACAATTAGCA ATCCAACAAA TTAATAATGC TGAAACGCTA 
TAATAGAGCA ACTAAATTAG ATAATGCAAT GGGTTCAGTA 
GCACCTTGGT GTTATCAGCA GCACAAATTA CATCAATGCA 
TTATGATAAT GCAATTGCGA ATGCAGCACA TGAGTTAGAT 
TGCaAAAGCT GAAGCAGAGC AATTGAAACA AAATATTATC 
TGGAGACCAA AACCTTGCAA ATGCCAAAGA TAAAGCAAAT 
TGGATTAAAT CAACAGCAAC AAGATCTTGC ACATAAAGCA 
ATCAGATGTA ACAGATATTG TTAATAATCA AATTGACTTA 
GAAACATTTA GTTGACAATG AAATTCCAAA TGCAGAGCAA 
TGACGATAAT GCTAAA 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GCTAAAACAA ATGGTAAACA 30120 

GATGGATTTA AAGGTCGCAT 30180 

GATGAGGCTA AGGCACTTAA 30240 

GAAGGACGCA CGAAAGGTAG 30300 

TATGATGAAA CGGTTGATAA 30360 

ACTGCAAAAC AAGTTATCAA 30420 

GGTGAAGAAA GACTTAATAA 30480 

CATCTAAACA ATGCTCAAAG 3 0540 

AATAAAGCAT CTCGAGCAAT 30600 

CAACAATATA TTGACGAACA 30660 

GATGACAATT TGAAAGCAAA 30720 

AAAGT6CAAG GTAATGCAAT 30780 

GATGCTCAAA ATGCATTAAA 30840 

GCGTTTGTTA ATTCGTTAAA 30900 

ATTAACAATG CCGATACTGT 30960 

AATGATGCAA TGGAAACATT 31020 

ACTGTCAATT ACCAAAACGC 31080 

31096 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATGACAGAAT GGGAGCGAGG ACTTAGAATG TTTCCTAAAT CAGGTTTATT AAATTTTGAG 60 

TTAGCGATAG mAAATCGTTC ATTAAATGAT GATGAAAAAG CATTAAAATA TGTGCGTAAA 120 

GCATTAAATG CAGACCCTAA AAATACAGAT TATATTAACT TAGAAAAAGA GTTGACTAAA 180 

TCAAATGAGT CGAAAAATAA ATAACTTTTA TGATGTACAA CAGTTATTGA AAAGTTACGG 240 

ATTTCTAATA TATTTTAAAA ATCCAGAAGA TATGTACGAA ATGATTCAAC AGGAGATTTC 300 
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TAATCAC5AGA AGGAATGAAC AGAAATGACA AAAATTATTT TAGCAGCTGA TGTAGGCGGG 420 

ACGACTTGTA AATTAGGTAT TTTCACACCT GAATTAGAAC AATTACATAA ATGGTCTATT 480 

^ CACACTGATA CATCTGATAG TACAGGATAT ACACTTTTGA AAGGAATTTA TGATTCGTTT 540 

GTTGAAAAAG TAAATGAAAA TAATTATAAT TTTTCAAATG TACTTGGCGT AGGTATTGGT 600 

GTACCAGGTC CTGTTGACTT TGAAAAAGGT ACAGTAAATG GAGCAGTAAA CTTATATTGG 660 

10 

CCAGAAAAAG TTAATGTACG TGAGATTTTT GAACAATTCG TTGATTGTCC AGTGTATGTA 720 

GATAATGATG CTAACATAGC TGCTTTAGGG GaGAAACACA AAGGTGCTGG TGAAGGTGCC 780 

GATGATGTTG TTGCCATCAC ACTTGGTACA GGTCTAGGTG GAGOAATTAT TTCCAAATGG 840 

TGAAATCGTA CATGGTCATA ATGGCTCtGG CGCAGAAATA GGTCATTTTA GAgCAGACTT 900 

CgATCAACXSA TTTaAATOTA ATTGTGGTCG TTCTOGATGT ATTGAAACAG TTGCTTCaGC 960 

20 GACAOGCOTT GTTAACTTAG TTAACTTCtA CTATCCGAAG TTGACGTTTA GATCTTCTAT 1020 

ATTAGAATTG ATTAAAGAAA ATAAGGTtAC aGCAAAAGCT GTmTGATG CGGCAAAAGC 1080 

TGGTGACCAA TTCTGTATTT TCATTACTGA AAAGGTTGCA AACTATATT6 GATATTTATG 1140 

2S TAGTATTATT AGTGTTACAA GTAATCCGAA ATATATCGTT CTAGGTG6AG GAATGTCTAC 1200 

TGCAGGACCT ATTTTAATTG AAAATATTAA AACAGAATAT CATAATTTAA CATTTGCACC 1260 

TGCTCAATTT GAAACTGAAA TTGTACAAGC GAAATTAGGT AATGATGCAG GTATTACAGG 1320 

30 

AGCAGCAGGA TTAATCAAGA CCTATGTATT AGATAAAGAG GGGGTAAAAT AATGGCTATT 1380 

GTTGATGTGG TTGTTATTCC AGTTGGAACG GAAGGTCCGA GTGTTAGTAA ATATATTGCA 1440 

GATATTCAGA AAAAACTTCA AGAATATAAA GCAATGGGTA AAATTGATTT TCAATTAACA 1500 

35 

CCAATGAATA CTCTAATTGA AGGTGAATTA AGCGATGTAT TAGAAGTTGT GCAAGTGATA 1560 

CATGMTTAC CTTTTGATAA AGGTTTAAGT AGAGTTTGTA CAAATATCCG TATTGATGAC 1620 

CGACGAGACA AATCTAGAAA AATGAATGAT AAACTAACAT CAGTACAAAA ACATTTAGAA 16 80 

40 

AATAGTGGTG AAAACCTATG AGGATTTCAA GCTTAACTTT AGGCTTAGTT GATACTAATA 1740 

CGTATTTCAT CGAAAATGAC AAAGCTGTTA TTCTGATTGA CCCTTCAGGT GAAACTGAAA 1800 

45 AAATTATTAA AAAATTAAAC CAAATAAATA AACCGTTAAA AGCTATTTTA TTAACACATG 1860 

CACACTTTGA TCATATCGGA GCAGTCGATG ATATAGTTGA TCGATTCGAT GTCCCGGTTT 1920 

ATATGCATCA AGCAGAGTTT GATTTTCTAA AAGATCCCGT TAAAAATGGG GCAGATAAAT 1980 

TTAAGCAATA TGGATTACCA ATTATTACAA GTAAGGTAAC TCCTGAAAAG TTAAmCGAAG 2040 

GTAGCACAGA AATAGAAGGA TTTAAGTTliT nAyrTGTaCA CACACCTGGA CATTCACCAG 2100 
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TO 



GAATCGGACG TACAGATTTA TATAAAGGTG ATTATGAAAC GCTAGTTGAT TCTATTCAAG 2220 
ATAAAATATT TGAATTAGAA GGC 2243 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TTGGnATCAT tyAcgGTAAA AAGAATAAaG CAAGATTtAT TTCATTAGTA CTAATTTGTG 60 

CAATGTTTGC AATTTGTTGG GTTGCATATA TTCAATGGGA GTCTACAATC GCTTCATTTA 120 

CACAATCTAT TAATATTTCa ATGGCACAAT ATAGTGTTTT ATGGACAATT AACGGAATAA 180 

TGATTTTAGT AGCACAACCA TTAATTAAAC CGATTCTCTA TCTGTTAAAA GGAAACTTAA 240 

AGAAGCAAAT GTTTGTCGGC ATCATCATTT TTATGTTGTC GTTCTTTGTC ACGAGTTTTG 300 

2S CCGAAAACTT TACAATATTT GTTGTCGGTA TGATTATTTT AACTTTTGGA GAAATGTTTG 360 

TATGGCCAGC AGTTCCAACT ATAGCCAATC AGTTAGCGCC A6ATGGTAAG CAAGGACAGT 420 

ACCAAGGTTT TGTGAATTCA GCTGCTACAG TAGGAAAAGC ATTTGGTCCA TTTCTTGGTG 480 

^ GTGTATTAGT TGATGCGTTT AATATGCGCA TGATGTTTAT CGGTATGATG CTACTACTTG 540 

TATTTGCATT AATATTATTA ATGGTTTTCA AGGAGAATAA TACGCAACCT AAAAAAATAG 600 

ATGCATAATG AGTAAATAGA ATTAACGTTA TAGACTTGAA ATAAATGTCG TTATAACATA 660 

ATATTAATTT GTATAATTTA ATTTCGTTTG GAGCTTTTCT ACAGAAAGCT AGTGATGCTG 720 

AGAGCTAGTG TTAAGGACTA AATGTAAATC GTATTAATTT TAAATTGAAT GAATGACATC 780 

TCTTACTATT AAAATGAGTG CACAATTTTT GTGAAATAGG GTGGTAACGC GGCAAATGTC 840 

GTCCCTATGT AAATAGAATA GTTAGAGGTG TCnTITTAT TGAATAGGAG GAAATGTGTT 900 

GAATTACAAC CACAATCAAA TTGAAAAGAA ATGGcAAGAC TATTGGGACG AAAATAAAAC 960 

ATTTAAAACA AATGATAACT TAGGTCAAAA GAAATTTTAT GCTTTAGACA TGTTTCCATA 1020 

TCCATCAGGT GCTGGTTTAC ATGTTGGACA TCCTGAGGGc TATACAGCAA CAGATATCAT 1080 

TTCAAGATAT AAAAGAATGC AAGGATATAA TGTATTACAT CCGATGGGGT GGGATGCATT 1140 

50 CGGATTACCA GCAGAGCAAT ATGCTTTAGA CACTGGCAAC GACCCACGTG AATTTACAAA 1200 

GAAAAATATC CAAACTTTTA AACGACAAAT TAAAGAATTA GGGTTCAGTT ATGATTGGGA 1260 
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GTTATATAAC AAAGGTTTAG CATACGTTGA 
AGGCACTGTT TTATCTAACG AAGAAGTGAT 
^ AGTTTATCGT AAGCCGATGA AACAATGGGT 

ATTAflCAGAT TTAGATGATT TAGATTGGCC 
GATTGGACGT TCTGAAGGGG CCAAAGTTTC 

10 

AGAAGTATTT ACGACTAGAC CAGATACAAT 
TGAACATGCA TTAGTTAATT CAATTACAAC 
TCAAACAGAA GCTTCTAAAA AGTCAGATTT 

15 

AGGTGTATTT ACTGGTGCAT ATGCAACTAA 
GATTGCTGAT TATGTATTAT CAACATATGG 

^ TGATGACAGA GATTATGAAT TTGCTAAAAA 
AGGTGGAAAT GTTGAAGAAG CAGCATACAC 
ACTT6ATGGT TTAGAAAATG AAGCGGCAAT 

25 AGGTGCTGGC GAAAAGAAAG TTAATTACAA 
TTATTGGGGC GAACCAATTC CTGTCATTCA 
TGAAGAAGAG CTACCATTGT TGTTACCTGA 

^ TGAGTCTCCA CTAGCTAATA TTGATTCATT 
GAAAGGACGT CGTGAAACAA ATACAATGCC 

ACGTTACATC GATCCTAAAA ATGAAAATAT 

35 

GTTACCTGTT GATTTATATA TGGGTGGAGT 

AAGATTTTGG CATAAAGTCC TTTATGATTT 

AAAATTATTT AACCAAGGTA TGATTTTAGG 

40 

AGGAAATGTA ATCAATCCTG ATGATATAGT 
TTACGAAATG TTTATGGGAC CTTTAGATGC 

^ TGGGTCTCGT CGATTCTTAG ATCGCGTATG 
GAGTTCAAAA ATTGTAACTA CAAATAATAA 
TAAAAAGGTA ACAGAAGACT TTGAAACATT 

SO GGTATTTATT AATGAGTGTT ATAAAGTTGA 
CGTTAAAATG TTAGCACCTA TTGCACCACA 

55 



TGAAGTTGCA GTTAACTGGT GTCCAGCATT 1380 

TGATGGTGTC TCTGAACGTG GTGGACATCC 1440 

ACTTAAAATC ACAGAATATG CAGATCAATT 1500 

TGAGTCTTTA AAAGATATGC AGCGCAATTG 1560 

ATTTGATGTA GATAATACGG AAGGAAAAGT 1620 

CTATGGTGCA TCATTCTTAG TCTTAAGTCC 1680 

AGATGAATAT AAAGAAAAAG TAAAAGCTTA 1740 

AGAACGTACA GATTTAGCAA AAGATAAATC 1800 

TCCTTTATCT GGTGAAAAAG TACAAATTTG 1860 

TACTGGAGCA ATTATGGCAG TACCAGCGCA 1920 

GTTTGATTTG CCAATCATTG AAGTCATCGA 1980 

TGGTGAAGGT AAACATATTA ATTCTGGTGA 2040 

TACTAAAGCT ATTCAATTAT TAGAGCAAAA 2100 

ATTAAGAGAT TGGTTATTCA GTCGTCAGOG 2160 

TTGGGAAGAT GGAACAATGA CAACTGTTCC 2220 

AACAGATGAA ATCAAGCCAT CAGGGACTGG 2280 

TGTAAATGTT GTAGATGAAA AAACAGGTAT 2340 

ACAATGGGCA GGTAGTTGTT GGTATTATTT 2400 

GTTAGCAGAT CCTGAAAAAT TAAAACATTG 2460 

AGAACATGCG GTTCTTCACT TATTATATGC 2520 

GGCTATCGTA CCTACTAAAG AACCTTTCCA 2580 

AGAAGGTAAT GAGAAGATGA GTAAATCTAA 2640 

ACAGTCTCAT GGTGCAGATA CTTTGCGTCT 2700 

TGCAATTGCA TGGAGTGAAA AAGGATTAGA 2760 

GCGTTTAATG GTAAATGAAG ATGGGACATT 2820 

ATCTTTAGAT AAAGTTTATA ACCAAACTGT 2 380 

AGGATTTAAT ACTGCTATTA GTCAATTAAT 2940 

TGAAGTTTAT AAACCTTACA TTGAAGGCTT 3000 

TATCX^GTGAA GAATTATGGT CAAAATTAGG 3060 
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TGATGAAGTA GAAATCGTT6 TTCAAGTGAA TGGTAAATTG AGAGCTAAAA TTAAAATTGC 3180 

TAAAGATACA TCAAAAGAAG AAATGCAAGA AATTGCCTTA TCTAATGACA ATGTTAAAGC 3240 

GAGTATTGAA GGTAAAGACA TCATGAAAGT CATCGCTGTT CCTCAAAAAT TAGTCAATAT 3300 

TGTAGCTAAA TAATGTTTTA AGGAGGACTT TGAAATGAAG TCAATTACTA CAGATGAATT 3360 

AAAAAATAAA CTTTTAGAAT CTAAACCAGT TCAAATTGTT GATGTTCGTA CTGATGAAGA 3420 

AACAGCAATG GGATATATTC CTAATGCAAA GTTAATTCCA ATGGATACCA TTCCGGATAA 3480 

TTTAAATTCA TTTAATAAAA ATGAAATATA TTATATTGTA TGTGCTGGTG GAGTTCGAAG 3540 

CGCTAAAGTT GTAGAATATT TAGAGGCAAA TGGCATTGAT GCCGTAAATG TCGAAGGCGG 3600 

CATGCACGCA TGGGGCGATG AAGGTTTGGA AATAAAAAGT ATTTAAAGTA GTGACATAAT 3660 

TTAAAATAAT ATTACATTTG TAATGACACC AAGTAACGTT TCGGTTGCTT GGTCTTTTTT 3720 

GGTATGAATT ACTTTCTGTT ACAAAACAAT CTAAAGCX5TT CTTGTTATGT TTTATTAAGA 3780 

TTTTAATTAC AAAACGGAAA CTAAATTGTA ATAAAATAAA ACTTTATTTT ATAAAATGAT 3840 

GATGATAAAA TTGAGTGAAC TTAAAATATT GTACAAAATA ATATAGCTAT AAATATAATA 3900 

2S TAGCTATAAA TATAATATGA GGGAGCGTAT ATTTTTAGCA TAATTCTTAA CAACACAGCA 3960 

GAGAACAGAC AACCAGGAGG AAAATGAAAT GAATTTGTTA AAGAAAAATA AATATAGTAT 4020 

TAGGAAGTAT AAAGTAGGCA TATTCTCTAC TTTAATCGGA ACAGTTTTAT TACTTTCAAA 4080 

CCCAAATGGT GCACAAGCCT TAACTACGGA TAATAATGTA CAAAGC6ATA CTAATCAAGC 4140 

AACACCTGTA AATTCACAAG ATAAAGATGT TGCTAATAAT AGAGGTTTAG CAAATAGTGC 4200 

GCAGAATACA CCTAATCAAT CTGCAACAAC CAATCAAGCA ACGAATCAAG CATTGGTTAA 4260 

TCATAATAAT GGTAGTATAG TAAATCAAGC TACGCCAACA TCAGTGCAAT CAAGTACGCC 4320 

TTCAGCACAA AACAATAATC ATACAGA7GG CAATACAACA GCAACTGAGA CAGTGTCAAA 4380 

CGCTAATAAT AATGATGTAG TGTCGAATAA TACCGCATTA AATGTACCAA CTAAAACAAA 4440 

TGAAAATGGT TCAGGAGGAC ATCTAACTTT AAAGGAAATT CAAGAAGATG TTCGTCATTC 4500 

TTCAAATAAA CCAGAGCTAG TTGCAATTGC TGAACCAGCA TCTAATAGAC CGAAAAAGAG 4560 

AAGTAGACGT GCGGCACCGG CAGATCCTAA TGCAACTCCA GCAGATCCAG CGGCTGCAGC 4620 

GGTAGGAAAC GGTGGTGCAC CAGTTGCAAT TACAGCGCCA TATACGCCAA CAACTGATCC 4680 

TAATGCCAAT AATGCAGGAC AAAATGCACC TAACGAAGTG CTGTCATTTG ATGACAATGG 474 0 

SO TATTAGACCA AGTACCAACC GTTCTGTGCC AACAGTAAAC GTTGTTAATA ACTTGCCGGG 4 800 

CTTCACACTA ATCAATGGTG GCAAAGTAGG GGTGTTTAGT CATGCAATGG TAAGAACGAG 4860 



55 



3S 



40 



45 



473 



10 



IS 



20 



EP 0 786 519 A2 

TCGTATACAT GGAACTGATA CGAATGACCA TGGCGATTTT AATGGTATCG AGAAAGCATT 4980 

AACAGTAAAT CCGAATTCTG AATTAATCTT TGAATTTAAT ACAAT6ACTA CTAAAAACX3G 5040 

TCAAGGCGCA ACAAATGTTA TTATCAAAAA TGCTGATACT AATGATACGA TTGCTGAAAA 5100 

GACTGTTGAA GGCGGTCCAA CTTTGCGTTT ATTTAAAGTA CCTGATAATG TGAGAAATCT 5160 

CAAAATTCAA TTTGTACCTA AAAATGACGC AATAACAGAT GCGCGTGGCA TTTATCAACT 5220 

AAAAGATGGT TACAAATACT ATAGCTTTGT TGACTCTATC GGACTTCATT CTGGGTCACA 5280 

TGTTTTTGTT GAAAGACGAA CAATGGATCC AACAGCAACA AATAATAAAG AGTTTACTGT 5340 

AACAACATCA TTAAAGAATA ATGGTAATTC TGGTGCTTCT CTAGATACAA ATGACTTTGT 5400 

ATATCAAGTT CAATTACCTG AAGGTGTTGA ATATGTGAAC AATTCATTGA CTAAAGATTT 5460 

TCCAAGTAAC AATTCAGGCG TTGATGTTAA TGATATGAAT GTTACATATG A7GCAGCAAA 5520 

TCGTGTGATA ACAATTAAAA GTACTGGAGG AGGTACAGCA AACTCTCCGG CAC6ACITAT 5580 

GCCTGATAAA ATACTCGATT TAAGATATAA ATTACGTGTA AATAATGTGC CGACACCAAG 5640 

AACAGTAACA TTTAACGAGA CATTAACGTA TAAAACATAT ACACAAGATT TCATTAATTC 5700 

25 AGCTGCAGAA AGTCATACTG TAAGTACAAA TCCATATACT ATCGATATCA TCATGAATAA 5760 

AGATGCATTA CAAGCCGAAG TTGACAGACG TATTCAACAA GCTGATTATA CATTTGOGTC 5820 

ATTAGATATC TTTAATGGTC TGAAACGACG CGCACAAACG ATTTTAGATG AAAATCGTAA 5880 

30 CAATGTACCA TTAAATAAAA GAGTTTCTCA AGCATATATT GATTCATTAA CTAATCAAAT 5940 

GCAACATACG TTAATTCGAA GTGTTGATGC TGAAAATGCA GTTAATAAAA AAGTTGACCA 6000 

AATGGAAGAT TTAGTTAATC AAAATGATGA ATTGACAGAT GAAGAAAAAC AAGCAGCAAT 6060 

ACAAGTTATC GAGGAACATA AAAATGAAAT AATTGGTAAT ArTGGTGACC AAACGACTGA 6120 

TGATGGCGTT ACTAGAATCA AAGATCAAGG TATACAGACC TTAAGTGGGG ATACTGCAAC 6180 

ACOGGTTGTT AAACCAAATG CTAAAAAAGC AATACX3TGAT AAAGCAACGA AACAAAGOGA 6240 

AATTATCAAT GCAACACCAG ATGCTACTGA AGACGAGATT CAAGATGCAC TAAATCAATT 6300 

AGCTACGGAT GAAACAGATG CTATTGATAA TGTTACX3AAT GCTACTACAA ATGCTGACGT 6360 

TGAAACAGCT AAAAATAATG GCATCAATAC TATTGGAGCA GTTGTTCCTC AAGTAACTCA 6420 

TAAAAAAGCT GCAAGAGATG CAATTAACCA AGCAACAGCA ACGAAAAGAC AACAAATAAA 6480 

TAGTAATAGA GAAGCAACTC AGGAAGAGAA AAATGCAGCA TTGAACGAAT TAACTCAAGC 6540 

SO AACCAACCAT GCTTTAGAAC AAATCAATCA AGCAACAACA AATGCTAATG TTGATAACGC 6600 

CAAAGGAGAT GGTCTAAATG CCATTAATCC AATTGCTCCT GTAACTGTTG TTAAGCAAGC 6660 
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TGATGCGACT 


CAAGAAGAAA 


GACAAGCAGC 


AATTGACAAA 


GTGAATGCTG CTGTAACTGC 


6780 




AGCAAACACA 


AACATTTTAA 


ACGCTAATAC 


CAATGCTGAT 


GTTGAACAAG TAAAGACAAA 


6840 


5 


TGCGATTCAA 


GGAATACAAG 


CAATTACACC 


AGCTACAAAA GTAAAAACAG ATGCAAAAAA 


6900 




TGCCATCGAT 


AAAAGTGCGG 


AAACGCAACA 


TAATACGATA 


TTTAATAATA ATGATGCGAC 


6960 




GCTCGAAGAA 


CAACAAGCAG 


CACAACAATT 


ACTTGATCAA 


GCTGTAGCCA CAGCGAAGCA 


7020 


10 


. AAATATTAAT 


GCAGCAGATA 


CGAATCAAGA 


AGTTGCACAA 


GCAAAAGATC AGGGCACACA 


7080 




AAATATAGTA 


GTGATTCAAC 


CGGCAACACA 


AGTTAAAACG 


GATACTCGCA ATGTTGTAAA 


7140 


IS 


TGATAAAGC6 


CGAGAGGCGA 


TAACAAATAT 


CAATGCTACA 


ACTGGCGCGA CTCGAGAAGA 


7200 


GAAACAAGAA 


GCGATAAATC 


GTGTCAATAC 


ACTTAAAAAT 


AGAGCATTAA CTGATATTGG 


7260 




TGTGACGTCT 


ACTACTGCGA 


TGGTCAATAG 


TATTAGAGAC 


GAT6CAGTCA ATCAAATCGG 


7320 


20 


CGCAGTTCAA 


CCGCATGTAA 


CGAAGAAACA 


AACTGCTACA 


GGTGTATTAA ATGATTTAGC 


7380 




AACT6CTAAA 


AAGCAAGAAA 


TTAATCAAAA 


CACAAATQCA ACAACTQAAG AAAAGCAA6T 


7440 




GGCTTTAAAT 


CAAGTGGATC 


AAGAGTTA6C 


AACGGCAATT 


AATtnATATAA ATCAAGCTGA 


7500 


25 


TACAAATGCG 


GAAGTAGATC 


AAGCGCAACA 


ATTAGGTACA AAAGCAATTA ATGCGATTCA 


7560 




GCCAAATATT 


GTTAAAAAAC 


CTGCAGCATT 


AGCACAAATC 


AATCAGCATT ATAATGCTAA 


7620 




AITAGCTGAA 


ATCAATGCTA 


CACCAGATGC 


AACGAATGAT 


GAGAAAAATG CTGCGATCAA 


7680 


30 


TACTTTAAAT 


CAAGACAGAC 


AACAAGCTAT 


TGAAAGTATT 


AAACAAGCTA ACACAAATGC 


7740 




AGAAGTAGAC 


CAAGCTGCGA 


CAGTAGCAGA 


GAATAATATC 


GATGCTGTTC AAGTTQATGT 


7800 




AQTAAAAAAA 


CAAGCAGCGC 


GAGATAAAAT 


CACTGCTGAA GTGGcGAacG TATTGaAGCX? 


7860 


35 


GTTAAACAAA 


CACCTAATGC 


AACTGACGAA 


GAAAAGCAGG 


CTGCTGTTAA TCAAATCCAA 


7920 




TCA/CTTTAA 


AGATTCAAGC 


AATTTAATCC 


AAATTTAATC 


CAAAACCCAA ACAAATGGAT 


7980 




TCAGGGTAGG 


ACACCACTTA 


CAAATCCAA 






8009 



40 

(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ACCCACCCCn TGGGGATAnT TTACCTGGTG GGGCCTTCGA TTGCCTTTAG GTGAAACCaG 60 
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AGATGAATGC TAACCATATT CATTCTGCTA AAGATGGTCG TGTTACTGCG ACAGCTGAAA 180 

TTATTCATCG AGGTAAGTCG ACACATGTAT GGGATATAAA AATTAAGAAT GACAAAGAAC 240 

AATTAATTAC AGTTATGCGT GGTACAGTTG CTATTAAACC TTTAAAATAA AAGAACTGCT 300 

AGCTGAAATG TTATGAGATA TTCATAACTA CGGCTAGCAG TTTTTTTATG CXJCTATATTG 360 

TTGTAGTTTT AGAAATGCTT GTTCAATGCG TTCGGCAGCT TTACGGCCAC CCATAACATT 420 

TCTACCAAAT GGTCCTAATT CTAAGTCTGC AAAGCATCCT GCGACAAATA GATTTGGTAT 4 80 

CCATTCTAAT TTTTCGGAAA TAACAGGGTA ATTACATTCG TTGATAGGTG CATCATAATT 540 

TTGTATTAAT TGCTTAATAA GTGGTTGTGA CATAAAATCT TGTTCAAAAC CAGTTGCAAC 600 

CATAATCTGT TGATATGGAA CAGAATCATT TTCAGTGTTA ATTACACCAC CACTAATTTG 660 

AGTGATAGGT GTTTTATGCa CATTTATACG ACCATTTTTA ATATGTTTTT TAAGGCGTAA 720 

GTACAGTTCG TGAGGCATTG ATCCTTTATG ACGTTCGCGT TGTACAATGG CATTTCTTTC 780 

AGGCATGCTT TTAGTACTTA AAAATGAA6A CATATTTTTC GGACCTAACC AACCAGGATC 840 

AGCATCAAAG TCATGTATTT CAATATCTTT ATTTAGCCAT AAATGAATCT TTTTATCGTT 900 

25 ATCATGATTT AACAATTTAA GTGCAAGATG TGCAGCAGTa ATGCCGCTAC CAACGATATG 960 

ATCGGTCTTA TCATATACTA CTTGATCAAG TTCTTTCTCG AAGATATGAT TTACATTCTG 1020 

TTTGTCTTTT AAAATGTCAG GCATAAACGG AATATTTGTA CTGCCTATTG CAATAACGAC 1080 

30 GCAATCTGTA GTGATAAT7T GTCCATCTTC TAACTTGATA TGCCATTTGT CTTCTTGTTT 1140 

ATCTAAAGTT TGAACTAAAC CTTGAACCAA GCAATCCTCT AATTGATATT GTTTAGAAOC 1200 

ATGTGCAATA TGATCCATAA ACATTGTCAA TTCAGGTCGT TGATAAGGAC CATAAAAAGC 1260 

ATTTGTATAT TGGTGCTGTT TAGCGAATTG TTTTAGATGG AACGGTTGTG GATGTACGTG 1320 

ATGTACAATC GGTGATCTTA AATAAGGCAT TTCTATTCGA TTTGTATATG AGTTAAACCT 1380 

TTGGCAAAAA GTTTCGTGTG GGTCAATGAT TGTTAATCGG TCTGTTGTTA ATCCGCTTGA 1440 

TAATAGTTTT TGTGCGATTG CAGTTCCCTG TATGCCACCG CCGATAATTG TCCAATGCAT 1500 

AATAAAACCT CTCTCTTTTT AAAACGTAAT AGTTACGATT TATAATTATT ATTATCATAA 1560 

TACATAACGA CATGAAAGGC AATTAAATTA AAGAGATATA TGTAGATAGG GCGAATCTGT 1620 

AGTCAAAGAA AAAATCATTG AAAAAGAGGT AACAATGTCA AAAGAwAACA GCAGTAAAAT 1680 

CATTCCTAAT TTGGAATCAT CTTACTGCTG TTTGTTGTTG ATTTATATTC ATGATTTTGT 1740 

SO TATATAATCT ACAATTTTGT GTCTTTTAAG TCTTCCGAAA TTTCATCGAC TTTAGTCTTT 1800 

TTAGTATAAG GCGTTTTAAT ATTATATGCT GCTTTCATAA TCATATGACT TGAAAGAGGA 1860 
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GCAATAAAAT ATAAAAACGT ACCAAATAGT AATGACATTG CACCTAATGT TGATGCTTTT 1980 

CCGGCAGCAT GTGCACGTGA ATATACATCT TCAAGTCTCA ATAATCCTAT AGCTGCTAGG 2040 

GCGCTAATTA AAGCACCGAT GATAACAAAG ATAAGTGCAA GACTAATCAG TATGATTTTG 2100 

ATCATGTTCA ATCACCTTAC CTTTGTCCAT AAATTTAGAG AATACTGCAG TACCTAAAAA 2160 

AGCTAATATA CCAATCATCA TAATAACGAC AATCATGTAT TTAATATTTA ATAAAATACT 2220 

GAATAATGCT ATAACTGCCA TTAATTGAAG ACCAATCGCA TCTAATGCGA CAACACGATC 2280 

GGCAAGTGAT GGGCCTAGCA CAACGCGAAT GAGCATAGCT AACATAGAAA TGACAACTAT 234 0 

GATTAATGCA ATAACGATAA TAACATTATG ATTCATTATA TTTCGCCCAC CTCTCTTACA 2400 

ATTTTCTCTA ATGATGTTTT AATACTTTCT ACTTCTTGCT CTTTAGTTGA AAAATCTATG 2460 

GCATGAATAT AAATTTTTGT ACX5ATCGTCA CTTACACCAA GCACTACAGT ACCAGGTGTT 2520 

AATGTAATTA AATTAGACAG CAAGACAATT TGCCAATCTT TTTTTAAATC TGTGTGATAA 2580 

ACAAAGAATC CTGGTTCATT TTTAATCGAA GGTTTAATAA TAATTTTCAA AACATCAAAA 2640 

TTAGCTTTAA TCAGTTCGAT TAAGAAAATA ATAACTAATT TAATAATACX5 ATATAGCGTG 2700 

2$ ATGACATAAA ATCTACCTGG TAACACTCTG TGTAAGAGGT AAACAAGAAC TAGGCCAAAG 2760 

ATGAAACCTA ACACAAAGTT ATTTGTTGTG TAACTATTTG TCACAAACAA CCAAAACACT 2820 

GCGATAATAA AGTTTAATAC TAAirGTACA GCCATGTTAT TTACCTCCTA ATACAGCTTT 2830 

AACGTAGGTT GATGGATTGT AGAATGTTTC TGCACCAGCT TTTACCATTG GATATAAGTA 2940 

ATCTGCTGAC AATCCATATA AAACAGTTAT CACAACTGCA ACGATTGCAA TCGTAGTTAA 3000 

ATATTTGACG TCGACTTTGT TATTAAGATC ATATCCTTTT GGTTGACCGA AAAAGCCTTG 3060 

TAGGAATATG CGAATGACAG AATATAATAC 6ACTAAACTT GATAATAAGA CGATGACACC 3120 

ACTTAAATAA AATCCTCTTr CAAATGTTGA TTGGACAATA AAAAATTTTC CATAAAAGCC 3180 

ACTGAGTGGG GGAATGCCAG CTAAACTTAA TGCTGCGATA AAGAATGACC AACCAAGTAC 3240 

AGGATATCGT TTAATTAAGC CACCAAATTG TCTTAAATCA GCAGTGCCTG TAATTTTAAT 33 00 

CATAATTCCG ATAAGCAAGA ATAATGCAAG TTTTACTAAC ATGTCGTGCA ATGTATAGTA 3360 

AATAGCCCCA ATCATACCTG ACTCTGTCAT CATTGCAACG CCGACTAAGA TCACACCTAC 3420 

AGCAATCATG ACATTGTATA GGATGATTTT TTTAATGTTG GCATATGCAA CAGCACCGAC 3430 

ACAACCAAAG ATGATCGTTA ATAGTGCTAA GAATAAAATG ACATAATGTG AAAAGCTTAC 3540 

SO ATTATCACTA AAGAATAGGC TCAATGTTCT AGCGATTGCA TAAACACCAA CTTTTGTTAA 3600 

CAAAGCACCA AAGAATGCAA TGATTGGAAT TGGTGGgCAT AGTATGCACT AGGTAACCAA 3660 
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ATATTGACTA AGCCACTGTC ATGCGCTGAA AGGTTAGCTA ATTTATTGCT TATATCTGCT 3700 

AGATTCAATG TTCCTACTAC TGAATATAAA ATCGCTACAC CCATTACGAA GAAGGATGAC 3840 

GATACAACGT TAACAAGAAC ATATTTTATT GTTTCTTGTA GTTGAATTTT TGTAGAACCA 3900 

ATTACTAATA AGAAATAAGA TGACATTAAA AATACTTCGA AAAATACGAA TAGGTTGAAA 3960 

ATGTCACCAG TTGTGAATGC ACCAATGATA CCTATTAACA TAAATAGTAC TGAAAAATAA 4020 

TAATAATATC TTTCACGTTC AATACCAATT GTTTGGTATG AATATAAAAT CACAATAGCT 4080 

GTAATAATAA TACTAGTAAT TATTAGTAGG GCACTGAATA TGTCTAATAC AAAGACAATA 414 0 

CTGTATGGTG CTTTCCATGA ACCTAGCTCT ACGCGTATTG GTCCATGTTT AACAACATTT 4200 

GCTAAATTGA TAATTGCCGC GACCAAGGTT AATAATGTAC CGCCTAGTGC GACATAACGC 4260 

TTTATAATAG GACGCTTTCC AATAAAGACA AGTAATATGG CTGTAATTAC TGGAATAACT 4320 

AGCGTTAACA CAAGCATATT ACTTTCAATC ATCTTCTGGA ACTCCTTTCA TACTCTCAAC 4380 

GTTATCTGTG CCTAATTCTT TATATGTTCT AAATGCTAAT ACTAAGAAAA AGGCTGTTGT 4440 

CGCAAgGCGA TAACGATTGC TGTTAAAATA AGTGCTTGCG GGaTAGGaTC AACATAGCTT 4500 

25 TTTACGTTCG CTTCATAAAT TGGAACAGTA CCATGTTTAA GTCCGCCCAT AGTTATTAAA 4560 

AATAAATTTG CTGCATGTOT TAATAGTGTA GTTCCCATAA CAATTCGTAT CAGACTTTTA 4620 

GACAAAACGA GATAGACACT AATTGCTGTG AGAATACCAC TAACAAAAAT CATAATAATT 4680 

30 TCCACTATTC GTTCTCTCCA ATCGAAATAA TAATTGTCAT GACAGTACCA ACTACTGCAC 4740 

ATAAAACACC GAAATCAAAG AATACTGCTG TTGTCATATG AACAGGTTCT AATATAAATA 4800 

ACGGTATATC AAATGTGACA TGCGTAAAGA AATTTTTGCC TAAAAACCAA CTTGCGATAG 4860 

GCGTCGCAAT ACAAAAAACT AATCCGATAC CTATCAAGAT TTTAAAATCT AATGGGAAAA 4920 

rmScGCAT TGTTTCTATA TCAAATGCAA TCGTAATGAT AACAAGTGAA CTTGCGAATA 4980 

ATAATCCGCC GACGAAACCG CCACCAGGT6 TATAATGTCC TGCTAAGAAA AGTGAAAAAC 5040 

CAAAGACCAT TACCATGAAA AAGATAATAA CTGCAGCAAA TTGCAAAATT AGATCATTTT 5100 

GTTGTCTATT CATGATTTTT CACCTCGTTA CCTTGCGTTT GACGCTTTTT ACGTAATTTA 5160 

ATCATTGTAT ATACAGCTAA TCCTGCGATA CCAAGCACAG ATGACTCGAA TAAAGTATCC 5220 

ATACCACGGA AATCAACAAG TATGACGTTT ACCATGTTTT TACCGTGAGC tAAATCATAA 5260 

ACGTGCTCTT GATAAAACTT AGATATCGAT TCAAAATGTC TATTTCCGTA TGCAATTAAA 5340 

SO CCGATAATAA TGACGGACAA ACCAACACCA CCAGCAATTA AAGCATTAGT AAGCTGGAAT 5400 

GAGCGCTTTT CATTATAACG ATTTAAATTT GGTAAGTGGT AGAAGCATAA TAAGAACAAT 5460 
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ATAAACAATA CAGACACAGC ATATCCAACT GCACTTAACA TAATGATGCT AAATAATCTT 5580 

GATTTAGCC5A AAAGAATTAA AAAGGCAGCA CTTAATAATA AAATTACGAT ACAAACTTCG 5640 

AAAATTCTAA TCGGACTAAC GTCTTTAAAA TTAATGTTGA AAGGTACTGA GAATATAGTG 5700 

ACAAATGTTA ATAAAATTAA TGCACCAAAA ATGATAACTA AATTATTACG TGAATAATCG 5760 

GTAACATAGC TATTCGTCAT CTTTTCAGAG TAGTTTGGAA TAACATTTGC ACTTCTGTTG 5820 

TACCAATAAT TGAATGTTAG TTTACCAGGT TGTCGTTGCA ACAATTTCAC CCAATAACTA 5880 

AATGTCACAA TTAGTAAGAT ACCTAAAATA TAAATCACTA ATGTTGATAA AAAGGCAGGC 5940 

GTTAATCCAT GGAACATATG GAATTCAACA TCATCAATTA CCGTATGATT AATCGAAGag 6000 

TnAGCTGGlT CAATAATCGA ATTAGTTAAA ATGCCAGGGA ATAAACCAAA TACAATTACT 6060 

AATGTAGCTA AAATAGCTGG TGATAAAAGC ATTAATATTG ATACTTCGTG TGCTTTTTTA 6120 

GGTAATTGTT GAGGTTTATA TTGTCCGAAA AATATATGCA TTATAAATTT AATTGAATAT 6180 

ACAAAT6TGA AGACACTGCC CACTATACCA ATGATTGGGA ATAGGTAGCC TAATGTATCA 6240 

ACACTGAATA AATTTGCTTG GCTTGCTGTA AATGTTGTTT CTAAAAATGA TTCTTTTGAT 6300 

25 AAGAAACCAT TGAACGGTGG TACACCAGCg CATACTTAAT GCTGTAATAA CAGTGATTGT 6360 

AAATGAAATA GGCATAATTG TTAGTAAGCC ACCTAATTTC TTAACATCAC GTGTACCAGT 6420 

AGAATGATCC ACTGCACCTG TAATCATAAA TAGGGCACCT TTAAATGTTG CATGGTTGAT 6480 

TAAATGGAAT ATTGCAGCCG TAAATGCAGC AGCATATATT TTGCTATCAT CGCCTTGATA 6540 

GTGATAACTA ATGGCACCGA TTCCAAGCAT CGCCATAATC ATACCTAATT GGGATACTGT 6600 

TGAAAATGCC AGTATACCTT TCAAGTCTTG TTGTTTTGTT GCGTTTAGCG AAgCCCAGAA 6660 

3S 

TAATGTAATT AAACCAACGA GTGTGACAGT CCATACCCAA CCTTGCGATG CTGCGAAGAT 6720 

TGGTpTCATT CGAGCGATTA AATATAACCC TGCTTTAACC ATTGTTGCTG AATGAAGATA 6780 

AGCACTGACT GGTGTAGGTG CTTCCATTGC ATCTGGTAGC CAAATATAAA ATGGAAACTG 684 0 

AGCAGATTTT GTAAAAGCAC CAATCATGAT TAAAATCATC GCAAAAATGA AGAATGGGCT 6900 

ATTTTGAATT TCAGAAGCAT GTTGAATCAT GTACTGAATG CTAAATGATT GTGTTGGTAT 6960 

AGCGAGTAAG ATGATACCAC CTAATAATGA TAGACCACCA AATACTGTGA TTATGAGCGA 7020 

TTTTTGAGCA CCATATATAG ATGCTTGTCG TTCGCGCCAG AATGAAATAA GTAAAAAACT 7080 

AGAAAATGAC GTTAGCTCCC AGAATAAATA TAGAATAATA ACATTATCTG AAAGTACGAC 7140 

SO ACCTAACATT GCACCCATAA ATAGTAATAA ATAACAATAA AAATTCCCTA GTTGTTCTGA 7200 

CTTACTTAAG TAGCCGATTG AATATAATAC TACTAAACT6 CCGATTCCTG AAATAAGCAA 7260 
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CCAATTTAAG 


GTTTTCATTA CA6TATTACC 


TGACATCGTC 


GTTTTAATTA 


ATGTAAGCAT 


7330 




ATAAATAAAT 


ATGACGATAG 


GGACAGGTAA 


TACGAACCAT 


CCTAAATGTA 


TACGTTTAAA 


7440 


5 


AAATCTATAC 


AGGATAGGAA TAATGAGTGC 


GAATATTAAC 


GGTAATATCA 


CCGCAATATG 


7500 




TAACAAACTC 


ACTATGTTGT 


CCTCCTTTAA AAAATATTTA 


TGTTATTCAT 


TATACATGAA 


7560 




TGATATAGTT 


CTGAAAAACG 


TACACACTCC 


TTGTTGTGCT 


TTATTTTCAG 


AaGTATTTAA 


7620 


10 


ATAAGAAGAA 


ACACGTCATT 


TTTTATTTAA 


AATTTTCTTT 


GTATTGAAGT 


GAATAATCTT 


7680 




CTTTTAAGCG 


TGCTAAACTA 


GCTAAAGACA 


TTTCAGCATG 


TTTTGTTTGC 


TGAGCTTTAA 


7740 


to 


GTTTAGTTTC 


TAAATCTGTA ATTGCTTGTT GAAGTGAATC 


TTCATAGCGC 


AATACATCAA 


7800 


CATTGAAGTC 


GCGTAATTGT 


GAACGTTTCG 


TATAGCGTTT 


TTCAAAATGG 


CTTAATGCTT 


7860 




TGCGGTCATG 


GAAAAATACA 


CCTTCAGTTT 


CAGTAGGGTT 


ATGTAAATCA 


CCTTGTTTCG 


7920 


20 


GGTGTTTGAT 


AACTTGTTCA ACTTTAACAA GGACATCGTC 


TCCATTTTCT 


TCAACAATCG 


7980 




TGACACCATA 


GCTACCTGTT TTGTGTGAAA ATCGATATAG 


CTTCATGCTA 


TTTTCCTCCC 


8040 




TTAAAAGTAT 


GTTAATATAT ATGTATCATA ACATGAATGG 


AGAATATAAA 


TGGCTAACTA 


8100 


25 


TCCACAGTTA 


AACAAAGAAG 


TACAACAAGG 


TGAAATCAAA 


GTGGTTATGC 


ACACAAATAA 


8160 




AGGTGACATG 


ACATTCAAAT 


TATTTCCAAA 


TATTGCACCA 


AAAACAGTTG 


AAAATTTTGT 


8220 




GACACATGCA 


AAAAATGGTT 


ATTATGATGG 


AATCACATTC 


CACCGTGTCA 


TTAATGACTT 


6280 


30 


CATGATTCAA 


GGTGGCGATC 


CAACAGCTAC 


TGGTATGGGT 


GGCGAAAGTA 


TTTATGGCGG 


8340 




TGCTTTTGAA 


GATGAATTTT 


CATTAAATGC 


ATTTAACTTA 


TATGGCGCAT 


TATCAATGGC 


8400 




TAACTCAGGA 


CCTAATACTA ATGGTTCACA ATTTTTCATT 


GTTCAAATGA 


AAGAAGTACC 


8460 


35 


TCAAAATATG 


TTAAGTCAAC 


TTGCAGATGG 


TGGCTGGCCT 


CAACCAATCG 


TTGATGCATA 


8520 




TGGCSAAAAG 


GGTGGTACAC 


CATGGTTAGA 


TCAAAAACAT 


ACAGTATTCG 


GTCAAATCAT 


8580 


40 


TGATGGTGAA 


aCTACATTAG 


AAGATATTGC 


AAATACAAAA 


GTGGGACCAC 


AAGATAAACC 


8640 


ACTTCATGAT 


GTTGTAATTG 


AATCTATTGA 


TGTTGAAGAA 


TAATATCTAA 


ACATAATTAA 


8700 




CTACCAACAT 


TTTAAACTCG 


GATAAAGCTA 


ATTTATGAAT 


GGATTAGTAT 


ATATTCCAAC 


8760 


AC 


gAAAATAAAT 


AAACTAATAT 


GATGAGCAAT 


CTCAATATAT 


TTATCaAGAA 


AGCACAGTTT 


8820 




TTAAATAGAT 


GTGTATTTTA 


AAGATAATAG 


TTGAGGTTGC 


TTTTTATGTT 


TTTACAGAGA 


8880 




ATTGCTATTC 


AAATAGTAAA 


TAAATTGAAA 


ACAAAGTAGC 


TGGATATCAT 


ATTGATTTAG 


8940 


SO 


ATAGGAATTT 


GTTGCTAATT 


TTATTTGTAA 


ATCCAAGTTT 


GTAGAATTCT 


TATTCATTTA 


9000 




TAAAATAATA 


TTCGTATGAT 


TTGATTTTTT 


AATTAGTCCA 


CCATTTCGAT 


TTGTGCTATG 


9060 



ss 
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AATTTAATGA AGGGTTTTAT CCATTACCAC GTA 10953 
(2) INFORMATION FOR SEQ ID NO: 63: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
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20 
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TTTGATAnAA 


AACTGAATnA 


ATTAAATGTA 


TCGATTCAAC 


CTAATGAAGT GAATTTACAA 


60 


GTTAAAGTAG 


AGCCTTTTAG 


CAnAAAGGTT 


AAAGTAftATG 


TTAAACAGAA AGGTAGTTTA 


120 


GCAGATGATA 


AAGAGTTAAG 


TTCGATTGAT 


TTAGAAGATA 


AAGAAATTGA AATCTTCGGT 


180 


AGTCGAGATG 


ACTTACAAAA 


TATAAGCGAA 


GTTGATGCAG 


AAGTAGATTT AGATGGTATT 


240 


TCAGAATCAA 


CTGAAAAGAC 


TGTAAAAATC 


AATTTwCCAG 


AACATGTCAC TAAAGCACAA 


300 


CCAAGTGAAA 


OGmAGGCTTA 


TATAAATGTA 


AAATAAATAG 


CTAAATTAAA GGAGAGTAAA 


360 


CAATGGGAAA 


ATATTTTGGT 


ACAGACGGAg 


TAAGAGGTGT 


C6CAAACCAA GAACTAACAC 


420 


CTGAATTGGC 


ATTTAAATTA 


GGAAGATACG 


GTGGCTATGT 


TCTAGCaCAT AATAAAGGTG 


480 


AAAAACACCC 


ACGTGTACTT 


GTAGGTCGCG 


ATACTAGAGT 


TTCAGGTGAA ATGTTAGAAT 


540 


CAGCATTAAT 


AGCTGGTTTG 


ATTTCAATTG 


GTGCAGAAGT 


GATGCGATTA GGTATTATTT 


600 


CAACACCAGG 


TGTTGCATAT 


TTAACACGCG 


ATATCGGTGC 






CAGCCTCTCA 




6CAGATAATG 


GTATTAAATT 


CTTTGGATCA GATGGTTTTA 


720 


AACTATCAGA 


TGAACAAGAA 


AATGAAATTG 


AAGCATTATT 


GGATCAAGAA AACCCAGAAT 


780 


TACCAAGACC 


AGTTGGCAAT 


GATATTGTAC 


ATTATTCAGA 


TTACTTTGAA GGGGCACAAA 


840 


AATATTTGAG 


CTATTTAAAA 


TCAACAGTAG 


ATGTTAACTT 


TGAAGGTTTG AAAATTGCTT 


900 


TAGATGGTGC 


AAATGGTTCA 


ACATCATCAC 


TAGCGCCATT 


CTTATTTGGT GACTTAGAAG 


960 


CAGATACTGA 


AACAATTGGA 


TGTAGTCCTG 


ATGGATATAA 


TATCAATGAG AAATGTGGCT 


1020 


CTACACATCC 


TGAAAAATTA 


GCTGAAAAAG 


TAGTTGAAAC 


TGAAAGTGAT TTTGGGTTAG 


1080 


CATTTGACGG 


CGATGGAGAC 


AGAATCATAG 


CAGTAGATGA 


GAATGGTCAA ATCGTTGACG 


1140 


GTGACCAAAT 


TATGTTTATT 


ATTG6TCAAG 


AAATGCATAA 


AAATCAAGAA TTGAATAATG 


1200 


ACATGATTGT 


TTCTACTGTT 


AT6AGTAATT 


TAGGTTTTTA 


CAAAGCGCTT 6AACAAGAAG 


1260 


OAATTAAATC 


TAATAAAACT 


AAAGTTGGCG 


ACAGATATGT 


AGTAGAA6AA ATGC6TC0CG 


1320 
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TAATACTC5AA ATCGGTAACA ATGGTAATTT TGGTGCTTCA TTAAAAGCAG ATCAATTTAA 3240 

ATATGAAGTA ACATTACCAC AAGGTGTAAC TTACGTTAAT AATTCATTAA CTACAACATT 3300 

5 CCCTAATGGT AATGAAGACA GTACAGTATT GAAAAATATG ACTGTTAATT ATGATCAAAA . 3360 

TGCAAATAAA GTTACATTTA CAAGCCAAGG TGTGACAACG GCACGTGGTA CACACACTAA 3420 

AGAAGTTTTA TTCCCAGATA AATCTTTAAA ATTATCATAT AAAGTTAATG TTGCGAATAT 3480 

10 

CGATACACCT AAAAATATTG ATTTTAATGA AAAATTAACA TATCGTACTG CTTCAGATGT 354 0 

TGTAATTAAT AATGCGCAAC CAGAAGTaCA CTAACTGCAG ATCCATTTTC AGTAGCGGTT 3600 

GAAATGAACA AAGATGCGTT GCAACAACAA GTAAACTCAC AAGTTGATAA TAGTCATTAC 3660 

IS 

ACAACAGCAT CAATTGCAGA ATACAATAAA CTTAAACAAC AAGCAGATAC TATTTTAAAT 3720 

GAAGATGCGA ATCATGTTAA AACTGCAAAT CGTGCATCTC AAGCGGATAT TGATGGTTTA 3780 

GTAACTAAAT TACAAGCTGC ATTAATTGAT AATCAAGCAG CAATTGCTGA ATTAGATACT 3840 

AAAGCTCAAG AAAAGGTTAC AGCAGCACAA CAAAGTAAAA AAGTTACGCA AGATGAAGTT 3900 

GCAGCACTTG TAACTAAAAT TAACAATGAT AAAAATAATG CAATCGCAGA AATTAATAAA 3960 

25 CAAACTACAG CACAAGGTGT CACAACTGAA AAAGATAATG GTATCGCAGT GTTAGAACAA 4020 

GATGTGATTA CACCAACAGT TAAACCTCAA GCGAAACAAG ATATTATCCA AGCAGTTACA 4080 

ACTCGTAAAC AACAAATTAA AAAGTCAAAT GCATCATTAC AAGATGAAAA AGATGTAGCA 4140 

30 AATGATAAAA TTGGTAAAAT TGAAACAAAG GCAATTAAAG ATATTGATGC A6CAACAACA 4200 

AATGCACAAG TAGAAGCCAT TAAAACAAAA GCAATCAATG ATATTAATCA AACTACACCT 4260 

GCTACAACAG CTAAAGCAGC AGCTCTTGAA GAATTTGACG AAGTTGTTCA AGCACAAATT 4320 

^ GATCAAGCAC CTTTAAATCC TGATACAACA AATGAAGAAG TAGCGGAAgC TATTGAACGT 4380 

ATTAATGCAG CTAAAGTTTC TGGTGTTAAA GCAATTGAAG CGACAACX3AC TGCACAAGAT 4440 

TTAGAAAGAG TTAAAAACGA AGAAATCTCA AAAATTGAAA ATATTACTGA CTCTACGCAA 4500 

40 

ACAAAAATGG ATGCCTATAA TGAAGTTAAA CAAGCTGCAA CAGCTAGAAA AGCTCAAAAT 4560 

GCTACAGTTT CAAATGCAAC AAATGAAGAA GTAGCAGAAG CTGATGCAGC AGTAGATGCA 4620 

^ GCTCAAAAGC AAGGTTTACA TGACATCCAA GTTGTTAAAT CAAAACAGGA AGTTGCTGAT 4680 

ACAAAATCAA AAGTATTAGA TAAAATCAAT GCAATTCAAA CACAAGCAAA AGTTAAACCT 4740 

GCAGCTGATA CGGAAGTAGA AAACGCATAT AATACACGTA AACAAGAAAT TCAAAATAGC 4800 

SO AATGCTTCAA CTACAGAAGA AAAACAAGCT GCATATACAG AATTAGATAC TAAAAAGCAA 4860 

GAAGCAAGAA CAAATCTTGA TGCTGCAAAT ACAAACAGTG ATGTAACAAC AGCTAAAGAC 4920 
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GCGGAAATCG CTCAAAAAGC AAGTGAACGT AAAACAGCAA TTGAAGCAAT GAATGATTCG 5040 

ACTACTGAAG AACAACAAGC AGCGAAAGAC AAAGTGGATC AAGCAGTAGT TACTGCAAAC 5100 

GCTGATATAG ATAATGCTGC AGCAAACAAT GATGTGGATA ATGCAAAAAC TACAAATGAA 5160 

GCTACAATCG CAGCCATTAC ACCTGATGCA AATGTTAAAC CAGCAGCAAA ACAAGCAATT 5220 

GGUSATAAAG TACAAGCTCA AGAAACAGCA ATTGATGGAA ATAACGGCTC AACAACTGAA 5260 

GAAAAAGCAG CTGCTAAACA ACAAGTTCAA ACTGAAAAAA CAACAGCTGA TGCCGCAATA 5340 

GATGCAGCAC ATACAAATGC GGAAGTTGAA GCGGCTAAAA AAGCAGCAAT TGCTAAAATT 5400 

GAAGCX3ATTC AGCCAGCAAC AACAACTAAA GATAATGCGA AAGAAGCAAT TGCTACGAAA 5460 

GCGAATGAAC GTAAAACAGC AATCGCTCAA ACGCAAGACA TTACTGCTGA AGAAATTGCA 5520 

GCGGCTAATG CGGACGTAGA TAATGCTGTG ACACAAGCAA ATAGCAACAT TGAAGCTGCT 5580 

AATAGTCAAA ATGATGTAGA CCAAGCGAAA ACX3ACAGGTG AAAATAGTAT TGATCAAGTA 5640 

ACACCAACAG TTAATAAAAA AGCAACTGCA CGTAATGAAA TCACAGCAAT TTTAAATAAC 5700 

AAATTGCAAG AGATTCAAGc tACGCCAGAT GCAACAGATG AAGAAAAACA AGCAGCTGAT 5760 

2s GCTGAAGCAA ATACTGAAAA TGGTAAAGCA AATCAAGCCA TTTCAGCAGC AACTACTAAC 5820 

GCACAAGTTG ATGAAGCTAA AGCAAATGCA GAAGCAGCGA TTAATGCGGT AACACCAAAA 5B80 

GTTGTGAAGA AACAAGCGGC TAAAGATGAA ATTGATCAAT TACAAGCAAC GCAAACAAAT 5940 

30 GTTATCAATA ATGATCAGAA CGCTACAACA GAAGAAAAAG AAGCAGCTAT TCAACAATTA 6000 

GCAACAGCAG TTACAGACGC GAAAAATAAT ATTACAGCTG CAACTGATGA TAATGGTGTA 6060 

GATCAGGCGA AAGACGCTGG AAAGAATTCA ATTCAAAGCA CGCAACCAGC AACAGCGGTT 6120 

AAATCAAATG CTAAAAATGA TGTTGATCAA GCTGTGACAA CTCAAAATCA AGCAATTGAT 6180 

AATAGAACTG GTGCTACAAC TGAAGAGAAA AATGCAGCAA AAGATTTAGT TTTAAAAGCT 6240 

AAAGAAAAAG CGTATCAAGA TATCTTAAAT GCACAAACAA CTAATGATGT TACGCAAATT 6300 

AAAGATCAAG CAGTTGCTGA TATTCAAGGT ATTACTGCAG ATACAACAAT TAAAGATGTT 6360 

GCGAAAGATG AATTAGCAAC AAAAGCAAAC GAAGAAAAAG CGCTTATTGC ACAAACTGCA 6420 

GATGCGACTA CTGAAGAAAA AGAACAAGCA AATCAACAAG TAGACGCACA ATTAACACAA 648 0 

GGTAATCAAA ATATTGAAAA TGCACAGTCA ATCXSATGATG TAAACACTGC AAAAGATAAT 6540 

GCAATTCAAG CAATTGACCC AATTCAAGCA TCAACAGATG TTAAAACGAA TGCAAGAGCG 6600 

50 GAATTGCTAA CTGAAATGCA AAATAAAATA ACTGAAATAC TTAATAATAA TGAGACTACT 6660 

AATGAAGAAA AAGGTAACGA TATTGGACCA GTTAGAGCAG CATATGAAGA AGGTTTAAAT 6720 
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AAAGTTCAAC 


aacttcatgc 


AAATCCTGTT 


AAGAAACCAG 


CAGGTAAAAA 


AGAATTAGAT 


6840 




CAAGCT6CAG 


ctgataagaa 


AACACAAATA 


GAACAAACAC 


CAAATGCATC 


ACAACAAGAA 


6900 


5 


ATTAATGATG 


CAAAACAAGA 


AGTTGATACT 


GAATTAAATC 


AAGCGAAAAC 


AAATGTCGAT 


6960 




CAATCATCAA 


CAAATGAATA 


TGTTGATAAT 


GCAGTTAAAG 


AAGGAAAAGC 


TAAAATTAAT 


7020 




GCA6TTAAAA 


CATTTAGTGA 


GTACAAAAAA 


GATGCITTAG 


CTAAAATTGA 


AGATGCATAT 


7080 


10 


AATGCTAAAG 


TAAACGAAGC 


GGATAACTCT 


AACGCATCGA 


CTTCAAGTGA 


AATTGCTGAA 


7140 




GCGAAACAAA 


AACTTGCTGA 


ATTAAAACAA 


ACTGCGGATC 


AAAATGTTAA 


TCAAGCTACT 


7200 


IS 


TCTAAAGATG 


ACATTGAAGT 


TCAAATTCAT 


AATGACTTAG 


ATAATATTAA 


CGATTACACA 


7260 


ATTCCAACAG 


GTAAAAAAGA 


ATCAGCTACA 


ACAGATTTAT 


ATGCTTATGC 


AGATCAGAAG 


7320 




AAAAATAATA 


TTTCAGCTGA 


CACTAATGCA 


ACACAAGATG 


AAAAGCAACA 


AGCAATTAAG 


7380 


20 


CAAGTTGACC 


AAAATGTTCA 


AACTGCATTA 


GAAAGCATTA 


ATAATGGTGT 


GGATAATGGT 


7440 




GACGTTGATG 


ATGCATTAAC 


ACAAGGTAAA 


GCAGCAATTG 


ATGCTATTCA 


AGTAGATGCT 


7500 




ACTGTTAAAC 


CTAAAGCGAA 


CCAAGCTATT 


GAAGTTAAAG 


CAGAAGATAC 


GAAAGAATCT 


7560 


2S 


ATTGATCAAA 


GTGACCAGTT 


AACTGCTGAA 


GAAAAAACTG 


AAGCATTAGC 


AATGATTAAA 


7620 




CAAATTACAG 


ATCAAGCTAA 


ACAAGGTATT 


ACTGATGCAA 


CAACAACTGC 


TGAAGTTGAA 


7680 




AAAGCGAAAg 


CTCaAGGACT 


TGAAGCATTT 


GATAACATTC 


AAATCGACTC 


AACAGAAAAA 


7740 


30 


CAAAAAGCTA 


TCGAAGAATT 


AGAAACTGCA 


CTAGACCAGA 


TTGAAGCAGG 


TGTAAATGTC 


7800 




AACGCTGATG 


CTACAACTGA 


AGAAAAAGAA 


GCGTTTACGA 


ATGCTTTAGA 


AGACATTTTA 


7860 




TCAAAAGCAA 


CTGaAGATAT 


TTCTGATCAA 


ACTACAAATG 


CAGAAATCGC 


TACTGTCAAA 


7920 


35 


AATAGTGCGC 


TTQAACAACT 


TAAAGCACAA 


CGTATTAATC 


CTGAAGTTAA 


GAAAAAOXSCT 


7980 




ttggAgcaa 


TCAGAGAAGT 


GGTTAACAAG 


CAAATAGGAA 


tAATTAAAAA 


TGCAGATGCA 


B040 


40 


GATGCATCGG 


CGGAAAGAnA 


TTGCACGTAC 


GGGATTTAGG 


TAGATATTTT 


GGACCGATTT 


81C0 


gctggataaa 


TTTAGGGTnA 


AACCCCAACC 


AATGCCGAAG 


TTGCCTGAAT 


TACCA 


8155 



(2) INFORMATION FOR SBQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 65: 



10 



IS 



20 



CAATTGGACa 


TCTTGTATGA 


AAAGGACAAC 


CTTGCGGCGG ATTACTTGGC GAAGGTAATT 


60 


CTCCTTTTAA 


TATAATTCTA 


TTGTTATTAT 


GTTTATCAAT 


TTGTGGTATT GATGAAATCA 


120 


ACGCTTTTGT 


ATATGGATGT 


TTGGGATTTT 


CATAAATTTC 


TTTATCAGAT GCGATTTCAA 


180 


CTATATGACC 


TAAATACATA 


ACTCCAATGA 


CATCACTTAT ATGTTTTACT ACACTTAAAT 


240 


CATGTGCGAT 


AAATAAATAG 


CTTAAGTTAA 


ATTGTTCTTG 


TAAATCTTTT AATAAATTCA 


300 


GTACTTGAGA 


TTGAACAGAT 


ACATCTAATG 


CACTTACAGG 


CTCATCAGCA ACAATTAAAC 


360 


TCGGACGCAA 


AGCCAATGCT 


CTTGCAATTC 


CCACTCTTTG 


TCTCTGTCCA CCTGAAAATT 


420 


CATGTGCATA 


TTtATAATAT 


GCATCTTCAC 


TTAGGCCAAC 


ACATTTTAAT AAATATAGTA 


480 


CTTCTTTTTT 


TATTTCTTCT 


TTTGGCAATT 


TTTTATAATT 


TAAAATAGGT TCTGAAATGA 


540 


TATCTCCAAC 


CATTTGCATC 


GGATTCAATG 


ATGCATACGG 


ATCTTGAAAT ATCATCTGAT 


600 


ATTGTTGTCG 


TGATTTTCTG 


AGri-iTlTAC 


CTTGTAATCT 


TGTTATATCT TCACCATTAA 


660 


CAATTATTGA 


GCCTGAAGTT 


GCATCTTCAA 


GCCTGATAAT 


CACTTTACCT AACGTTGACT 


720 


TACCACAACC 


CG 








732 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 5838 base pairs 
SS (B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
AATATATTCA TATGTTTCAT CAACAATATT AGCTGCTTTT TGAATTAAA6 CAATTTCGTC 60 
AGCATCTTTG ACGTCTCTAA TTTTATCTAC AGTATTAGAA ATOCTTATTA ATGATATACG 120 

45 

GCTTTTATTT AATTCAAGGT ATGTATCATA ACTTACATGA TGCCCCTCAA AACCTACATT 180 
TTCAAAATTT TCTTGGTGTA GCAATTCTTT AATCTCACCA ATAATAGTAG ATTTACGATT 24 0 

AATAATTTCA TAATTTGGCG CCTGCTTAGT TGCTTGATCA ATATATCTAA AGTCTGTTAT 300 
CAAATATTGT TTATCTTTAG ATATGATAAG TGCTCCACTG GTACCAGTAA AACCTGATAA 360 
ATATCTTCTA TTGTAATCCG AAAGAATGaT AATCGCATCT AAATGTTTTT GTTCTAAAAT 420 
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AAATGTATAA TATTTGATTC GCTAATTAAT 
TCTTTAGTGA AATATTTTGA TAATTTGACC 

^ TTAGGGAGGA TGCAAGATGA GTGCTAGTTT 
TATTGCTTAT ATGATTGTTC AACAAATTCT 
AAATGAATTC CATAATGGGA TTAGAAAAGC 

10 

CTATGACTAC GGTCACATTA ATGGGTCTCG 
ATTCCAAGGA TTAAGAAAAG ATCAACCGGT 
CTATAGAGCC GCTCGTATTT TGAAAAAGAA 
CGGCTATAAA AAATGGACTG GAAAAATAAA 
TACGATTTAA TAAAATCTGA GTGTTAATTG 

20 TiTIAACAAA GTCTGTTACT ATATTTCTCT 
CATAATAAAG AAAGTTCAGC TCATTTTCAA 
AAAAGACAAT TTCTATGAAA GAATAATACC 

25 ITi'Ci'l'GTTT AAATAGAAAT TGTCTTTTTC 
TTCATACAAA GTTAGAATAA TAATTCTCGG 
TTAAGATTTT CAAACTTTAA TATTGGTTTA 

30 

ATCACAGTTG TATGTGGTGC TTCTAGCacT 
ATACTAATTA ATGTATCGAT AAAATAATCA 
TCAATCATCA TACCTTCTTC AACATTTAAT 

35 

AAATCTAATA ATCGCTTAGC CATGTCTAAA 
CCACTTAACA CAAACTCGTG TTTACAATAT 

40 AAACGTGCTT TAATATAATT CGCATTAAGA 
GTTGCTCCCA TAGTTCGAAT ATACGTATAA 
AATGG'i'lTrA CACGTCCGAT AGAATTTTTA 

45 TCTTTAATAA CCATTGGCTT TGGTAAGTAA 
CCTGAACCAG GACCGCCACC ACCATGTGGA 
ACAGCATCAA ATCCCATATC TCCTGGGCGA 

SO 

CCATCATAAT ATAATAGACC ACCAGCATTA 
TTTTCGAAAA TACCTAAAGT GTTTGGATTA 

55 



CAATTTAACT AAATGAATAA TAATTGCAAT 2340 

TAACAGTCTT ATAATTATAT TATCGTTTAA 2400 

GTACATCGCA ATAATTTTAG TTATAGCAAT 2460 

TAACAAGCGA GCTGTTAAAG AATTAGATCA 2520 

TCAAGTCATC GATGTTAGAG AGAAAGTTGA 2580 

CAATATTCCT ATGACAATGT TCAGGCAACG 2640 

ATACTTATGT GATGCCAATG GGATTGCTAG 2700 

TGGATATACA 6ATATCTATA TGTTAAAAGG 2760 

GTCTAAAAAA TAGTTTTTGT AAATTTAATA 2820 

ATCATCAATA ACAATACTCA GATTTTAATT 2880 

AOCTTCACTG ATCATTAAAC TTAGTTTCAG 2940 

TACX3ATTCAA TTACCGCAAT CTAAAAAAT6 3000 

AAACCCTAAG AGTTATTACT TCGQTTTAGT 3060 

AATTGATTTT GAAACCATTA TCCTTAAATC 3120 

AATATGTGTT TAATACTTTA TTTTTCCTGT 3180 

CGAGCAGCTG TAGCTTCGTC TAATCGATCA 3240 

TTATCAGGAT CATTTTTAGC TTCTTCAGCA 3300 

AGTGTTTCTT TAGACTCTGT CTCAGTCGGT 33 60 

GGGAAGTATA TTGTTGGTGG ATGTACACCG 3420 

GTACGTACAC CAAATTCTTT TTGACGCACA 34 80 

TGTTTATAAG GTATTTCAAA GTGTTTAGAT 3540 

ACCGCTGCTT CAGAAACCTC TTTAAGTCCA 3600 

GCTCTTAAGT AAATACCAAA GTTACCATAA 3660 

ATGTCATTAT CATATTTAAA TTTGTCGCCA 3720 

CTTGCTAGTT CTTTTACTAC ACCGACTGGA 3780 

CCAGTAAATG TTTTATGCAA GTTTAAATGA 3840 

ACTTTGTCCA TAATAGCGTT TAAATTCGCA 3900 

TGQACGATTT CACGGATTTC CATAATATTT 3960 

GTTAACATAA TAGCTGCTGT ATTTTCATTT 4020 
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10 



IS 



20 
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GATTTAAATC CTGCAAATGa AGCTGAGGCT GGaTTCGTAC CATGCGCAGA ATCTGGcACA 4140 

ATGACTTCAT CACGATGACC TTCACCATTA TTCTCATGGT AAGCTTTAAA TATCATCAAT 4200 

GCAGTCCATT CACCATGTGC GCCAGCAGCT GGTTGTAATG TCACCTCATC CATACCAGTA 4260 

ATTTCTTTTA ATTCTTCTTG CAAACTATAA ATAATTTCTA ATGAACCTTG AACTTGATCT 4320 

TCATCTTGTA ATGGATGTGA TTCACTAAAT CCTGGTATTC TAGCAACCTT TTCATTAATT 4380 

TTAGGGTTAT ACTTCATCGT ACATGAACCC AATGGATAAA ATCCGTTGTC TACACCGAAA 4440 

TmTATTTG AAAGTTCAGT ATAATGACGT ACTAAGTCTA GTTCAGCAAC TTCAGGAAAC 4500 

TCCGCTTTGT TTTTACGAAT AAATTTATCA TCTAACAATG ACTCAACAGA ATTTGTTTTA 4560 

ATATCACTTT TTGGTAATGA ATATGCATAT CTGCCTTCAC GAGATCTTTC AAAAATTAAT 4620 

GGACTTGATT TACTAGTCAT TTAACTCACC AGCCTTTTCT ACAAATGTAT CGATTTCATC 4680 

TTTTGTTCTT AATTCAGTTA CAGCTATTAA CATGTGATTT TTAAAGTCGT CTGAAACAAC 4740 

ACCTAAATCA AAACCACCX3A TAATATTGTA CTTCACTAAT TCCTCGTTAA CTTGTTGAAT 4800 

TGGTTTGTCA AATTTGACTA CAAACTCATT GmnAAGnTGT ACCATCTAAT ACTTCAAAAC 4860 

2S CTTTTTTAAT AAATTGTTGT TTAGCATAGT TAGCATGTTC TATATTTTQA ACTGCAATAT 4920 

CATAGATACC TTGTTTACCA AGTGCTGACA TTGCAATTGA TGaCGcTAAA GCATTTAATG 4980 

CTTGGTTAGA ACAAATATTA GATGTCGCTT TATCGCGTCG AATATGTTGT TCACGTGCTT 5040 

GTAATGTTAA TACAAAGCCA CGATTACCTT CATCATCTTG TGTTTGACCX; ACTAATCTAC 5100 

CTGGCACTTT ACGCATTAAC TTTTTCGTCG TTGCAAAATA TCCACAATGT GGCCCACCGA 5160 

ATTGAGCAGG AATTCCGAAT GGCTGAGTAT CACCTACAAC AATATCTGCA CCAAATGAAC 5220 

CTGGAGGTGT AAGTAATCCC AATGCTAATG GATTTGCATA TACGATAAAT AATGCTTTTT 5280 

TATCrftCAAT AAAGCTATGA ATCTTTTCAA GATCTTCAAT TGAACCX3TAA AAGTTTGGAT 5340 

ATTGTACTGC AACAGCTGCT GTTTCATCAT CCACTGCTGC TTCTAATTTT TTCAAATCTG 5400 

TAACAGTGCC ATCTAAATCG ATTTCCACTA CTTCGAATTC CTTACGCGTC TTAGCATAAG 5460 

TATGAAGTAC TTGTAATGCT TGATAATGTA AACCTTTTGA GACTACAATT TTATTTTTCT 5520 

45 TTGTTTGACT AAATGCTAAG ATACATGCTT CAGCAAAGCT AGTCATCCCA TCATACATAG 5580 

AAGAATTTGC TACATCCATA TCTGTTAATT CACAAATTAA AGTTTGGAAC TCAAAAATGG 5640 

CTTGTAATTC ACCTTGAGAA ATTTCCGGTT GATATGGCGT ATATGCTGTG TAAAATTCTG 5700 

ATCTTGAAAT CATAGCATCC ACAACTGATG GCGCGTAATG ATCATAAACA CCAGCACCCA 5760 

rAAATGATGT ATGCGTTTCT TTAGTGATAT tCTTGCTkGC AATGGGGATT TAAACnTCTA 5820 



55 
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40 
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<2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



IS 



30 



3S 



40 



ATnATAATTG 


GCTTTGCTAA 


TAATTACTTC 


CCTGAATTAC 


aAGTATTAGC 


AAACGAAATA 


€0 


AAATCTGATA 


TGGCTAGTTC 


ATTAAAACAA 


TGATATTTTT 


ATTTAAATTT 


TTaAAGCTTT 


120 


OTACGAAATT 


GTACAAAGCT 


TTTTTGGTGC 


GTATTGTATG 


GGCAACAACT 


TGACGATGAA 


180 


AATCCGTTAC 


AGGATTGGTA 


ATAGGAAATG 


TTAGCGAAAG 


ACAAGGGTAT 


CCATTGTAGA 


240 


TTAACAAAAG 


GACGTTTCCA 


CAAGTGTGGG 


TTATTCTCAC 


TAAAGCAATA 


CGCAGAGACA 


300 


ACTTACGTAA 


AATTTTGAAC 


TGACTAGAAC 


GGAACTTCTA 


CTCAATTATT 


GATAAAAATT 


360 


TTCAAAAAGA 


CTTGAATGTG 


CTGAGAATAC 


GAAGTTTATO 


GAAGGATTAT 


CAAAATATAA 


420 


ATGTGCATTC 


ATTTACAACC 


TTTATTGACA 


ATGATTCTCA 


ACTAATATAG 


TATATAATCA 


480 


AATCGTAATA 


GTTACGATTT 


GTTTTCTGCA 


ACTTTTTTGA 


AGTTTTAGTT 


GAGGTGAAAA 


540 


CAATAAAAGC 


ATCTAAGTGA 


ATGTAGTTAA 


CGGACAACTG 


CATTCGCTTG 


TAGAGCCACA 


600 


AGAAGCAACT 


TTAAATAAGG 


TTTACGGTTG 


CATTTTGATA 


CAACAACCGA 


TTACTAAGTC 


660 


ATGCTTTCCA 


CTTTGCGGGT 


TAGCATGACT 


TACCTAATAG 


ATAGAGCTAT 


TAGGTTCAGC 


720 


TTCTAAAAAA 


TTACAGTTTT 


AGAGGAATAC 


AGTTGcTTGc 


tTCGCAACAA 


CTGCATAAGA 


780 


GCCATGGTTT 


TCGCTTTTGC 


GAATTAGCAT 


GACTTACCTA 


CTAGATAGAG 


CTATTAGGTT 


640 


CATCTTCTAA 


AAAATTACAG 


GTTTAGAGGA 


ATACAGTTGT 


TTGcTTCGCA 


ACAACTGCAT 


900 


AAGAGCCTCT 


AGTAATTAAA 


ATTACAGAGG 


CTCTAAAAAT 


ACATCTAAAG 


GAGTGTCGTA 


960 


TGAATCGGCA 


GGTTATAGAA 


TTTTCTAAGT 


ATAATCCTTC 


GGGGAATATG 


ACGATACTTG 


1020 


TTCATTCAAA 


ACATGATGCT 


AGTGAATATG 


CATCTATCGC 


CAATCAGTTG 


ATGGCCGCAA 


1080 


CACATGTATG 


CTGTGAACAG 


GTAGGCTTTA 


TAGrATCAAC 


ACAAAATGAT 


GATGGTAATG 


1140 


ATTTTCACTT 


AGTTATGAGC 


GGTAATGAAT 


TTTGCGGTAA 


TGCGACGATG 


TCATATATAC 


1200 


ATCATTTGCA 


GGAAAGTCAT 


TTGCTTAAAG 


ACCAACAGTT 


TAAGGTGAAG 


GTGTCTGGCT 


1260 


GTTCGGATTT 


AGTGCAATGC 


GCAATTCATG 


ATTGCCAATA 


CTATGAAGTT 


CAAATGCCAC 


1320 


AAGCCCATCG 


TGTTGTGCCA 


ACAACAATTA 


ATATGGGTAA 


TCATTCATGG 


AAAGCAATAG 


1380 



SS 
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TTATGTGCAC CCACCACTAT TTATGAATGA 
AGATGTACCG GTTTATGTGT ATAAGTTATT 
^ CCGTGAAATG CGTTTAATGT GGAAGGAAAT 

GTCAGTCAAC CTGCTTCAAT TTATGGTGAA 
GGATGAAGGT GATATTGAGC ATTTCGAAAT 

10 

TTATGTAAGA TATACCGCAA TCCTCATTGA 
TTACTTTGAT TTTTCAGCTG TACCATTTAA 
TCAAATTCCA AGAATGCCAA GTGAAGATTA 

IS 

GAAAATGCTA GGTATCAAAA CGCCAATGAT 
TTGCCAGGCG TACAAGGATA TGCATCAAGA 
20 TCTATTTGAA GGAGATAAAG CACTCGTCAC 

ATAATAAGGG TTTGAAGTTT TATAATAGAA 
ATAAAAATAA GCAAATAATT GAGAAAAATA 
TATCAATTTA GAAAGAGGAA AAGCAAATGA 
TTGCATCAGG GCTAATTTTA ACTGGTTGTG 
AAAACAAGCA ATTAACGTAT ACGACGGTTA 

30 

ACGGTGGATC AATGTCTGCT GAAAGTATGA 
ATGGTATTAA GCCTTTACTA GCTAAAAAGT 
CGTTCCATTT GAGAGATGAC GTTAAATTCC 

3S 

GTTAAGAAAA ATATTGACGC AgTTCAAGAA 
TCGACATTAA TTGACAATGT TAAAGTTAAA 
GAAGCATATC AACCTGCATT GGCTGAATTA 
CCAAAAGACT TTaAAAACGG TACAAcAAAA 
CCATTTAAAT TAGGTGAACA CAAAAAAGAT 
45 TACTGGGGCG AAAAGTCTAA ACTTAACAAA 

ACAGCATTCC TATCAATGAA AAAAGGTGAA 
ACAGATAGCT TAGACAAAGA CTCTTTAAAA 
AAGCGTAGTC AACCTATGAA TACGAAAATG 
GCTGTGAGTG ACAAAACAGT CAGACAAGCG 

55 



CTTTTCATTG AAAGCCATTT TCGAAGGAAC 3300 

TCCTGAAGGA CCGATAACGA TGACACTAAT 3360 

GATGGTTATT TTACAAGCAT TTAGAGTGCC 3420 

GGAAAATTAT CCAGTACGTC CTGAAACTTT 34 80 

CTTGCCAGAT ATCTTACAAG AATATCTGCT 3540 

TCCATTTTCA CAGCCAGACG AAAACGGACA 3600 

GCAAGTCTAT AAAAATGAAC AGGATGTTGT 3660 

TTACAGAACG GCGATGATTC AGCATATTGG 3720 

TGATCAGTTC CTAACTCGCT ATGAAGCAAG 3780 

TCAACACTTA TCTTCTCAAT TTAATACAAA 3840 

AAAATTTTTG GAAATCAATA GAACGCTTTC 3900 

AAAAATTATT GAATTATGTT TGACATTTAC 3960 

ATCATTACGA TTTGATTAAG TAATGCAACT 4020 

GAAAACTAAC TAAAATGAGT GCAATGTTAC 4080 

GCGGTAATAA AGGTTTAGAG GAGAAAAAAG 4140 

AAGATATCGG TGATATGAAT CCGCATGTTT 4200 

TATACGAGCC GC7TGTACGT AACACGAAAG 4260 

GGGATGTGTC TGAAGATGGG AAGACATACA 4320 

ATGATGGTAC GCCATTTGca TGctGACGCA 4380 

AACAAAAAAT TGCATTCTTG GTTAAAGATT 444 0 

GATAAGTACA CGGTTGAATT GAATTTGAAA 4500 

GCGATGCCTC GTCCATATGT ATTTGTGTCT 4560 

GATGGCGTTA AAAAGTTCGA TGGTACTGGT 4620 

GAGTCTGCAG ACTTTAACAA AAATGATCAA 4680 

GTACAAGCAA AAGTAATGCC TGCTGGTX3AA 4740 

ACGAACTTTG CCTTCACAGA TGATAGAGGT 4800 

CAATTGAAAG ATACAGGTGA CTATCAAGTT 4860 

TTAGTTGTCA ATTCTGGTAA AAAAGATAAC 4920 

ATTGGTCATA TGGTAAACAG AGATAAAATT 4980 



494 



ACAGACATTA ATTTCGATAT GCCAACACGT 
TTAGATGAAG CTGGTTGGAA GAAAGGTAAA 
5 AACCTTGAAA TGGCAATGTA CTATGACAAA 

TACTTACAAG CAGAATTTAA GAAAATGGGT 
GATAAAATTG CTGAACGTCG TACTTCTGGT 

70 

GGATTATTGT ACGATCCACA AAGTACTATT 
AGTGCAACAT CAGGCATTGA GAACAAAGAT 
AAAATCCAAA ACGGTAAAGA GCGTTCAGAC 

IS 

GATGAAGGTA TCTTTATCCC TATTTCACAC 
TTAGAAAAAG TATCATTCAC ACAATCACAG 

2^ AAATAAAGGA 6CAATTAGAT GTTCAAATTT 

TTGATGATTG TAGTAAGTTT TATGACATTT 
GCTGTGACAA TTTTACATGC ACAAGGGACG 

2$ ACXIAATGAGA AGTACGGTTT CAATGATCCA 

GAAGCGATGC AATTTAATTT TGGTACAAGC 
ATTGGTCCAG CATTTATGAA TACATTGAAA 

^ ATTACATCAA TTATTTTAGG TGTAGTTAGT 

GCGATACGTT CAGTGGCTTT CTTTCTAACT 
CTTATTATTT ACGTTTCAGT GAAGTTAAAC 

3S 

GAAAGTTACA TATTGCCAGT GATCGTTATT 
AAT(HTAGAC GCTCGATGGT GGAACAATTA 
AGCGGTGTGA AATCTATCAC ATTAATGTTG 

40 

GTATCAATCT TTTGTATGTC TATACCAATG 
ATCTTTGCAT GGCCTGGACT AGGTCAATTA 

45 CCAGTCATTC AAGCATATGT ATTAATTGTA 

GCAGATATCA TTAATGCGCT ATTAAATCCA 
TAAAmCGATT ATTmCArGwT AAAGGTGCAG 

so TCTTTTTAGG ATTAGCAGCA CCACTTGTGA 
CAAACAAATT TGCTGGCATG AGTTTTCAAC 

55 
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AAGTATGACC TTAAAAAAGC AGAATCATTA 5100 

GACAGCGATG TTCGTCAAAA AGATGGTAAA 5X60 

GGTTCTTCAA GTCAAAAAGA ACAAGCAGAA 5220 

ATTAAGTTAA ACATCAATGG CGAAACATCA 5280 

GAITATGACT TAATGTTCAA CCAAACTTGG 5340 

GCAGCATTTA AAGAGAAAAA TGGTTATGAA 5400 

AAAATATACA ACAGCATTGA TGACGCATTT 5460 

GCTTATAAAA ACATTTTGAA ACAAATTGAT 5520 

GGTAGTATGA CAGTTGTTGC ACCaAAAGAT 5580 

TATGAATTAC CATTCAATGA AATGCAGTAT 5640 

ATCTTAAAAC GTATTGCGCT CATGTTTCCA 5700 

CTA1TGACGT ATATTACAAA TGAAAATCCA 5760 

CCAAATGTAA CACCAGAGTT GATTGCAGAA 5820 

TTATTAATTC AATATAAAAA TTGGTTACTT 5880 

TACATTACAG GTGACCCAGT TGCTGAACGT 5940 

TTAACAATAA TTTCAAGTGT TATGGTGATG 6000 

GCATTAAAAA GAGGAAAGTT CACTGATCGT 6060 

GCATTACCAT CATATTGGAT AGCTTCAATA 6120 

ATATTGCCGA CTTCTGGATT AACAGGTCCA 6180 

ACGATTGCCT ATGCTGGTAT TTACTTTAGA 6240 

AATGAAGATT ATGTACTTTA TTTAAGAGCA 6300 

CATGTGTTGC GTAATGCTTT ACAAGTTGCG 6360 

ATAATGGGTG GACTAGTTGT TATCGAGTAT 6420 

AGTTTAAAAG CAATACTTGA ACACGATTTT 64 30 

GCGGTATTAT TTATTGTATT TAATACATTA 6540 

AGATTAAGGG aGGGCGCACG ATGATAATTT 6600 

TAATTGCTTT AGGCATTATT GTATTATATG 6660 

CATTTTATGA TCCTAACCAT ATCGATACAG 6720 

ATCTACTAGG TACTGACCAT TTAGGTAGAG 6780 
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TATTTGTTTC TGTACTTATT GGATCTATTT 
TTGTTGACGC CTTAATCATG CGTQCGTGTG 
^ TAACGTTAGC ATTAATTGCA TTGTTTGGAA 

TTTTGACGCG TTGGGCATGG TTCTGTCGTG 
CTTCTGACCA TGTAAGATTT GCTAAAACAA 

10 

AACATATTAT GCCATTAACA TTAGCAGATA 
CAATGATCTT GCAAATATCT GGCTTTTCAT 
CAGAGTGGGG CATGATGCTT AACGAaGCTA 

15 

TGTTTGCGCC AGGTATTGCC ATAGTGATTA 
CTTTACAAAT TGCTATTGAT CCCCGCATCT 

^ AAGGAGTGGT GCAATCATGA CATTGTTAAC 
GACAGATCAA CCACTCGTGA GTGATGTGAA 
CGTTATTGGA GAAAGTGGTA GTGGTAAATC 

25 TCCCGAACGA CTCGGGGTGA CAGGTGAAAT 

ATCTGAATCG CAATTGAAAA AGTACCGTGG 
TAGTCGTGCC TTTGACCCAT CAACTACTGT 
ACATACGTCA ATGTCTACAC AAGAAATTGA 
AAGTTTGAAA GATCCTAAAC GTATATTAAA 
GTTACAGCGA TTGATGATTG CTTTAGCGTT 

^ TGAOCCGACA ACGGCTTTAG ATACAATTAC 

TATT3UUUW^ CACTTTGACT GTGCGATGAT 
CAAGATTGCA GACCGTGTTG TTGTGATQAA 

40 

TGAATCAGTC TTGCATCATC CAGAACATGT 
GAAGATTAAT GATCATTTTA AACATGTGAT 
ATGTTGAAAA GTCATATCAA AGCGCACATG 

45 

AAGGTGTGTC ATTTGAGTGT CCAATCGGTG 
GCGGTAAATC GACGTTGAGT CktATGATAT 
SO TAACCTTAAA TGATCAACCG ATGCATAAGA 
TATTTCAAGA TTATACGTCA TCATTACATC 



TAGGATTCTT ATCAGGATAT TTCCAAGGGT 6900 

ATGTTATGTT GGCATTCCCA AGTTATGTTG 6960 

TGGGTGCCGA AAATATTATC ATGGCATTTA 7020 

TTATACGTAC AAGTGTTATG CAGTACACTG 7080 

TCGGTATGAA TGATATGAAA ATTATTCACA 714 0 

TTGCTATCAT CTCTAGTAGC TCGATGTGTT 7200 

TTTTAGGATT AGGTGTCAAA GCGCCTACTG 7260 

GAAAAGTGAT GTTTACACAT CCTGAAATGA 7320 

TAGTGATGGC ATTTAACTTC TTATCCGATG 7380 

CTTCTAAAGA TAAACTTCGT TCTGTGAAAA 7440 

AGTTAAACAT TTGACGATTA CAGATACCTG 7500 

TTTTACATTA ACTAAGGGTG AAaCTTTAGG 7560 

AATCACTTGT AAATCGATTA TTGGTTTGAA 7620 

TATCTTTGAT GGTACAtCAA TGTTGTCATT 7680 

TAAAGACATT GCGATGGTCA TGCAACAAGG 7740 

CGGTAAACAA ATGTTTGAGA CTATGAAAGT 7800 

AAAGACATTG ATTGAATATA TGGATTATTT 7860 

ATCATACCCT TACATX3TTAT CAGGAGGAAT 7920 

AgcTTTgAAA CCAAAGTTAA TCATTGCTGA 7980 

ACAATATGAT GTACTGGAAG CATTTATAGA 8040 

TTTCATTTCA CATGATTTAA CX3GTTATTAA 8100 

AAATGGTCAG CTTATTGAAC AAGGGACACG 6160 

TTATACGArt ATTkTATTAT CAACGAAGAA 8220 

GAGGGGTGAT GTACATGATT AAAATTAAAG 8280 

TTTTTAAGCG TCGTCX5AACA CCTATCGTGA 8340 

CGACGATTGC GATTATCGGA GAAAGTGGTA 8400 

TAGGTATTGA GAAACCGGAT AAAGGTTGTG 8460 

AGAAAGTGAG ACGTCATCAA ATTGGTGCTG 8520 

CATTTCAGAC TGTTAGAGAA ATCTTATTTG 8580 
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TGTTGGAAGA AGTCGGTCTA TCTAAGGCAT ACATGGATAA ATATCCTAAT ATGTTATCAG 8700 

GTGGAGAGGC GCAACGTGTT GCGATTGCGC GTGCAATATG TATTAACCCT AAATATATTT 8760 

^ TGTTTGATGA AGCCATTAGT TCACTCGACA TGTCAATTCA AACACAAATA TTAGATTTAT 8820 

TGATTCATTT ACGTGAAACG CGTCAGTTGA GTTATATTTT TATCACACAT GATATTCAAG 8880 

CTGCCACGTA TTTATGTGAT CAATTAATTA TTTTTAAAAA CGGAAAAATA GAAGAACAAA 8940 

10 

TTCCGACAAG CGCATTGCAT AAAAGTGACA ATGCTTATAC AAGAGAATTA ATAGAAAAAC 9000 

AACTATCATT CTAAGGAGTG AGATAATGAA AGGTGCAATG GCTTGGCCCT TTTTGAGATT 9060 

ATATATATTA ACATTGATGT TCTTTAGTGC CAATGCAATC TTAAACGTGT TTATACCTTT 9120 

IS 

ACGAGGGCAT GATTTAGGCG CAACX3AATAC GGTTATCGGT ATCGTTATGG GGGCATACAT 9180 

GTTAACAGCA ATGGTATTTC GACCATGGGC AGGACAAATT ATTGCTCGTG TCGGTCCCAT 9240 

2Q TAAAGTATTA AGAATTATTT TGATTATCAA TGCCATAGCT TTAATTATTT ATGGTTTTAC 9300 

TGGCTTAGAA GGTTATTTCG TAGCACGTGT TATGCAAGGT GTGTGTACGG CATTCTTTTC 9360 

TATGTCTTTA CAGCTAGGTA TTATTGATGC ATTACCAGAG GAACATCGTT CTGAAGGTGT 9420 

25 ATCATTGTAC TCGCTATTTT CAACX5ATTCC AAACTTAATC GGACCATTAG TTGCCGTAGG 94 80 

TATTTGGAAT GCAAATAATA TTTCACTATT TGCAAITGTC ATTATCTTTA TCGCATTAAC 9540 

AACAACATTC TTTGsTATCG CGTGACCTTT GCTGAACAGG AACCCGATAC GTCAGATAAG 9600 

ATTGAAAAAA TGCCGTTTAA CGCTGTAACT GTTTTTGCGC AATTTTTCAA AAATAAAGAG 9660 

TTGTTGAACA GTGGTATTAT CATGATTGTT GCATCGATTG TATTTGGTGC AGTTAGTACA 9720 

TTTGTACCGT TATACACAGT GAGTTTAGGA TTCGCGAATG CGGGAATCTT TTTGACAATA 9780 

35 

CAGOCCATCG CAGTTGTTGC GGCAAGATTT TACTTAAGGA AATACATTCC GTCAGATGGT 9840 

ATG-ffiGCATC CTAAATATAT GGTATCTGTA CTATCATTAT TAGTAATCGC GTCATTTGTA 9900 

GTGGCATTTG GTCCGCAAGT AGGTGCAATT ATTTTCTATG GTAGTGCGAT ATTAATAGGA 9960 

40 

ATGACGCAAG CAATGGTGTA CCCAACATTA ACATCATACT TAAGCTTCGT CTTACCAAAA 10020 

GTAGGTCGTA ATATGTTGTT AGGTTTATTT ATTGCCTGTG CAGACTTAGG TATATCGTTA 10080 

GGTGGCGCAT TGATGGGACC TATTTCCGAT TTAGTAGGAT TTAAATGGAT GTATCTAATT 10140 

45 

TGTGGTATGT TAGTCATTGT AATAATGATT ATGAGTTTCT TGAAAAAGCC AACACCACGT 10200 

CCAGCGAGTA GTCTTTAATG AAGTGAATTA AAGCATATTA AGTTAATGAA TATTTAAATT 10260 

SO TTAAAAGGTA TATTGaGCAT GGCGATTCAT GTGCTTCATG CTAGGACATG AAACATTCTA 10320 

TATGGCTCGT TTTTAGAACG ACAtATATCT AAATAAAGCA CGCTTArAAG TGAGTTTTGA 10380 
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TTACATGAAA ATATGCAAAA CGAGTATAAC TGCTAATTGA TAGAAATAGC TCACCATAAA 10500 

AITACGGTAT GATTTTAAAT ATAAGTAAGT CGCACTACCT GCTAGTATCA ATGCTGGAAT 10560 

GAATTCCCAC CATGTATTAA TGTATGGATA GTAGAACAGA GTTTCAAGGA TAATGGACAA 10620 

TACTATTGTA ATCTTTAAAG GTATTAATCT GCTTAATTCT TGAATTAAAA TATGACGGAA 10680 

AATAAGTTGA CAAATCAAAG TATTTAATAT AATGGTTAAC GAAAATATAG CTATTAAACT 10740 

GATGGAaCCA TACCCTTTAA TGAGCGGGTA AATGTCAAAG ACAGTAAAGG AATCTACATT 10800 

TAGTGCGAAA ATATTGAAAT GATTTAAAAG TAAAAAGAGT ACGACACTTA GTGTAAATGA 10860 

TATAAGAATA TGCCATTTAT ATTTAGCACT AGCAACGATT TGCGAACGTA TCATTGGAAT 10920 

AAACGCATCT TCATGCATCA GACGAAAAAT AGCTAGTGAA ATAATAACTG CGAGTAAATA 10980 

GCTAATGTTC ATTGAAATAG GAAAAGAOAA ACCCCACGGA GCTTGTTGAG TGAATACAGC li040 

TACTAACCCA AAAGTTAAAA AGACGATAAT GATCGGCAAG ATGTTAACCA AAAATATGTA 11100 

AAGGAAAATA AATCCAATAT CACX5TTTGAA AAAACGCGAT TGTTCGGTAG CGTATTCTTC 11160 

TTCTATGTAA TGTTTATTTG TATTTGACAT AGTATACCTC TTAAATAGTT GTATTATATA 11220 

GATACTTTAG CACATATTAC TTTGTATTGT ATGTTTTATA CATTAAAATT TAAAATGAAA 11280 

AACATATCAT AAAATTGTTT TATAAAATGA AGCGCTTCCA TTGTGTTTTG TTTTGTAAGG 11340 

TGTATCATAA ATATTGAATT GAAATTTTGG GGGGAGGTAT TGTAATGACG TTTCTTACAG 11400 

TCATGCAATT TATAGTTAAC ATTATCGTTG TAGGATTCAT GCTTACX3GTT ATTGTTATCXr 11460 

GGCTTATTTG GTTAATTAAA GATAAAAGAC AATCACAACA TAGTGTATTA AGGAATTATC 11520 

CTTTACTAGC ACGTATTAGA TATATTTCAG AAAAAATGGG ACCGGAAITA CGTCAGTATT 11580 

TATTTTCTGG GGATAATGAA GGGAAACCTT TTTCAOGTAA TGATTATAAA AATATCGTTT 11640 

TGGCfiSGAAA ATATAACTCT CGTATGACCA GCTTCGGTAC TACTAAAGAT TATCAAGACG 11700 

GCTTTTACAT ACAGAACACA ATGTTTCCGA TGCAACOTAA TGAGATTTCA GTAGATAATA 11760 

CAACATTGTT ATCAACATTC ATTTATAAAA TCGCGAATGA GCGTTTATTT AGTCGTGAAG 11820 

AATATCGTGT GCCGACAAAG ATTGATCCX5T ATTACTTAAG TGATGACCAT GCAATAAAAT 118 80 

TAGGTGAACA TTTAAAACAT CCATTTATTT TAAAACGTAT CGTAGGACAA TCTGGTATGA 11940 

GTTATGGCGC TTTAGGAAAA AATGCCATTA CAGCTTTATC TAAAGGTCTA GCTAAAGCGG 12000 

GCACTTGGAT GAATACAGGT GAAGGTGGCT TATCAGAATA TCATTTAAAA GGTAATGGGG 12060 

ATATCATTTT CCAAATTGGT CCCGGTTTAT TTGGTGTTCG TGATAAAGAA GGTAATTTTA 12120 

GTGAAGGTTT ATTTAAAGAG GTTGCACAGT TATCTAACGT ACGCGCATTT GAGCTGAAGT 12180 
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TTGCTAAAAT 
TTATTCATAA 
AACCAGTAGG 
CGATGGTGGA 
CTGGTGCAAC 
CTATTGTGTC 
CTGGTAAGTT 
TAAATATCGC 
TGAATACGTG 
TTGGAGAAAA 
ATATTGCAGC 
ATCGAAAAGT 
AACTTAATTA 
TATTAGTAGT 
TTAAGGTTTG 
CCATTTTATC 
TAATATCTTT 
CTTCTGTGTA 
ATTTTACGCC 
CGTCTTTATC 
CTTfATCATC 
ATGTTTTGGT 
CAATTAATGA 
GAAATTATAA 
TTGAAGATAT 
AAAATGATAA 
GAAATACAGG 
TGTAAACGGC 
GCCTGTCAGT 



CCGAAATGTT 
TGCTGAAGAT 
ATTCAAAATT 
ACTAGATAAG 
ATTCCAAGAA 
TGGCATGTTA 
AGTGACACCA 
ACGTGGGATG 
TCCTGTAGGT 
6CAATATCGT 
AGCTGTTGGC 
CGATGGTGAG 
TTTCGGGAAA 
GGATGCTGGT 
TAACCTTAGT 
TTTTTCTTTT 
ATTTTCTTTA 
ATCTATGTCT 
TTTAAGGTCT 
CATACCTAGA 
TTTATATGTG 
TTGTTCTTTA 
ACATAATTTT 
CATTTACTAA 
GAGTTTTTTT 
AGCGkTAGGG 
AGGATGAATA 
ATTGCTGGTT 
GATCAAATTA 



GAACCTTATA 
TTGATTCGTT 
GTAGTAAGCA 
TATCCAAGCT 
TTACAAGATG 
GAAAAATATG 
GATAAAATTG 
ATGATTAGTG 
GTTGCAACGA 
GTCACAAACT 
GTATCCAGTC 
TTACAAACGA 
TTGAAA6CAG 
CACACAAGAA 
CTTATCTGAG 
AAACCTCTGT 
GTCGAAACAC 
AAGTGyTCAA 
TT6AAAATAC 
TCGTCATATT 
ATAGAAGTTA 
CCACAAGCTG 
TTCAAAGTCA 
AAAATGATGT 
AAGCGGATTC 
AACGTTTTTC 
ACATGAATCA 
ATGAAATGCA 
TTGAAGATAA 



AAACAATCAA 
TCGTCGATCA 
AAGTTTCAQA 
TTATTACGAT 
GTGTTGGCTT 
GTATTCGAGA 
CGATTGCACT 
TCGGTTGTAT 
CAGATGCGAA 
ATGTAACAAG 
CTACAGAAAT 
TACATGATTA 
CGGATTTTAG 
CTTCAAATAT 
GQCATTTTTA 
GCTTTAATTG 
CAAGACGTTT 
TTGCTTTTTT 
TTTCAGATTT 
TAATTGTGTT 
GTACATGTTT 
ATAATGCAAT 
GTCGCCTTCT 
TATTCAAAAA 
CTCACAAAAT 
TGAAAGTTAG 
GTCAGTCAAA 
AGTTAAAGAA 
CTTGGGTGGC 



TTCACCTAAC 
GTTGCAGCAA 
AATTGAAACA 
TGATGGTGGT 
ACCGCTATTT 
TAAAGTGAAA 
AGGTTTAGGT 
AATGAGTCAA 
GAAAGAAAAA 
TTTGCATGAA 
TACTGCTGAT 
TAAATTAAAA 
CGTTACTGCA 
TAAAGCCCTC 
AGTTATAAAC 
CTTTTCAAGT 
ATTTAATTTT 
ATCTTTATAG 
GGCGAATAAC 
GATTGTAGAC 
ACCACTAACA 
GATACAAACT 
TTCGATATTT 
TTTAAATTTT 
TTTAAAAATA 
TGATACAATA 
TTACTTAAAC 
GCAATGCGTa 
ATTTTTGGAA 



C6TTACGAAT 
TTAGGTCAAA 
CTTGTACGTA 
GAAGGTGGTA 
ACAGCTCTAC 
TTGGCGGCAT 
GCAGATTTTG 
CAATGTCACA 
GCATTGATTG 
GGCTTATTCA 
CATATTGTAT 
CTCATTAGTT 
AATAATTTTA 
AGAATATGAA 
TATTTGTCGT 
TTTTCAAAAC 
TTCATGTCAA 
TCTACTTTGT 
TTTTTGGCTT 
TGTTTTAAAA 
TCACCWTCAT 
AATGCTACTA 
GTATTATAAA 
GTCATTTTTT 
TTTAAGCCTk 
GTTTTAAGTT 
ATTTAACAGA 
ACTATATAGA 
AGAAAAATGC 
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50 



AACAAAGATT 
GTCATGCTAT 
ATCGGTTCTA 
AAAAATATGT 
GAAGTAGGCA 
TTAACTGCGA 
GTTTAAAAGA 
TTGGTTTGCG 
TcGATGTAGG 
TAGGCXjGTGG 
GAAAgcATaT 
CAGGTGGAGO 
CAATCGGTGT 
ATGAAAATTC 
AAAATATCAT 
GTTAAACAAT 

rrri ' GrmT 

ATGATTTTTT 
TATATTACAT 
AGCGATAGAA 
ATTATAGGTC 
AGTCCAACTA 
GAATTGCGTC 
GAAGATAACA 
AAGCAATTAA 
ATGTCGGGTG 
ATTCTTTTAT 
AATATCATTT 
GACCAAAGTA 



GATAAACATG 
CTCAAAAAGT 
AACCGCCACA 
TTATAGATAT 
ATATGGTTAC 
ArCATTTGAT 
TGAAAATATT 
TGGTGCGAAA 
TATTGCTTAT 
TCCAGTTGTC 
TAAAGATGTA 
TACAGATGCG 
TACGCTGCGA 
TATCCGTCTT 
GTGGTAATCA 
TGTCTAATTT 
AATTTAAATG 
CTTAAATGTA 
GAGGAGCGGT 
TCATACTAGA 
CATCAGGTAG 
GTGGAGAACT 
AACGAATCAG 
TGATATTCCC 
TTAAAGATGT 
GTGAGCGGCA 
TAGATGAATC 
TTAAATTAGC 
TGCGACACTT 



GTTTTATTTC 
AACGATTACA 
TGTCTTAACG 
TGGTGTTAGT 
GCCATATAGT 
AATCGCTATG 
GGCATTAACT 
GTGGCAGCGA 
GATACCCCAG 
ATTATGATGG 
GCTAAGGAAC 
GGAAGTATTC 
TACATGCATT 
GTTACTGAAA 
AATCCATAAA 
TAATTCTTAG 
CTGAAAATCA 
ATTGCACTAA 
GCAAATGTTG 
TCATATCAGT 
TGGTAAAAGT 
TTATTTTAAA 
TTATTTGATG 
ATCACTTGCA 
CGGTTTGGGA 
AAGAATTGCT 
GACCAGTGCA 
AGATCAAGGC 
TCAAAAGCGT 



ATTTACGCCA 
ACAGATTCGG 
CCTGAAGAAC 
AGCAAGGAAG 
GAATTTGAAG 
GCTGTGCATT 
TATACAGTGG 
ATACGATTAA 
GTATGTCAGG 
ATGCTACAAG 
ATAACATCGA 
ATGTCGCAAA 
CTAATGTTTC 
TTGTCCGTTC 
TAATAAAGAA 
TCATTAGACA 
ATTATGCCTA 
AAACCAAAAA 
TTAGAAATTA 
CTAAAAGTAG 
ACATTTCAAA 
GGTAAACCCT 
CAGCAAAGTG 
CGTAATGATA 
CATTATCAAT 
ATAGCGCGCC 
TTAGACGTTA 
GTGGCAATTA 
ATAACAATTG 



kTgGTGGATG 
GCAAAGAAAT 
GTAAAAAGCC 
AAGCTGAAGA 
TGCTTGCAAA 
AGCTATTGAG 
TGCCACAGTG 
ACCAGACTTO 
TCAAACGAGC 
TATTGCTCAC 
AGTACAATGG 
TGAAGGTATT 
AGTGCTCAAT 
ATTGAATGAT 
TCCTTTTAAT 
GTATCCATGT 
AATTTTGATA 
AACGGGAATA 
AAGATTTAGT 
ATAAAGGCGA 
AGCAAATATG 
ATAATGATTA 
ACTTGTTTGG 
AATTTGATAG 
TAAGTTCGGA 
AACTGATGTA 
ATAATAAAGA 
TGTGGATTAC 
TTGATGGTCA 



GTGGAATCAA 
TAGAGGTATC 
AATGGAAATC 
AGCTGGCGTT 
TGATAAATAT 
GTATTAAAAC 
CAAGAAGAAG 
GCGATAgcTG 
GATAGTAAAC 
CAAGGTTTGC 
GATACGACAC 
CCAACGATGA 
GTAOATGATT 
GAAAGTTATA 
ATGGTAGGTT 
TAATAGGATT 
TTACAAGAAA 
ATATACCTGA 
GTATAAAGCG 
GAGTATTGCC 
TAATTTGTTT 
TGACCCGGAA 
TGAAACGATT 
AAAACGTGCA 
AGTGGAAAAT 
TACACCGGAT 
AAAGATAGAA 
CCACAGCGAT 
AATTTCTAAT 



14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
145B0 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
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CATTCCGATT ATCATTTCAT ATAAAGAAGG TTTACATATT ATTAAAGATT TAATTGTTGC 15900 

GACATTACGA GCAGTTGTGC AATTAATCAT TTTGGGATTT TTGCTGCATT ATATTTTTAA 15960 

^ AATAAACGAT AAATGGCTGC TTATTTTATG TGTATTGGTC ATTATTATTA ATGCATCATG 16020 

GAATACAATT AGTCGAGCAT CACCAGTGAT GCATCATGTG TTTTGGATAT CATTTCTAGC . 16080 

TATCTTCATT GGAACGGCAT TACCGCTTGC AGGTACTATT GCGACAGGGG CCATTCAATT 1614 0 

10 

TACCGCAAAT GAAGTTATAC CTATCGGCGG CATGCTTGCA AATAATGGCT TGATTGCAAT 16200 

TAATTTAGCT TACCAGAATT TAGATOGTGC ATTCGTACAA GATGGTACTA ATATTGAATC 16260 

TAAATTATCA CTTGCAGCTA CACCTAAATT GGCTTCTAAA GGTGCAATAC GTGAAAGTAT 16320 

IS 

TCGTTTAGCT ATAGTGCCAA CTATTGATTC GGTTAAAACA TATGGGCTTG TGTCGATTCC 16380 

TGGTATGATG ACAGGCTTAA TTATTGGTGQ CGTACCACCT TTACAAGCGA TTAAATTTCA 16440 

20 ATTGTTAGTC GTGTTTATTC ATACAACTGC GACCATTATG TCTOCTTTGA TTGCGACATA 16500 

TTTAAGCTAT GGTCAATTTT TCAATGCAAG ACATCAATTA GTAGCACGAA ATACTGATGT 16560 

TAAGAGTGAA TCATGATAGA TTTTACTGCA TCAGATTTAG GCATTAGTTT TAATTGGAAA 16620 

25 TGAAGTGACG CGCACATATA GTATCX3CTAT TCATTAGCGC AGCGAAAATA TTCATAAAGG 16680 

CACGCATACT TTGTAGTCAG TTATCTQTTC TGACATATAA AGCGTGOGTG CTTTTTTGGA 16740 

GTTATTGTTG AAACTGAAGT AATTATACAT AATTATTAAA TGACATACTT GTGTTAATTT 16800 

30 TTCAAATACT GAAAAACAAT TTCaATAATT TTCCaATTAA GCACAGAAAA TTAAAGCAAA 16860 

ATATTATATA ATAGAACGGT TATATATaAA nATTngTgCA CACATTTTTT AATAAATCGT 16920 

TATTCTAAGG GAAATGAATA TCGGAAATTT TGTTTGAAAG GAGTTTTAAA TTGTCAATCA 16980 

35 

TGCGACTATT TACATTCATT TTAAGTATTT TTATCGTAGG AATGGTTGAA ATGATGGTTG 17040 

CAG(^TTAT GAACTTGATG AGTCAGGACT TACATGTATC AGAAGCTGTC GTTGGTCAAT 17100 

TAGTGACAAT GTACGCTTTA ACATTTGCGA TATGTGGACC TATTCTGGTT AAATTAACXJA 17160 

40 

ACCGTTTTTC ATCAAGGCCT GTATTATTAT GGACATTACT TATATTTATC ATTGGTAATG 17220 

GCATTATTGC TGTAGCGCCA AATTTTTCaA TATTAGTAGT TGGTAGAATT ATCTCATCTG 17280 

45 CAGCAGCAGC ACTAATTATC GTAAAAGTAT TAGCTATTAC AGCGATGTTA TCAGCACCTA 17340 

AAAATCGTGG TAAAATGATT GGACTTGTCT ATACAGGGTT TAGTGGTGCT AATGTTTTTG 17400 

GTGTACCAAT TGGAACGGTT ATCGGCGATT TAGTAGGTTG GCGCTATACA TTTCTATTCT 17460 

50 TAATTATTGT GAGTATTATT GTTGGCTTCT TGATGATGAT CTATTTACCG AAGGATCAGG 17520 

AAATACAACG AGGCCCTGTG AATCATGAGA CACCATCTCA TGAAAATCAT GTTACTTCGA 17580 
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CAAACTCAGT GACATTCGTC TTTATAAATC CACTTATTTT ATCTAATGGT CATGATATGT 17700 

CATTCGTTTC ATTAGCACTT CTAGTAAATG GAATCGCTGG CGTTATTGGA ACATCATTAG 17760 

5 

GTGGTATATT CTCCGATAAA ATTACAAGTA AGCGTTGGTT AATGATTTCT GTTTCTATTT 17820 

TTATCGTCAT GATGTTACTT ATGAATTTAA TCTTACCTGG TTCAGGTCTA TTGTTAGCAG 17880 

GACTATTTAT TTGGAATATC ATGCAATGGA GTACTAATCC AGCAGTGCAA AGCGGTGTGA 17940 

10 

TTCAACATGT TGAAGGCGAC ACAA6CCAAG TAATGAGTTG GAACATGTCT AGTTTAAACG 18000 

CTGGTATTGG TGTTGGAGGC ATTATTGGAG GCTTGGTCAT GACACATGTT TCTGTTCAAG 18060 

CTATCACATA TACGAGTGCC ATCATTGGCXi CATTAGGATT AATCGTTGTT TTCACATTGA 18120 

AAAATAATCA TTATGCTAAA ACATTTAAAT CATCATAATT CTCATATGAm AAGCACGCCT 18180 

GCTATCAAAT TCAGGTGTGC TTTTTTAOAT GCGATAACGT TATTCATATG TGCX5ATAATA 18240 

20 GCGACGTTCA TTATGATACA TCGGCCAAGG CATTTTACOG CTTTTAGCAA AATTAGCTAA 1B300 

ATCATTTTGC ATTTGTCGAC TTAAAAATTT AAGGTOaGCA GTTGTTGGaT ATgAT 18355 
(2) INFORMATION FOR SEQ ID NO: 68: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CGCAAAGAAG TACAAAAAAT GTTTTTACAA GAAGGTATTA AAACACCTCA ACCAATTATG 60 

35 

ACTGCTTATA ATCATAGTGA AAACGgTGTT TAGTAGTTTA TAATACATGG AGGTCATATT 120 
TAATGGCGTC AAAATATGGA ATAAATGATA TAGTAGAAAT GAAAAAACAA CATGCGTGTG 180 
GAACAAACCG TTTTAAGATT ATTAGAATGG GTGCAGACAT AAQAATTAAA TGTGAAAATT 240 

40 

GTCAAAGAAG TATTATGATT CCACGTCAAA CGTTTGATAA AAAACTTAAA AAAATCATCG 300 
AATCTCATGA TGATACACAA AGATAGGAGA ATGATTAATG GCTTTAACAG CAGGTATCGT 360 

45 TGGATTGCCA AACGTTGGTA AATCAACATT ATTTAATGCA ATAACAAAAG CAGGTGCTTT 420 

AGCAGCGAAC TATCCATTCG CTACGATTGA TCCTAATGTA GGGATAGTAG AAGTGCCAGA 480 
TGCTAQATTA CTTAAATTAG AAGAAATGGT TCAACCTAAA AAGACATTGC CGACTACATT 540 

^ TGAATTTACA GATATCGCTG GTATTGTGAA AGGTGCTTCA AAG6GAGAAG GGTTAGGTAA 600 

TAAATTCTTA TCACATATTA GAGAAGTAGA TGCGATTTGT CAGGTCGTTC GTGCATTTGA 660 
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TAATATGGAA TTAGTACTAG CGGACTTAGA ATCTGTTGAG AAACGTTTGC CTAGAATTGA 780 

AAAATTAGCA CGTCAAAAAG ATAAGACTGC TGAAATGGAA GTACGTATTT TAACAACTAT 840 

TAAAGAAGCT TTAGAAAATG GTAAACCCGC TCGTAGTATT GACTTTAATG AAGAAGATCA 900 

AAAATGGGTG AATCAAGCGC AATTACTGAC TTCTAAAAAA ATGCTTTATA TCX3CTAATGT 960 

TGGTGAAGAT GAAATTGGTG ATGATGATAA TGATAAAGTA AAAGCGATTC GTGAATATGC 1020 

AGCGCAAGAA GACTCTGAAG TGATTGTTAT TAGTGCAAAA ATTGAAGAAG AAATTGCTAC 1080 

ATTAGATGAT GAAGATAAAG AAATGTTCTT AGAAGaTTTA GGTATCGaAG AACCAGGATT 1140 

AGATCgrTTA ATTAGGAmCA CtTATGAATT ATTAGGnTTA TCCACCATAA TT 1192 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7494 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : dovible 

(D) TOPOLOGY: linear 



2S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AATATAGCTG CAATAGCATC TCGTTTCATT TGTATAATCA ATTCCGGTTT TUUITATCAGT 60 

GTGAACGTAA GCACGACACA GATTAAAAAT AACACTGCCG GAATGAGTCG TTTCAATCGT 120 

^ CGCTtCCAAA ACTCTAGCAA ATCGATTTTT TGCGTCCGAT AATACTCACT TATCAACAAA 180 

CTTGTTATTA AATAACCTGA AATAACGAAG AATGTATCTA CTCCTAAAAA GCCCCCACTT 240 

AACCATTGTG CATTCAAGTG ATAAATAATG ATTCCTATAA CTGCGAATGC CCTCAATCCA 300 

3S 

TCTAATCCAG GTAAGTATCG CGGGGAATAC ATTTTTTCTA AACGTTTAAA GTCTTTTGTA 360 

TCCAfcTTAA TAAACGCCCC ATTTATTTTT CTCTATTTTG TAGTATATCA CAATATTTTT 420 

GAAAATAAAA TATTGCACTG aTTTTCATTA ATTGATTTAA CCCTTAATTA AGATAGTTTT 4 80 

40 

AAATTTTTTA TTAAGTAGAA AACAATTATT ACAGTTGATT TCATTACTGC AAACCACATA 54 0 

TAAATTTGTC GATTTTACTA CATAACATAG ATTATCATAG ATTCTTGAAT TTTTAGCAAA 600 

^ ATAACTGTTA TTTTCATTAT ATTTTTACAA AAAAAGGTTC GTTTTATATT TTATGCATCT 660 

TACTGTAACA GAATCATTAA GATATGCTAT TCGAATATAC TTTTTCAAAA TTTATATAAT 720 

GAATAAATTA ACATGTATTG AAAAAAAAGC GAAATGCAGC CTATCCTCTA ATGTAAACCA 780 

SO AACGATATAT CTCGTCAGAC TTTATATTTA AACGCTATGT GTCACTTTTA AAATGAATAT 840 

TACTAAGATT GTCATATCAA TTATTATTQC ATCGAATTAA TCTTTTAAAT TTCTGTAATA 900 
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ACGGAAGTCA TTATTAGAAT AAAAATACTG TGCACTAATA AATTTATCAA TTGTTCCTAA 1020 

ATAAATACCA TCGATATTTT GTTCTTTACA TGTCATTATA ACTTTATCTA AAAGTTTTTT 1080 

ACCTATTTTT AAATTCCTAT AACCTTTATC AACAAACATT TTTTTAAGTG CAGACATATT 1140 

ATTATCTAGT CTAATCAAAC CTATAGTACC AACAATATTT TGaTGATTGT TTATTGCAAG 1200 

CCAAAATgCC CTCCATTATT CAAATAGTTA TGTTCGATGT TCTCCAAATC AGGTTGATCA 1260 

TCTCTATCAA TTTTTATATa AATTCATTTT TTTGAATCGA TAAAATAAAC TCGATTAGCT 1320 

CTTCCTTATA AGACCTATTA TATTCAATTA TGTTTATAGC CATTTTTATC TCCTTTTTCA 1380 

TTTAATTTAA TTATAAAATG TGCGTTTAGT TTGTATCTAG TGTACTCAGT ACAGCCTCAA 1440 

ATGAAGTTTC ATTCCACTTG GCACTTAATA AAGACAAGTA TTTTAGCAGT AATACAATAA 1500 

AGTCCAATAA ATTTCCCTAA CTTCAATATC C^ICTTTTTAA AAAATGTATT TTTAATTAAT 1560 

20 AAAAAAACTC TCCCCAATTT CTATGGGAAG AGCTATATAT TTAATGTCTA AACATTACTT 1620 

TTATTTATTA TGAAGGAATT AGAATCCCCA AGCACCTAAA CCTTGTGCTT TGTATGCTTT 1680 

AACAGCTGCXv TTGATTTGTT GGTCAACAGT GTTTGTTGGA CCCCAACCTG GCATAGTTT6 1740 

25 GAATAAACCT GAAGCACCTG ATGGGTTGTA AGCATTTACT TGACCATTTG ATTCACGAGC 1800 

GATGATTGCA GCCCATGTAG AAGCTGAAAC ACCAGTACGT TGAGCCATGA TTTGAGCTGC 1860 

TGATGAACCA GTAGCACCTG CAGTATTACC ATTGCTTAAT CTCACTGAAC TTGAAGTAGT 1920 

TGAAGTGCTG TAGTTATGGT AAGTTGGAGC TGAAACAGCT TCAACGTtTG AGTTACTTGA 1980 

TTGTGCATTG TAGCTTACTG ATTGTACATT TGAACCTTGG TTGTATGAAG TAGTGTAGTC 2040 

TGCACCTGCA ACGTTTGAGA AACCAGCAGT TTGACCATTA GCTGCTTCAT AGCTCCATGA 2100 

CCATGTAGTA CCATTTGAAG TGAAGTTATA TTGGAAACCA TCTTTTACAA AGTGGATGTC 2160 

ATATCCACCA TCTTTGATTG GAGCTGCATT TAATTGATCT TGGTGATTAT GCGCTAAGTC 2220 

AACTAAGTGT GCTTGATCAA CGTTTACTTC AGCAGCGTGT GCTTGATGTC CTGTACCTGC 2280 

TGCGTAACCT GTTACACCTA ATGCCACTGC TAATGATGAT GCCATAATTG TCTTTTTCAT 234 0 

AGTAAAAAAT CCTCCAGTAA TAATTGTnAG TTTATGTTTT TAGTAATTAT AtTTTGaATT 2400 

45 TGAATGTCGT AGTgCAAGTT TAAATTGTCT TTTATTTCTT TCaACXSGTAC TCACTATATC 2460 

ACAaAAAACC AGCCAGTAAA TTACACTTTC TTTACAAAAC ATTACAATAT CAAGTGTTAT 2520 

TTGtAATGTT GAAATATGGC TGTTTTATAC TGTAATGTGA AATATGTGCC CTTTAGAATC 2580 

50 CAATCAACCC TTGAAATAGT CTTTAACACA TAAGATTTTT ACTATATTTA GCTCAACTAT 2640 

TACAGCTTTC GTAATATTAC AGATTGTATT TTTGTTACAT AGCTGTAATA TATCTGACAT 2700 
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TACACATGTA TTGATTGCTA TTATTGTTGT ATATTCAAAG TTTTAAAACA CACATCTTTT 2820 

GTGAATTGTC TTATCTTTTA TTAGCGCAAA TAAACTGCAG CTCAATTATA TTGTTCAACT 2880 

TCATTCTCGC AATTCACAAT AACATTAAAT AATTTTTGGT CTCATATTTT CAAAAAACAT 2940 

ACTGTTATTA TCCCATGAAT TTAAAAATAT CATTAGTATA TAAACGAAAC ACTTTACGAT 3 000 

AAATGATATC TGCAAGCCAA GCTGTTACAA ATGGTACAAC AAAGAACGCT ACTACAATTA 3060 

GTAAGACACT CAACCAAGCA GAATCAACCT CCATAAATTT AAATGCATTA ATCGGTCCTA 3120 

CCATTCCTAT AAAACCAAAT CCAGCTGACT CTTTCXnTCC ATC5AATACCT ACTAATGCTG 3180 

ATACCAAACC TGATACAATG GCTGTCGTTA ATATTGGTAA CATAAGAATT GGATATTTCA 3240 

CCATATTAGG TATCATCATT TTAACGCCTC CAAAGAAGAC GGATAACGGC ACCCCTAAAC 3300 

GATTCACTTT ACTTGTACCA ATTATCAATA CTGCTTCAGT CGCGGAGATA CX»ATTGACG 3360 

20 CTGATCCAGC TGCTAAACCT GTAATACCTA TCXSCAAAGGC AATGGCCACA GTTGATAGTG 3420 

GCGAAATAAT AATAAGACTA AATACCATTG AAATCAAAAT ACTCATGACA ATCGGTTGTA 3480 

ATTCTGTAAA ACCATTAACC ATATTACCGA TGGCTGTTGT AATCATTTTC GTATACGGCA 3540 

25 ATATTAAAAC ACCAATTGCA CCTGAAATAC CGCCAACAAC TGTTGGGAAT ACAATCAATG 3600 

CCATACTACC TACGCGATGT TGAATAAGTA AAATGAATAA CACTGCAATC GCTGCTGTAA 3660 

TCATTGTATT AATTAAATCA CCAATACCCG TAATCATCCA AGCACCATTT TTAAACTGOG 3720 

CTGCACCGCT TCCTACATAT GCTGCACTTG CCACAACAGC AATTGCTAAT GGCGATAGGT 3780 

CAAATTTCAT GGCAACCAAT GCACCAATCA AAGCAGGTAC TGTAAATTGA ATTGCAACGA 3840 

CAACGCCTAA TAACGTTTTA AAAATCGGAT GATAATCCAT AAAGTATTTA AAAATTTCTC 3900 

CAAGTATCGC ATTAGGAACT AAACCCGCAA CAATACCTAT GGCGACACCT GATAAAACTC 3960 

TAAMATAAA ATCTTTGGGT GTAATTGTTT TAATTGATGT CATAATATCA TCCTTCCATT 4020 

TATGTATATA CATCTGTATG CAAATAATAA AGAGCCTTAA GTTATAAGCT GCCACTAGCT 4080 

TAAATTCTAA GATGTGCATG CCGATGTTGT TATATTTAGG CTAGCAGTAT CATCTATAAC 414 0 

TCAAGACTAT GAAAAATAGT ATATCACAAA ATTCTGAATT TTTAGATAAA TAAATTGGCA 4200 

45 ATTTTTCAAA CATATTGTTA CAATACACTT TTATTTTATC TTCATTTTTA AAATGCATTA 4260 

ATACAATAGA AGAAAGACAT TCAAATGCTT ACCAAAAAGG TACATTATTT GTTAGGAGCG 4320 

TATCAGCaCT TACATATCAT CAACACAATT GACAATATAA TAGAAGATAC TGATAATAAG 4380 

SO TGTTAAAACA ACAGATGTTA GGTAGTGAAC AAATGATGGA AAGTAAATCC ATAGATCCAA 4440 

GAATCGTTAG AACCAAACAA TTGCTTGTCG ATGCTTTTCT TAAAATTTCT AGAGAAAAGA 4 500 
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TTTACGCTCA TTTCGCTGAT AAAGAAGACC TCCTAGACTA CACATTATCT GTAACCATTT 4 620 

TAAAAGACTT GAATGATAAT TTGAGCATTT CTAATGTCAT TAATGAAAAG GTTCTGCGTA 4680 

ATATTTTCAT TTCAATTGCG AGTTATATCA AAGATGCTGC AAAGTCTTGC GAATTAAATA 4 74 0 

GTGAAGCATT TTGCAACAAA GCACATCAAC GTATTAATAA TGAATTAGAA GATATTTTTG 4800 

CGATTATGTT AGAAAACAGC TATCCGGAGC ATCAACGAGA TATCATTGTA AATAGTGCGA 4860 

GTTTTTTAGC AGCTGGTATC TCAGGCTTAG CATTACATTG GTTTAACACG AGTCAAGAGA 4920 

CAGCCGATGT GTTTATCX5AT CGCAACCTTC CATTTTTAAT TCATCATATA GCACATTTTT 4980 

AATAAAACTT GGTATTTAGT CATGCATCTT GAAATCACTA TGTGACTTAG 6TTCATACTT 5040 

GTACACACAA TAAAATTTAA CGTATTACGA TTGATTAGCC GTGTCTAGGA CATAAATCAA 5100 

CGTCCTATAC TCTACAATGT CATATTAGCA GTCGTTAACT 6AATGAAAAT AAGCTTGTCA 5160 

20 TTAAAACATA TAGATTTTAG TGACAAGCAT yiT i U ' lTm ' GCGTACTTAA ACAACACTTC 5220 

AGGCAATATG TTGTTTAGGC AACAAATGAT ATGTGCGTGT TTATTGGCAA ACGTACGACT^ 5280 

TAGTAGTATA GTATGTCTAA ACAACATATG TTGCATAGTT GATATGCX3TT GTTTAAATAC 5340 

TAAGATAGGA GGGATTGACG TGAGCGAGAC AGATGAACCT CAGGGGTTTG AACGCACGCA 5400 

TAATATATTA AATATTAATC AGAGTAGTCT GGGTGTAGTG ACATACATTA CAAATAAATT 5460 

AAAGTCGACG TTGAAGCAAC ACATAATAAT TGCTCGTGGT AAAAAGCGAA TCGACTATCX5 5520 

ACTGTCGTAT AACTTTTACA TACGTATTAT GATAATGTAG AAATCAAGAA AATCGACTGT 5580 

GAATATACCT ATGCTATGCC CATTGCAATT TTAATAAGAC ACACGATGTC ATTCGACAAT 5640 

GCTCATTTCT TTGCTCAGTT ACGTCATCCT GTCTTATAAA ACAACATTGC AGACATGTAT 5700 

ATCAAACGAC ACTTCAATAA CATCACTTTG CCcATCGTAC TACTAGTAAA ATCGTGTCTC 5760 

AAATCCCTTA TTTTAATTCC AAAAAtCTGC TGGTCAAAAG ACCGAGAAAC TAAAAACATT 5820 

ACTTAATGTG TTGATAAATT ACCATATAAA AATAATCTCA AAATATATCA ACACTTGATT 5880 

CTAAGGAGGA TATGACAATA TGAAAATTTT AGATAGAATT AATGAACTTG CAAATAAAGA 5940 

AAAAGTACAA CCACTTACTG TAGCTGAAAA ACAAGAACAA CATGCATTGC GTCAAGAcTA 6000 

45 CTTAAGCATG ATCCGAGGAC AAGTATTAAC AACATTTTCC ACAATAAAAG TGGTT6ATCC 6060 

AATCXJGTcAG GATGTCACAC CAGATAAAGT TTATGATCTT CGCCAACAAT ACGGTTATAT 6120 

TCaAAATTAA tATTTGCTCA CX3AGGTATTG CACTTAAGGT GCCAACTGAC CTCATAAACA 6180 

AAGCCCATAC TGATTQAAGA CACTAATGTG tCsaCCATGG TGCACATTAC GCTTCATCTC 6240 

TGTATGGGCT TTTTATTTAT TCTTTTGAGA ATTTCATTTT AGCAGACCAA AAAATTAAAA 6300 
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25 



30 



TGAACGACTG 


TGCCACCCGC 


TTCTTTCACT 


TTATTCACCA 


ACTGGTCAAC 


TTCTTCATTT 


6420 


GTGTTCACAC 


CTAGAGAAAT 


CATCACTTCA 


TTTGGTTCAG 


TATTAAGGCT 


TTGCTGACTT 


6480 


ACATTTTGAA 


AATGCTTGTn 


TTCTATTAAA 


ATTACGGkTG 


tTTGACCTAT 


tTGAATGCCG 


6540 


ACCATTTTAT 


CTAACATTTG 


TGGGTTTCTA 


TTTATTTTAA 


ATCCTAACGC 


TTTATAAAAC 


6600 


TGTGCGCTCT 


TTTCTAAATC 


TTGCACATGC 


AAATTAAACC 


ACATTGATTG 


AATCATQATT 


6660 


GCACCCCATT 


CATTACTTAT 


TATAGTTTTG 


GACTTTAAGC 


CAATCACTTA 


ATGATAATCT 


6720 




ATTTCAGCCA 


TTAATTCAAA 


GTCTACTTCA 


TAACCTTTTT 


CTTCCAACCA 


6780 


TTGCTTTTCr 


GCAACACCAC 


TAACAAATTC 


TCCTTCTATA 


ACAGTAGATT 


TACCTGTCAC 


6840 


TTCACTAAAA 


ATTGTTGCTG 


CTTCACTTAA 


TGTAACTTCA 


TCGGAACCAA 


TCTCTATTGA 


6900 


TTGATGOGTA 


AAGCTTTGTG 


GATGTGCAAA 


AATATACGAT 


GCAATTTTAG 


CTATATCAAT 


6960 


AGAAGAAATC 


ATTGTGAATT 


TTATATTCGG 


ATTAATAAAT 


TCTGGTAATG 


TAATACXSTTC 


7020 


ATCTTCXjACT 


TTAGCAATGC 


GTAAAAAATT 


ATCCATAAAG 


AATGATGGTT 


TGATAACTGT 


7080 


TGCATTTATA 


TTAGATTCCA 


TTAATCTATT 


TTCTATTTTT 


GCTAGTACTT 


CAAAGTGTGG 


7140 


gccagttcga 


TTTCGATTAA 


CCCCTCCCGC 


AGTACTATAC 


ACAATATGTT 


GAATATTTTC 


7200 


TTGCTCAGCT 


ATTTCAATTA 


TCTTCATACC 


TTGTCTTAAT 


TCTTCGCTAA 


CATCATCTTT 


7260 


AACGATTGGC 


TGAATACTGT 


ATAAGCCATA 


CTTACCTTTC 


ATCGCTGATT 


GCAAACTAAC 


7320 


ATTATCACTC 


AGATCACCTT 


CArCGATTGA 


TAAATGCGGA 


TGTCCTATGT 


CTGAAAGTTT 


7380 


ACXSATTATnC 


TTATTTCTAG 


TTAATGCACT 


TACATACCAT 


CCATCCTCTA 


ACAACTGTTT 


7440 


TACAACTGCA 


TTACCTTGCT 


TCCCTGTTGC 


GCCTATTACn 


AAAATATCTT 


TCAT 


7494 



35 

(2) INFORMATION FOR SEQ ID NO: 70: 

•^(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



50 



AATTTATTTC 


GCCGTCCCAC 


CCCAACTTGC ATTGTCTGTA GAAATTGGGA 


ATCCAATTTC 


60 


TCTTTGTTGG 


GGCCCCGCCC 


CAACTCGCAT TGCCTGTAGA ATTTCTTTTC 


GAAATTCTCT 


120 


GTGTTGGGGC 


CCCTGACTAG 


AATTGAAAAA AGCTTATTAC AAGCGCATTT 


TCGTTCAGTC 


180 


AATTACTGCC 


AATATAACTT 


CGTAGATCAT AGAACATTGA TTTATTTCCC 


AGCCTATTCT 


240 
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AGCAAAGGTA ATAATGATAT TAATAATGTA 
TAAAACATCA GAACCACTAA AAACAAAAAA 
^ GACCACTTTT CAAAAAAATC TCtTTTCaTa 

TTATATTCTC TTTTAAGTTT ATTATTCAAA 
ATAAACATTT CAACTACTTT TAAAAACCAA 

10 

ATAAGTGAAC ATAGTTCTTT AGTTATAATA 
GCAATTGGTT TTCATTTCCT CTTAAAGATA 

15 CTATATTTTT CAACTTATCT CTATATTTAT 

CCTdTCTTC GTGAGTTAAT AAATGAAGCA 
TTAAATTCG6 TTTTAAAATA TGCAAATCAT 

^ CTCGTTTTAA TTCAATTTCC ACACGCCATA 
TATCTTTACX5 TTCTTGTTTT TATTATAAAT 
AAAATATTTT GTTTCTGGTT TTACATTACG 

25 

ATCTGACAAT GCATAATAGT CATTTAAATC 
CGTAAAACTA ACATCGTCCA AATAACTGAT 
ATGCGAAAGC TTATTAGGAT TAAATTCAAC 

30 

TTTATTTTGT CATATTCAAT ATAAACTTTT 
TGTAAAATAT CCCAAAGCCG AATTTCAGGA 
35 GCGTTAGACA TGCTAAGATT CCCAACAATC 
GCTAGTGACA TCCTATGTCX5 ATTTAACCGG 
ACAAATGGAT GAAACGAAAT TCAAAACACT 

40 

TACCATTATG TTCCTACTAA AAAACyAAAA 
TAGGATACTA TGTAATAAAA ATTTACAATA 
GmATACCCAT ACAAAGAGGA TAAAATAAAA 

45 

CTCGAGGTTT AAATATTGGT GCCTTATTTA 
TCATTAACmT AATCCTTAAA GAGTTTTAAA 
SO TCATCAACTT TTAAATAATT CAATAATTTT 
AACTTTAATA AACTATTCAT TTTGACAGGA 
AATACTTTCT CGCTTTAnAC AAAnACAAAA 
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CAAAAAATAT AAATCAAATC GACATCCTTA 360 

GCACAAAATA AAATTAAATT TAAAATAAAC 420 

TTTCCACCCC TAATTTTAAT AAGCATTATT 480 

AGGAAAACAG AAATATCTTT CaATATTATT 540 

GAAAAAAATA CTTATTTTAA GTAGATGAGC 600 

ATTAATTCAA CCAAAAGTCG ATTTGTTTTT 660 

TTTTCATTAA ATCTGTCAAA TCAATAGACG 720 

TTTTAGTACG TCTTTCTAAA TTTCCCCATT 780 

TTGCTCGTTC TTGTATATTT TCAATCATTT 840 

CAAAACAATC TTTCCAACAA TCAACCATAT 900 

GAAATGTTGA ATCAATTTCA ACATCTGCAT 960 

CCGAATAAAC CTATCACTAT TACGCACACC 1020 

TCCATAAAAT ATAGTTTTCT TTACCGACTT 1080 

AAATTCAAAA TCAAAAGCCA AATCTAATCT 1140 

GATATTTTGT TTTAACCAAA GCACTTCATC 1200 

GCGCATAtAC GTCTATTCCA AAGAGTTGCT 1260 

TCTTTAAGAG CTTTAGCTTT AAAGTTTGTT 1320 

TTAGTACTCA TAAAATGTGA AAGTCTCTCT 1380 

GTTATAGCGT CAAAAGACAA TTTTGGAATA 1440 

CTATTACCGG ATATTAGAGT ATCCAGTTTT 1500 

AAAAAATATG TTCCACTAAC AGCAAAAAAA 1S60 

ATACTGGAGA ACAAATGTCA GGATATAACT 1620 

AAAAAACAGG AAAACAAATT TCAAGTAAAA 1680 

AACCTCGAAC TGaAATGATG ATCTTTTCAG 1740 

TATAGATTCG TTATATTATA TTCTCTATTT 1800 

TTAATACCTG CTAGATGATT CAAAAATGTT 1860 

TGTGGTGTCA GTAAATnTCT ATCAAAATAC 1920 

CGTGACATTT CAATCACGTC GTCTAAAGAT 1980 

ACTTACCCGA TTAAAATCAA GTAAGTTTTA 2040 
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TATTTGATAA AAAATCAATA AGTAATTGTG CGCCTTCAAC TTGAATATCT TTTACAACTG 2160 

GCGCGTCGAT ATACATATCA TACTGACCAC CGCCTACTGC ACGATAATTA TTTACACAAA 2220 

5 

TTGTATATGT CTGCTTTAAA TCAACTGCGT GACCTTGAAT CATCATATTG CTCACACGTT 2280 

GTCCCTTTGG TCTTCCAACA TGAATGGTAT AACTTACGCC ACCATATATA TCATAATTAA 2340 

AGTGTTGTGG TTTGGGTTCA AGGAAGTCTG CGCTCACACT AACTTCATCA TTTTTCACGT 2400 

10 

CAAAATATTC TGCTGATCGT TCAATGGCTT CTTTAAGTTT GGCACCACTT ACAGCTAAAA 2460 

CTTTAAATGT ATTTGGAAAT GGGTAATTGT TAATAACATC TCGCATCGTC ACX3ACTTGCT 2520 

IS TGAAACGACT AGCAGAATCA AACAAAGCTG TACAGGCAAC ATCTGCGTCA CTTT TT TCTA 2580 

ATAAAGCGTA ATTCATAAAA TTTGTAAAAG GATGCGGTGC CACACGTGCC TCAAATGCAT 2640 

GATTAATCGT CATATCATAT GGCAATGTAG TAATTTCGTA ATCTAACCAG TCCTCTAACT 2700 

20 

GCTTTCGTAA ATGTTGGTCA TCTTCATCAA TAGTAAATGT GGAATCATCT ATAACAGGAA 2760 

GTAATTCACA TGATTCAACG GATAGATTTT CATATTCATC AGTACTCAAG ACTACTCTGC 2820 

CTACAGTTGT ACCTCTCGTA CCAGGTTGAA TCACAGCCGT TTGCTTAAAC CTTTCAGCAA 2880 

TTTGTCGATG TTGGTGACCC GTAATAAAGA TATCTATATC TTTAGAAAAC GCTTCTAACA 2940 

TGGCATATCC TTCATTTTCA CCCGTTAATA CTTCGGTCGG CGTACCACTT TCTAAATCCT 3000 

30 TTTCAAATCC ACCATGGTAA CAAACCACAA TGATATCTGC ATGTCGCTTC ATTTCAGGTA 3060 

AGTATTGTTG AAGTATTTCA AAAGCACTAT GAAACXSTArT GnCnTGAATA TGCTCTGGTT 3120 

GTTCCCAATG GGGAATAAAT TGTGTCGTTA AACCTATCAC ACCAACAGTT TGATCTCCAA 3180 

3S CCTGAAAATA CTTCACACCG TTATCAGTCA ATGTACTATC ATTTTCATAT ATATTAGCX3C 3240 

ACAAAACTGG ATAATTGAGT CTGCGTAAAG TGTCTTTTAA GTATGGTAAT CCATAATTAA 3300 

ATTCATGATT ACCAAGCGTA CCAAAGTCGA ATGCXATTCG ATTATAAAAA TCAACT7VAAG 3360 

40 

GCTGGCTACT GCCGCTATGC GCGATTAAGT AATTACAAAA TGGTGACCCT TGCAAAAAAT 3420 

CACCATTATC TATTTTAAAA CTTTGGTCAT ACTGCCTTCT GTsTTGTTCT ATAACATGAT 34 80 

^ TCGCTAGTAA CAATCCCATA GGTTGATATT GATTTCTACT CGTAAAATCT GTTGGGAAAA 3540 

TATAACCATG TACGTCACTC ACGACATAAA ATGCTATGTT TGACATCCTC ACTCACTCCT 3600 

TCAATCACAA ACATCTTTCT TATTTCTATT ATATATTTAT TTGAAGTCTG TTGTAATCAA 3660 

SO GGTTTTGTCA CCGAGTTTTA AACGAATCTT TGAACCTTCC ATACTTTCAA GTACTTTAGC 3720 

ATTGACCTTA ATTGTGACAT TTCCGTTTTC ATCTGCTTTA ACTGTTGGCA AAGTACTGTA 3780 

ACCTGGTGGG TTATAATCGT TATCTTTACT TGAAAATTGT CCGATTTGAC GTCCGCCTTC 3840 
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TATTGTCATT 


TCAAATGGCT 


CATTTACAGA AACATTTTGC GGGATATCAA 


ATGTTACTTT 


3960 




TTCGTTCTGA 


TTTGGTGGTG 


TATGATCATC TGGTGTGTTT GGCTGAGGAT 


CTGCGCCTTT 


4020 


5 


TTCGCTGCCA 


TAACTACCTG 


CTTTAAATGT TGTTGGATCA TACCATTTAT 


AACCACTCGG 


4080 




CGGTTGTGAC 


CATGGCTCTT 


TTTCAGGCTC AGTTGAACGC TCTGGTCGTT 


CAAAATCAAG 


4140 


10 


CAACTTAGTC 


TTTGTATCTA 


ATGTTAGGCT ACTCGCCTTA AGTGATTTCC 


CATCATTATC 


4200 




TTTAGACATC 


CAAGCCGTTA 


TATTATTTAA TAGCTTACCG TTGTCTTGTT 


CTTTAAAACC 


4260 




ATCATATGTT 


TTCTTCmT 


CTCCATTATC TTCTCTTACA TATTTGGGCG 


AACTATCTTC 


4320 


IS 


CACAAGTGAT 


GAATCACCGA 


TAAATGCTGC TTTACCTTTT CCAACTTTAG 


AAATTGCTAC 


4380 




ATAGGGGCCT 


TCTGCTTTAC 


CX5CCCCCATT ATAAATACCT TGATCTACAG 


CATGTGACCA 


4440 




TTTACTTTTC 


GCTGGCAATT 


GTTCTGGTGT ATACACAATA CCTTTTGCTT 


TCTCTGGATT 


4500 


20 


AGTAATTGCT 


AATGTCGATC 


CGGCATGCAT AGAGACAGAT TTCACACCTT 


CAGTAATACC 


4560 




GAAACTTTCT 


TTTGAAGAAA 


CAATATTGCT CGTATTTAAA TCACCTAGTG 


CATTATATCG 


4620 


25 


AAAACGTACG 


CCAAAGTTTG 


TAGATAACCA ATCTGAACTT TTCACACCTT 


GCATTGCAGT 


4680 


AGAACTTTTT 


TCTTCTGCAT 


TCATACCTTT CGACATATCT TCATATGCTC 


CACGTCGATA 


4740 




ACCATTCATT 


GCCTCCGATG 


AATCAATACG ATTTAAATTT CGGTCAGCAT 


TGTAATGATC 


4800 


30 


TGAAATAAAG 


ACAACATTGC 


CACCTTGTTt CACATATTTA ACAATTGCTG 


CCTGTTCTGA 


4860 




TTCTTTGAAA 


GGAATGTTAG 


CCTCAGGAAT TACAAATATT TTGGAACTTT 


TCAAACTTGC 


4920 




TTCTGTTATG 


TTCGAATGAC 


CATCAATAGC TTTAACGTCA TAACCTTGTT 


TTTGTATTGA 


4980 


35 


ATCCGCATAA 


TCTGAAAATG 


CACCATCACT AACCCAATCT GCAGCACCAG 


CTGTTTGACC 


5040 




ATGAGAA06A 


TCGAATAATA 


CCGTTCGCTG TTGCTTTGTA OGTTGCGATT 


CATGCGTTAT 


5100 


40 


AGC»AAGAT 


TGCGGTAAAG 


CACTTAATGA TACCGTTGCA ACAATTGCAG 


AGACAGTTAA 


5160 


TGACTTATAT 


ATTTTTTTCA 


TTTTGTGAGG CTCCTTTTAA AATAAATTTG 


TTCTTGAATT 


5220 




ATAGGATAAA 


AATTCGTTGC 


ATATGAGCAA TTTAACGAAA AATTTACAAA 


ATCTTATCAA 


5280 


45 


ACTCTTAAAG 


AAAGTTATTA 


AAATTCATTT TTATAAAATA CTTTTTAACA 


TTTAAATGTG 


5340 




GTACGCTATA 


AGTGTAATTT 


CATTGCATAC ATATTACACG ATTAAGAATG 


TGAAGGGGAC 


5400 




AGTTATCAAA 


TGAAAAATTT 


TAAGTGTTTA TTTGTATTAA TGTTAGCAGT 


CATTGTTTTT 


5460 


50 


GCAGCAGCAT 


GTGGAAACTC 


AAGTTCTTTA GATAATCAAA AGAACGCTAG 


TAATGATTCG 


5520 




GATTCTAAAT 


CAGGAGGATA 


CAAACCTAAA GAATTAACCG TTCAATTTGT 


ACCTTCGCAA 


5580 




AATGCTGGAA 


CATTAGAAGC 


TAAAGCAAAA CCATTAGAAA AATTACTATC 


TAAAGAATTA 


5640 
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TCTAAAAAAG TTGATGTTGG TTTCTTACCA 
AAAGCAGCTG ATTTATTATT ACAAGCACAA 

5 

AGTAAAGAAC TTGTAGATAG TTATAAATCA 
AAAAGCTTGA AAGATTTAAA AGGTAAGAAA 
GGATATACAT TCCCACTTGC GATGTTAAAA 

10 

ATGAAAATTG TGAATGTTAA AGGTCATGAC 
GTAGATGCTG CGGCTGTATT TAACGATGCA 
IS GTATTTAAAG ACACACGAAT TTTAAAATTA 

GTAAGACCAG ATATGGATAA AGATTTTCAA 
GCTAAATCAA AAGAAGGTCA CAAAATTATT 
GAAACGAAAG ATTCAAATTT CGACATTGTA 
AAATAATCAT TATTTAACAA ATGAATCATT 
ATAGATATAT TCTAGATTAA TATTGAAAAG 

25 

GGTGTCGCTG TACATTTGAA ACCATTTGTA 
TTTACCTTGG CTTTTCTTTA TTAAAGAAAO 
AACGTCAGTA AAGTCTATCC TAACGGTCAT 
GAAAAAGGTG AATTTGCAGT TATTGTCGGA 
AGATCTGTAA ATCX3TTTGCA TGATATCACG 
35 ATCACTAAAG CCCATGGTAA AGCATTATTA 
CAACATTTTA ATTTAGTTAA ACGGTCAAGT 
GGniTCACC CTACTTGGAA AATGGTATTA 

40 

GCAATGGATG CACTAGAACG CGTCAATATC 
TTATCAGGTG GCCAACAACA ACGTATATCT 
ATTATTCTTG CAGATGAACC AGTTGCTTCA 

45 

GATGATTTAA GAAAAATCAA CCAAGAATTA 
GTTGACTTGG CAAAAGAATA TGGCACACGC 
SO TATGATGGTC CTGCATCTGA AGCAACAGAT 
ATTAAAGAAG ATGAAAAGCT AGGAGTGAAC 
TGACTCCCTT TTAAAGAAAA AGGTTTCTTT 
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CCAACGGCAT ACACATTAGC ACATGATCAA 5760 

CGTTTCGGTG TAAAAGAAGA TGGTTCAGCA 5820 

GAAATTCTTG TTAAAAAAGA CTCAAAAATT 5880 

ATTGCCTTAC AAGATGTAAC ATCAACTGCT 5940 

AACGAAGCAG GTATTAATGC AACTAAAGAT 6000 

CAAGCAGTTA TCTCATTATT AAATGGAGAt 6060 

CGTAATACTG TGAAAAAAGA CCAACCAAAT 6120 

ACACAAGCTA TTCCGAATGA CACAATTTCT 6180 

GAAAAATTGA AAAAAGCTTT TATAGACATT 6240 

AGCGAAGTTT ATTCACATGA AGGATACACA 6300 

AGAGAGTACG AAAAATTAGT TAAAGATATG 6360 

AGCGAATTTG GTATTAAAAG CTTTCGTTCA 6420 

CTAGGCGCTA AACTGAAACA GATATAGAAA €480 

CACAGAAACC CAATGTCTAT GATATTTCAG 654 0 

GTGTCAAACA TGAGTCAAAT CGAATTTAAA 6600 

GTAGGCTTGA AAAATATTAA CTTAAATATT 6660 

CTATCTGGTG CTGGGAAATC CACGTTATTA 6720 

TCAGGTGAAA TTTTCATCCA AGGTAAATCA 6780 

GAAATGCGCC GAAATATAGG TATGATTTTC 6840 

GTATTACGAA ATGTACTAAG TGGACGTGTA 6900 

GGTTTATTCC CAAAAGAAGA CAAAATTAAG 6960 

TTAGATAAAT ATAATCAACG CTCTGATGAA 7020 

ATTGCACGTG CGCTATGCCA AGAATCTGAA 7080 

TTAGACCCAT TAACTACGAA ACAGGTTATG 7140 

GGCATCACAA TTTTAATTAA TTTACATTTT 7200 

ATCATTGGTT TACGTGATGG TGAAGTTGTC 7260 

GACGTATTTA GTGAAATATA TGGACGTACA 7320 

TAACATGCCT TTAGAAATAC CTACAAAGTA 7380 

AAAAACGAGT TTTACCTTCA TGTTAATCAT 7440 



511 



EP 0 786 519 A2 



10 



IS 



AATACCTCAA ATAGGTGATC TATTCAAACA AATGATTCCA CCTGATTTCG AGTATTTACA 7560 

ACAAATTACA ACGCCAATGT TAGATACCAT TCGAATGGcT ATCGTAAGTA CAGTATTAGG 7620 

TAGCATCGTT TCAATACCAA TTGCGTTATT ATGTGCTAGC AATATCGTTC ATCAAAAGTG 7680 

GATTTCAATA CCCTCGCGCT TTATTTTAAA TATAGTTCGT ACTATTCCAG ATTTGTTATT 7740 

AGCAGCAATC TTTGTGGCTG TATTTGGAAT CGGTCAAATT CCAGGGATAT TAGCACTGTT 7800 

TATTTTAACT ATCTGTATTA TTGGAAAATT ATTATATGAA TCATTGGAAA CGATAGATCC 7860 

AGGTCCAATG GAAGCAATGA CGGCTGTTGG CGCTAATAAA ATAAAATGGA TTGTTTTCGG 7920 

TGTTGTACCA CAAGCCATAT CGTCATTTAT GTCATACOTA TTATATGCAT TTGAAGTAAA 7980 

TATACGTGCT TCAGCTGTOC TTGGATTAGT CGGCGCTGGC GGTATTGGAT TGTTTTATGA 8040 

TCAAACACTT GGTTTATTTC AATATCCAAA AACAGCAACG ATTATTTTAT TTACTTTAGT 8100 

20 TATCGTCGTC GTCATTGATT ACATCAGTAC GAAAGTGAGG GCACATCTCG CATGACACAG 8160 

GAAATAGCAA AATATAATGT TCACACAAAA GCACACAAAC GAAAATTGAT TAAAAGATGG 8220 

CTTATTGCAA TTGTCGTCTT AGCTATTATC ATCTGGGCAT TTGCAGGTGT ACCAAGTTTA 8280 

GAACTTAAAA GTAAATCATT AGAAATCTTA AAATCCATAT TCAGCGGATT ATTCCATCCT 8340 

GATATCAGCT ATATCTATAT ACCAGATGGC GAAGACTTAT TACGTGGTTr ACTTGAAACC 84 00 

TTTGCGATAG CCGTTGTAGG TACTTTCATC GCCGCAATTA TCTGTATTCC ATTAGCATTT 8460 

CTAGGTGCAA ATAATATGGT AAAGCTACGC CCAGTTTCAG GTGTTAGCAA ATTTATTTTA 8520 

AGTGTTATAC GTGTCTTCCC AGAAATTGTA ATGGCACTTA TATTTATCAA AGCTGTTGGC 8580 

^5 CCAGGTTCAT TTTCAGGTGT ATTAGCTTTA GGTATCCATT CCXSTAGtATG CTTGGGAAAC 8640 

TTTTAGCTGA AGATATTGAA GGTCTAGATT TCAGTGCTGT AGAATCATTA AAQGCCAGTG 8700 

GTGC35AATAA GATTAAAACA CTCGTATTTG CAGTCATACC ACAAATTATG CCTGCCTTTC 8760 

TATdACTCAT ACTTTATCGC TTTGAACTAA ACTTACGTTC AGCTTCTATA CTGGGGCTAA 8820 

TTGGGGCTGG TGGTATCGGG ACACCACTCA TATTTGCCAT TCAAACACGT TCTTCGGACC 8880 

GTGTAGGTAT TATATTAATC GGTTTAGTAC TAATGGTCGC AATTGTCGAT TTAATTTCCG 8940 

GTTCAATCCG AAAACGTATT GOTTAACATT AAATCAGGAT ACTCCTAAAT AAGAAGTCCT 9000 

ACCGTCTTAC GTTTCTCTAT TATAATAAAA ACAGCAGTGA AGAAAACTAT TGTTATAGTT 9060 

AACTTCACTG CTGTTTTTAT AATATCTAAA TTTATTCTAT TTCAATTCCT TTAAATAACT 9120 

TTTACCGAAC TCTGGTAATG TTACGTTGAA ATTATCTGCT ATAGTTGCAC CGATAGAACT 9180 

GAATGTAGTA TCACTTTCTA GTGCATGACC ACCTTTAAAT TTCGGACTGT ACATAATTAC 9240 

55 



2S 



30 



45 



SO 



512 



EP0 786 519 A2 



TGTAATAATT ACTAAATCGT CTTCTTTTAA 
GAAATCTTTA ATTGCTTGTG CATAACCTGG 
^ AAAGTCTACT AAGTTTAAGA AGCTAATACC 

TTGATCCATA CCGTCCATGT TACTCTTCGT 
ATAAATGTCA TTAATTTTAC CGATGGCAAT 

TO 

TAAGACAGTT TTACCAAAAG GTTTTAACGC 
GTTTCCTGGT TCACCAACAT ATGGACGTGC 
TTTTGTCAAC TCACGAACCT TTTCACAAAT 
TTCATGTGCA GCAATTTGCA ATACTGGGTC 
TTTCATTTGG TGCTCGCCCC ACTCATCGAT 
20 AACAACTTTA CGACCTGTCA TTTCTTCAAT 

AGGGTATACT TTAAAAGGTT GCATAATATT 

T6TATCTTTA CCAACTGAAG CTTCACTCAA 

25 

TGCATTTACT ACTGGTAATT TATCGATGTT 

AGTTTGATCG AAACCTTCTA AGGTATGTCT 

AGCTGCGTCT GGCGCTTCAC CAATACCTAC 

30 

AAATGGTCTT GTCATAGCTA TCACTCCCAA 
TTCTAAACCT TGCATAATTT GAACACCTGC 

^ AACCATTTTA TTGAAATCTT CTAAATTACG 

AGCACCTACT GTATCTTTCA TTAATTTAAC 
ACCt^TTGAA GITTTAACGA AGTCCGCACC 

40 AATTTCGTCA TGGTCCAACA ATACCGTCTC 

AGCTTTAACC ACTGCTTCAA TGTCTTGTTG 
GCCGATGTTG ATGACCATGT CAATTTCATC 

^ AAATGCTTTC GTTGCAGTTG TOGACGCACC 
CACCTCTGAA TCAGCTAGTC GCTCTGCTGC 
AGATTTAAAA TTGTATGctT TCGCTTCATC 

SO 

CTCAGGCTTC AATAAAGTGT GATCTATATA 
TGTTATATAA TCTCTTTATT TAATTTTACT 

55 



GTTGCTAAAC AGTTCTGGCA AGCGATCATC 9360 

TTTATCACGA CGATGACCGT ATAATGCATC 9420 

TGTGaAATCT TTCTTAACAA TTTTCATCAA 9480 

ACGAACCGCT TCTGTTACAC CTTCACCATC 954 0 

AACATCATAA CCACCGTCTT TCAAATGATC 9600 

ATAGTCATGT CGAITAGATG TACGTGTAAA 9660 

GATAATACGA CCAATTAAAT ATTTAGGGTC 9720 

ATCATATAAC TCTTCTAATG GGATAATOTC 9780 

TGCACTTGTA TAAACAATTA AGTCACCAGT 9840 

AATTTGCGTA CCCGATGCCG GTTTGTTAGC 9900 

TTGTTGAATT AACTCTTCAG GGAATCCATT 9960 

TAATCCCATA ATTTCCCAGT GACCAGTCAT 10020 

TTTAGTATAG TATGCTTCTG GTTGTTCAAC 10080 

CCCTAGACCT AACTTTTCAA GGTTTGGTAA 10140 

TAAAGTATGT GAACCTTCAT CTTTAAAATC 10200 

TGAATCCATT ACGATTAAAT GTACACGATT 10260 

AATTTATATA TATTAGTAAT CTGAATCTGC 10320 

GCTCGCACCA ATACGTGTCG CACCTGCTTC 10380 

TACGCCACCT GATGCTTTTA CTTCTACATC 10440 

GTCTTCTGCA GTCGCACCGC CACCTGCAAA 10500 

AGCCGCTTTT GTTAATTCAC TCGCTTTTAC 10560 

AATAATCACT TTTACTGTGT GACCTTTCGC 10620 

TACATCATCA AAACGTCCAT CTTTTAATGC 10680 

TGCACCATTT TGAATTGCAT CTTCTGTTTC 10740 

TAATGGGAAT CCTATTACCG TACAAACGAG 10800 

ATATTTAACA TGTGTTGGAT TCACACATAC 10860 

GATGATTTGA TCGATTTGCG TACGTGTTGA 10920 

TTTCTCAAAT TTCATACTTA CTACTCCTCG 10980 

ATAAATACGA ATATATCTCG CGAATTTATA 11040 
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5 



10 



20 



ATACTCATTA 


AACCTAAAAT 


AATTAAAATA ATACCGAAAT 


GTGAACTTAA 


TGCATCATTG 


11160 


CCTGGGAAAT 


TTAATGCTTT 


AAAATCGATT AGAGCCGCAG 


CAATCX3CAAT 


ACCTACAGAT 


11220 


ACCGCCACAT 


TAATAATTAA 


ATTATAAAAA CCAATAGCCA 


CACCTGTCAT 


ATTAAGATCT 


11280 


ATTGTTTTAA 


TGGCTTCGTT 


AAGTAAAGGT GCATACATTA 


AAGCAAAGCT 


ACCTGCAAAG 


11340 


AATATCATAG 


AAATGACGAA 


GATTGAAATG TGATTACCTA 


CTGCAAATGC 


AGGTAAAATC 


11400 


AAGCTCAGTG 


CTATTAAAAT 


AATTGCTGTG ATAATCGCTT 


GTTTTGAATT 


CAGATATTCG 


11460 


CCGATTTTAC 


CACTTAGTGC 


ACCAACAAT6 ACTGCTACTA 


TATAACCCGG 


TACTAATAAC 


11520 




TGTCTAGTTG 


CAGATGATAA ATTTGCTCCA 


TTATGAATGG 


GAACGTAAAA 


11580 


ATATAACCCA 


ATTGGATAGC 


ATACATTACA AATACTATAA 


ATAAAAATGA 


AGCATAACXST 


11640 


TThTTTTGGA 


AAAATGATTT 


ATTTACTAAT GGACGTTGGG 


CATTTTTAAT ATATAGCGCA 


11700 


AAAACGATAA 


TCGCAATTAA 


GGCACCAATC ATATATAACC 


AATTAAAGTT 


CGTAATAAAC 


11760 


AGCATGACTG 


TTGTAGCAGG 


GGATCCTCTA GAGTCGAnCC 


TG 




11802 



(2) INFORMATION FOR SEQ ID NO: 71: 

25 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

3S CTAAAGAAGA TGCGAAACAA GATGTTGATA AACAAGTTCA AGCTTTAATT GACGAAATCG 60 

ATCAAAATCC AAATCTAACA GATAAGGAAA AACAAGCACT TAAAGATCGT ATTAATCAAA 120 

TAC^X:AACA AGGTCATAAC GACATTAACA ATGCGATGAC AAAAGAAGCA ATTGAACAAG 180 

^ CAAAAGAACG TTTAGCGCAA gCATTGCAAG ACATCAAAGA TTTAGTGAAA GCTAAAGAAG 240 

ATGCGAAAAA TGATATTGAT AAACGTGTAC AAGCTTTAAT TGACGAAATC GATCAAAATC 300 

CAAATCTAAC AGATAAGGAA AAACAAGCAC TTAAAGATCG AATTAATCAA ATACTTCAAC 360 

45 

AAGGTCATAA CGACATTAAC AATGCGCTGA CTAAAGAAGA AATTGAGCAG GCAAAAGCAC 420 

AACTTGCACA AGCATTGCAA GACATCAAAG ATTTAGTGAA AGCTAAAGAA GATGCGAAAA 480 

^ ATGCAATAAA AGCCTTAGCT AATGCGAAgc GTGATCAAAT CAATTCAAAT CCAGATTTAA 540 

CACCTGAGCA AAAAGCAAAA GCGCTCAAAG AAATTGACGA AGCTGAAAAA CGAGCACTAC 600 

AAAACGTTGA GAATGCTCAA ACTATAGATC AATTAAATCG AGGATTAAAC TTAGGTTTAG 660 
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TTGAAGCAAC ACCTGAGCAA ATCCTAGTTA ATGGTGAACT CATTGTACAT CGTGATGACA 780 

TCATTACAGA ACAAGATATT CTTGCACACA TAAACTTAAT TGATCAGCTT TCAGCAGAAG 840 

^ TCATCGATAC ACCATCAACT GCAACGATTT CTGATAGCTT AACAGCAAAA GTTGAAGTTA 900 

CATTGCTTGA TGGATCAAAA GTGATTGTTA ATGTTCCTGT AAAAGTTGTA GAAAAAGAAT 960 

TGTCAGTAGT CAAACAACAG GCAATTGAaT CAATCGAAAA TGCGGCACAA CAAAAGATTA 1020 

10 

ATGAAATCAA TAATAGTGTG ACATTAACAC TGGAACAAAA AGAAGCTGCA ATTGCGilAAG 1080 

TTAATAAGCT TAAACAACAA GCAATTGGAT CATGTTnAAC AATGGCACCT GGATGTTCCA 1140 

TTCAGTTGAA GGAAATTTCA ACAACAAGGA ACAAGCGCCn GATTGGAACA ATTTGA 1195 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 
20 {B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<0) TOPOLOGY: linear 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



30 



4S 



SO 



CAATCGTTTC 


AACGCTATTA 


TCTTTAGACA 


ACAATTGTAA 


GCGTGTATGT 


GCAGTTTCTA 


60 


AACAGTCTAT 


AATTCGAGTT 


CTTAATTCAG 


CTGGATCATC 


TTTAAAAATA 


AAATCCATCG 


120 


CTGCAACTTT 


GTAGACAAAT 


GTTAAATAGG 


TAAGTTCACT 


GTGACTCGTA 


ACGAAAATAA 


180 


TGTTACCAAC 


TGGGTCATGC 


TTACGAATTT 


CACTGCCTAA 


TTTGATACCA 


TTAATATCAG 


240 


TTGAAAGTTG 


AATATCTAAA 


AAGTAACAGC 


CTATGTCATT 


CATATTTTTA 


GCTTGCTCAA 


300 


GCACCTCATA 


AGGATTATCA 


GTTGCGAGGG 


CAATTTCCAT 


AGGCTTTTCT 


TCTATCATTA 


360 


TATAATTTTT 


AATAATGGTA 


ACCATGTTTT 




TGGATOGTCT 


TCGCAAATGA 


420 


AAATTTTCAT 


ACATTCACAT 


CCTTATGGCT 


AGTTGTTAAT 


AATTTCAACT 


TTTTGAATAA 


480 


AGAAACCATT 


TTCGATAATT 


GTATCTAATA 


AGACATTGTC 


TGCATTATCA 


GCAATTTCTT 


540 


TTAAAGTTGA 


TAGACCTAAA 


CCAC6ACCTT 


CACCTTTAGT 


AGAAAAACTT 


TCTTGGAACA 


600 


ATTCATGAAT 


GCGTGGTATA 


TCATCAGCGC 


ATTTATTCAT 


AACAATAAAC 


GTTACTGAAT 


660 


TTTCACTTTC 


AATAAATGCA 


ACGCGAATGA 


TAGGGTCATC 


AATTTCAGTT 


GATGCCTCAA 


720 


TTGCATTATC 


AAGAATAATA 


CCAATACTGC 


GACTTAAATC 


GATCATATTC 


AAGTTAATGC 


780 


TACTTACTTC 


ATCGGGTATT 


TCGATACTAA 


TCGGAATATT 


CATTTCTTGT 


GCACGTAAAA 


840 


TTTTCGCAGT 


AATTAAGCCT 


TTAATTTCAC 


GTACTTTAAG 


ATTCTCGATA 


CCATTTAATT 


900 
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GTAGGCCAGG CATGTCATCT TCTGGAATGT ATTCTCAAAG TGTCGTTAAG ATATTGACAT 1020 

AATCATGACG GAACTTGCGC ATTTCGTTGT TGATAOCTTC AATCTTCAAT GTATATTCAT 1080 

AATAGGTTTC AATTTCTTCT TGATTACCTT TATATTTCAT CTCTTTAAGG AGAAATTGAG 1140 

AAATAACAAA TGTTAATATA CTTAAAAATA TAGTGATACC AATAAAAATA AAAGAATACT 1200 

GCCTTATTAC TTTAGCTTCA TCCGAGTTTA TTTGTGAATA AAAGAAAAAT AATGAAAAAG 1260 

TAAGCAGTAA GATAGTCGAA ATAACTATTA AAAATCCTTT GTTTAGTATT AGATATGGTG 1320 

TGCTAATTTT TTTGAGAACT CTATTTATTA TATATGAGAA TAGTATACTA ATAGTCACAT 1380 

AAACTACAAA AAAGCTAGGG AATATTACAA ATATACTATC AGAAATTTTG GTGGATATAT 1440 

GCATATATAA CTATATACCT GTi^GTTAGCA CnGTtlATAGG AATAATCnOG CGAGGTCCAT 1500 

AATCCACCAA AATAGAATA 1519 
(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GTAGGAATCT CTTTGTCTTT TTGGGAGGAC ATTTAATATG AATGTATATT TAGCAGAATT 60 

CCTAGGAACT GCAATCTTAA TCCTTTTTGG TGGTGGCGTT TGTGCCAATG TCAATTTAAA 120 

GAGAAGTGCT GCGAATGGTG CTGATTGGAT TGTCATCACA GCTGGATGGG GATTAGCGGT 180 

TACAATGGGT GTGTTTGCTG TCGGTCAATT CTCAGGTGCA CATTTAAACC CAGCGGTGTC 240 

TTTAGCTCTT GCATTAGACG GAAGTTTTGA TTGGTCATTA GTTCCTGGTT ATATTGTTGC 300 

TCAAATGTTA GGTGCAATTG TCGGAGCAAC AATTGTATGG TTAATGTACT TGCCACATTG 360 

GAAAGCGACA GAAGAAGCTG GCGCGAAATT AGGTGTTTTC TCTACAGCAC CGGCTATTAA 420 

GAATTACm GCCAACTTTT TAAGTGAGAT TATCGGAACA ATGGCATTAA CTTTAGGTAT 480 

45 TTTATTTATC GGTGTAAACA AAATTGCCGA TGGTTTAAAT CCTTTAATTG TCGGAGCATT 540 

AATTGTTGCA ATCGGATTAA GTTTAGGCGG TGCTACTGGT TATGCAATCA ACCCAGCACG 600 

TGATTTAGGT CCGAGAATTG CACATGCGAT TTTACCAATA GCTGGTAAAG GTGGTTCAAA 660 

TTGGTCATAT GCAATCGTTC CTATCTTAGG ACCAATTGCC GGTGGTTTAT TAGGTGCAGT 720 

GGTATACGCT GTATTTTATA AACATACATT TAATATTGGT TGTGCAATTG CrATTGTTOT 780 

SS 



516 



CGAATCAATT TACTAAAATA AAAAGAAACG 
TGGATTATGC TATTTTTTCG CCAAAATTTA 
5 TTTTTAATAA AGGAGATACT ACTAATGGAA 

ACAAGCTCAA GAGCGATTTT ATTCAATCAA 
GAGTTTAAGC AATATTTTCC ACAATCAGGT 

10 

ACATCTGTGT TAGCTGTAAT GACGGAAGTA 
ATTGCAGGTA TCGGTATTAC AAACCAACGT 
GGCCGCCCAA TTTATCACGC AATTGTTTGG 

IS 

GAATTAAAAC AACAAGGATA TGAACAAACA 
CC6TATTTTG CAGGTACAAA AGTTAAATGG 

20 AAAGCAGAAA ATGGCGATCT ATTATTTGGA 
TCaGGaAAAg CtGCGCATAT TACTGATTAT 
ATCCATGATT TAGAATGGGA CGATGAGTTA 

25 TGCCAGAAGT TAAAGCTTCG AGTGAAGTAT 

GTCAAGAAGT ACCAATCGCT GGAGTAGCTG 
CTTGCTTCGA ACGTGGTGAC GTGAAAAACA 
ATACAGGTGA CAAAGCGGTT AAATCTGAAA 
TTGATGGAAA AGTAAATTAT GCGCTTGAAG 
AATGGTTACG TGATGGATTA AGAATGATTA 

3S 

CACGAGTTGA CTCTACTGAG GGTGTTTATG 
CATMTGGGA TTCTGAAGCA CGTGGTGCGA 
AGCACTTTAT CCGTGCAACT TTAGAATCAC 

40 

CAATGTCAAA AGACTCTGGT ATTGATGTCC 
AAAATAACTT TATTATGCAG TTCCAAGCAG 

45 AAATTCAAGA AACTACAGCT TTAGGTGCTG 
GGGAGAGTAA AGATGATATC GCTAAAAACT 
TGGATGAAGG CGAAAGAGAA AAATTATATA 

^ AAGTTTTTAA AACAGAATAA ACTTGTAGAT 
CAATTTAAGT TAATATTTGA ATCGAGAAGC 

55 
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TAAATAGCAT AATTTAACAT GTTTGATTCA 900 

ACAGATTTTG TACAATGGGT TAGCGATTAT 960 

AAATATATTT TATCTATAGA CCAAGGAACA 1020 

AAAGGGGAAA TTGCAGGGGT AGCACAACGT 1080 

TGGGTTGAAC ATGATGCAAA TGAAATTTGG 1140 

ATTAATGAAA ATGATGTTAG AGCTGATCAA 1200 

GAAACAACGG TTGTTTGGGA CAAaCATACT 1260 

CAATCACGTC AAACACAATC AATTT G TTCA 1320 

TTTA6AGATA AGACAOGATT ACTTTTAGAT 1380 

ATTCTAGACA ATGTTGAAGG TGCACGAGAA 1440 

ACGATTGATA CTTGGTTAGT ATGGAAATTA 1500 

TCaAATGCGA GTCGTACATT AATGTTTAAT 1560 

TTAGAACTAt TACAGTACCT AAAAATATGT 1620 

ATGGTAAGAC AATTGATTAC CACTTCTATG 1680 

GTGATCAACA AGCAGCATTA TTTGGACAAG 1740 

CATATGGAAC TGGTGGCTTC ATGTTAATGA 1800 

GTGGTTTATT AACAACAATT GCTTATGGTA 1860 

GTTCCATCTT TGTTTCGGGT TCAGCAATCC 1920 

ATTCAGCACC ACAATCAGAA AGTTATGCGA 1980 

TTGTTCCAGC TTTTGTAGGT TTAGGAACAC 2040 

TTTTCGGTTT AACACGTGGA ACTGAAAAAG 2100 

TATGTTACCA AACTCGTGAC GTTATGGAAG 2160 

AAAGTTTACG TGTCGATGGT GGTGCAGTTA 2220 

ACATTGTTAA TACTTCTGTT GAAAGACCTG 2280 

CATTTTTGGC AGGTTTAGCA GTTGGATTCT 2340 

GGAAATTAGA AGAAAAATTC GATCCGAAAA 2400 

GAGGTTGGAA AAAAGCTGTT GAAGCAACAC 2460 

TAGACTTTTG TATAAACATT GTGATACAAT 2520 

GAGAGATTTG TTCGAACATG TACAATTGAA 2580 
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GCATTGTCTA CTTTTAAGAG AGAACATATT AAAAAGAATT TAAGAAATGA TGAATATGAT 2700 

TTAGTAATTA TTGGTGGCGG TATTACAGGT GCAGGTATTG CACTAGACGC GAGTGAAAGA 2760 

GGAATGAAAG TTGCATTAGT TGAAATGCAA GACTTTGCAC AAGGAACAAG CTCAAGATCT 2820 

ACAAAATTAG TCCATGGTGG TTTGCGTTAC TTAAAACAAT TCCAAATTGG AGTAGTTGCC 2880 

GAAACTGGTA AAGAACGTGC GATTGTTTAT GAAAATGGGC CTCATGTTAC GACTCCAGAG 2940 

TGGATGCTTT TACCAATGCA TAAAGGTGGA ACATTTGGTA AATTCTCAAC ATCAATTGGT 3000 

TTAGGAATGT ATGATCGTTT AGCAGGTGTT AAGAAGTCTG AACX3TAAAAA AATGTTATCT 3060 

AAAAAAGAAA CTTTAGCTAA AGAACCATTA GTTAAAAAAG AAGGTCTAAA AGGCGGCGGT 3120 

TACTATGTTG AATATCGTAC TGACGATGCG CXyTTTAACTA TTGAAGTTAT GAAGCGTGCT 3180 

GCTGAAAAAG GCGCAGAAAT TATCAACTAT ACTAAATCTG AACACTTCAC TTATGATAAA 3240 

20 AATCAACAAG TAAATGGTGT TAAAGTTATA GATAAATTAA CTAATGAAAA TTATACAATT 3300 

AAGGCTAAAA AAGTGGTTAA TGCAGCAGGT CCATGGGTTG ATGATGTTAG AAGTGGTGAT 3360 

TATGCACGCA ATAATAAAAA ATTACGTTTA ACTAAAGGTG TACATGTTGT TATTGATCAA 3420 

TCAAAATTCC CATTAGGTCA AGCAGTATAC TTTGATACTG AAAAAGATGG AAGAATGATT 3480 

TTTGCAATTC CACGTGAAGG AAAAGCGTAT GTAGGTACTA CAGATACATT CTATGACAAT 3540 

ATCAAATCTT CACCATTAAC TACACAAGAA GACAGAGACT ATTTAATCGA TGCGATTAAT 3600 

TACATGTTCC CTAGTGTTAA TGTTACAGAT GAAGATATTG AATCAACATG GGCAGGAATT 3660 

AGACCATTAA TTTACGAAGA AGGCAAAGAC CCTTCTGAAA TCTCTCGTAA GGATGAAATT 3720 

TGGGAAGGTA AATCAGGTTT ATTAACTATT GCAGGTGGTA AATTAACAGG CTATCGTCAC 3780 

ATGGCTCAAG ACATTGTTGA TTTAGTATCT AAACOCTTGA AAAAAGACTA CGGTTTAACA 3840 

TTTAGTCCAT GTAATACAAA AGGTCTGGCA ATTTCAGGTG GCGATGTAGG TGGTAGCAAG 3900 

^ AACTTTGATG CGTTTGTAGA GCAAAAAGTA GATGTAGCTA AAGGATTCGG CATTGATGAA 3960 

GATGTTGCAA GACGTTTAGC ATCTAAATAT GGTTCAAATG TTGATGAATT GTTCAACATT 4020 

GCGCAAACAT CTCAATACCA TGATAGCAAG TTACCATTAG AAATTTATGT AGAACTTGTT 4080 

45 TATAGTATTC AACAAGAAAT GGTATACAAA CCTAACGATT TCTTAGTTCG TCGTTCTGGT 4140 

AAAATGTATT TCAATATTAA AGATGTATTA GATTATAAAG ATGCTGTCAT CGATATTATG 4200 

GCAGATATGC TTGATTACTC TCCAGCTCAA ATTGAAGCAT ATACTGAAGA AGTTGAGCAA 4260 

SO 

GCAATTAAAG AAGCGCAACA TGGaAATAAT CAACCAGCAG TTAAAGAATA AtTAATTTGT 4320 

ACAATCATAA ACTGGTGTCC TGTTTTAAGG GCATCAGTTT TTTTATACQA QATACATTAG 4380 
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GTTATTAAAG GTOTGAGATG ATGACTGAAA AACAATTTAA ATTAACTGTA CAAGATAATA 4500 

CGAATATTGA AGTTAAAGTG AATTTTACAG ATGTAGATTC AAAAGGAATT ATTCATATAT 4560 

5 TTCATGGTAT GGCTGAACAT ATGGAACGTT ACGATAAATT AGCACATGCA CTTTCAAAGC 4620 

ATGGCTTCGA TGT6ATACGT CATAATCATC GAGGACATGG TATTAATATT GATGAATCAA 4680 

CAAGAGGGCA TTACGATGAT ATGAAACX3AG TTATCGGTGA TGCCTTTGAA GTAGCGCAAA 4740 

10 

CAGTGAGAGG CAATGTTGAT AAACCATACA TTATAATCGG ACATTCAATG GGATCCGTTA 4800 

TAGCTAGATT GTTTGTAGAA ACATATCCGC AATATGTTGA TGGTCTAATT TTAAGTGGTA 4 860 

CTGGTATGTA TTCATTATGG AAAGGTTTAC CAACCGTTAA AGTGTTACAA CTGATTACAA 4920 

IS 

AAATTTATGG TGCTGAGAAA CGAGTTGAAT GGGTTAACCA GTTAGTATCA AATAGTTTTA 4 980 

ATAAAAxmAT ACGTCCATTA CGTACACAAA QTGATTGGAT TTCTAGTAAT CCAATTGAAG 5040 

^ TAGATAaCTT TATTAAAGAT CCATATAGTG GaTTTAATGT GTCAAATCAA TTATTATATC 5100 

AAACAGCCTA TTATATGCTA CATACATCAC AATTAAAAAA TATGAAAATG TTAAaTCATG 5160 

CCATGCCTAT ATTATTAGTT TCAGGATATG ACGATCCTTT AGGTGATTAT GGTAAAGGGA 5220 

25 TTTTAAAATT GGCGAATATA TATAGAAACG CTGGCATxiAA AAATGTTAAA GTGAATCTTT 5280 

ATCATCATAA ACGTCATGAA GTGTTATTTG AAAAnGATCA TGACnAAATT TGGGAAGACT 5340 

TGTTTAAATG GTTGAATCAA TTTTATAAAA AATAAAGAAA GTGGAATTAA ATATGAATAA 5400 

AAATAAGCCT TTTATTGTAG TAATTGTGGG GCCAACTGCT TGCAG 5445 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGCTTGAAC TACGCCAATA AGTCCCCCTA GTACAAGAAT GAATACCATG ATATCGACCG 60 
45 CTTCTATCGT ACCTTCAACC ATGCTACTTG TTATTTGTTC TGGTCCAGCT GGATGTTGCT 120 

TTAATCTTTC ATAAGTATTC GGAATTGATA CCGGCTTATT AATTGCACCT GATTTAAATT 180 

GTTCAATCTT AATTTTAACC CCCATTTTGT CTAGTTCCTG TTGCGTACCC GGAACCTTTT 240 
^ TCACTTGGTT ATGAGGGTTA ACTATCTTTA GTTCTTGGGA TGAAGGTTCG TAAGAAAGTT 300 

TAGAATATGC ACCAGCAGGA ATAACCCATG TTGCTATAAC TGCAACAACC GTTAAAATGA 360 
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TAATTGTATT TTCCACGGTT TCATCTCCTT 
GATTTTATAA ATATAAATTA AGAAAGTGCA 
^ GGGGGTGCAC ATAAATAATA AAAATCATGC 

AATTCAATTA CTTTTTAATC ACAGTACCTA 
TTAATGATGT TATAAGCACA CTTCCTTTTG 

10 

CAATTTTTGG TAACATACTT CCTTTTGCAA 
CATCAACATT TGTTGTTTTC AAAGGCTGTT 
AATCAATTGC TGTTAAAATA ATCAATTGAT 
TTGTTTTATC TTTGTCTATA ACTGCATCAA 
TTACTGGTAT ACCTCCACCA CCAGCAGCAA 
20 TAATACTCTC TAATTCAATA ATAGAGATGG 

CTCTTCCAGC ATCTTCAACA AATATAAATC 
CTTTGTTGTA AAATAACCCA ATTGGTTTTG 
CAACTTCAAC TTGTGTCACT AGTGTTACCA 
CATTTTGTAA GCTTTCTTGT AATTGATAGC 
CAGCAAATGG AAATGCCGGA CCTTGGTTAT 

30 

TGCTTCCAAC CTGTGGTCCA TTACCATGAC 
AyCCTACTAA TGATTtCGCA GTATTTTTAA 
CCTAAAGCAT TACCACCTAA TGCTACTACT 

3S 

CATTTAAAAT TCACCCAATG TAGCAACCAT 
AGCTTCTTGG AATACAACTG AAGCTTTACT 

40 TCGAATACCA TATTTTTCAA AAATTTGTTG 

TQGTAAGCAA TGCTCAAAAA TAACATTT6G 
TACTTGATAT GGTTTCAATA ATTCAAGTCG 

^ TGATACCCAA ACATCAGTGT AAATTACATC 

TGTGATTAAT ATGTTGCCaC CATTTTCaGC 

ATCTGTTGGA TTTAATTCTT TTGGACAAAC 

SO 

ACCTTGCATT AATGCATTTG CAACGTTATT 

ATCTGCATAA TCTTTTTTTA AGACTTCTTT 

SS 



CGACATTTAA CCTAGCATTT CTACCTTAAA 480 

CCCCGCATCA AAATAGAGGC ATTATTTTCA 540 

ATTTGACATA TAGTAATTGA AAAGCGTTTC 600 

CTTTACCCTC TAAGGCAGCA TCTAATTCAT 660 

GATTGTTTTC AATAAATGAT ATGGCTGCTT 720 

ATTGATTTTC GTCTATATAT CGTTTTAATT 780 

GGTTTTCAGT GTTAAAATTA ATATATACAT 840 

CGCATTGAAT ATTAGCACCC AACAACGCAC 900 

TACCTTTAAA ACCATCATGT TGCTCTCTAA 960 

TAACGAGT6T ATCATTTTTA ATAAGTGTTT 1020 

GTTGTGGTGA AGGAACAACG CGTCTATATC 1080 

CTTTTTCTTT TTGAATTTGT TCAGCTTCTT 1140 

AAGGATTGTT AAATGCCGGA TCATTTTCAT 1200 

CTTGTTTATC CATTCCAATA GAATGCAATT 1260 

CGATGTAAGC TTGACTCATT GCGCCACATT 1320 

GTTCTGCAGC ATAGTTAAGT CCCAAATTAA 1380 

TAATAACAAT CTCATGTCCT TTTGTnATTA 1440 

CAAGCTCGAG TtGgTyCTTG aGGTGATTTn 1500 

ATTTTCGCCA TCATATTCAC TTCCTTATAT 1560 

GaCTGCTTTG ATTGTATGCA TTCTGTTCTC 1620 

TTCGAATACT TCATCTGTAA CTTCCATTTC 1680 

ACCTATTTTC GTATCAGCAT TATGGAAAGA 1740 

ATTACCAGTT TTATCCATTA TTTCTTTATT 1800 

TTCTTTCCAT ACTTCATCAG GTTCACCCAT 1860 

CGAACCTTTT ACaCCTTGGT CaATATCATC 1920 

GGCAATATTT TTACAGCGAT TTAATAATTC 1980 

TAAATGGAAG TTCATACCCA TAATGGCAGC 2040 

ACGACCATCT CCAACATATG TAAAGTTAAT 2100 

TGCTGTTAAG AAATCAGCAA GAACTTGAGT 2160 
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TTCTACTGTT CTTTGTGAAA AACCACGGTA TTCAATGCCA TCATACATTC CACCAAGCAC 2280 

ACGTGCAGTA TCTTTAGTTG TTTCTTTTTT ACCGATTTGT GATCCAGTTG GGCCTAAATA 2340 

5 AGTTACATTT GCACCTTGAT CATGOGCTGC AACTTCAAAT GCACATCGCG TTCTTGTAGA 2400 

ATCTTTTTCA AATAACAGTG CAATATTTTT ATTTTTTAAC ATAGGCTTTT CAGTGCCAAT 2460 

ATATTTAGCA CGTTTTAAAT CCTCGGAGAG TGTTAATAAG GTTCTACCTC TTGTCGTGAA 2520 

10 

AAGTCTAATA AAGTTAAAAA ACTTCTGTTT CGTAnATTTT TCATTAAnA 2569 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 1273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 





CCTGGAACCA 


TCCaATCGtG 


CaAATCtTGa 


AAGaGAATAC 


GCAACAACAA 


TTAAATGTAT 


60 


2S 


TGGAACACTA 


TATTCCAAAT 


GACCATCCAG 


CACTCGTTGA 


ATTAAAAATA 


TGGGAACGTT 


120 




GGTTACATAA 


ACAAGGTTAC 


AAAGACATCC 


ATTTAGATAT 


TACTGCGCAC 


CACCTAGATC 


180 




CTATTACACA 


GGTTTATTTA 


TTCAATGTCA 


TTTTGCTGAA 


AATGAATCTC 


GAGTTTTAAC 


240 


30 


AGGTGGTTAT 


TACAAAGGAA 


GCATCGAAGG 


GTTTGGATTA 


GGATTAACAC 


TTTAAGTAAG 


300 




GGAGTATGCA 


CAATGTTAAG 


AATCGCCATA 


GCCAAAGGAC 


GTCTAATGGA 


TAGTTTAATT 


360 


35 


AACTATTTAG 


ATGTAATTGA 


ATATACGACA 


TTATCAGAAA 


CATTAAAAAA 


TAGAGAACGC 


420 


CAATTATTAT 


TAAGTGTAGA 


TAATATTGAA 


TGCATTTTAG 


TAAAAGGAAG 


TGACGTGCCA 


480 




ATCTATGTGG 


AACAAGGAAT 


GGCAGACATA 


GGCATTGTTG 


GTAGCGACAT 


ATTAGATGAG 


540 


40 


CGCCAATATA 


ATGTTAATAA 


TTTGTTGAAT 


ATGCCTTTTG 


GAGCATGTCA 


TTTTGCGGTT 


600 




6CAGCGAAAC 


CTGAAACGAC 


CAATTATCGT 


AAAATCGCAA 


CGAGTTATGT 


TCATACTGCT 


660 




GAAACATATT 


TTAAATCAAA 


AGGTATTGAT 


GTCGAATTGA 


TTAAATTGAA 


TGGCTCTGTT 


720 


45 


GAATTGGCCT 


GTGTTGTAGA 


TATGGTAGAC 


GGAATTGTCG 


ACATCGTTCA AACAGGTACT 


780 




ACGCTAAAAG 


CGAACGGACT 


GGTTGAAAAG 


CAACATATTA 


GTGATATCAA 


TGCAAGATTA 


840 




ATAACTAATA 


AAGCAGCTTA 


TTTTAAAAAA 


TCACAATTAA 


TAGAGCAATT 


TATTCGCTCT 


900 


50 


TTGGAGGTGT 


CTATTGCCAA 


TGCTTAATGC 


ACAACAATTT 


TTAAATCAAT 


TTTCATTAGA 


960 




AGCACCATTA 


GATGA6TCAT 


TGTATCCaAT 


TATTCGCGAT 


ATTTGTCAGG 


AAGTTAAAGT 


1020 
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TTTAGaAATT AGTCATGAmC AAATTAAAGC AGCATTTGAC ACATTAGATG AAAAAACAAA 1140 

ACAAGCATTA CAACAAAGTT ATGAAAGAAT TAnAGCATAT CAaGAAaGTA TtaAACAGaC 1200 

GaATCAACAG TTAGAAGaAT CAGTGGaGTG tTrTGaAATA TACCATCCmC taGaAAGTGT 1260 

CGGTATTTAT GTG 1273 



(2) INFORMATION FOR SEQ ID NO: 76: 

(1) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1308 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 76: 



20 


GTTQATAAAT 


TAAAAATGTT 


TTTATCAGAT 


ATTCAAAGTT 


ACCAACAATA 


TAGTAAAGAT 


60 




CATCCGGTGT 


ATCAGTTAAT 


TGATAAATTT 


TATAATGATC 


ATTATGTTAT 


TCAATACTTT 


120 




AGTGGACTTA 


TTGGTGGACG 


TGGACGACGT 


GCAAATCTTT 


ATGGTTTATT 


TAATAAAGCT 


180 


2S 


ATCGAGTTTG 


AGAATTCAAG 


TTTTAGAGGT 


TTATATCAAT 


TTATTCOTTT 


TATCGATGAA 


240 




TTGATTGAAA 


GAGGCAAAGA 


TTTTGGTGAG 


GAAAATGTAG 


TTGGTCCAAA 


CGATAATGTC 


300 




GTTAGAATGA 


TGACAATTCA 


TAGTAGTAAA 


GGTCTAGAGT 


TTCCATTTGT 


CATTTATTCT 


360 


30 


GGATTGTCAA 


AAGATTTTAA 


TAAACGTGAT 


TTGAAACAAC 


CAGTTATTTT 


AAATCAGCAA 


420 




TTTGGTCTCG 


GAATGGATTA 


TTTTGATGTG 


GATAAAGAAA 


TGGCATTTCC ATCTTTAGCT 


490 


35 


TCGGTTGCAT 


ATAGAGCTGT 


TGCCGArAAA 


GAACTTGTGT 


CAGAAGAAAT 


GCGATTAGTC 


540 


TATGTAGCAT 


TAACAAGAGC 


GAAAGAACAA 


CTTTATTTAA 


TTGGTAGAGT 


GAAAAATGAT 


600 




AAATCATTAC 


TAGAACTAGA 


GCAATTGTCT 


ATTTCTGGTG 


AGCACATTGC 


TGTCAATGAA 


660 


40 


CGATTAACTT 


CACCAAATCC 


GTTCCATCTT 


ATTTATAGTA 


TTTTATCTAA ACATCAATCT 


720 




GCGTCAATTC 


CAGATGATTT 


AAAATTTGAA 


AAAGATATAG 


CACAAATTGA 


AGATAGTAGT 


780 




CGTCCGAATG 


TAAATATTTC 


AATTGTGTAC 


TTTGAAGATG 


TGTCTACAGA AACCATTTTA 


840 


4S 


GATAATGATG 


AATATCGTTC 


GGTTAATCAA 


TTAGAAACTA 


TGCAAAATGG TAATGAAGAT 


900 




GTTAAAGCAC 


AAATTAAACA 


CCAACTTGAT 


TATCGATATC 


CATATQTAAA 


TGATACTAAA 


960 




AAGCCCTCAA 


AACAATCTGT 


TTCTGAATTG 


AAAAGACAAT 


ATGAAACAGA AGAAAGTGGC 


1020 


SO 


ACAAGTTACG 


AACGAGTAAG 


GCAATATCGT 


ATCGGTTTTT 


CAACGTATGA 


ACGACCTAAA 


1080 




TTTCTAAGTG 


AACAAGGTAA 


AC6AAAAGCG 


AATGAAATTG 


GTACGTTAAT 


GCATACAGTG 


1140 
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GATGGATTAA TCGATAAACA TATTATCGAA GCAGATGCGA AAAAAGATAT CCGTATGGAT 1260 
GAAATAATGA CATTTATCAA TAGTGATTAT ATTCGATATT GCTGAAGC 1308 
^ (2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1431 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



30 



3S 



SO 



GATGCCATTn 


ATnnGTATGC 


AAGAAGTTGT 


TCCGGCTTCA 


GGTGGATTaC 


CAGTTGGTAC 


60 


TGGTGGTAAG 


AGGTTACTAA 


TGCTTTCAGG 


CGGTATAGAC 


TCACCAGTT6 


CTGGGATGGA 


120 


AGTGATGAGA 


CGTGGCGTAA 


CAATTGAAGC 


GATTCATTTC 


CATAGTCCAC 


CATTTACAAG 


180 


TGATCAAGCA 


AAAGAAAAAG 


TTATTGAATT 


GACACGTATT 


TTAGCTGAAC 


GT6TTGGACC 


240 


AATTAAATTG 


CATATTGTAC 


CATTTACAGA 


ATTGCAAAAA 


CAGGTAAATA AAGTrGTACA 


300 


TCCAAGATAT 


ACAATGACTT 


CAACGAGACG 


TATGATGATG 


CGTGTTGCTG ATAAATTAGT 


360 


ACATCAAATA 


GGGGCTTTAG 


CTATTGTAAA 


TGGTGAAAAC 


CTAGGGCAOG 


TAGCCAGTCA 


420 


AACACTTCAT 


AGCATOTATG 


CAATTAATAA 


TGTAACTTCT 


ACTCCTGTAT 


TACGTCCTTT 


480 


ATTAACTTAC 


GATAAAGAAG 


AAATTATTAT 


TAAATCGAAA 


GAAATTGGTA 


CATTTGAAAC 




ATCTATTCAA 


CCATTTGAAG 


ATTGTTGTAC 


AATTTTCACC 


CCTAAAAATC 


CAGTAACCGA 


600 


ACCAAACTTT 


GATAAGGTAG 


TCC/^TATGA 


AAGTGTCTTT 


GATTTTGAAG 


AGATGATTAA 


660 


TCGTGCTGTT 


GAAAATATTG 


AAACACTTGA 


AATAACTAGT 


GATTATAAAA 


CTATTAAAGA 


720 


ACAQPVAACA 


AACCAATTAA 


TAAACGACTT 


TTTATAAATA 


AAATCCTAGA 


GTAAATTTAA 


780 


ACATAAGGGG 


ATGTTAAACT 


ATGGATTTGA 


ACTTAACGAT 


GATTATAATC ATAATTTTAT 


840 


TTGGTTTTAT 


CGCGGCGTTT 


ATAGATTCGG 


TTGTAGGGGG 


TGGCOGTTTA ATTTCTACGC 


900 


CAGCATTATT 


AGCAATCGGT 


CTACCACCAT 


CTGTGGCTTT 


AGGTACAAAT AAATTGGCAA 


960 


GTTCGTTTGG 


TTCTTTAACT 


AGTACGATAA 


AGTTTATAAG 


GTCCGGTAAA GTGGACTTAT 


1020 


ATGTTGTTGC 


CAAATTATTT 


GGTTTTGTAT 


TTTTGGCATC 


TGCATGTGGC 


GCATATATTG 


1080 


CAAC6ATGGT 


TCCGTCACAA 


ATATTGAAAC 


CTTTAATCAT 


CATTGCACTT 


TCGTCGGTGT 


1140 


TTATATTCAC 


ATTACTTAAA 


AAAGATTGGG 


OCAATACACG 


CACGTTTACT 


CAATTTACAT 


1200 


TTAAGAAAGC 


CATAATATTT 


GCAGCACTTT 


TTATATTAAT 


CGGCTTTTAT 


GATGGATTTG 


1260 
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TAAGTGCAGC AGGAAATGCT AAAGTTTTGA ACTTTGCTTC TAATATAGGT GCGCTTGTAT 1380 

TATTTATGGT ATTAGGACAA GTAGATTATG TAATAGGTTT AATTATGGCT A 1431 

5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiCGY: linear 

IS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

AATATTATTT TAAATTCAAT ATTTATTGGT GCATTTATTT TAAACTTATT ATTCGCCTTT 60 

ACCATTATTT TCATGGAAAG ACGTTCTGCC AATTCTATCT GGGCTTGGTT ACTAGTCTTA 120 

20 GTTTTCTTGC CTTTATTCGG CTTCATTTTA TACTTACTAT TAGGACGACA AATTCAACGT 180 

GACCAAATrr TCAAAATTGA TAAGGAAGAT AAAAAAGGAT TAGAGTTAAT CGTTGATGAG 240 

CAATTAGCTG CTTTAAAAAA TGAAAACTTT TCAAATTCCA ATTATCAAAT TGTAAAATTT 300 

25 

AAAGAAATGA TTCAAATGTT GTTATATAAT AACGCAGCAT TTTTAACAAC AGACAACGAT 360 

TTArrrrtAT ACACAGACGG CCAAGAAAAA TTTGATGACC TAATACAAGA CATCCGTAAT 420 

GCTACTGATT ATATTCATTT TCAGTACTAT ATTATTCAAA ATGATGAATT AGGTCGTACC 4 80 

30 

ATTTTAAATG AACTTGGTAA AAAAGCGGAA CAAGGTGTAG AAGTTAAAAT TCTTTATGAT 540 

GACATGGGTT CTCGTGGACT GCGTAAAAAA GGCTTACGCC CGTTTCGCAA TAAAGGTGGA 600 

^ CATGCTGAAG CATTTTTCCC ATCAAAATTA CCTTTAATTA ACTTGCGTAT GAACAATCGA 660 

AACCATCGAA AAATTGTTGT AATAGATGGG CAAATTGGAT ATGTTGGTGG TTTTAATGTT 720 

GGTGATGAGT ACTTAGGTAA ATCAAAAAAA TTCGGCTATT GGCGAGATAC GCATTTACGA 780 

40 ATTGTCGGGG ATGCAGTGAA TGCATTGCAA TTACGATTTA TTCTAGATTG GAATTCACAA 840 

GCCACACGTG ACCACATCTC CTATGATGAT CGTTATTTCC CAGATGTAAA TTCTGGTGGA 900 

ACAATTGGCG TTCAAATAGC TTCTAGTGGT CCTGACGAAG AATGGGAACA GATTAAATAC 960 

^ GGCTATTTGA AAATGATTTC ATCTGCTAAA AAATCGATTT ATATTCAATC TCCCTATTTC 1020 

ATACCTGATC AAGCCTTTTT AGATTCTATT AAAATTGCGG CATTAGGTGG TGTTGATGTC 1080 

AATATCATGA TTCCTAATAA ACCTGACCAT CCGTTTGTTT TTTGGGCTAC TTTAAAAAAT 1140 

50 

GCAGCATCCT TATTAGATGC CGGTGTTAAA GTATTTCACT ACGACAATGG CTTTTTACAC 1200 

TCAAAAACAC TTGTTATAGA TGATGAAATT GCAAGTGTGG GAACAGCTAA TATGGACCAT 1260 
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10 



IS 



AAATTAAAAC AAGCTTTTAT AGATGATTTA GCAGTATCTT CTGAATTAAC AAAAGCACGT 1380 

TATGCTAAGC GAAGTCTTTG GATTAAATTT AAAGAAGGTA TTTCACAATT ATTGTCACCT 144 0 

ATCTTATAAA ATAGAAATAT GAGGAGTGTA aCTTTAATGC AACAATCAGA CGTCATTAGT 1500 

GCTGCCAAAA AATATATGGA ATCTATTCAT CAAAATGATT ATACAGGCCA TGATATTGCG 1560 

CATGTATATC GTGTCACTGC TTTAGCTAAA TCAATCGCTG AAAATGAAGG TGTTAATGAT 1620 

ACTTTAGTCA TTGAACTCGC ATGTTTGCTT CATGATACCG TTGACX3AAAA AGTTGTAGAT 1680 

GCTAACAAAC AATATGTTGA ATTGAAGTCA TTTTTATCTT CTTTATCACT ATCAACCGAA 1740 

GATCAAGAGC ACATTTTATT TATTATTAAT AATATGAGCT ATCGCAATGG CAAAAATGAT 1800 

CATGTCACTT TATCTTTAGA AGGTCAAATT GTCAGGGATG CAGATCGTCT TGATGCTATA 1860 

GGCGCTATAG GTGTTOCAOG AACATTTCAA TTTGCAGGAC ACTTTGGTGA ACCTATGTGG 1920 

20 ACAGAACATA TGTCACTAGA TAAGATTAAT GATGATTTAG TTGAACAGTT GCCACCATCT 1980 

GCAATTAAAC ATTTCTTTGA AAAATTACTT AAGTTA6AAT CTTTAATGCA TACAGATACG 2040 

GCGAAGATGA TTGCTAAAGA ACGTCACGAC TTTATGATGA TGTACTTGAA ACAGTmTT 2100 

ACGGAATGGA ATTGTCACGA CTAGACATTG AAGTTGTAGT ATGATGATGC GATGTAATGG 2160 

CGTGTTGTTG TGGAAGCTTG GTGTCATGCC ATGTTACTTT GATGTGTTGT TGTGGGAGCT 2220 

TGGTGACATG TCATGCTACT TTGATGTGCT GGTACCACGA TGCX3TCTTGA TGTAGTGCTA 2280 

TGATGTGGCA TTGCGGTGTT ATGGTGTTAT AGACAGGTTT GGCGTTGATG CCATGTTACT 2340 

TTGATGTGCT GGTACCACGA TGCGACTTGA TGTAGTGCTA TGATGTGGCA TTGCGGTGTT 2400 

ATGGTGTTAT AGACCGGTTT GATGTTGATG CCATGTTACT TTGATGTGCT GGTGCTACGA 2460 

TGCGACTTGA TGTAGTGCTA TGATGTGGCG TTGCGCTGTT ATGGTGTTAT AGCCAGGTTT 2520 

GGTG3TGATG TCATGCCGTT ACGATTCTAT GATATGTTGT TGGGACGTTG CAATGTGTAT 2580 

40 TATGCCGTTG TGACGTTATT ATTTCACACT GTTACATGTA TAAGTGAATT GCTGTGGAAA 2640 

TTTGCGACAT ATACTGCTAC ACTGATGAAT CATTGTGTCA AGATGACATT GCGATGAAGA 2700 

ATGACAACTC TGTTATTAAC CACTTTTTAC ATACTGAAAA CTCGTTAATA TTATTTCAAA 2760 

^ TAAAAACAGC AGTAGGATGA CTTTCACATT TGAAATCATC TTACTGCTGT TTCTATTTAT 2820 

CACATATTGT ATAATGTGAC ACTAAGTTTC GCTATTGAAG CGAAAAATAA TGTGOGCCCT 2880 

ATAAAGTTAA AATTATCTTC AACTTTTAGG GTGCACATTA TTTGGACTTG CTAAGGTTAT 2940 

50 

TTCTTTTTCT TTTTAGACAC AACTTGTGTG TTTTTGCCTT TTTTATTGCt GCCGCCGTTG 3000 

TGCTCTCTTT CATACGCTTC AATGAAAGGT TGTACTTCTT TTTTAGCGAC TTTTTCATAA 3060 

55 
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5 



TO 



IS 



30 



35 



CCAAGTGCTQ 


ATGCTGAGCT 


TAATGAAATC 


CAGATAATCA 


TAATTGGTGA AATGACCATC 


3180 


ATCATGTAAC 


CCATTTGACG 


TTGTTCGTCT 


GGCATCGTTT 


TACTTGATAC ATATGCTTGG 


3240 


ATAAAGTATA 


AAACACCGGC 


AATAATTGTA 


ATCCAAATAT 


CAGGACGTCC TAAATCGAAC 


3300 


CATAAGAAGT 


GTGGATATTT 


AAACAAACCA 


TCTACAAGTT 


GGTCTTTAAG TACAAAGTAT 


3360 


AATCCCATQA 


TGATTGGTAA 


TTGGATTAGC 


ATTGGTAAAC AACCCAACAT ACTCTTAATC 


3420 


GGGTTCATGT 


CATACTTTTT 


ATATACTTGC 


ATTAATTCTT 


GGTTTGCAGC CATTTTTTCT 


3480 


TCTTGTGTAC 


GCGnCaCXSTT 


CACTTTTTCT 


TGAATTTTTT 


CAACTTCTGG CTTTGCAACT 


3540 


TTCATTTTTT 


GACGCATCAT 


ATGACTATTT 


TTATAGTTTG ACAACATGAA TGGTAATAAA 


3600 


ATAATACGAA 


TTACCAATAC 


AAGGATAATA 


ATAGCTAAAC 


CATAATTGTC GTTTAATAAG 


3660 


TTATTTCCCA 


ACCAATCCAA 


TACATTTTTC 


ATTGGATCTA CGAATGTATT GTAGAAAAAy 


3720 


cwCtACGTTT 


TTCAGGTTTA 


GAATAGTCAC 


AACCAGCCAA AAAGACCATA ATACCTAAAA 


3780 


ATAATGGTAG 


TAACGCTTTT 


TTCTTCATTT 


TTCCACCTCT 


ATCATTATAT TCACATAGGA 


3840 


TTTATTCTAT 


CACATTAATG 


AGTACGTATG 


AAACAATAAG 


TGGAAAAATT TAACTAATTA 


3900 


TTAAAAAAAT 


CTTTGAATCG 


ATTAACAGTC 


TTTTCAATAT 


TTTCACTTTT AGAAATGGCT 


3960 


GAAATGACTG 


AAATTCCATT 


GGCACCTGCT 


TCTACAATCG 


GCGCCACATT ATTAGTATTG 


4020 


ATACCGCCAA 


TAGCTACAAT 


CGGTAGTTGC 


GGATTCATTT 


CTTTAAACGT TGCAATCATT 


4080 


TCTGGACCTA 


CTGGTATATG 


CGCGTCATGC 


TTCGACGGCG 


TAGGATAGAT TGGTCCAACA 


4140 


CCTATATAAT 


CmACATGAGT 


TAAATCAGAT 


TTTGCATACT 


CATCTAAATC ACTAATACTA 




AGTCCAATAA 


TTTTATCAGT 


GAAATATTGT 


GCTATCTCTT TGACTTTCGC ATCATCTTGA 


4260 


CCGACATGTA 


TACCATCCX5C 


GTTAATTTCT 


TTTGCCAAGG ATACATCATC ATTAACGATA 


4320 


AAAGGCACAT 


CATATTGATG 


ACAGAGATGC 


TGTAATTCTT TAGCTAATAC AAGTTTATCG 


4380 


TTTCCTTTTA 


AAGCTGATTC 


ACC 






4403 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TGGAnCCAAT ATTAGAAATG ATTAAAACAT TAACAGGTAT TAATAGTCCT TCAGGAGnCA 60 
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TAACAAATAA AGGTGCGTTA TTAATAACAG 
GTATTACTGC TCATGTTGAT ACTTTAGGTG 

5 

GCTTaGCAAT AGAATTAATT GGAGGATTCA 
AAATTAAAAC TGATGCTGGT CAAATATATA 
TTCATGTATA TAGAAATAAT CATGAAATAC 

10 

TTGATGAAGT AACTACATCA GAAGAAGATA 
TTGTTAGCTT TQATCCACGT ACAGTTATCA 
,5 TAGATGATAA AGCTAGCGTA CGgTtGATAC 

AAATAATATT ACCACATACA ACGCAATTTT 
GTGCAAATGC ATCAATTGAT TCGAAAATCA 
TGGGAGACG6 TCAAGCATCG GATGAATATA 
GTCCATATCA TAAGCAATTG AAATCGCACC 
CATATAAAGT AGACATATAT CCATATTATG 

2S 

GTGCGGATAT CAGACATGGT TTATTTGGCG 
GAACACATAT TGATTCTATT AAAGCGACAG 
CAATTGAGTA AACAATTAGT GTTGACAAAT 

30 

AAAAATAATT AGAATTTTCT AAAGAAATAG 
CTAATGGGTG ATGGGAATTA GCACGCCATA 

^ AATGAGTTTT GAAAGTGAAC ATGAATTATO 
CTTACAGGTA TATGC6TTAT GTGGTGTCTT 
AAGCTAGCAA CAGAAAGTTA GTGGATQATG 

40 TTTTGTCTGC AACAGAAAAA TTATATATAG 
TCGGTTAATT TAGGTGGTAC CACGCGTCAC 
TTTTTGAAAG GAGCGTATAG AATGGATATA 

^ CCCGAAGTTT TAGCACAACT TCATTCCAAG 
CAAACTAAAG GTCGCTATTC AGTTGTTATT 
AATGATGTAT TATCAGTAAG TACTTTAAAA 

SO 

CATTATTTAA CGACTAAnAT AAATGAAGAC 
AGTCATTA 

55 



TGCCAGGCAA 


AAATGATGAA 


GTACAACGCT 


180 


CaATGGTTAA 


AGAAATTAAA 


GAAGATGGTC 


240 


CGTATAACGC 


GATTGAGGGT 


GAATATTGCC 


300 


CAGGAACAAT 


TTGTCTGCAT 


GAAACAAGTG 


360 


CTAGAGATCA 


AAAGCATATG 


GAAATAAGAA 


420 


CAAAGAGTTT 


AGGTATTTCA 


GTAGGTGATT 


480 


CGTCATCAGG 


TTTTATTAAA TCTCGTCATT 


540 


TACAATTACT 


AAAGAAATTA AAAGAAGAGC 


600 


ATA'ITTTCTAA 


TAACGAAGAA 


ATAGGTTACG 


660 


AAGAATATAT 


TGCATTAGAT ATGGGCGCGT 


720 


CAGTTTCTAT 


TTGTGCCAAA GATGCTTCAG 


780 


TAGTTAATCT 


TTGCAAAATA AATAACATTC 


840 


GTTCAGATGC 


TTCAGCAGCT 


TTACATGCTG 


900 


CTGGCATTGA ATCATCTCAT 


GCAATGGAAC 


960 


AGAAATTACT 


ATATGCATAT 


TGCTTATCAC 


1020 


GTGaACGACC 


TATGTAATAT 


AATGAACTAT 


1080 


TAGCAGATAT 


GAAACGTAGC 


AAATAGAAAG 


1140 


TCTTGTGAAT 


TGGACTTTGG 


AAAACAATTG 


1200 


TTAACTAAGG 


TGGCACCACG 


GTAACGCGTC 


1260 


TTTATTTAGA 


CAAAATGTAG 


TAGTTAATTA 


1320 


TGAACTAACA 


CCGAGATTAA 


TGAAATTGGG 


1380 


TAAAGAGTGA ACTATGAATA 


TTTCGAATAT 


1440 


nTCCTTTATA 


TTGATAAGGA 


TGCTGGCGCT 


1500 


TTTTATAAAA AAATAAAAGC AAATGTAACG 


1560 


AAGaTCATTT 


TGGAAAGTAC 


AAATCAACAA 


1620 


TTTGATATTT ATGGCACTTT AACTTTAGAT 


1680 


GAATCGTATC 


AAATCACTGA AAGACCGTAC 


1740 


TACCATAATA 


TTCCAAGATG 


AGGCAACTTA 


1800 
1808 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

10 





TGGTCGTCAA 


TTTCTTGATT 


ATATCTATAA 


TCCTCATTTT 


CAATATTAGA 


GTCTGTAGAA 


60 




TCATCGATAT 


TATTATCATT 


CGCATGACTA 


GAAGCAGAAT 


CATTATTTTT 


ATCATTGCTT 


120 


IS 


TCTTCTTTTT 


TGAAGTCTTT 


ATTTATCAAG 


TAAATTTCTT 


CATCAAAATC 


AGCTTGTTGA 


180 




GATGTATCAT 


CTTTATTTTG 


ATTAGAAAAA 


TGTGTA6CCT 


TTGATCTTTT 


TCTTTGCCQT 


240 




CTTTTCTTAG 


ATGTATTCCT 


CGTAAATAAT 


TCTAATTCAT 


CTTTATCTTC 


ATTTGATTCT 


300 


20 


TGTTGATCGT 


TCTTCGTTTT 


ATCATCCATC 


AATACTCACA 


CCCTTTAATA 


AGATGGTAAA 


360 




TGGGCACGGA 


ATCTTTCAAT 


AAATTTCTCT 


CCACGCTCTT 


CAAAAGTACT 


ATATTGATCC 


420 




CAACTCGCAC 


AAGCAGGTGA 


CAATAATACA 


ACATCATTTG 


GTTCTATAAT 


ATCTTGTACT 


480 


2S 


TTATCAACAG 


OGTCTTCGAC 


ATTGTTCGCT 


TCAATGACCG 


ATTTCCCTTG 


ACTATTACCT 


540 




AGTTTAGCAA 


ACTTAGCTTT 


CGTTTGTCCG 


AATACAACCA 


TCGCGCGAAC 


ATTTTCCATA 


600 


30 


TAAGGAATGA 


GTTCGTCAAA 


TTCATTCCCT 


CGATCCAAAC 


CACCACATAA 


CCAAATGATT 


660 


GGTTGATTAA 


ATGAATTTAA 


GGCAAACTGT 


GTTGCTAGCG 


TGTTTGTTGC 


TTTGGAATCA 


720 






TATTAGTTCT 


ATTAUTACCA 


ACATATTGCA 


ATCTATGCTC 


TATTCCTGAA 


780 


35 


AATGTAGTTA 


AACTATCAAT 


AATTGCtTTA 


ATAGGTACAC 


CAGCanAATA 


CAAGCAAGCA 


840 




CAGCTGCTAA 


TATATTTcTA 


AATTATOTTC 


ACCAGGCAAT 


ACTAGAtCTT 


CAGTGTTAAT 


900 




AATadgAACA 


CCTTTATaAA 


CGATAAAACC 


ATCTTtAATA 


TAAaTACCAT 


CArCTtCTTG 


960 


40 


TTGAGTTGAG 


AAATACAATG 


TCTTAGCTTT 


TAATTCTTCC 


GACTCTATCA 


CTTGTCTTTG 


1020 




ATGATAATTA 


CAAATCAAAT 


AATCCTCTTC 


CGTTTGATTT 


TTATATATTT 


GCTTTTTAGC 


1080 




ATTTTGATAG 


TTTTCTAAAT 


TTTCATG6TA 


ATCTAGATGC 


GCCGAATAAA 


TGTTAGTAAT 


1140 


4S 


TATAGCAATG 


TGTGGTTTAT 


ACTTTTCGAT 


TCCAAGTAAC 


TGGAATGACG 


ACAACTCTGT 


1200 




AACTAAATAA 


TCTGTAGGCT 


TTACTTCTTG 


TGCTACTTTA 


GATGCAACAT 


AACCAATATT 


1260 




GCCGGATAAT 


CTTCCAGTTA 


AGCGACTTTT 


TTTAAACATA 


TCTCCAATTA 


GAGAAGTAAC 


1320 



SO 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4280 base pairs 

SS 
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(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TTTACACCAA TCAAAAAATC GAACTGATAT AAATAAGTAC AAAGCTTATC TATCAATCCG 60 

ATTTAGTTAT AAAACAAAAA AAGCCACAGT AATGTGGCTT TTTGTTATAT TCAGTATCAA 120 

AATGGTATCA ATAGCCATTT TCGGAAGTCA AGAATGGCTT AACAACGCGG TTTAAAGCTA 180 

TCCAATACTA CCTTCCATTT CGAACTTGAT TAAACGOTTC ATTTCGACCG CGTATTCCAT 240 

IS TGGAAGTTCT TTTGTAAATG GTTCGATGAA TCCCATAACA ATCATTTCTG TCGCTTCTTC 300 

TTCAGAAATA CCACGACTCA TTAGATAGAA TAATTGTTCT TCAGAAACTT TTGAAACCTT 360 

GGCTTCATGT TCTAATGATA TTTGATCGTT GAATACTTCG TTATATGGAA TTGTATCTGA 420 

20 

TGTTGATTCG TTATCTAAGA TTAATGTATC ACATTCAATA TTTGAACGAG CACCTTTTGC 480 

TTTACGTCCA AAATGAACAA TACCGCGATA AATAACTTTA CCACCATTTT TAGAAATAGA 540 

TTTAGAAACA ATTGTAGAAG ATGTATTAGG TGCTTTATGA ATCATTTTAG CACCGGCATC 600 

25 

TTGAACTTGT CCTTTACCAG CAAATGCAAT AGATAATGTA CTACCTTTTG CACCTTCACC 660 

TAAAAGAACA CAGTTTGGAT ATTTCATCGT TAACTTAGAA CCTAAGTTAC CATCTACCCA 720 

TTCCATATTT CCGTTTTCAT AAACAAAAGT ACGTTTTGTA ACTAAATTGT ATACATTGTT 780 

30 

CGCCCAGTTT TGAATCGTAG TATAACGAAC GTGCGCATCT TTATGCACAA TGATTTCCAC 840 

AACAGCAGAG TGTAAAGAAC TAGTTGTATA AACTGGTGCA GTACAACCTT CTACGTAATG 900 

35 TACAGAAGCA CCTTCATCAG CAATGATTAA TGTACGTTCA AATTGACCCA TOTTCTCAGA 960 

OTTAATACGG AAATAAGCTT GTAGTGGCGT ATCTAGTTTG ATATTTTTAG GTACATAAAT 1020 

GAAGC3AACCA CCTGACCATA CTGCTGAGTT TAACGCCGCA AATTrGTTAT CTGCTGCAGG 1080 

40 TACTACAGAA GCAAAGTATT TTTTGAATAA TTCTTCATTT TCTTGTAAAG CACTATCTGT 1140 

ATCTTTAAAG ATAATACCTT TTTCTTCAAG TTCTTTTTCC ATATTATGGT AAACAACTTC 1200 

AGATTCATAT TGAGCAGAAA CACCAGCTAA ATATTTTTGT TCA6CTTCAG GAATTCCTAA 1260 

4S 

TTTATCGAAA GTTCTTTTAA TTTCTTCTGG CACTTCATCC CATGAACGTT CAGCTTGTTC 1320 

TGAAGGCTTT ACATAGTAAG TAATGTCATC GAAATTCAAT TCTGATAAGT CGCCACCCCA 1380 

TTGAGGCATT GGCATTTTAT AAAACAATTT TAATGATTTA AGACGGAAAT CTAACATCCA 1440 

50 

TTCCGGCTCA TTTTTCATGT TAGAAATTTC TCTAACGATA TTCTCAGTTA AACCACGTTC 1500 

TGATCTGAAA ATGGACACAT CATCGTCGTG GAATCCATAT TTATAATCCC CAACATCAGG 1560 

55 
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10 



IS 



TTTAATTCAT GATGTAAACC ATATTATAAC AATGACATGA CATCTTATAA AAAT-ITTTAT 1680 

ACTTTTATAT GTCTAATATC AAAATTATCT ATGATTAACA GCATTCTATT CTTCTTCAGT 1740 

CGTACCTTCT GCTTTACCTT CTTTAGCAAC AGTACCTTTT TCCAATGCTT TCCAAGCTAA 1800 

TGTGGCACAT TTAATACGAG CTGGGAATTG AGATACACCT TGCAATGCTT CAATATCTCC 1860 

CATTTCTTCT GTAATCACAT AGTCTTCACC AAGCATCATT TTCGTAAATT CTTGGCTCAT 1920 

TTGCATTGCT TCTCCAAGTG AATGACCTTT AACAGCTTGT GTCATCATCG ATGCACTTGC 1980 

CATTGAAATC GAACAACCTT CACCTTCAAA CTTAGCATCT TTTATAATGC CGTCTTCTAT 2040 

ATCAAATGTT AGTCGTATAC GGTCACCGCA TGTCX3GGTTA TTCATATCTA CTGTCATAGA 2100 

CCCCTTATCT AATACACCTT TATTTCTAGG ATTTTTATAA TGATCCATAA TGACAGATCT 2160 

ATATAATTGA TCTAGATTAT TAAAATTCAT AAGAGAAAAA CTCCTTCGTT TGTTTCAAGG 2220 

CATTTATTAA CTGATCAAOG TCTTCTTTCG TGTTGTATAT ATAAAAACTC GCTCTAGCTG 2280 

TTGAAGACAC ATTTAACCAT TTCATTAACG GTTGCGCACA ATGATGCCCA GCTCTAACCG 2340 

CTACACCTTC TGTATCTACG GCTGTAGCAA CATCX3TGTGG ATGTACATCT TGTAAATTAA 2400 

2S ACGTTATTAC ACCTGCACGA CGATCCTTTG GCGGGCCATA AATTTCAATT CCTTCAATTG 2460 

CAGACATTTG CTCATAAGCA TATATCXSTTA ATTCTTGTTC ATATTTATGA ATTGCATCAA 2520 

AACCTATGCG TTCTAAATAG CGAATAGCTT CTGCAAQCCC AATTGCTTGA GCAATTAATG 2580 

GAGTACCCGC CTCAAATTTA GTAGGTAAAT CAGCCCATGT TGCATCATAC TTACTTACAA 2640 

AATCAATCAT GTCGCCACCG AACTCAATCX5 GTTCCATTTT TTGTAGTAAC TCACGTTTAC 2700 

CAAATAATAC GCCAATACCT GTTGGTCCAA GCATTTTATG ACCACTAAAA CTATAAAAAT 2760 

CAGCATTCAT TTCTTGCATA TCAAGTTTCA TATGTGGTGC TGctTQCQCC CCATCAACAC 2B20 

TGATfiATTGC ACCATGTTGA TGAGCTATTT CTGCAATGGT TTTAACATCA TTAATTGTAC 2880 

CGAGCACATT AGATATATGT GCAATAGCAA CGATCTTTGT TTTATCATTA ATCGTTTGCT 2940 

TAATATCCTC GATGTTTAAT TCACCGTCAG CTGTCATTGG TATAAATTTC AATGTCGCAT 3000 

TTTTACGCTT TGCTAACTGT TGCCAAGGAA CAATATTGGC ATGATGTTCC ATTTCAGTGA 3060 

^ CAACAATTTC ATCGCCCTCT TCAACATTTG CATCACCATA GCTATGTGCT ACAAGGTTAA 3120 

TCGACGCAGT TGTTCCX5CGT GTAAAAATGA TTTCTTCAAA ATACTTCGCA TTAATAAAAC 3180 

GACGAACGGT TTCACGGGCA TTTTCATAAC CATCAGTTGC CAATGATCCT AATGTATGAA 3240 

SO CACCACGATG AACGTTTGAA TTATAACGCT TGTAGTAATC TTCTAAAACA TTTAACACTT 3300 

GCACAGGCXST TTGACTTGTC GCTGTTGAAT CAAGATATGC TAAACGTTTG CCATTGACTT 3360 
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CTTCATTCAC 


GACCTTa-CIT 


AAATAAAAAT 


CCTAATCATT 


TAAATACTGA 


CGTTGTATTA 


3480 


GTCTTATACC 


AATATCGACA 


GTCTATATCT 


ATTACAAACT 


TTTATTTTCA 


AAATATTATT 


3540 


TAGAAACTTT 


GCGTTCAATT 


ACTTCTCTCA 


ATTGACGTTT 


AACGTCTTCG 


ATAGGTAATT 


3600 


CACGTACTAC 


TGGATCTAAG 


AAACCATGTA 


TAACAAGACG 


TTCCGCTTCT 


CTTTGAGAAA 


3660 


TACCACGACT 


CATTAAATAG 


TAAAGTTGAT 


CTGGATCAAC 


ACGACCTACT 


GATGCAGCAT 


3720 


GACCAGCTTG 


TACATCATCT 


TCATCAATTA 


ATAAAATAGG 


ATTCGCGTCA 


CCACGAGCAT 


3780 


6TTCAGATAA 


CATTAATACA 


CGTGATTCCT 


GATTAGCAAT 


TGATTTAGTT 


CCACCATGCT 


3840 


TAAT6TAGCC 


GATACCATTA 


AATACA6ACG 


ATGCATGTTC 


TTTCATAACA 


CCATGTTTAA 


3900 


GGATATAACC 


ATCTGTTTCT 


TTACCATATT 


GTACGATTTT 


AGAT6TTAGA 


TTAAiVi-i'iT 


3960 


GTTCGCCTGT 


ACCTACAACT 


ACTGATTTAA 


GTGAACTTGT TGAACGATCA 


CCAAATAAAT 


4020 


TTGTTGTATT 


ATCAATAATT 


TGGCTACCCT 


CATTCATTAA ACCTAGTCCC 


CAATTAATTG 


4080 


AGGCATCCGC 


TTCAGTAATA 


CCACGTCGAA 


TGATATGACC 


TGTAAAGCCT 


TTATCCATAT 


4140 


AGTCCACTGA 


GCCATAT6TG 


ATATTTGAAT 


TTGCACCAGC AATCACTTCA 


GAAATAATAT 


4200 


TtAATTGATT 


TCCTTCACCA 


GATGCATTTO 


irtTAAGTAATT 


TTCAACATAT 


GTGACTTCGG 


4260 


CGCTTTCTTC 


AGTAACGATG 










4280 



(2) INFORMATION FOR SBQ ID NO: 82: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



40 


TCnGACTCGA ACGGTGmAAC 


TAttCCGTTG 


TaATTCCgGA 


GgAAsCAAGG TATGCCCATC 


60 




TGCaAAGAAA gaATGsAATG AACTTTTTGG 


AAATGTAGAA 


GTGGTAAATA 


AAGATAAAGG 


120 




ATATTACATT CTQAGAAGTA 


TAAAAGCTTG 


AAATGAAATG 


GATATTCTGT 


TATAGTTATA 


180 


4S 


TAATGTAAAA ATTTATGTTC 


AATAAGTGTG 


TACTTTTACG 


TTAAATAGAT 


AAGTTAATTA 


240 




AGAATAAATA TAGAATCGAA 


AATGGTGTCA 


TCATTAGTGT 


TGCCGTTTTC 


TTTTTGTCIT 


300 




TTTATTAATA TGCTTATGGT 


ATTTAGCTAA 


AAGCGGATCA 


CATAATTTTT 


GAGGGGTGAA 


360 


SO 


TCTGTTTGGC AGGTCAAGTT 


GTCCAATATG 


GAAGACATCG 


TAAACGTAGA 


AACTACGCGA 


420 




GAATTTCAGA AGTATTAGAA 


TTACCAAACT 


TAATAGAAAT 


TCAAACTAAA 


TCTTACGAGT 


480 
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20 



CTGGTAATTT GTCATTAGAG TTTGTGGATT ACCGTTTAGG AGAACCAAAA TATGATTTAG 600 

AAGAATCTAA AAACCGTGAC GCTACTTATG CTGCACCTCT TCGTGTAAAA GTGCGTCTAA 660 

TCATTAAAGA AACAGGAGAA GTTAAAGAAC AAGAAGTCTT TATGGGTGAT TTCCCATTAA 720 

TGACTGATAC AGGTACGTTC GTTATCAATG GTGCAGAACG TGTAATCGTA TCTCAATTAG 780 

TTCGTTCACC ATCCGTTTAT TTCAATGAAA AAATCGACAA AAATGGTCGT GAAAACTATG 840 

ATGCAACAAT TATTCCAAAC CGTGGTGCAT GGTTAGAATA TGAAACAGAT GCTAAAGATG 900 

TTGTATACGT ACGTATTGAT AGAACACGTA AACTACCATT AACAGTATTG TTACGTGCAT 960 

TAGGTTTCTC AAGCGACCAA GAAATTGTTG ACCTTTTAGG TGACAATGAA TATTTACGTA X020 

AXACTTTAGA GAAAGACGGC ACTGAAAACA CTGAACAAGC GTTATTAGAA ATCTATGAAC 1080 

GTTTACGTCC AGGTGAACCA CCAACTGTTG AAAATGCTAA AAGTCTATTG TATTCACGTT 1140 

TCTTTGATCC AAAACGCTAT GACTTAOCAA GCGTGGGTCG TTATAAAACA AACAAAAAAT 1200 

TACATTTAAA ACATCGTTTA TTTAATCAAA AATTAGCTGA GCCAATTGTA AATACTGAAA 1260 

CTGGTGAAAT TGTAGTTGAA GAAGGTACAG TGCTTGATCG TCGTAAAATC GACGAAATCA 1320 

2S TGGATGTACT TGAATCAAAT GCAAACAGCG AAGTGTTTGA ATTGCATGGT AGCGTTATAG 1380 

ACGAGCCAGT AGAAATTCAA TCAATTAAAG TATATGTTCC TAACGATGAT GAAGGTCGTA 1440 

CGACAACTGT AATTGGTAAT GCTTTCCCTG ACTCAGAAGPT TAAATGCATT ACACCAGCAG 1500 

ATATCATTGC TTCAATGAGT TACTTCTTTA ACTTATTAAG CGGTATTGGA TATACAGATG 1560 

ATATTGACCA TTTAGGTAAC CGTCGTTTAC GTTCTGTAGG TGAATTACTA CAAAACCAAT 1620 

TCCGTATCGG TTTATCAAGA ATGGAAAGAG TTGTACGTGA AAGAATGTCA ATTCAAGATA 1680 

CTGAGTCTAT CACACCTCAA CAATTAATTA ATATTCGACC TGTTATTGCA TCTATTAAAG 1740 

AATtCTTTGG TAGCTCTCAA TTATCACAAT TCATGGACCA AGCAAACCCA TTAGCTGAGT 1800 

TAACGCATAA ACGTCGTCTA TCAGCATTAG GACCTGGTGG TTTAACACGT GAACGTGCTC 1860 

AAATGGAAGT ACGTGACGTT CACTACTCTC ACTATGGCCG TATGTGTCCA ATTGAAACAC 1920 

CTGAGGGACC AAACATTGGA TTGATTAACT CATTATCAAG TTATGCACGT GTAAATGAAT 1980 

TCGGCTTTAT TGAAACACCA TATCGTAAAG TTGATTTAGA TACACATGCT ATCACTGATC 2040 

AAATTGACTA TTTAACAGCT GACGAAGAAG ATAGCTATGT TGTAGCACAA GCAAACTCTA 2100 

AATTAGATGA AAATGGTCGT TTCATGGATG ATGAAGTTGT ATGTCGTTTC CGTGGTAACA 2160 

SO ATACAGTTAT GGCTAAAGAA AAAATGGATT ATATGGATGT ATCGCCGAAG CAAGTTGTTT 2220 

CAGCAGCGAC AgcATGTATT CCATTCTTAG AAAATGATGA CTCAAACCGT GCATTGATGG 2280 



30 



35 



40 



45 



SS 
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CAGGTATGGA ACACGTTGCA GCACGTGATT CTGGTGCGGC TATTACAGCT AAGCACAGAG 2400 

GTCGTGTTGA ACATGTTGAA TCTAATGAAA TT CT TGTTCG TCGTCTAGTT GAAGAGAACG 2460 

5 

GCGTTGAGCA TGAAGGTGAA TTAGATCGCT ATCCATTAGC TAAATTTAAA CGTTCAAACT 2520 

CAGGTACATG TTACAACCAA CGTCCAATCG TTGCAGTTGG AGATGTTGTT GAGTATAACG 2580 

AGATTTTAGC AGATGGACCA TCTATGGAAT TAGGAGAAAT GGCATTAGGT AGAAACGTAG 2640 

70 

TAGTTGGTTT CATGACTTGG GACGGTTACA ACTATGAGGA TGCCGTTATC ATGAGTGAAA 2700 

GACTTGTGAA AGATGACGTG TATACTTCTA TTCATATTGA AGAGTATGAA TCAGAAGCAC 2760 

GTGATACTAA GTTAGGACCT GAAGAAATCA CAAGAGATAT TCCTAATGTT TCTGAAAGTG 2820 

CACTTAAGAA CTTAGACGAT CGTGGTATCG TTTATATTGG T6CAGAAGTA AAAGATGGAG 2880 

ATATTTTAGT TGGTAAAGTA ACGCCTAAAG GTGTAACTGA GTTAACTGCC GAAGAAAGAT 2940 

20 TGTTACATGC AATCTTTGGT GAAAAAGCAC GTGAAGTTAG AGATACTTCA TTACGTGTAC 3000 

CTCACGGCGC TGGCGGTATC GTTCTTGATG TAAAAGTATT CAATCGTGAA GAAGGCGACG 3060 

ATACATTATC ACCTGGTGTA AACCAATTAG TACGTGTATA TATCGTTCAA AAACGTAAAA 3120 

25 TTCATGTTGG TGATAAGATG TGTGGTCGAC ATGGTAACAA AGGTGTCATT TCTAAGATTG 3180 

TTCCTGAAGA AGATATGCCT TACTTACCAG ATGGACGTCC GATCGATATC ATGTTAAATC 3240 

CTCTTGGTGT ACCATCTCGT ATGAACATCG GACAAGTATT AGAGCTACAC TTAGGTATGG 3300 

30 

CTGCTAAAAA TCTTGGTATT CACGTTGCAT CACCAGTATT TGACGGTGCA AACGATGACX3 3360 

ATGTATGGTC AACAATTGAA GAAGCTGGTA TGGCTCGTGA TGGTAAAACT GTACTTTATG 3420 

ATGGACGTAC AGGTGAACCA TTCGATAACC GTATTTCAGT AGGTGTAATG TACATGTTGA 3480 

AACTTGCGCA CATGGTTGAT GATAAATTAC ATGCGCGTTC AACAGGACCA TATTCACTTG 354 0 

tTACACAACA ACCACTTGGC GGTAAAGCGC AATTCGGTGG ACAACGTTTT GGTGAGATGG 3600 

AGGTATGGGC ACTTGAAGCA TATGGTGCTG CATACACATT ACAAGAAATC TTAACTTACA 3660 

40 

AATCCGATGA TACAGTAGGA CGTGTGAAAA CATACGAGGC TATTGTTAAA GGTGAAAACA 3720 

TCTCTAGACC AAGTGTTCCA GAATCATTCC GAGTATTGAT GAAAGAATTA CAAAGTTTAG 3780 

45 GTTTAGATGT AAAAGTTATG GATGAGCAAG ATAATGAAAT CGAAATGACA GACGTTGATG 3840 

ACGATGATGT TGTAGAACGC AAAGTAGATT TACAACAAAA TGATGCTCCT GAAACACAAA 3900 

AAGAAGTTAC TGATTAATAC GCAATTTACA AAACAGGCAA AAAGATACTA AGCTGAATTT 3960 

^ TATTGATGAT TCAGTTTAGT ACTTTAAGCC ATTTTAAATA AATGCAAATC AATCAAATAG 4020 

CACAGCTAAT CTAAATTGAA GGAGGTAGGC TCCTTGATTG ATGTAAATAA TTTCCATTAT 4080 
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AAACCTGAAA CAATCAACTA CCGTACATTA AAACCTGAAA AAGATGGTCT ATTCTGTGAA 4200 

AGAATTTTCG GACCTACAAA AGACTGGGAA TGTAGTTGTG GTAAATACAA ACGTGTTCGC 4260 

TACAAAGGCA TGGTCTGTGA CAGATGTGGA GTTGAAGTAA CTAAATCTAA AGTACGTCGT 4320 

GAAAGAATGG GTCACATTGA ACTTGCTGCT CCAGTTTCTC ACATTTGGTA TTTCAAAGGT 4380 

ATACCAAGTC GTATGGGATT ATTACTTGAC ATGTCACCAA GAGCATTAGA AGAAGTTATT 4440 

TACTTTGCTT CTTATGTTGT TGTAGATCCA GGTCCAACTG GTTTAGAAAA GAAAACTTTA 4500 

TTATCTGAAG CTGAATTCAG AGATTATTAT GATAAATACC CAGGTCAATT CGTTGCAAAA 4560 

ATGGGTGCAG AAGGTATTAA AGATTTACTT GAAGAGATTG ATCTTGACX3A AGAACTTAAA 4620 

TTGTTACX5CG ATGAGTTGGA ATCAGCTACT GGTCAAAGAC TTACTCGTGC AATTAAACGT 4680 

TTAGAAGTTG TTGAATCATT CC3GTAATTCA GGTAACAAAC CTTCATGGAT GATTTTAGAT 4740 

20 GTACTTCCAA TCATCCCACC AGAAATTCGT CCAATGGTTC AATTAGATGG TQGACGATTT 4800 

GCAACAAGTG ACTTAAACGA CTTATACCGT CGTGTAATTA ATCGAAATAA TCGTTTGAAA 4860 

CGTTTATTAG ATTTAGGTGC ACCTGGTATC ATCQTTCAAA ACGAAAAACG TATGTTACAA 4920 

25 GAAGCCGTTG ACGCTTTAAT TGATAATGGT CX5TCGTGGTC GTCCAGTTAC TGGCCCAGGT 4980 

AACCGTCCAT TAAAATCTTT ATCTCATATG TTAAAAGGTA AACAAGGTCX; TTTCCGTCAA 5040 

AACTTACTTG GTAAACGTGT TGACTATTCA GGACGTTCAG TTATTGCAGT AGGTCCAAGC 5100 

TTGAAAATGT ACCAATGTGG TTTACCAAAA GAAATGGCAC TTGAACTATT TAAACCATTC 5160 

GTAATGAAAG AATTAGTTCA ACGTGAAATT GCAACTAACA TTAAAAATGC GAAGAGTAAA 5220 

ATCGAACGTA TGGATGATGA AGTTTGGGAC GTATTGGAAG AAGTAATTAG AGAACATCCT 5280 

GTATTACTTA ACCGTGCACC AACACTTCAT AGACTTGGTA TTCAAGCATT TGAACCAACT 5340 

TTACTTGAAG GTCGTGCGAT TCGTCTACAT CCACTTGTAA CAACAGCTTA TAACGCTGAC 5400 

TTTGACGGTG ACCAAATGGC GGTTCACGTT CCTTTATCAA AAGAGGCACA AGCTGAAGCA 5460 

AGAATGTTGA TGTTAGCAGC ACAAAACATC TTGAACCCTA AAGATGGTAA ACCTGTAGTT 5520 

ACACCATCAC AAGATATGGT ACTTGGTAAC TATTACCTTA CTTTAGAAAG AAAAGATGCA 5530 

4S GTAAATACAG 6CGCAATCTT TAATAATACA AATGAAGTAT TAAAAGCATA TGCAAATGGC 5640 

TTTOTACATT TACACACTAG AATTGGTGTA CATGCAAGTT CGTTCAATAA TCCAACATTT 5700 

ACTGAAGAAC AAAACAAAAA GATTCTTGCT ACGTCAGTAG GTAAAATTAT ATTCAATGAA 5760 

SO ATCATTCCAG ATTCATTTGC TTATATTAAT GAACCTACX5C AAGAAAACTT AGAAAGAAAG 5820 

ACACCAAACA GATATTTCAT CGATCCTACA ACTTTAGGTG AAGGTGGATT AAAAGAATAC 5880 
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GAAGTATTCA ACAGATTTAG CATCACTGAT ACATCAATGA TGTTAGACCG TATGAAAGAC 6000 

TTAGGATTCA AATTCTCATC TAAAGCTGGT ATTACAGTAG GTGTTGCTGA TATCGTAGTA 6060 

^ TTACCTGATA AGCAACAAAT ACTTGATGAG CATGAAAAAT TAGTCGACAG AATTACAAAA 6120 

CAATTCAACC GTGGTTTAAT CACTGAAGAA GAAAGATATA ATGCAGTTGT TGAAATTTGG 6180 

ACAGATGCAA AAGATCAAAT TCAAGGTGAA TTGATGCAAT CACTTGATAA AACTAACCCA 6240 

10 

ATCTTCATGA TGAGTGATTC AGGT6CCCGT GGTAACGCAT CTAACTTTAC ACAGTTAGCA 6300 

GGTATGCGTG GATTGATGGC CGCACCATCT GGJTAAGATTA TCGAATTACC AATCACATCT 6360 

TCATTCCGTG AAGGTTTAAC AGTACTTGAA TACTTCATCT CAACTCACGG TGCACGTAAA 6420 

IS 

GGTCTTGCCG ATACAGCACT TAAAACAGCT GACTCAGGAT ATCTTACTCG TCGTCTTGTT 6480 

GACGTGGCAC AAGATGTTAT TGTTCGTGAA GAAGACTGTG GTACTGATAG AGGTTTATTA 6540 

20 GTTTCTGATA TTAAAGAAGG TACAGAAATG ATTGAACCAT TTATCGAAC6 TATTGAAGGT 6600 

CGTTATTCTA AAGAAACAAT TCGTCATCCT GAAACT6ATG AAATAATCAT TCGTCCTGAT 6660 

GAATTAATTA CACCTGAAAT TGCTAAGAAA ATTACAGATO CTGGTATTOA ACAAATGTAT 6720 

2S ATTCGCTCAG CATTTACTTG TAACGCACGA CATGGTGTTT GTGAAAAATG TTACGGTAAA 6780 

AACCTTGCTA CTGGTGAAAA AGTTGAAGTT GGTGAAGCAG TTGGTACAAT TGCAGCCCAA 6840 

TCTATCGGTG AACCAGGTAC ACAGCTTACA ATGCGTACAT TCCATACAGG TGGGGTAGCA 6900 

GGTAGCGATA TCACACAAGG TCTTCCTCGT ATTCAAGAGA TTTTC6AAGC ACGTAACCcT 6960 

AAAGGTCAAG CGGTAATTAC GGAAATCGAA GGTGTC6TAG AAGATATTAA ATTAGCAAAA 7020 

GATAGACAAC AAGAAATTGT TGTTAAAGGT GCTAATGAAA CAAGATCATA CCTTGCTTCA 7080 

35 

GGTACTTCAA QAATTATTGT AGAAATCGGT CAACCAGTTC AACGTGGTGA AGTATTAACT 7140 

GAATCTTCTA TTGAACCTAA GAATTACTTA TCTGTTGCTG GATTAAACGC GACTGAAAGC 7200 

TACTTATTAA AAGAAGTACA AAAAGTTTAC CGTATGCAAG GTGTAGAAAT CGACGATAAA 7260 

40 

CACGTTGAGG TTATGGTTCG ACAAATGTTA CGTAAAGTTA GAATTATCGA AGCAGGTGAT 7320 

ACGAAGTTAT TACCAGGTTC ATTAGTTGAT ATTCATAACT TTACAGATGC AAATAGAGAA 73 80 

^ GCATTTAAAC ACCGTAAGCG TCCTGCAACA GCTAAACCAG TATTACTTGG TATTACTAAA 7440 

GCATCACTTG AAACAGAAA6 TTTCTTATCT GCAGCATCAT TCCAAGAAAC AACAAGAGTT 7500 

CTTACAGATG CAGCAATTAA AGGTAAGCGT GATGACTTAT TAGGTCTTAA AGAAAACGTA 7560 

SO ATTATTGGTA AGTTAATTCC AGCTGGTACT GGTATGAGAC GTTATAGCGA OGTAAAATAC 7620 

GAAAAAACAG CTAAACCAGT TGCAGAAGTT GAATCTCAAA CTGAAGTAAC GGAATAACAA 7680 
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ATGTTGACGA ATTCTCTTGT TCAATGTTAA TATATTAAAG GTTGATGCAA GCAGAACTTT 7800 

GGAGGATAAA TTATTGTCTA AGGAAAAAGT tGCACGCTTT AACAAACAAC ATTTTGTAGT 7860 

* TGGTCTTAAA GAAACGCTTA AAGCGTTAAA GAAAGATCAA GTTACATCTT TGATTATTGC 7920 

TGAAGACGTT GAAGTATATT TAATGACTCG CGTGTTAAGC CAAATCAATC AGAAAAATAT 7980 

ACCTGTATCT TTTTTCAAAA GCAAACATGC TTTGGGTAAA CATGTAGGTA TTAACGTCAA 8040 

10 

TGCX3ACAATA GTAGCATTGA TTAAATGAGA ATTAGTAAGT GTTTTACTTA CTAAATTTTA 8100 

TTTAACCTAA AAATGAACCA CCTGQATGTG TGGGA1TAAA AAGTGAAGAG AGGAGGACAT 8160 

ATCACATGCC AACTATTAAC CAATTAGTAC GTAAACCAAG ACAAAGCAAA ATCAAAAAAT 8220 

IS 

CAGATTCTCC AGCTTTAAAT AAAGGTTTCA ACAGTAAAAA GAAAAAATTT ACT6ACTTAA 8280 

ACTCACCACA AAAACGTGGT GTATGTACTC GTGTAGGTAC AATGACACCT AAAAAACCTA 8340 

20 ACTCAGCGTT ACGTAAATAT GCACGTGTGc gTtTATCAAA CAACATCGAA ATTAACGCAT 8400 

ACATCCCTGG TATCGGACAT AACTTACAAG AACACAGTGT TCTACTTGTA CGTGGTGGAC 8460 

GTGTAAAAGA CTTACCAGGT GTGCGTTACC ATATTGTACX3 TGGAGCACTT GATACTTCAG 8520 

25 GTGTTGACGG ACGTAGACAA GGTCGTTCAT TATACGGAAC TAAGAAACCT AAAAACTAAG 8580 

AATTTAGTTT TTAATTAAAT CTTAAACTTA AAATATTTAA TATAAGGAAG GGAGGATTTA 8640 

CATTATGCCT CGTAAAGGAT CAGTACCTAA AAGAGACGTA TTACCAGATC CAATTCATAA 8700 

CTCTAAGTTA GTAACTAAAT TAATTAACAA AATTATGTTA GATGGTAAAC GTGGAACAGC 8760 

ACAAAGAATT CTTTATTCAG CATTCGACCT AGTTGAACAA C6CAGgtTCX3 TQATGCATTA 8820 

GAAGTATTCG AAGAAGCAAT CAACAACATT ATGCCAGTAT TAGAAGTTAA AGCTCGTCGC 8880 

35 

GTAGGTGGTT CTAACTATCA AGTACCAGTA GAAGTTCGTC CAGAGCGTCG TACTACTTTA 8940 

GGTTTACGTT GGTTAGTTAA CTATGCACGT CTTCGTGGTG AAAAAACGAT GGAAGATCGT 9000 

TTAGCTAACG AAATTTTAGA TGCAGCAAAT AATACAGGTG GTGCCGTTAA GAAACGTGAG 9060 

40 

GACACTCACA AAATGGCTGA AGCAAACAAA GCATTTGCTC ACTACCGTTG GTAAGATAAA 9120 

AGCTTTTACC CTGAGTGTGT TCTATATTAA TGAATTTTCA TTAAGCGTTC ATGCTTAGGG 9180 

45 CATCGCCATA TCTATCGTAT TTATTCAGTA ATATAAACTG GAAGGAGAAA AAATACATGG 9240 

CTAGAGAATT TTCATTAGAA AAAACTCGTA ATATCGGTAT CATGGCTCAC ATTGATGCTG 9300 

GTAAAACGAC TACGACTGAA CGTATTCTTT ATTACACTQG CCGTATCCAC AArGknGGTG 9360 

50 AAaCACACGA AGGTGCTTCA CAAATGGACT GGATGGAGCA AGAACAAGAC CGTGGTATTA 9420 

CTATCACATC TGCTGCAACA ACAGCAGCTT GGGAAGGTCA CCGTGTAAAC ATTATCGATA 9480 
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CAGTTACAGT ACTTGATGCA CAATCAGGTG TTGAACCTCA AACTGAAACA GTTTGGCGTC 9600 

AGGCTACAAC TTATGGTGTT CCACGTATCG TATTTGTAAA CAAAATGGAC AAATTAGGTG 9660 

^ CTAACTTCGA ATACTCTGTA AGTACATTAC ATGATCGTTT ACAAgCTAAC GCTGCTCCAA 9720 

TCCAATTACC AATTGGTGCG GAAGACGAAT TCGAAGCAAT CATTGACTTA GTTGAAATGA 9780 

AATGTTTCAA ATATACAAAT GATTTAGGTA CTGAAATTGA AGAAATTGAA ATTCCTGAAG 9840 

10 

ACCACTTAGA TAGAGCTGAA GAAGCTCX5TG CTAGCTTAAT CGAAGCAGTT GCAGAAACTA 9900 

GCGACGAATT AATGGAAAAA TATCTTGGTG ACGAAGAAAT TTCAGTTTCT GAATTAAAA6 9960 

AAGCTATCCG CCAAGCTaCt AcTAACXSTAG AATTCTACCC AGTACTTTGT GGTACAGCTT 10020 

IS 

TCAAAAACAA AGGTGTTCAA TTAATGCTTG ACGCTGTAAT TGATTACTTA CCTTCACCAC 10080 

TAGACGTTAA ACCAATTATT GGTCACCGTG CTAGCAACCC TGAAGAAGAA GTAATCGCGA 10140 

20 AAGCAGACGA TTCAGCTGAA TTCGCTGCAT TAGCGTTCAA AGTTATGACT GACCCTTATG 10200 

TTGGTAAATT AACATTCTTC CGTGTGTATT CAGGTACAAT GACATCTGGT TCATACGTTA 10260 

AGAACTCTAC TAAAGGTAAA CGTGAACGTG TAGGTCGTTT ATTACAAATG CACGCTAACT 10320 

25 CACGTCAAGA AATCGATACT GTATACTCTG GAGATATCGC TGCTGCGGTA GGTCTTAAAG 10380 

ATACAGGTAC TGGTGATACT TTATGTGGTG AGAAAAATGA CATTATCTTG GAATCAATGG 10440 

AATTCCCAGA GCCAGTTATT CACTTATCAG TAGAGCCAAA ATCTAAAGCT GACCAAGATA 10500 

AAATGACTCA AGCTTTAGTT AAATTACAAG AAGAAGACCC AACATTCCAT GCACACACTG 10560 

ACGAAGAAAC TGGACAAGTT ATCATCGGTG GTATGGGTGA GCTTCACTTA GACATCTTAG 10620 

TAGACCGTAT GAAGAAAGAA TTCAACGTTG AATGTAACGT AGGTGCTCCA ATGGTTTCAT 10680 

35 

ATCGTGAAAC ATTCAAATCA TCTGCACAAO TTCAAGGTAA ATTCTCTCGT CAATCTGGTG 10740 

GTCGTGGTCA ATACGGTGAT GTTCACATTG AATTCACACC AAACGAAACA GGCGCAGGTT 10800 

TCGAATTCGA AAACGCTATC GTTGGTGGTG TAGTTCCTCG TGAATACATT CCATCAGTAG 10860 

40 

AAGCTGGTCT TAAAGATGCT ATGGAAAATG GTGTTTTAGC AGGTTATCCT TTAATTGATG 10 920 

TTAAAGCTAA ATTATATGAT GGTTCATACC ATGATGTCGA TTCATCTGAA ATGGCCTTCA 10980 

AAATTGCTGC ATCATTAGCA CTTAAAGAAG CTGCTAAAAA ATGTGATCCT GTAATCTTAG 11040 

45 

AACCAATGAT GAAAGTAACT ATTGAAATGC CTGAAGAGTA CATGGGTGAT ATCATGGGTG 11100 

ACGTAACATC TCGTCGTGGA CGTGTTGATG GTATGGAACC TCGTGGTAAT GCACAAGTTG 11160 

SO TTAATGCTTA TGTACCACTT TCAGAAATGT TCGGTTATGC AACATCATTA CGTTCAAACA 11220 

CTCAAGGTCG CGGTACTTAC ACTATGTACT TCGATCACtA TGCTOAAGTT CCaAAATCaA 11280 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



GCCTAGGTTA 
TTGGGGTGAA 
CTCATGATGG 
GGAGAGATTT 
GTACTATCGG 
TAGCAAAAAA 
AAAAAGAACG 
ACTACGCTCA 
CTGCTCAAAT 
CTC6TGAACA 
ACAAAGTTGA 
ACTTATTAAG 
TAAAAGCTTT 
TAGATACTTA 
AGGACGTATT 
AAATCAAAGT 
TTACAGGTGT 
GTGCATTATT 
CTGGTTCAAT 
AAGGTGGACG 
CTGA66TAAC 
ACGTTGAAAT 
CAATCCGTGA 
TTCTAATTTC 
TGTGTAAATT 
AATAAAACTT 
AATGCTCCCT 
TTTATGTCTC 
GCTCTAATTC 



AAATACAAGG 
TGCATTATAA 
TGAGAAACTA 
TATAATGGCA 
TCACGTTGAC 
TGGTGACTCA 
TGGTATCACA 
CGTTGACTGC 
GGACGGCGGT 
CATTCTTTTA 
CATGGTTGAC 
CGAATATGAC 
AGAAGGCGAT 
CATTCCAACT 
CTCAATCACT 
TGGTGAAGAA 
TGAAATGTTC 
ACGTGGTGTT 
TACACCACAT 
TCACACTCCA 
TGGTGTTGTT 
GACAGTAGAA 
AGGTGGACGT 
TTAGATTTTA 
TTGTAATGGC 
ATAACTGATA 
TCAAAGTTTT 
AGATTCATAT 
TATGATTTTT 



TGAGCTTAAA 
AAGAATTGTA 
TCATGAGAGA 
AAAGAAAAAT 
CATGGTAAAA 
GTTGCACAAT 
ATCAATACTT 
CCAGGACACG 
ATCTTAGTAG 
TCACGTAACG 
GATGAAGAAT 
TTCCCAGGTG 
GCTCAATACG 
CCAGAACXSTG 
GGTCGTGGTA 
GTTGAAATCA 
CGTAAATTAT 
GCTCGTGAAG 
ACTGAATTCA 
TTCTTCTCAA 
CACTTACCAG 
TTAATCGCTC 
ACTGTAGGAT 
TATAAAAAGA 
TATTCGATTT 
ATACTGTTTA 
CATTTTTTCa 
CTTTCAATTA 
AAAAATACAG 



TGTAAGCTAT 
AAATTCTTTT 
TAAATTTAAA 
TCGATCGTTC 
CAACATTAAC 
CATATGACAT 
CTCACATTGA 
CTGACTACGT 
TATCTGCTGC 
TTGGTGTACC 
TATTAGAATT 
ACGATGTACC 
AAGAAAAAAT 
ATTCTGACAA 
CTGTTGCTAC 
TCGGTTTACA 
TAGACTACXX; 
ACGTACAACG 
AAGCAGAAGT 
ACTATCGTCC 
AAGGTACTGA 
CAATCGCGAT 
CAGGCGTTGT 
AGATCCCTCA 
AGAAGAACAA 
AATAAAATTG 
ATGTCTACTT 
ATTTAAATGC 
CTACAGCGTA 



CATCTTTATA 
TGCATCGCTA 
TATTATTTTT 
TAAAGAACAT 
AGCAGCAATC 
GATTGACAAC 
GTACCAAACT 
TAAAAACATG 
TGACGGTCCA 
AGCATTAGTA 
AGTAGAAAT6 
TGTAATCGCT 
CTTAGAATTA 
ACCATTCATG 
AGGCCGTGTT 
TGACACATCT 
TGAAGCTGGT 
TGGTCAAGTA 
ATACGTATTA 
ACAATTCTAT 
AATGGTAATG 
TGAAGACGGT 
TACTGAAATC 
ATCGAGGGGt 
TAATTGATGA 
TTGAGTCTTG 
TGAAGGGAGC 
TTAATTTGTT 
TTTTAATGAT 



GTTTGATTTT 
TAAATAATTT 
AATTAGAATA 
GCCAATATCG 
GCTACTGTAT 
GCTCCAGAAG 
GACAAACGTC 
ATCACTGGTG 
ATGCCACAAA 
GTATTCTTAA 
GAAGTTCGTG 
GGTTCAGCAT 
ATGGAAGCTG 
ATGCCAGTTG 
GAACGTGGTC 
AAAACAACT6 
GACAACATT6 
TTAGCTGCTC 
TCAAAAGACG 
TTCCGTACTA 
CCTGGTGATA 
ACTCGTTTCT 
ATTAAATAAT 
CTTTTTTTAA 
AAGACTGACT 
GACATTGTAA 
ATTTCATTAG 
TTAAATACTT 
TTTTCATCAA 



11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 



55 



538 



EP 0 786 519 A2 



TCAGAAAGAA TGCACCTGGT CGTACTTTCA AATAATOTGA AAAATCTTCT CCAATCATCA 13200 

TTAAATCTGA TTCATTAAAG CGTACATGTA AGTCATTTGT TGCTTCTTTA ATAACTTGAT 13260 

5 ATGCTTTCTC OTTATTATGG ACAGGCAAAT ACCCTTTAAT ATAATTCAAA TCATAGTTAA 13320 

TATCATTTGC TATTGCTAAA CCTTGTAGAA GCTTATCCAT TTTGTCCATT ACATGATTCT 13380 

GTATATCTGA ATCGAAAGTT CTAACTGTAC CTTTACAAAA TGCTTGATCA GGAATAACGC 13440 

10 

TATCTGTGGT GCCTGCTTGA ATCATTCCAA ATGAAAGTAC AGCTTGTTTA ACTGOATCGA 13500 

TCGTACGTGA AATTATTTTT TGTGCACTTA AAATGAACTC TGCCATGATT ACTATTGGGT 13560 

CAATGGTTTC ATGAGGTTTG GCACCATGAC CACCACGACC TTTAAATGTG ACGCTAAATT 13620 

IS 

CATCTGGAGA GGCCATGATT GCCCCCGCAC GTGAATGAAT AGTTCCAGTA GGATAACCAC 13680 

TCCATAAATG TGTACCGTAA ATTCTATCTA CATTTTCCAG ACATCCAGCA TCTATCATTT 13740 

CTTGAGAACC ACCTGGCATG ATTT C T T CAC CQTACTOGAA TATTAATACA ACATTACCTT 13800 

20 

CTAATAAATG TTTATGTTCA TCTAAAATCT CTGCTACAGT AAGTAAAATT GCTGTATGAC 13860 

CATCATGCCC ACACGCATGC ATACATCCTG GATTTTTAGA CTTATAAGGC ACATCX5TTTA 13920 

2S ATTCCTCGAC AGGTAACX3CA TCAAAGTCAG CTCTTAATGC AATGGTAGGT CCTGTGCCCA 13980 

AGCCTTTAAA TGTGGCTTTG ATACCATTGC GGCCGATAGG AGTTTCAATA TCACAAGATA 14040 

ACTGGCTTAA TTGGTTAACA ATATAATCAT GTGTTTGAAA TTCTTCAAAA QATAACTCAG 14100 

^0 GATATTGGTG TAAATAACGT CTGAGTTGAA TTGTTTTATT TTCTTTATTA TTTGCTAGTT 14160 

GGAACCAATC TAACACCCTT ATCACTACTT TCTAAAATAA TGTTTATAGT ATAACATTTT 14220 

ATGAAATTAT CGTACTAAAT GATTGCTTTG AGATATTTTA TCTATGAATG ATAAGGCTTT 14280 

^ CAAGTTATGT AGAATTACTG TATGATAAAG GTATTACCAA ACAATACTTA AGGGGGATTA 14340 

TATACTGTGG TTCAATCATT ACATGAGTTT TTAGAGGAAA ATATAAATTA TCTAAAAGAA 14400 

AATGGT7TGT ATAATGAAAT AGATACAATT GAAGGTGCAA ACGGACCAGA AATCAAAATC 14460 

40 

AATGGGAAAT CATACATTAA CTTATCTTCA AATAATTATT TAGGACTAGC AACAAATGAA 14520 

GATTTGAAAT CaGctGCAAA AGCAGCTATT GATACACATG GTGTAGGTGC AGGCGCTGTT 14580 

CGTACAATCA ATGGTACATT AGATTTACAC GACGAATTAG AAGAAACACT AGCAAAATTT 14640 

45 

AAAGGAACAG AAGCTGCAAT AGCTTATCAA TCAGGATTTA ATTGTAATAT GGCTGCTATT 14700 

TCAGCTGTCA TGAATAAAAA TGATGCTATT TTATCAGATG AGCTTAATCA TGCATCAATT 14760 

so ATTGATGGAT GTCGCTTATC TAAAGCTAAA ATTATTCGAG TTAACCATTC AGACATGGAT 14820 

GATTTACGTG CGAAAGCAAA AGAAGCAGTT GAATCAGGTC AATACAATAA AGTGATGTAT 14880 
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5 



10 



IS 



20 



ATTGCAGAAG AATTrGGTTT 


ATTAACTTAT 


GTTGACGAOG 


CTCATGGTTC 


AGGTGTTATG 


15000 


GGTAAAGGCG CTGGTACGGT 


TAAACATTTT 


GGTTTACAAG 


ATAAAATCGA 


TTTCCAAATA 


15060 


GGTACGCTTT CTAAAGCAAT 


TGGTGTCGTT 


GGCGGTTATG 


TAGCAGGTAC 


AAAAGAGTTA 


15120 


ATAGATTGGT TAAAAGCACA 


ATCACGACCA 


TTCTTATTCT 


CTACATCATT 


AGCACCTGGG 


15180 


GATACCAAAG CAATAACTGA 


AGCAGTTAAA 


AAGTTAATGG 


ATTCAACTGA 


ATTACATGAT 


15240 


AAATTATGGA ACAATGCACA 


ATATTTAAAA 


AATGGATTGT 


CAAAATTAGG 


ATATGATACA 


15300 






ATTATTGGTG 


ATGAAAAAAC 


AACTCAAGAA 




ITTAGTAAGC GTT7AAAAGA 


CGAAGGTGTC 


TATGT6AAAT 


CTATCGTTTT 


CCCAACAGTA 


15420 


CCAAGAOGTA CAGGAGGTGT 


AAGAAATATG 


CCTACAGCTG 


CACATACAAA 


AGACATGTTA 


15480 


GATGAA6CAA TTGCGGCTTA 


TGAAAAAGTA 


GGAAAAGAAA 


TGAAGTTQAT 


TTAATATTTA 


15540 


TTTATTCCCA CGGCAAATAT 


TGTCGTGGGC 


TTTTTTTAAT 


GTTTAGTTTA TTAACAGT 


15598 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 
2$ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AAGTAAATCA ACTTACTGGG ATAAGAATAA AGG06ATTAT AGTAACAA6T TGATTTTATT 60 

CGAAAAACAT TTTGAACCGG TTCTGGGTAT CAAGATGCAA CATAGTGGAG GTCATAGCTT 120 

35 

TGGCCACACG ATTATTACGA TTGAAAGTCA AGGAGATAAA GCAGTTCATA TGGGTGATAT 180 

ATTCCCAACT ACTGCACATA AAAATCCTCT ATGGGTAACG GCATATGATG ATTATCCTAT 240 

GCAATCGATT CGTGAAAAAG AACGCATGAT ACCATATTTT ATICAGCAAC AATATTGGTT 300 

40 

CTTGTTTTAT CATGATGAAA ACTACTTTGC TGTAAAATAC AGCGATAATG GTGAAAACAT 360 

AGATGCATAT ATTTTACGTG AAACATTAGT TGATAATAAC TAAAATAAAG ATGTATTACT 420 

^ AAACAAATTT TCAAAAATAA AAAATTGAGC CACATCCAAT CTTACTAATT AGGGTGTGGC 480 

TCATTTTTAA GTTTTACgAT CCAAATCAAA TATGGaTAAA ATTCgTATTA ACGCTCTACa 540 

ATGtTAATGA CTTCACCAGT ATATGCATCT GCATAAAAAT CATAATGAAT ATTTTGACCA 600 

so TTTTTAATAG TTGTAATTCC ACCTTOATAA ACTAAACGGT ATTTATCAGT TTCAGGATGA 660 

A 661 

55 



540 



EP0786 519A2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

10 

GCAGACGGTA CAGCAGTTAA AGTCGCACCA AaACTGTAGT GAATcTAATC GGTGcATTCT 60 

TTTTAGGATT AGTTGTCGCG CTTATATATA TCTTCTTCAA AGTAATTTTC GATAAGCGAA 120 

TTAAAGATGA AGAAGATGTA GAGAAAGAAT TAGGATTGCC TGTATTGGGT TCAATTCAAA 180 

15 

AATTTAATTA A6GATGGTTG CTACTTATGT CAAAAAAGGA AAATACGACA ACAACACTAT 240 

TTGTATATGA AAAACCAAAA TCAACAATTA GTGAAAAGTT TCGAGGTATA CGTTCAAACA 300 

20 TCATGTTTTC AAAAGCAAAT GGTGAAGTAA AGCGCTTATT GGTTACTTCT GAAAAGCCTG 360 

GTGCAGGTAA AAGTACAGTT GTATCGAATG TAGCGATTAC TTATGCACAA OCAGGCTATA 420 

AGACATTAGT TATTGATGGC GATATGCGTA AgcCAACACA AAACTATATT TTTAATGAGC 480 

2S AAAATAATAA TG6ACTATCA AGCTTAATCA TTGGTCGAAC GACTATGTCA GAAGCAATTA 540 

CGTCGACAGA AATTGAAAAT TTA6ATTTGC TAACAGCTGG CCCTGTACCT CCAAATCCAT 600 

CTGAGTTAAT TGGGTCTGAA AGGTTCAAAG AATTAGTTGA TCTGTTTAAT AAACGTTACG 660 

^ ACATTATTAT TGTCGATACA CCGCCAGTTA ATACTGTGAC TGATGCACAA CTATATGCGC 720 

GTGCTATTAA AGATAGTCTG TTAGTAATTG ATAGTGAAAA AAATGATAAr AATGAAGTTA 780 

AAAAAGCAAA AGCACTTATG GAAAAAGCAG GCAGTAACAT TCTAGGTGTC ATTTTGAACA 840 

35 

AGACAAAGGT CGATAAATCT TCTAGTTATT ATCACTATTA TGGAGATGAA TAAGTATGAT 900 

TGATATTCAT AACCATATAT TGCCTAATAT GGATGACGGT CCGACAAATG AAACAGAGAT 960 

GATGGATCTT TTAAAACAAG CGACAACACA AGGTGTTACA GAAATCATTG TAACATCACA 1020 

40 

TCACTTACAT CCTCGATATA CCACACCTAT AGAAAAAGTG AAATCATGTT TAAACCATAT 1080 

TGAAAGCTTA GAGGAAGTAC AAGCACTAAA TCTAAAGTTT TATTATGGTC AGGAAATAAG 1140 

^ AATTACCGAT CAAATCCTTA ATGATATTGA TCGAAAAGTT ATTAACGGTA TTAATGATTC 1200 

ACGCTATTTA CTAATAGAAT TTCCATCAAA TGAAGTTCCA CACTATACTG ATCAATTATt 1260 

TTTCGAATtA CAGAGTAAAG GCTTTGTACC GATTATTGCA CATCCAGAGC GGAATAAAGC 1320 

SO AATAAGTCAA AACCTTGACA TACTATACGA TTTAATTAAC AAAGGTGCTT TAAGTCAAGT 1380 

GACAACGGcG TCATTAGCGG GTATTTCCGG TAAAAAAATT AGAAAATTAG CAATTCAAAT 1440 
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10 



IS 



20 



GTTCTTAATG AAAGACTTAT TTAATGATAA GAAATTACGT GATTATTATG AAGATATGAA 1560 

CGGATTTATT AGTAATGCGA AGTTAGTTGT TGATGATAAA AAAATTCCTA AACGAATGCC 1620 

ACAACAAGAT TATAAACAGA AAAGATGGTT TGGGTTATAA ACAGCAAATG AGGGGTTTTA 1680 

TGGCACATTT ATCTGTGAAA TTGCGGCTTT TAATACTAGC ATTAATCGAT TCACTGATAG 1740 

TGACATTTTC AGTATTCGTA AGTTATTACA TTTTAGAACC GTATTTCAAA ACATATTCTG 1800 

TCAAATTATT AATATTGGCA GCTATATCAC TATTCATATC GCATCATATT TCaGCATTTA 1860 

TTTTTAATAT GTATCATCGA GCGTGGGAAT ATGCCAGTGT GAGTGAATTG ATTTTAATTG 1920 

TTAAAGCTGT GACX3ACATCT ATCGTTATTA CGATGGTGGT CGTGACAATT GTTACAGGCA 1980 

ATAGACCXSTT TTTTAGATTG TATTTAATTA CTTGGATGAT GCACTTGATT TTAATAGGTG 2040 

GCTCAAGGTT ATTTTGGCX3T ATTTATCGGA AATACCTTGG AGGTAAGTCA TTTAATAAGA 2100 

AGCCAACrrr AGTTGTTGGT GCTGGTCAAG CAGGTTCAAT GCTGATTAGA CAAATGTTGA 2160 

AAA6TGACGA AATGAAACTT GAACCGGTAT TAGCAGTCGA TGATGAGGAA CATAAACGCA 2220 

ATATCACAAT TACTGAGGGT GTAAAAGTCC AAGGTAAAAT TGCX^GATATT CCAGAACTAG 2280 

25 TGAGQAAATA TAAGATTAAA AAAATCATCA TTGCAATTCC AACTATTGGT CAAGAGCGTT 2340 

TGAAAGAAAT TAATAATATT TGCCATATGG ATGGCGTTGA GTTATTGAAA ATGCCAAATA 2400 

TAGAAGACGT CATGTCTGGT GAGTTAGAAG TGAACCAACT TAAAAAAGTT GAAGTAGAAG 2460 

ATTTACTAGG CAGAGATCCT GTTGAATTAG ATATGGATAT GATATCAAAT GAATTGACGA 2520 

ATAAAACTAT TTTAGTTACG GGTGCAGGTG GTTCAATAGG ATCAGAAATT TGTAGACAAG 2580 

TTTGTAATTT CTATCCAGAA CGTATTATTC TACTTGGCCA TGGTGAAAAC AGTATTTATT 2640 

TAATCAATCG TGAATTQCGA AATCGCTTCG GwAAAAATOT TGATATCGTT CCTATTATAG 2700 

CGGATGTGCA AAATAGAGCG CGTATGTTTG AAATTATGGA AACGTATAAA CCATACGCAG 2760 

TTTATCATGC AGCAGCACAC AAGCACGTGC CGTTAATGGA AGACAACCCT GAAGAAGCAG 2820 

TACGTAATAA TATTTTAGGT ACGAAAAATA CTGCTGAAGC TGCTAAAAAT GCAGAGGTAA 2880 

AGAAATTCGT TATGATTTCT ACGGATAAAG CCGTTAATCC GCCTAATGTC ATGGGAGCTT 2940 

CAAAGCGAAT TGCAGAAATG ATTATTCAAA GTTTAAATGA TGAAACGCAT CGAACAAATT 3000 

TTGTTGCAGT GAGATTTGGT AATGTACTTG GATCGAGAGG ATCTGTGATT CCACTTTTCA 3060 

AAAGTCAAAT TGAAGAAGGT GGGCCAGTTA CTGTGACACA TCCTGAAATG ACACGTTACT 3120 

50 TTATGACAAT TCCTGAAGCT TCTAGACTAG TTTTGCAGGC AGGGGCATTA GCAGAAGGTG 3180 

GCX^AAGTATT TGTGCTAGAT ATOGGAGAAC CAGTGAAAAT TGTAOATTTG GCACGTAATT 3240 
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ATTGAAAATG GTGTACCTAC AGTACCAAAC GTATTTAAAG TGACATTGGG AGAAATTGTA 5160 

GATTTATTAT ACAAGTTCAA ACAGTCACGT CTCGATCGAA CATTGCCGAA ATTAGATAAC 5220 

TTGTTTGAAA AAGATTTGTA TAGTACGTAT TTAAGCTATC TACCTAGTAC aGACTTTAGT 5280 

TAyCCCTTAC TTATGAATGT GGATGATAGG GGTTCTTTTA CAGAATTTAT AAAAACACCG 5340 

GATCX5TGGTC AAGTTTCTGT AAATATTTCT AAACCAGGTA TTACTAAAGG TAATCACTGG 5400 

CATCATACTA AAAACGAAAA ATTTCTAGTC GTATCAGGTA AAGGGGTAAT TCGTTTTAGA 5460 

CATGTTAATG ATGATGAAAT CATTGAATAT TATGTTTCTG GCGACAAATT AGAAGTTGTA 5520 

GACATACCAG TAGGATACAC ACATAATATT GAAAATTTAG GCX^ACACAGA TATGGTAACT 5580 

ATTATGTGGG TGAATGAAAT GTTTGATCCA AATCAGCCAG ATACX5TATTT CTTGGAGGTA 5640 

TAGCGCATGG aAAAACTGAA rTTAATGACA ATAGTTGGTA CAAGGCCTGA AATCATTCGT 5700 

TTATCATCAA CGATTAAAGC ATGTGATCAA TATtTTAA 5738 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

ATCATCAACA AGAATGATAT TTTTCCCATC TACTATATCT TTTACCGCAG ATAACTTCAC 60 

TCTCACACCT TGCTCACGTA ATTCTTGAGT TGGTTGAATA AATGTTCTTG CAACATATTG 120 

ATTTTTAACT AGTCCCATTT CATATGGCAA ACCTATTTCT TCAGCATAAC CACTCOCAGC 180 

TGAT^GCGAT gAATTGGGTA CACCGATGAC CATATCAGCA TTTACAGGGC TTTCTTGGGC 240 

TAATTTTTTA CCAGAAGCTT TACGTACTGC ATGGACATTT TTACCAGCTA TTGTTGAGTC 300 

TGGTCTAGCA AAATAAATAT ATTCCATCGC AGAAATTGCA GTTGTCGTAT GATGTGTATA 360 

AGATTTAACT GTAATACCTT TATCGTTAAT CACGACATAT TCACCTGCAT GAATATCTTG 420 

AACAAATTCT GCACCTAACA CATCTATTGC ACATGTTTCA CTTGCAAGGA TGTATGTCCC 480 

ATCTTTCATT TTACCTACAA CAAGTGGTCT GATAGCATTT GGATCTACTG CGCCATATAA 54 0 

CGCATCTTTA GITAAAATCG CAAATGTAAA ACCGCCTTTA ACTTTTCGCA AACTTTCTTT 600 

CAACGCTTCC TCAAAAGTAG GAGCTTTACT TOGAOGTATC AAATGCATAA TGACTTCAGT 660 

ATCAGAAGAC GAATGGAAGA TAGCACCTTG TTTTTCTAAA TTCTGACGCA ATGATTTAGC 720 
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CGGTTGAATA TTTTCAATAC CTTTATTACC TGAAGTAGCA TAACGGACGT GACCAATTGC 840 

ATGTTGATAT CCTTTTAATC GTTCCATTTG ATCATCTTTA ATCGCTTCAG TTAGTAAGCC 900 

^ TAATCCTCGC TCGCCTTTTA ATTCATTTTG ATCAGAAACA ACTATACCTG CACCTTCTTG 960 

ACCACGATGT TGCAAACTAT GAAGTCCCAT ATAtGTTAGT TGCGCTGCtT CaGGATGATT 1020 

CCAAATACCA AACACGCCAC ATTCTTCGTT TAATCCTGAG TAGTTAAACA TTGaGCAATT 1080 

10 

GCCCCtTCCC ATATTTGTTT AATATCTGAA ACATTTTCAC TAATCTCTGT aTATGGTGTT 1140 

GTTACCTTGr aATTATCACT ATCTGITAAA AGTCCAAITT CTATTGCATT ATCAATATTT 1200 

AAAGTTTTAC CTGATTTAAC AGAAACAACA TATCGGCCTT GCX5TCTCACT AAACAATTGT 1260 

15 

GCATTTGTTA TATCTATTGA AGATTTTAAT CCTAAACCGT AATGCGCACT TAGTTTAQCT 1320 

AAGGTAATCA GTAAGCCACC TTTACCAACT GTTTGAACAT GTGATAATAG TCCTTCACGA 1380 

2Q ATAGOGGTCT T6ATTGATTC ACCTTTTTCA ACTTC7GAAC TCAAATCTAA TGACTCAAAT 1440 

TCATGATTAA CTTTGCCATA AATTAACTTT TCAAGTTGAC TACXACCAAA GTCGTCCTTA 1500 

GTATCACCGA TTAAATATAA TTTATCTCCA ACTTGAGGTT CAAAATCATT TAAATAATTT 1560 

25 ACATTTTCAA TCAAACCTAC CATTCCAACA ACTGGTGTT6 GGAAAATAGA AGTACCTTTC 1620 

GTTTCGTTAT ATAAAGATAC ATTACCAGAA ACTACTGGTG TCTTAAGAAT GTCX3CATGCT 1680 

TCTGCCATAC CTTTCGTTGA ATCTATCAAC TGTTGATAGA ITTCrrrtTr TTCAGGAGAA 1740 

30 CCATAATTTA AACAATCTGT CATTGCTAAT GGTGTTGCAC CCACGGCAAT TAAATTTCGA 1800 

TAAGCTTCAG CTACTACCAT CTTTCCACCT TCATATGGAT TGTTATATAC ATAACGCGCT 1860 

TCACCATCAA TTGTTGAAGC AATTGCCTTA TTTGTGCCTT CCACACGTAC TACCGATGCT 1920 

35 

TGAAGTCCTG GCTTAATTAT CGTATTGGCA CCAACTTGTT GGTCGTATTG ATCATATAAA 1980 

TAGTCTTTAG ATGCTATAGT CGQATGCTTA AGTAATTTAA AOAAAGTATC TTTAACATCG 2040 

ATGTGTGTAT AATCATTTTT AGAAGTATTA TAATCTTTTT CTTCTCCTTC TAAAATATAT 2100 

40 

ACAGGTGCrr CATCAGCTAG TGGTTCAACT GGAATGTCAG CATAAACTTC GTCATCATAT 2160 

GTTAAAACAA AACGATTTGT ATCTGTAACT TCACCTATAA CAGCACTATC CAATTCGTGC 2220 

^ TTATCAAATA AATCTAAGAA TTTTTGTTCA GTACCTTTTT CAACAACTAG TAACATACGT 2280 

TCTTGAGTTT CTGAAAGCAT CATTTCATAA GGAGAAATAC CTGGCTCACG TGTTGGCACT 2340 

TGTTCTAATC TCAAATGTAA CCCACTACCA CCTTTTGCCG CCATTTCAGA CGATGAAGAT 2400 

SO GTTAAACCAG CAGCACCCAT ATCTTGAATA CCAACTAATT CATCAAATGT AATTGCTTCA 2460 

AGTGTTGCTT CCATTAATTT TTTACCTACA AATGGATCAC CGATTTGTAC AGAAGGTCGT 2520 
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10 



IS 



20 



CGACCAGTTT TCAAACCAAC ATAAATGACC GAATTACCTA CACCTTTTGC TGTGCCTTTT 2640 

TGAATCATGT CGTGATTGaT AACACCAACA CACATTGCAT TAACAAGTGG ATTGCCATCA 2700 

TAACGTTCAT CAAATTCGAT TTCACCAGCA GTTGTTGGaA TACCAATGCA GTTACCATAA 2760 

CCTCCGATAC CCTTTACAAC ACCTTTAAGT AATCTTTGGT TTTGTTTATT ATCTAATTCT 2 820 

CCAAATCTAA GACTGITTAA CAAATTAATA GGTCTAGCCC CAATAGAGAC AATGTCACGA 2880 

ATGATTCCAC CAACGCCTGT AGCAGCCCCT TGATATGGTT CAATTGCTGA TGGATGATTG 2940 

TGAGACTCTA CTTTAAATAC TACGGCTTGA TTATCACCTA TATCGACTAC CCCTGCACCT 3000 

TCACCAGGCC CCATAAGCAC ATGGTcACCT GACX5TAGGAA ATTGCTTTAA AAACGGTTTA 3060 

GAATGTTTAT AAflAGCAATG TTCACTCCAC ATAACAGAAA AGATACCTGT TTCTGTAAAG 3120 

TTAGGTTGTC TGCCTAAAAT ATCGCAAACT TTTTCATATT CTTGATCaCT TAATCCCATA 3180 

TCTTGATATA CTTTTTCAAG TTTAATTTCT TCAACGCTTG GTTCGATAAA TTTAGACATG 3240 

TTCTTCCCTC CAACTTTTTA CCATCGCTTC AAATAATTTC ACACCACTAT CAGTACCTAA 3300 

CAACGTTTCT AAAGCTCTTT CagGATGtGG CATCATGCCA CATACATTGC CTTTTTCGTT 3360 

25 AACAATTCCT GCAATATCAT CATATGAACC GTTCGGATTA TTCACATATT TCAGAATAAT 3420 

TTGATTGTTA GCTTTTAATT GTTGATATAT TTCATCAGTA CAATAATAAT GACCTTCACC 3480 

GTGAGCTACA GGATATATAA CTTTTTCACC TTGTTCATAA AGATTTGTAA ATGCCGTTTG 3540 

^ ATTATTCACT ATTTCTAACT CTTCATTTCT ACTAATAAAT AAATGTGAAT CGTTATGCAA 3600 

TAATGCACCA GGTAATAAGC CTATTTCAGT TAAAATTTGA AACCCATTAC AAACACCTAA 3660 

TACTGGCTTA CCTTCAGCTG CAAGACGTTT AACTTCCGAA ATAATCGGsG CTACACTAGC 3720 

CATTGCCCCA GATCTTAAGT AATCCCCGAA TGAAAATCCA CCAGGAATAA GTACGCCATC 3780 

AAATGCACTT AGTGATGTTT CTCTATAATC TACATATTCC GCTTCAACAC CACTTTTAAT 3840 

AGCAGCATTA AACATGTCTC TATCACAATT CGAACCTGGA AAAACAAGAA CCX3CAAATTT 3900 

CATTTTATGC ATTCTCCTTT TCATCATCTA ACACTTTATA GCTATATTCT TCAATCACTG 3960 

TATTTGCAAA CAATTTTTCA CTTAGAGTTG TAATAATGTT GTGTACCTTT TCATCACTAA 4020 

CCTCATCCAC TGTCATATAT AATACTTTTC CTACACGAAT ATCATTCACT TGTGCATAAC 4080 

CTAAGTCATG TACAGCTCGA GTAAGCGTTT GTCCTTGCGT ATCTAATACT TGTGGTTGTA 414 0 

ATGTCATATG TAGTTCAATT GTTTTCATTA TTTTAAATCC TCCAATTTGT TTAAAAATAT 4200 

SO TTGATATGTT TCAATCAGTG ATCCAGTGTT ATTTCTATAT ACATCTTTAT CAAAGTTTGC 4260 

ATTGGTAGCT TTATCCCAAA TTCGACATGT ATCTGGAGAT ATTTCATCCG CTAACAAAAT 4320 
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547 



AAGAATCTAT ACCCTTTAAA CTTTTTGTTT 
TTAATGATGC AACCATACCT GGTAAATGTG 
^ TTATACCTCT TTCTCTCGCT TCAGAAGCAA 

CGGATACTAC TTGTTTTTCG TACGGAATTT 
GCATAATTTT CCAATCGGAA GAACTGCCCA 

10 

TTTCAAAAGT TTGAATTGTG AATTACTTTA 
AATTTCTGCT TTTTCAATCA AAAATCGAAC 
CTTTTGCTAT GTAAATTAGT TTTATAAACT 

IS 

GAAGTTTTTA TGGTTGCTAA AATTTTAGAT 
GGGTTACAAG ATCAAGTTGA AGCGCTAAAA 
ATATTAGTTG GTAATGATGG CX3CTAGTCAA 
GAAAAAATTG GTATGATTTc AGAAATCGTA 
GTATTAAACG AACTAAATAG ACTAAATAAT 

25 GTACCATTAC CAAAACAAOT TAGOGAACAG 

GATGTGGACG GTTTTCATCC AATAAATATA 
GTACCTTGCA CACCGCTCGG CATCATGGAA 

^ QGTAAAAATG CAGTTGTAAT TGGACGAAGT 

CTACTTCAAA AAAATGCATC AGTAACAATC 
TATTTAAAAG ATGCTGATGT CATTGTCAGT 

35 

GATGTGGTCA AAGAAGGAGC AGTAATTATC 
AAATTAAAAG GTGACGTTGA TTATGATGCG 
GTTCCTGGTG GCGTTGGTCC ATTAACAATT 

40 

GAAAAAATGC GTCGAGGTAT TGATTCGTAA 
CTCTACAAAG TTATAATGGC AGTAGTTGAC 
TTCAATTCTA GTCAACCTTG CCGGGGTGGG 

45 

TTCTGTCCCA CTCCCTAATA ACTGAGTTTT 
ATTTTGCAAT TGCAATGAAT AACAAGAAAA 
SO TTCAAAGTAG ACATTGAATA AATGAAGGCT 

TCAACAATTT TATATAGACA QTAGTTAATT 

55 
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CAATCGGCAC TCCAATAACT GGTAGCGTCXS 6240 

CCX3CACCGCC AGCGCCTGCA ATGATAATGT 6300 

ATTGAACCAT CATTTTTGGC GTACGATGTG 6360 

CAAAATAATC CAACATGTTA CAACTCTCTT 6420 

TAATGACTGC TACTTTCACT TTGTACACCC 6480 

GTTGTATATT ATAGATATAG CATAACAAGC 654 0 

TTTATTTTGA TnTTTATTT GAATTTACGT 6600 

AACAAAGTTA GGATATTGAC AATAGGAGGA 6660 

GGTAAACAAA TTGCCAAAGA CTACAGACAG 6720 

GAAAAGGGTT TTACACCTAA ATTATCCGTT 6780 

AGTTATGTTA GATCAAAAAA GAAAGCAGCT 6640 

CATTTGGAAG AAACAGCTAC TGAAGAAQAA 6900 

GATGATTCTG TAAGTGGTAT TTTGGTACAA 6960 

AAAATATTAG AAGCAATCAA "TCCTGAAAAA 7020 

GGGAAATTAT ATATCGATGA ACAAACTTTT 7080 

ATATTAAAAC ATGCTGATAT TGATTTAGAA 7140 

CATATTGTCG GACAACCAGT TTCTAAGTTA 7200 

TTACATTCTC GTTCAAAAGA TATGGCATCA 7260 

GCAGTTGGTA AGCCTGGTTT AGTAACAAAA 7320 

GATGTTGGCA ATACX3CCAGA TGAAAATGGC 7380 

GTTAAAGAAA TTGCTGGAQC TATTACACCA 7440 

ACTATGGTAT TAAATAATAC TTTGCTTGCA 7500 

AGAGCCTGAG ACATAAATCA ATGTTCTATG 7560 

TGAACGAAAA TTCGCTTGTA ACAAGCTTTT 7620 

ACGACGAAAT AAATTTTACG AAAATATCAT 7680 

AATGAAGTCT TTTAACCCAC ATTAAATATT 7740 

ATCTGGGACA TTAATCGATC AAATGCTCCC 7800 

TTGAAGGGAG CATTTCACTT TGTACTTGGC 7860 

GAATGAAAAT AAGCTTGTAA CAAGTTTTCA 7920 
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TO 



15 



GTTGGGGATG 


GGCCCCAACA 


CAGAAGCTGT GACTATGATA AAGTACTACT ACATAGTTAA 


8040 


TCATTAGTGG 


TTCTTTATCA 


TTTTCGCCTC CCTTTTCTTA TTGTTTTGAT 


ACACAAAAAT 


8100 


TTAAGTTCAA 


ACTGTCGAAT 


AAAGTTATAT TTGATTTCAA ATTATCCCTA AATTATTAAT 


8160 


TkTACAATTG 


TGGCAGATTT 


TCAAAATAAT AATTATTTCC TCATTATTTA 


TAAATTTATA 


8220 


TTTAAATTTC 


ATTCTTTATA 


GGGTAAGATT AGGACTATAG TATGATGTGT 


ArATAATATA 


8280 


AATTAAGGTA 


TAGTAAAGCT 


AACTCAGAAA TGACTTATCA TTCGGAGGTT ACATTATGAA 


8340 


TAAACTATTA 


CAGTCATTAT 


CAGCCCTCGG TGTTTCT6CT ACACTAGTAA 


CACCAAATTT 


8400 


AAATGCAiSAT 


GCAACGACX3A 


ATACTACACC ACAAATTAAA GGCGCTAATG ATATCGTTAT 


8460 


TAAGAAAGGT 


CAAGATTATA 


ACCTTCTAAA CGGCATAAGT GCATTTGATA 


AAGAAGATGG 


8520 


AGATTTAACC 


GATAAAATTA 


AAGTCX3ATGG CCAAATTGAT ACATCTAAAT 


CTGGTAAATA 


8580 


TCAAATTAAA 


TATCATGTCA 


CTGATTCAGA TGGTGCAATT AAAATTTCCA 


CTAGGTATAT 


8640 


TGAGGTTAAA 


TA6CCCTCAT 


CACTATACTG CAAATAAAAT GGTAGCAAAC 


GAACATGTTT 


8700 


TGCTACCATT 


TTATTTGTTA 


TTCTAACTTC ATCTGCAACT TTAACCCAAA 


TATTGTATTT 


8760 


TT1TCTC3TATA 

AAA A A 4^ 


CCAAAGGACT 


ACCTATCAAA TTATTAAAAC TTAACTGCTC 


TTTTTAAAAA 


8820 


AATGTTTTGA 


TTTTGAACAA 


ACAAATTTCC ACTTTTCATT GTTTAACGAT 


AAATTACTTT 


8880 


TGGCAAATTC 


CTTATTAAAA 


TGTTTGCGCT TCCTTTCAAT CAACTAGCCA 


TCATTTTCAA 


8940 


TTTATTA<3AC 


AATTTCAAAC 


TTTTTTTATT TTCATTCAAT TAACCTTTAA 


TTGAAAGCTA 


9000 


TTCTCAACTT 


TCCTTTTAAA 


TATGAAGCAA TTTTTTCAAA AACGCTATTA 


GTCACAAAAT 


9060 


GT 








9062 



35 

(2) INFORMATION FOR SEQ ID NO: 86: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



SO 



AAATATTTTT TCAAAACTAT 


GTGAAAATGG 


aCCATGTCtA 


aATCATGTAA TAATGCAGyA 


60 


CATAATGCCA ACGGTCTmTC 


TTTATTGTCC 


CATGCATCAT 


GACCAATAAA TGACTCATCA 


120 


ATTAATCGTC TAACTATTTC ATACACACCT AAAGAATGTC 


CAAAGOGACT ATGTTCTGCT 


180 


GTGTGAAAAG ATAGGTACAG 


TGTTCCTAGT 


TGTCTAATTC 


GACGTAACCT TTGGAATTCC 


240 
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TCTTTAAAAA CTTTTTCTTC TACTAATTTT AAATCTACAT ATGCGTTAGT CATTATTCCC 360 

CTCCTTTTCG TTTAATATAA TATTTAATTT ACTTAAAATG CTTTGTACAT AAGTGCTAAG 420 

TCTAACTTTT CGCCATACAT TTCTGGCTCA TAAGAGCGTA AGATTGTAAA ACCTTGCTCT 480 

TTATAGTAAG CTACTGCTTC TTCATTTTTA TTATCTACTT CTAAGTAAAC ACCTTCAAAT 540 

TTATCTTCAA AACGTGATAA TCCTTCATTT AACAATGCTG TACCATAACC TGTATGTTGC 600 

GATTCTGGTT TAACATAATG AGCTGATAAA TATAATTCTT CACCGTAAAT AAAGTTAGCA 660 

AAGCCAACX3A TGTCATTACC TTCTTCAACG ACTAAGAATA ATTGTTCTTG AAGTCTTTTC 720 

TTTAAATGAT GTTCATTATA TGAAGCTtCT AACAAGTGAT TAACTGTTGT CGCAGCGTAT 780 

ATATTTAAGT ATGTATTAAA CCAAGCTTTA GTTGCGACAT CTCTAATTTG AACAACATCT 840 

TTTTCAGTTG CTTGTCTTAC CTTGAACATG ACTTTCTCCC CTTATTAACA AGTTTTAATA 900 

ACGGCATTAT ACCACAACTT GCTCAATACT TAATAAACAA TGATTGTCTA TTCAATTTAT 960 

ATATtTATAT TTTCCGTTAA AATTAAAAAT AAAAAATAAC GAAGCAAAAA AtCACTTCGT 1020 

TTAGTATGAG GTATGTCTTA TTGCAATATA CTATTCCACT CAGTTGCACG TGCTAAGGCA 1080 

2S TAGTTGTCTT TCATGATGTC ACCAGGCTTT TCAGCAGTTC CAATAATATA ACCATTTAAA 1140 

GTGGCACCTA rAAAGTCTAA KCTKIKTTTQ ATri'GCGTAA TTGCTGGTTC GCTmkTTT 1200 

TTGGACAATC TCCACCAACT AAAATAACTC TAAAATCCTT TTCGGCCATT TGTGCCTTAA 1260 

30 AATTAGGATA TCGTTTATCT TGTAATGTTT CTGACCAATG TTCGATAAAT GCTTTCAATG 1320 

GTGCTGAAAT GCTATACCAA TACACTGGTG ATGCAAAAAT AATTGTATCA CTAGCCAATA 1380 

TTTTATCTAG AATCGGCAAA TAGTCATCGT CATATGAAGT AATAGTCTCT GCTGTATGTC 1440 

TCACGTCACG TATCGGTTTA AACTGATGTT GTGTCACGTC AATCCATTGA TACTCTAAAT 1500 

CTTCCAAAGC GAATTTTGTT AATTGTGCAG TATTACCGTT TGGTCTACTC CCACCAAACA 1560 

AAACAGTAAT CATTTTAGCC TAACCTCACT TTTGATTAAT AAATATCTGT GTTTTTOGTT 1620 

ACCTAATTAT ACTATCATAA GCTTTGCCTA CCGAATAGTA AAACGCTTAC AACTTTTATA 1680 

TAAATTTGAC GAAATTTCGT CATGCCTTAT ATAACGTCGT TTGTGATACG GGGCTAATTC 1740 

ATGATGAAAT TAGATACATA TATCACCATT AAATACAATT CATTTAGTCT TCAATCGGAA 1800 

ACAGTTCATC GATATATTGA ATCTCATCAT CTGATAAAAC GATATCTGCA GCTTTAATAT 1860 

TTTCAACGAC TTGTTCTGCA CGTTTTGCAC CAGGAATAAT CACATCGATA GCTGGTCTCG 1920 

50 TTAAATAAAA TGCTAATACA ATGTTCGCAA TTGAAGTTTG ATGTGCTGCA GCTATGCTTT 1980 

CCAAAGCTTT TACGCGACGC ACATTTTCTT CAAATACACC TGGTTTAAAA TCACGACX5TG 2040 
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GCTAATGGGA 


AATATGQAAT 


AAATGTGATT 


TGGTGATCAA 


CACAATATTG 


TAATACTGCC 


2160 


TCATTTTOGC 


GATGCAATAA 


ATTATATTCT 


AACTGTACAA CATCAACGTA ACCATCTTTA 


2220 


TTTGCTTCTT 


TAAGTTGATC 


TAATGTGAAA 


TTTGATACAC 


CAATTGCTTT 


AATCTTCCCT 


2280 


TGTTCCTTAA 


GCTCTTGTAA 


TGCTGCAACT 


GCTTGATCTT 


TCGGAGTGTT 


GTTATCCGGA 


2340 


AAATGAATAT 


AATATAAATC 


GATATAATCA 


GTTTGTAGAC 


GTTTCAAACT 


ATTCTCAACT 


2400 


TGTTGTTTTA 


AATATTCCGG 


TTGATTGTTC 


TGATGTACTT 


CTTGATTTTC 


ATCAAATTCA 


2460 






TTTAATTTGC 


TCTCGCGGAT 


ATTCTTTAAC 


AACTTCTCCA 


ft 


ACCAATTCTT 


CTGATCGTTC 


TGGCCCATAA 


ATATATGCCG 


TATCTAATAA ATTAATACCA 


2580 


TGATTAATGG 


CTTGACGAAC 


AACATCTTTT 


CCTTGTTCTT 


CATCTAAGTT 


CGGATATAAA 


2640 


TTATGCCCAa 


CCTAtGCGTT 


CGTCCCAA6T 


GCGATTGGAA ACACTTCAAC 


ATCAGATTTA 


2700 


CCTAAGTTTA 


CAAATTGCTn 


CATTAGACCC 


AGCnCCTT 






2738 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9425 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 
GATTAGATGA TATTTAACGA AAATTAaGrT 
GAAAATAGCT ATTAAAATAG AGTAGACGTA 

35 

TTTCAAAAAT AGCGTAATTA TTATAATAAA 
TATAACATTC TTCAAAAAAT AATAGCAAAA 

TTGGCTGTGC TATAATCTAT CTTGTGCTTA 

40 

ACCGCCCTTT CACGGCGGTA ACACGGGTTC 
CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT 
GTCGAGACCG TACAAATGCC TATCCAAGAG 

45 

TTAATAAAAG ATATATGGAC GAATGATAAT 
TTAAAATGTA TGAACCTCAA GTAACTTAGT 
50 AAAATCTTTA TTAGACGTAC AAACATATGC 
GCCACCAGTC CTCCATAACA TCAATTGTTA 

55 



;EQ id NO: 87: 

GmAATACTtG AATGTArGAa GTCTGATGTC 60 

ATGtAAATGA AAGCACCTAA AATAGAAAAA 120 

TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TTATGTAAAA AATATCTTGT CATGGCAAGA 240 

AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 300 

GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 360 

GTCACGCAGG AGATCGCGGG TTCGATTCCC 420 

GATAGGCATT TTTTTGCGTT TAATATTATA 480 

CATATTGATT TATCTGTTCG TCCATTTTCT 540 

GGTTGGATAT GAAA6ATAAA CGTAGACAAT 600 

TACTGTCAAC ATATTTCTTC GTTGTGATAT 660 

AAGTAACGAA TAACGAATAA TGATATTTAT 720 
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GACCTCATCA TTGTGTTAAA TATCATTGTC ACAATCCGCC GTGAGAAACT AATAAAAAAT 840 

AGTAATATAT AAGTTTATAT TGGAAAATAG AATTAATAGC TTATAAATGG TAAATTATAT 900 

5 AATAGGTTAC TATACGTTAT AAGACGGAAA ATGCGCACAA TAACAAAAAT AGTAAGCGAC 960 

ATCCTGTGAT TTTTTACACA AACATAAACG ATAAAGAACA AAAAATGATA AAATAATATT 1020 

AATGATTTAA GAAAAGAGGT TTATGCAAAT GGCTAGAAAA GTTGTTGTAG TTGATGATGA 1080 

AAAACCGATT GCTGATATTT TAGAATTTAA CTTAAAAAAA GAAGGATACG ATGTGTACTG 1140 

TGCATACGAT GGTAATGATG CAGTCGACTT AATTTATGAA GAAGAACCAG ACATCGTATT 1200 

ACTAGATATC ATGTTACCTG GTCGTGATGG TATGGAAGTA TGTCGT6AAG TGCGCAAAAA 1260 

1$ 

ATACGAAATG CCAATAATAA TGCTTACTGC TAAAGATTCA GAAATTGATA AAGTGCTTGG 1320 

TTTAGAACTA GGTGCAGATG ACTATGTAAC GAAACCGTTT AGTACGCGTG AATTAATCGC 1380 

ACGTGTGAAA GCGAACTTAC GTCGTCATTA CTCACAACCA GCACAAGACA CTGGAAATGT 1440 

20 

AACGAATGAA ATCACAATTA AAGATATTGT GATTTATCCA GACGCATATT CTATTAAAAA 1500 

ACGTGGCJGAA GATATTGAAT TAACACATCG TGAATTTGAA TTGTTCCATT ATTTATCAAA 1560 

2s ACATATGGGA CAAGTAATGA CACGTGAACA TTTATTACAA ACAGTATGGG GCTATGATTA 1620 

CTTTGGCX5AT GTACGTACGG TCGATGTAAC GATTCGTCGT TTACGTGAAA AGATTGAAGA 1680 

TGATCCGTCA CATCCTGAAT ATATTGTGAC GCGTAGAGGC GTTGGATATT TCCTCCAACA 1740 

30 ACATGAGTAG AGGTCGAAAC GAATGAAGTG GCTAAAACAA CTACAATCCC TTCATACTAA 1800 

ATTTGTAATT GTTTATGTAT TACTGATTAT CATTGGTATG CAAATTATCG GGTTATATTT 1860 

TACAAATAAC CTTGAAAAAG AGCTGCTTGA TAATTTTAAG AAGAATATTA CGCAGTACGC 1920 

GAAACAATTA GAAATTAGTA TTGAAAAAGT ATATGACGAA AAGGGCTCCG TAAATGCACA 1980 

AAAAGATATT CAAAATTTAT TAAGTGAGTA TGCCAACCGT CAAGAAATTG GAGAAATTCG 2040 

TTTTATAGAT AAAGACCAAA TTATTATTGC GACGACGAAG CAGTCTAACC GTAGTCTAAT 2100 

40 

CAATCAAAAA GCGAATGATA GTTCTGTCCA AAAAGCACTA TCACTAGGAC AATCAAACGA 2160 

TCATTTAATT TTAAAAGATT ATGGCGGTGG TAAGGACCGT GTCTGGGTAT ATAATATCCC 2220 

AGTTAAAGTC GATAAAAAGG TAATTGGTAA TATTTATATC GAATCAAAAA TTAATGACGT 2280 

45 

TTATAACCAA TTAAATAATA TAAATCAAAT ATTCATTGTT GGTACAGCTA TTTCATTATT 2340 

AATgCACAGT CATCCTAGGA TTCTTTATAG CGCGAACGAT TACCAAACCA ATCACCGATA 24 00 

so TGCGTAACCA GACGGTCGAA ATGTCCaGAG GTAACTATAC GCAACGTGTG AAGATTTATG 2460 

GTAATGATGA AATTGGCGAA TTAGCTTTAG CATTTAATAA CTTGTCTAAA CGTGTACAAG 2520 
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GTGATGCTAT TATTGCAACA GACCGCCGTG GACGTATTCG TATCGTCAAT GATATGGCAC 264 0 

TCAAGATGCT TGGTATGGCG AAAGAAGACA TCATCGGATA TTACATGTTA AGTGTATTAA 2700 

GTCTTGAAGA TGAATTTAAA CTGGAAGAAA TTCAAGAGAA TAATGATAGT TTCTTATTAG 2760 

ATTTAAATGA AGAAGAAGGT CTAATCGCAC GTGTTAACTT TAGTACGATT GTGGAGGAAA 2820 

CAGGATTTGT AACTGGTTAT ATCGCTGTGT TACATGACGT AACTGAACAA CAACAAGTTG 2880 

AACGTGAGCX3 TCGTGAATTT GTTGCCAATG TATCACATGA GTTACGTACA CCTTTAACTT 2940 

CTATGAATAG TTACATTGAA GCACTTGAAG AAGGTGCATG GAAAGATGAG GAACTTGCGC 3000 

CACAATTTTT ATCTGTTACC CGTGAAGAAA CAGAACGAAT GATTCGACTG GTCAATGACT 3060 

TGCTACAGTT ATCTAAAATG GATAAT6AGT CTGATCAAAT CAACAAAGAA ATTATCGACT 3120 

TTAACATGTT CATTAATAAA ATTATTAATC GACATGAAAT GTCTGCGAAA GATACAACAT 3180 

20 TTATTCGAGA TATTCCGAAA AAGACGATTT TCACAGAATT TGATCCTGAT AAAAT6ACGC 3240 

AAGTATTTGA TAATGTCATT ACAAATGOGA TGAAATATTC TAGAGGOGAT AAACGTGTCG 3300 

AGTTCCACGT GAAACAAAAT CCACTTTATA ATCGAATGAC GATTCGTATT AAAGATAATG 3360 

25 GCATTGGTAT TCCTATCAAT AAAGTCGATA AGATATTCGA CCGATTCTAT CGTGTAGATA 3420 

AGGCACGTAC GCGTAAAATG GGTGGTACTG GATTAGGACT AGCCATTTCG AAAGAGATTG 3480 

TGGAAGCGCA CAATGGTCGT ATTTGGGCAA ACAGTGTAGA AGGTCAAGGT ACATCTATCT 3540 

TTATCACACT TCCATGTGAA GTCATTGAAG ACGGTGATTG GGATGAATAA TAAGGAGCAT 3600 

ATTAAATCTG TCATTTTAGC ACTACTCGTC TTGATGAGTG TCGTATTGAC ATATATGGTA 3660 

TGGAACTTTT CTCCTGATAT TGCAAATGTC GACAATACAG ATAGTAAGAA GAGTGAAACG 3720 

rAACCTITAA CGACACCTAT GACAGCCAAA ATGGATACAA CTATTACGCC ATTTCAGATT 3780 

ATTciTTCGA AAAATGATCA TCCAGAAGGA ACGATTGCGA CGGTATCTAA TGTGAATAAA 3 840 

CTGACGAAAC CTTTGAAAAA TAAAGAAGTG AAGTCCGTGG AACATGTTCG TCGTGATCAT 3900 

AACTTGATGA TTCCTGATTT GAACAGTGAT TTTATATTAT TCX3ATTTTAC GTATGATTTA 3 960 

CCGTTATCAA CATATCTT6G TCAAGTACTG AACATGAATG CGAAAGTACC AAATCATTTC 4020 

45 AATTTCAATC GTTTGGTCAT AGATCATGAT GCTGATGATA ATATCGTGCT TTATGCTATA 4080 

AGCAAAGATC GCCACGATTA CGTAAAATTA ACAACTACAA CGAAAAATGA TCATTTTTTA 4140 

GATGCATTAG CAGCAGTGAA AAAAGATATG CAACCATACA CAGATATCAT CACAAACAAA 4200 

^ GATACAATTG ATCGTACGAC GCATGTTTTT GCACCAAGTA AACCTGAAAA GTTAAAAACA 4260 

TATCGCATGG TATTTAACAC GATTAGTGTT GAGAAAATGA ATGCTATACT ATTTGACGAT 4320 
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GCAAACTATA ACGATAAAAA TGAAAAATAT CATTATAAAA ACCTGTCC6A AGAT6AAGCG 


4440 




AGTTCCAGCA AAATGGAAGA AACGATTCCA GGAACCTTTG ATTTTATTAA TGGTCATGGT 


4500 


5 


GGTTTCTTAA 


ACGAAGACTT TAGATTGTTT AGTACGAATA ATCAGTCAGG CGAGTTAACA 


4560 




TATCaACGTT 


TCCtTAATGG TTATCCAACG TTTAATAAAG AAGGTTCTAA TCAAATTCAA 


4620 




GTCACTTGGG 


GTGAAAAAGG CGTCTTTGAC TATCGTCGTT CGTTATTACG CACCGACGTT 


4680 


10 


GTTTTAAATA 


GTGAGGATAA TAAATCGTTG CCGAAATTAG AGTCTGTACG TTCAAGCTTA 


4740 




GCX^CAATA 


GTGATATTAA TTTTGAAAAA GTAACAAACA TCGCTATCGG TTACGAAATG 


4800 


IS 


CAGGATAATT 


CAGATCATAA TCACATTGAA GTGCAGATTA ACAGTGAACT CGTACCGCGT 


4860 


TGGTATGTAG 


AATATGATGG CGAATGGTAT GTTTATAACG ATGGGaGGCT TGaATAAATG 


4920 




AACTGGaAAC 


TGACAAAGAC ACTTTTCATT TTCGTGTTTA TTCTTGTCAA CATOGTGTTA 


4980 


20 


GTATOGATTT 


ATGTTAATAA AGTCAATCGC TCACACATTA ATGAAGTCGA GAGTAACAAT 


5040 




GAAGTTAATT 


TTCAGCAAGA AGAAATTAAA GTACCGACTA QTATATTGAA TAAATCAGTT 


5100 




AAAGGTATAA 


AATTAGAGCA AATTACAGGG CXSATCAAAAG ACTTTAGTTC TAAAGCTAAA 


5160 


25 


GGCGATTCGG 


ATTTGACCAC ATCAGATGGT GGAAAATTAT TGAATGCGAA CATTAGTCAA 


5220 




TCX3GTAAAGG 


TCAGTGACAA TAACTTAAAA GATTTGAAAG ATTATGTTAA CAAGCGCGTA 


5280 




TTTAAAGGTG 


CTGAATATCA ATTAAGCX5AG ATTAGTTCAG ATTCTGTAAA ATATGAACAA 


5340 


30 


ACGTATGATG 


ATTTTCCGAT TTTAAATAAC AGTAAAGCGA TGTTAAACTT TAATATAGAA 


5400 




GATAACAAAG 


CGACTAGTTA TAAACAATCA ATGATGGATG ACATTAAGCC CACAGATGGT 


5460 




GCAGATAAGA 


AGCATCAAGT GATTGGTGTG AGAAAAGCAA TCGAGGCATT ATATTATAAT 


5520 


35 


CGTTACTTGA AAAAAGGTGA TGAAGTCATT AATGCTAGAC TCGGTTACTA CTCAGTCGTG 


5580 




AATGAAACGA ATGTTCAATT GTTACAACCA AACTGGGAAA TTAAAGTGAA GCATGACGGT 


5640 


40 


AAGGATAAAA 


CGAATACTTA CTATGTCGAA GCGACAAATA ATAACCCTAA AATTATTAAT 


5700 


CATTAATATG 


AATCGTAATA AGCTAGCATT GCAAGCTCAT CATATGTGAG AAGCXK5TGCT 


5760 




AGCTTTTTTG 


CTGGTACGGT TTATTATGGC TGATGTTTTT GCGTCTCCAA CGTGCGCATT 


5820 


45 


TATTCATATT 


TTAAGTAGAA CCGCATTGTA AAATTAGTGT AACTGTTATT TTAAAAACTT 


5880 




TAGTATTTGT 


CTAATCATTG TTATAATAAT TAAGAAATTC ATTGCACGTG ATTATCAAAA 


5940 




TTTAAATATA 


AGAAACCGGT CX5ATGAACTA AAGTTACATA ATAGGAAAGG TATACAAAAC 


6000 


50 


AGCTAATATA 


CTGATAGTTT CTGTAGGGAA AATCGTATAT TTGCACTQAT GTATATTGCA 


6060 




GTCATATAGA 


GAGATTGACT GTTTAAAGAG AAAGGATGAG CCGCTTGATA CGCATGAGTG 


6120 
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TAGTTGATGT TGGTTrGACT GGAAAGAAAA TGGAAGAATT GTTTAGTCAA ATTGACCGTA 6240 

ATATTCAAGA TTTAAATGGT ATTTTAGTAA CCCATGAACA TATTGATCAT ATTAAAGGAT 6300 

^ TAGGTGTTTT GGCGCGTAAA TATCAATTGC CAATTTATGC GAATGAAAAA ACTTGGCAGG 6360 

CAATTGAAAA GAAAGATAGT CGCATCCCTA TGGATCAGAA ATTCATTTTT AATCCTTATG 6420 

AAACAAAATC TATTGCAGGT TTCGATGTTG AATCGTTTAA CGTGTCACAT GATGCAATAG 6480 

10 

ATCCGCAATT TTATATTTTC CATAATAACT ATAAGAAGTT TACGATTTTA ACGGATACGG 6540 

GTTACGTGTC TGATCGTATG AAAGGTATGA TACGTGGCAG CGATGCGTTT ATTTTTGAGA 6600 

GTAATCATGA CGTCGATATG TTGAGAATGT GTCGTTATCC ATGGAAGACG AAACAACGTA 6660 

IS 

TTTTAGGCGA TATGGGTCAT GTATCTAATG AG6ATGCGGC TCATGCAATG ACAGACGTGA 6720 

TTACAGGTAA CACGAAACGT ATTTACCTAT CGCATTTATC ACAAGACAAT AACATGAAAG 6780 

AnTGGCGCG TATGAGTGTT GGCCAAGTAT TGAACGAACA CGATATTGAT ACX^SAAAAAG 6840 

AAGTATTGCT ATGTGATACX5 QATAAAGCTA TTCCAACGCC AATATATACA ATATAAATGA 6900 

GAGTCATCCX5 ATAAAGTTCC GCATTGCTGT GAGACGACTT TATCGGGTGC TTTTTTATGT 6960 

25 TGTTGGTGGG AAATGGCTGT TGTTGAGTTG AATCGGCTTG ATTGAAATGT GTAAAATAAT 7020 

TCGATATTAA ATGTAATTTA TAAATAATTT ACATAAAATC AATCATTTTA ATATAAGGAT 7080 

TATGATAATA TATTGGTGTA TGACAGTTAA TGGAGGGAAC GAAATGAAAG CTTTATTACT 7140 

^ TAAAACAAGT GTATGGCTCG TTTTGCTTTT TAGTGTAATG GGATTATGGC AAGTCTCGAA 7200 

CGCGGCTGAG CAGCATACAC CAATGAAAGC ACATGCAGTA ACAACGATAG ACAAAGCAAC 7260 

AACAGATAAG CAACAAGTAC CGCCAACAAA GGAAGCGGCT CATCATTCTG GCAAAGAAGC 7320 

^ GGCAACCAAC GTATCAGCAT CAGCGCAGGG AACAGCTGAT GATACAAACA GCAAAGTAAC 7380 

ATCGAACGCA CCATCTAACA AACCATCTAC AGTAGTTTCA ACAAAAGTAA ACGAAACACG 7440 

CGACGTAGAT ACACAACAAG CCTCAACACA AAAACCAACT CACACAGCAA CGTTCAAATT 7500 

40 

ATCAAATGCT AAAACAGCAT CACTTTCACC ACGAATGTTT GCTGCTAATG CACCACAAAC 7560 

AACAACACAT AAAATATTAC ATACAAATGA TATCCATGGC CGACTAGCCG AAGAAAAAGG 7620 

GCGTGTCATC GGTATGGCTA AATTAAAAAC AGTAAAAGAA CAAGAAAAGC CTGATTTAAT 7680 

45 

GTTAGACGCA GGAGACGCCT TCCAAGGTTT ACCACTTTCA AACCAGTCTA AAGGTGAAGA 774 0 

AATGGCTAAA GCAATGAATG CAGTAGGTTA TGATGCTATG GCAGTCGGTA ACCATGAATT 7800 

SO TGACTTTGGA TACGATCAGT TGAAAAAGTT AGAGGGTATG TTAGACTTCC CGATGCTAAG 7860 

TAcTAACGTT TATAAAGATG GAAAACGCGC GTTTAAGCCT TCAACGATTG TAACAAAAAA 7920 
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TGAAGGCATT AAAGGC6TTQ AATTTAGAGA TCCATTACAA AGTGTGACAG CGGAAATGAT 8040 

GCGTATTTAT AAAGACGTAG ATACATTTGT TGTTATATCA CATTTAGGAA TTGATCCTTC 8100 

^ AACACAAGAA ACATGGCGTG GTGATTACTT AGTGAAACAA TTAAGTCAAA ATCCACAATT 8160 

GAAGAAACGT ATTACAGTTA TTGATGGTCA TTCACATACA GTACTTCAAA ATGGTCAAAT 8220 

TTATAACAAT GATGCATTGG CACAAACAGG TACAGCACTT GCGAATATCG GTAAGATTAC 8280 

10 

ATTTAATTAT CGCAATGGAG AGGTATCGAA TATTAAACCG TCATTGATTA ATGTTAAAGA 8340 

CGTTGAAAAT GTAACACCGA ACAAAGCATT AGCTGAACAA ATTAATCAAG CTGATCAAAC 8400 

ATTTAfiAGCA CAAACTGCAG AGGTAATTAT TCCAAACAAT ACCATTGATT TCAAAGGAGA 8460 

75 

AAGAGATGAC GTTAGAACGC GTGAAACAAA TTTAGGAAAC GCGATTGCAG ATGCTATGGA 8520 

AGCGTATGGC GTTAAGAATT TCTCTAAAAA GACTGACTTT GCCGTGACAA ATGGTGGAGG 8580 

20 TATTCGTGCC TCTATCGCAA AAGGTAAGGT GACACGCTAT GATTTAATCT CAGTATTACC 8640 

ATTTGGAAAT ACGATTGCGC AAATTGATGT AAAAGGTTCA GACGTCTGGA CX3GCTTTCGA 8700 

ACATAGTTTA GGCGCACCAA CAACACAAAA GGACGGTAAG ACAGTGTTAA CAGCGAATGG 8760 

25 CGGTTTACTA CATATCTCTG ATTCAATCCG TGTTTACTAT GATATAAATA AACCGTCTGG 8820 

CAAACGAATT AATGCTATTC AAATTTTAAA TAAAGAGACA GGTAAGTTTG AAAATATTGA 8880 

TTTAAAACGT GTATATCACG TAACGATGAA TGACTTCACA GCATCAGGTG GCGACGGATA 8940 

TAGTATGTTC GGTGGTCCTA GAGAAGAAGG TATTTCATTA GATCAAGTAC TAGCAAGTTA 9000 

TTTAAAAACA GCTAACTTAG CTAAGTATGA TACGACAGAA CCACAACX3TA TGTTATTAGG 9060 

TAAACCAGCA GTAAGTGAAC AACCAGCTAA AGGACAACAA GGTAGCAAAG GTAGTAAGTC 9120 

35 

TGGTAAAGAT ACACAACCAA TTGGTGACGA CAAAGTGATG GATCCAGCGA AAAAACCAGC 9180 

TCC^^TAAA GTTGTATTGT TgtAGCGCAT AGAGGAACTG TTAGTAGCGG TACAGAAGGT 9240 

TCTGGTCGCA CAATAGAAGG AGCTACTGTA TCAAGCAAGA GTGGGAAACA ATTGGCTAGA 93 00 

40 

ATGTCAGTGC CTAAAGGTAG CGCGCATGAG AAACAGTTAT TTCATAATCA ACAGTCATTG 9360 

ACGTAGCTAA GTAATGATAA ATAATCATAA ATAAAATTAC AGATATTGAC AAAAAATAGT 9420 

45 AAATA 9425 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3886 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

AGTTGTAATG TCACATTTCC AGAGTCTGAA ATTATCTTTA TCACGTTACA TTTACTAGGC 60 

5 TCTAAAATGA CTGAACATAC AGCATCTTCA ATTACCTTTG AATACCATGA TTTATCGCAA 120 

AATATACATG AATTGATCAC TTGTGTTAGC CAAGAATTAG GCATTGATAT GTCAAAAGAC 180 

AACAAGTTAC ATACCAGTCT GATCACACAT ATCAAACCAG CTATACATCG TATTAAATAC 24 0 

10 

GATATGCTAC AACCTAATCC TTTGAGGCAA GAAGTTATGC GTCGCTATCC TCAAATCATT 300 

GAAGCCGTTA GCAAGCATAT TAGTCCAATT GAACAAGATG CTGCTATTCG CTTCAACGAA 360 

GATX3AATTAA CATACATTAC AATTCACTTC GCATCAAGTA TAGAGCGTGT TGCAACACAT 420 

IS 

AAACAATCAA TGATTAAGGT TGTCTTACTA TGTGGTTCTG GTATAGGCAC GTCACAACTT 480 

TTAAAATCAA AACTAAATCA CCTGTATCCT GaGTTnCACA TTTGGGAtGc CTATTcCATT 540 

TaTcAATTGG aAGaAAGTCG ATTATTGCAA GATAACATTG ATTATGTCAT TTCAACAGTA 600 

20 

CCTTGTGAAA TATCAGCTGT ACCAGTTATT CATGTCGATC CATTTATCAA TCAACAATCT 660 

CGTCAAAAAT TGAATCAAAT TATCAATGAC TCAAGAGAAC AACGAGTCAT GAAAATGGCA 720 

2s ACTGATGGCA AGTCACTCGC AGATTTATTG CCTGAACATC GCATCATTAT AAATAAACAA 780 

CCATTATCAA TTGAATCCGC AATTGCAGTG GCTGTGCAAC CTTTAATCAA TGATGGCATT 840 

GTCTATTCAA ATTATACAGC TGCAATTTTA AAACAATTTG AACAATTCGG GTCATATATG 900 

30 GTCATTAGTC CACATATTGC ACTTATTCAC GCTGGTACTG ATTATGTACA GAATGGTGTA 960 

GGTTTCGCAC TAACATATTT CACTGAAGGG ATTATCTTTG GTAGTAAAGC TAACGATCCC 1020 

GTTCACCTTG TAATTACATT AGCAACGGAC CACCCCAATG CACATTTAAA GGCATTGGGA 1080 

^ CAGTTAAGCG AATGCTTAAS CAACGACTTA TATCGACAAG A TTTC T TAGA TGGGAATATT 1140 

TTTfl;^AATTA AACAACACAT TGCTTTAACT ATGACAAAGG AGGCTTAATA ACGTGTCATT 1200 

AGACATTTTG TCAACAACAC GCATCATTGT AAAAGAACAA GTAAATGATT GGACTGAAGC 1260 

40 

TATAACTATA GCTTCTCAGC CATTACTACA AGAACAAATT ATTGAACAAG GCTATGTTCA 1320 

AGCAATGATT GATAGCGTTA ATGAACTTGG ACCTTATATC GTTATCGCAC CTGAAATTGC 1380 

AATTGCACAT GCAAGACCGA ACAATGACGT ACATCAAGTT GGTTTAAGTC TATTAAAGTT 1440 

45 

GAATCAACAT GTGGCATTTT GTGATGAAGA TCACTACGCA TCTCTCATTT TTGTATTGAG 1500 

TGCCATCGAC AATCATTCAC ACTTATCTGT ATTACAAAAT TTAGCAACCG TACTGGGCGA 1560 

SO TAACCAAACA GTCCAGCAAC TATTAACTGC AACAAATGCA CAAGACATTA AAAACATTTT 1620 

AAAGGAGCAT GATTAATATG AAAATTTTAG TAG7ATGTGG CCACGGTTTA GGAAGTAGTT 1680 
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10 



IS 



AAGTTGAACA TAGTGACATT ATGACAGCAA GTCCAGAGAT GGCTGACTTG TTTATTTGTG 1800 

GTAGAGATTT AGCTGAAAAT GCCGAACGTC TAGGGGATGT CTTAOTTCTT GATAATATTT 1860 

TAGATAAAGC TGAATTACAA CAAAAGCTCT CAGAAAAATT ACAACAACTT AACATGATTT. 1920 

AAAGGAGGTA CGACCTATGC AAGCAATCCT TAATTTTATA GTCGATATTT TAAGTCAACC 1980 

AGCCATTCTT GTTGCACTGA TTGCCTTTAT AGGTTTAATC GTTCAGAAAA AACCTGCCGC 2040 

AACGATCACT TCAGGAACCA TTAAAACGAT ATTAGGCTTC TTAATTTTAA GTGCAGGTGC 2100 

TGATGTCGTC GTTCGATCTC TTGAACCATT CGGCAAAATA TTCCAACACG CATTTGGTGT 2160 

GCAAGGTATC GTACCTAACA ACGAAGCTAT CGTCTCACTA GCCTTAAAAO ATTTTGQAAC 2220 

AACAGCTGCA CTCATCATGG TCTGTGGCAT GATTGTTAAT ATTTTAATTG CCCGCTTCAC 2280 

TAATTTAAAA TATATCTTTT TAACAGGTCA TCATACATTT TACATGGCTG CGTTTTTAGC 2340 

20 AATCATTTTA ACAGTCAGTC ATATTAAAGG CTGGCTAACG ATTGTTATCG GCGCACTCGT 2400 

ATTAGGATTA ATCATGGCAG TATTACCTGC ATTACTCCAA CCTACGATGC GAAAAATTAC 2460 

AGGGAATGAC CAAGTAGCTT TAGGTCATTT TGGCTCAATC AGTTACTTTQ CCOCAGTGCT 2520 

25 GTAGGTCAAT TATTCAAAGG TAAGTCTAAA TCAACGGAAG AGATTAAATT TCCAAAAGGC 2580 

TTAAGTTTCT TACGAGAAAG TACAATTAGT ATCTCGATTA CGATGGCATT ACTTTACTTC 2640 

ATCGCATCCT TATTTGCGGG CGTTAGTTAT GTACACGAAT CTATTAGTGA TGGTCAAAAC 27 GO 

TTTATTGTCT TTTCATTAAT TCAAGGTGTG ACATTTGCTG CTGGTGTATT TATTATTTTA 2760 

ACGGGCGTTC GTTTAATCTT AGCTGAAATC GTCCCAGCAT TTAAAGGAAT TTCTGAAAAG 2820 

CTTGTACCAA ATTCTAAACC TGCATTAGAC TGCCCTATTG TGTTCCCTTA TGCACAAAAT 2880 

GCAGTATTAA TTGGATTCTT TGTCAGCTTT ATTACAGGTG TCATCGGTAT GTTTATCTTA 2940 

TTCTTATTTG GTGGCGTCGT CATTTTACCT GGCGTAGTTG CACACTTCTT CTTAGGTGCA 3000 

ACGGCTGCTG TATTCGGTAA TGCAAGAGGC GGTATTAAAG GTGCTATTGc TGGCGCCGCT 3060 

CTAAATGGTA TCCTAATCAC GTTTTTACCA TTATTATTCT TGCCATTTTT AGGCGAATTA 3120 

GGTGGTGCTG CAACAACATT CTCAGATACA GACTTTTTAG CTGTCGGTAT CGTGTTCGGT 3180 

45 AACXK:AGTAA AATATATGGG ATTATTTGGT GCQATTCTAT TTATTATTAT CGTAGGTGCG 3240 

ACAACAATTT TATTAAAAGG CCGTCAAAAA GAACAGCAAT AGTGTTAACG TAGAAATATA 3300 

AAACACCGTC ACATATTGAG TGAATGCCCC TTTtATCAAG AGGAAAGCCA CTTACTTATG 3360 

^ GACGGTGTTT TGTATTATAT TAAATGATAC TTAGCCATAC TATCGACAGC TGCTAAAATT 3420 

GCTTCTTCTT GTGTCGCAAT CGGTTCCCAA CCAAGTAATG TTTTTgCACG TTCGTTACTT 3480 
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CCTAGACTCA AAATAAAGTC TGGTAATTTT TTAGTAGAAA CTTTTTGAGC TATTTCAGGT 3600 

CTCTTTTCTT TAATTAATTT TGCAATTTCC AACAAATTAA TTTGTCCATC AGCCGTCGCA 3660 

* ATAAATCGCT TGCCATTAGC TTGTTCATTT GTCATTGCCA AAATGTGCAG TTCAGCTACG 3720 

TCTCTCACAT CAACAACATT TAACGGAATT TGCGGTACAC GTTTCATTGA ACCATTCAAT 3780 

AAATTTTCTA ATAAATGAAA GCTTCCTGAA ACGTGTGCAT CTAATGATGG CCCAAAAATT 3840 

10 

GCAACTGGAT TGATTGTGGC AAATTCTACT GTTGTATTTT CATTCT 3886 

(2) INFORMATION FOR SEQ ID NO: 89: 

(1) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 4879 base pairs 

<B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GTCATCTATC AAAAATTTGG TATACAGACC GACAATTATT AATTAATAAT TTAATTTCCC 60 

2S AGGCAATACC AGTGATTAAA TATCCACAAA TACAACATAA AGAACAACCA TTAGAATCTA 120 

TTTCACAACT TATATTGTCT AAGATGACAT CTAATCAATA GT6TTTAAAT TTCTCAGTGG 180 

CTGTGAATGA GGTTTAAAAG TACTATAAAA CGTAAACTTT GATACTTTAA AATACGCAAA 240 

AAACGGTAAA CCCTAATTCA TATTATAGAG TTTACCGTTT TATTTTTTAA CTTGCATCAT 300 

AGTTATATTA ACATTATTGT TGGTAGTTTG GATCAGTAAC CATTGCTTGT CCAGTATAAT 360 

CAACCGTTAC AATTGAATAT TTTCCaTTTG CATTTGGGTC TTTAAAACTA AACACATACT 420 

j5 

TATAGTTGCC ATTATGTTCT TCAATAGAAT AATCATTATA CACTTTATTA TTACTACCAA 480 

ATTTATTTGC TTCATTATTA GCCGCATTTA AAGCTGTTTQ GAAATTTGGC AATTGCTGTA 540 

AAGCfTTGATT TTTATTTCCA TTAAACGGAT AAATTTGACG TGCAACCGGC OCGGCATTTT 600 

40 

GnCCATAATA TGGTGCAACG TAACTTGATT TTTGATTATT ATTCQCTTGG TTATTACTTG 660 

ATTGGTTATT ATTTGTTTGG TTTTGGTCAT TGTTTGTTGC ATTTGAATTA GATTGTTGCT 720 

GGTTATCGTT TGCACTATTA TCTTTATTAT CTTTGTTTAC GTCTTTACTA TCATCTTTAT 78 0 

4S 

TATCTTTCTT ATCTTTAGAT GAATCATTTG TTTTTTTATC TTGTTGTTCA GTTTTCGCTT 84 0 

TATCATCTTT TTCTTTATTA CCGTCTTTTT GTTGGTCACT ATCTTGACCA CATGCAGCTA 900 

SO AAAATAATGA TAATGCTAGT AACCCTGTAA CTAATCTTTT CATACATATC TCCTCCTATA 960 

ATtCGATATT CATTGAATAA TCTTGAAATA CATATCTACC ATGTGTATCT TTTCATGGCT 1020 
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TAAGGTTCTT TTTATTATAC CCTAATTTTT GTTCATTATT ATTTAATTTT TGTGAATTTT 1140 

ATGtTTkCTA TAAATTTAAT TATTTTACTT TAACAATTCA TTACGCATTT AGCATTTCAA 1200 

^ GGTATACACA ATATTTATTA CTATGATTTC ATTTTATCTG CTGCAAAAAC AATCATTATA 1260 

ACTCTTTTTC CATAATTAAA TCTGTATCCG TTACATCACC TGTTTGAAAA TGATGTTCAC 1320 

CAACCACTTT AAATCCATGA CGTTTATAAA ATGCTTGAGC ACGAGGATTA TGCTCCCAAA 1380 

10 

CTCCTAGCCA AATTTTATGT TTATTATGTT CTTGAGCAAT TTTTTCGGCC AATTCTATCA 1440 

ATTGTGAACC TCTTCCGCCA CCTTGAAAGT CTTTCAAAAA ATATATGCGC TGCACTTCTA 1500 

AATAGGTCTC CCCCATTTCT TCAGTTTGAG CACTATTAAT ATTCATCTTT ATATAACCAA 1560 

IS 

CATTCGCACC ATCTTCTTGa TAAAAATAAT QAAATQAATC TACATGGTTA ATCTCTTGTG 1620 

TAAATTTCTC TACA6TATAA TTGTCTTTAA AAAATTGATC AAAATCTTTG TCATCATAGT 1680 

20 AAGAACCAAA CGTGTCATAA AATGTTCTAG TTGCTAATTC AACTAATTCA CTAGCATTTT 1740 

GTTCTGAAAT TTCTTTGATT ATCCCAGCCA TATAAATCCT CCAATAAACA GTGATCGAAT 1800 

CAAAATATTA CTTATGTTAT TTTTCAGCCA AAACTATTTA AAAATACATT AACACAAATC 1860 

25 AATTACAAAT TGTATTGATT GTGTGTAACA TCAATAAATG ATACATTTAT TCCAGTAAAA 1920 

TG6CC6TATT TTCAAAAGAG AAAAAGAGAG GATGTATCGT TGTGATAGAA ACATTTAAAG 1980 

CGTTTGTAAT TGATAAAGAT GAGAGTGGTA AAGTGACACC AACTTTCAAA CAATTATCGC 2040 

CTACTGATTT ACCTAAAGGA GATGTGCTGA TTAAAGTACA TTACTCTGGT ATAAATTATA 2100 

AAGATGCTTT AGCGACTCAA GATCATAATG CAGTCGTAAA ATCGTATCCT ATGATTCCAG 2160 

GAATAGATTT AGCTGGAACA ATTGTTGAAT CCX3AAGCACC AGGCTTTGAa AAAGGAGAAC 2220 

35 

AAGTAATTGT AACGAGTTAT GACCTAGGT6 TCAGCCATTA TGGCX^GTTTT AGTQAATATG 2280 

CGCGTGTAAA ATCAGAATGG ATTATCAAGC TTCCTGATAC TTTAACATTA GAAGAATCAA 2340 

TGATATATGG CACAGCTGGT TATACTGCCG GTTTAGCAAT TGAAAGACTT GAAAAAGTTG 24 00 

40 

GAATGAATAT TGAAGATGGT CCTGTACTCG TTCGCGGTGC TTCAGGTGGT GTCGGTACTT 2460 

TAGCAGTACT CATGCTTAAT GAACTTGGTT ATAAAGTTAT CGCAAGTACA GGTAAACAAG 2520 

^ ATGTTAGCGA TCAATTACTT GAACTTGGTG CCAAAGAAGT TATCGATCGA CTTCCTGTTG 2580 

AAGATGATCA TAAAAAGCCA CTCGCATCAT CAACTTGGCA AGCTTGTGTA GACCCTGTTG 2640 

GTGGCGAAGG TATTAATTAT GTTACAAAGC GTTTAAATCA TAGTGGGTCA ATTACAGTTA 2700 

SO TTGGTATGAC TGCCGGTAAT ACTTATACTA ATTCTGTATT CCCTCACATT TTAAGAGGTG 2760 

TAAACATTTT AGGAATTGAC TCGGTATTTA CTGCTATGAA ATTAAGACAG CGC6TTTGGC 2820 
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TO 



IS 



20 



2S 



30 



SS 



40 



45 



SO 



TTGATGAACT TCCAGAiVCAA CTTAACAAAG TAATTAAACA TGAAAATAAA GGGCGCATTG 2940 

TTATCGATTT CGGTGTAGAT AAATAGTATT CATGAAAAAG ACATCCCGTT ATGCCSAGATG 3000 

TCTTTTTrAA TTTAGTATTT GATATACATA CCGCCTGAAT CTGGTTCGGT AGGTATAAAT 3060 

CCAAATTTTG TATATAATTT ATCCGCTGGG TAGTCTGCAA TCAGAcTAAC GTATGTACTC 3120 

TCAACAGCCA CACCTTTAAT ATATTGCATA ATATGCTCCA TAATTAGACT GCCGTAACCT 3180 

TGACCTTGGT AACTTTTCAA AACTGCAATA TCAACAATTT GAAAAACAGT TCCGCCATCG 3240 

CCAATCACTC TACCCATACC AATTAACCGA TCTTTATCAT ACAAGGTTAC TGTAAATAAG 3300 

GCATTAGGTA ATCCTTTTTC aGCTGTTCGC GCGTCTTTGG ACTCATACCT GCGTTAATCC 3360 

TTAATGCGCA ATAATCCTCG CAAGTCX^GAA TATCATATGT CACTTTAACC ATTATTTACC 3420 

CCACTTTTCA TCACACAATA TATCAACCTA GTATAAATGT TTATTTACAA TAGTCTTATT 3480 

CGCTTCTTTA AACACTTCAT GATGACTTGA AACATAACCC TCTGCATTCG CATCTGGTTG 3540 

GATATATGTT TTAGCAAGGT TCGCTGCATT TGCACCATCA CTAAATGCAC TTGCAATTA6 3600 

ATGTGATTTT GCATCATGAT AAACAATATC TCCACACGCA TAGATACCAG GTATACTAGT 3660 

TGTCGTATTA CCAAATCCTT TAACACGACA ATCATCATGC ATATCTAGCT TTGAAGATGT 3720 

TtCACTCAAT AATGTATTAC AACGATCAAA CCCATGACTA ATAATCACAT OGTCAAATTT 3780 

AACTGTATGC CTATCGCCAC TTTCAACATG TTCCAAAACA ACTTCACTTA TATGCGTTTC 3840 

ATCATCATTG CCGACCAAGT ATTTAATACG TGTTTTTGGG CATAGTTTCA CATTTAAATC 3900 

TGTCACCAAC GTTTTCATCG CTTCATGACC ACTTACATCT TCTTTTCGAT AAACAACTGT 3960 

CAOGCTTTTA GCAATCTTGO CAATATCATG CGCCCAATCT AATGCTGTAT TTCCTCCACC 4020 

TGATATTAAT ACATCTTTAT CTTTGAAACG TCTGTAACTT TGTACAACAT AATGTAAATT 4080 

AGTT5ATTGA TATCTCTCTA CACCTTTAAC ATCTAATTGT TTTGGATTAA TAATACCCGC 4140 

ACCAATTGCA ATGATAACTG CTTTCGATGT ATATATTTCT CCCGCTTCTG TTTCAACTTC 4200 

GAAATGACGT TCTGCCTTTT TCCTAATATC TACCACACGT TCATTCAAAT GAACTTCCGG 4260 

TTTAAAATAT AATCCTTGCT TAATTGTATC TTTTAAAATT TCATGACAAG GTTTTGGCGC 4320 

AATGCCGCCA ATATCCCAAA TAATTTTTTC AGGGTAAATT CTCATCTTAC CCCCTAATTC 4380 

AGATTGAACA TCTATCAATC TTACAGACAT ATCTCGCAAT CCAGCATAAA AGCTTGCATA 4440 

CAAACCAGAC GGACCGCCAC CAATGATTGT AACATCTTTC ATTATGTGCC TCCTATGACT 4500 

CTCTATATTC ATTTCTTTCA TTAACGTGCT CAAATTGATA ATTATTATCA TTTAAAGCCA 4560 

TTATACTATT AATATTTATA TTGTTAAAAT AAATCGCATA GTTAGCCATG AATTATCAAT 4620 
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GAAAGATGTG TATATTTTTT AGTTCTAGTT ATATTATTTT TTAAAAGACT CATCACGTGG 4740 
TTCTTTAAGA ATTGCTTGTC TTAAAAGGAA AAATAGCAAC AATAAACCTG CAAGCATACC 4800 
^ TGTGTGCCCA ATACCTGCAA AGCCTGCnAA TGCTTCTGGA GAGTATGATT TACCAGTGAC 4860 

TTGGAAGAAT CCTTTTGTC 4879 
(2) INFORMATION FOR SEQ ID NO: 90: 

10 

ii) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



20 


ATAATGTCTT 


AGaTTGATTG 


GGAGTTTrrr 


TAArrrrriT 


GAAATTAAAT 


TAATCTGTAs 


60 




yTAATAAAAA 


ATTTGAATAA 


CTGACACAyT 


TTTTTGATCA 


TAGCTAyATA CTTTGTGAAT 


120 




TAATTCACAT 


TATAATAAGA 


GTGAAGATAA 


GAGTATTATA 


AATnATCTTT 


AAATAAATAT 


180 


25 


ATGTGAAGTA 


AAAATTACAC 


GTTAGCATAT 


CGATTATGgT 


CATTTCkTTT AACATATTAA 


240 




CTgGGGaACG 


TTAAAAGTTA 


ACGGkTGATA 


TCyAACtAAA 


AACAAGGTCA 


CAGTAGTATG 


3lJ0 




TTTTAATCTG 


GCGTCTATTA 


CAAATAAAAA 


TTACATCTAT 


AATTATTCGT 


TTTCTTTTTT 


360 


30 


GAAAGTAATA 


GCCAATTAAT 


ATCATACATA 


CTGGAGTGAC 


TATAAGGAGG 


ACATTATTAT 


420 




GAGAGCAGCA 


GTTGTAACGA 


AAGATCACAA 


agtaagtatt 


GAGGACAAAA 


AGTTAAGAGC 


480 




TTTAAAACCT 


GGTGAAGCGT 


TGGTACAAAC 


GGAATATTGT 


GGCGTTTGTC 


ATACCGATTT 


540 


35 


ACATGTTAAG 


AATGCTGATT 


TTGGTGATGT 


TACAGGCGTT 


ACTTTAG6TC 


ATGAAGGTAT 


600 




TGC^AAGTC 


ATCGAAGTTG 


CGGAAGATGT 


AGAATCATTA 


AAAATTGGAG 


ACCGTGTOTC 


660 


40 


TATCGCTTGG 


ATGTTCGAAA 


GCTGTGGAAG 


ATGTGAATAT 


TGTACAACAG 


GTCGTGAAAC 


720 


ACTTTGCCGT 


AGTGTGAAAA 


ATGCTGGTTA 


TACAGTAGAT 


GGTGCAATGG 


CTGAACAAGT 


780 




TATTGTTACT 


GCAGACTATG 


CTGTGAAAGT 


ACCTGAAAAA 


TTAGATCCAG 


CAGCAGCGTC 


840 


45 


TTCTATTACA 


TGCGCAGGTG 


TGACAACTTA 


TAAAGCTGTA 


AAAGTAAGTA 


ATGTAAAACC 


900 




TGGACAATGG 


TTAGGTGTTT 


TTGGTATAGG 


TGGTTTAGGT 


AACCTAGCTT 


TACAATATGC 


960 




TAAAAACGTT 


ATGGGGGCTA 


AAATTGTTGC 


CTTCGACATC 


AATGATGATA AATTAGCATT 


1020 


SO 


CGCGAAAGAA 


TTAGGTGCTG 


ATGCTATTAT 


TAATTCTAAA 


GATGTTGATC 


CAGTTGCAGA 


1080 




AGTTATGAAA 


TTAACTGATA 


ACAAAGGATT 


AGATGCAACA 


GTGGTAACTT 


CAGTTGCTAA 


1140 
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TTTACCTGTT GATAAAATGA ACTTAGATAT CCCAAGATtA GTGCTTGATG GTATTGAAGT 1260 

AGTAGGTTCA CTTGTTGGTA CAAGACAAGA CTTACGTGAA GCGTTTGAAT TTGCTGCTGA 1320 

^ AAATAAAGTA ACACCTAAAG TTCAATTAAG AAAATTAGAA GAAATCAATG ATATTTTTGA 1380 

AGAAATGGAA AATGGTACTA TAACTGGTAG AATGGTTATT AAATTTTAAA AATATCAACT 1440 

GACTATATAG ATAAAGAAGG TAGTGCTCTG AACACTATCA TTATTAATCA AACCCCGAGG 1500 

10 

TTTTCCTGAA AAGATAGTGG llAAATCCCCG TGTTTTTTGG GTTTGAGGnG GTTGTnTGTA 1560 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 11014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 





GTCCTGTnGC 


TGCAATGAAT 


ACGCCTAAAA 


ATCCAGGGAT 


GTAATGGATA 


CTTTGTG6TA 


60 


2S 


GTACTAATGA 


TA6AAATGAT 


AAAAATGAAA 


TCACAAAGGC 


TACGCTCGCA AAAGCTTGAC 


120 




ATGTACGCTT 


ATOGCCATAA 


TCTAACCCTG 


TACGTATATG 


TAATAAATAC 


TGTAATCCGA 


180 




TACTTAAATA 


CATAATTGCC 


ACGCATAAGA 


AGAATGGGAA 


GAATGTCTTT 


TCAAAGTCCO 


240 


30 


GATATAGGCT 


GTTAGATAGG 


AAGACCATGA 


TGAACATATT 


AAACATCATA 


AACGAGACGT 


300 




CTTTGAATGT 


AACTTGACCA 


AATCGATTTG 


TAAAAAATGT 


TTGATGAGAC 


CACATTAACC 


360 


35 


ATAAGAACAA 


ACTCATGACG 


ATGTATTTGA 


AAAATAAATC 


AGCTGAAATG 


GAACCGTTTT 


420 


GTGTTGTTAA 


AATCACATGT 


GCAATTTTTT 


GAATGGCATA 


GACGAAAATT 


AAATCAAAGA 


480 




ACAffiCTCATG 


GAATCCTGCA 


CGCTTTTCAG 


CTAAATGTTT 


TGGTGTTAAT 


GCATTAACCA 


540 


40 


TAAAATTTTA 


ACTCCTTTAA 


GATGTGTAAT 


TAATTTACTA 


AGTATACTAT 


TTATTTTTTC 


600 


TAGTGAATAG 


GGGCAGATTT 


GGCGATGAAG 


TGGAAGGAGA 


GGTGACTGCA 


AGGTAATTGC 


660 




GGAATTAACA 


ATCATCAGCG 


ATTTAATATT 


TGACTGGAGA 


CGTCATGGTA 


ATAAAAAATT 


720 


45 


GATGAGAAAT 


TGATGGTGAA 


ACCAGCTGTG 


AATAsCGaTG 


cAATGATrsA 


TAGaiATTTAA 


780 




TTAQAGTCAT 


TACGCGaAAT 


GATTAATGAT 


AATTTCTGGT 


AAATCAAAGC 


aTAATTTTGT 


840 




ACTATAGATG 


AGGATGATAG 


AGCATATTTA 


AGAGGGTGAA 


ATGTTAAAGT 


GAAACCGTTT 


900 


50 


ACGTTTCCGA 


TTGCCCAAAC 


AAATTACATC 


ATTGTATAAT 


ATGATrTGTT 


AAATGCATAA 


960 




CAAGAATGAA 


AATGTAACAT 


ACGTAGCAAT 


TGGTTTCATA 


AATTGGATGT 


TAGTGGCGTA 


1020 
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10 



IS 



TGACGAGAGT CGTATTAGCA GCAGCATACA GGACACCTAT TGGCGTTTTT GGAGGTGCGT 1140 

TTAAAGACGT GCCAGCCTAT GATTTAGGTG CGACTTTAAT AGAACATATT ATTAAAGAGA 1200 

CGGGTTTGAA TCCAAGTGAG ATTGATGAAG TTATCATCGG TAACGTACTA CAAGCAGGAC 1260 

AAGGACAAAA TCCAGCACGA ATTGCTGCTA TGAAAGGTGG CTTGCCAGAm ACAGTACCTG 1320 

CATTTACGGT GaATAAAGTA TGTGGTTCTG GGTTAAAGTC GATTCAATTA GCATATCAAT 1380 

CTATTGTGAC TGGTGAAAAT GACATCGTGC TAGCTGGCGG TATGGAGAAT ATGTCTCAAT 1440 

CACCAATGCT TGTCAACAAC AGTCGCTTTO 6TTTTAAAAT GGGACATCAA TCAATGGTTG 1500 

ATAGCATGGT ATATGATGGT TTAACAGATG TATTTAATCA ATATCATATG GGTATTACTG 1560 

CTGAAAATTT AGTAGAGCAA TATGGTATTT CAAGAGAAGA ACAAGATACA TTTGCT6TAA 1620 

ACTCACAACA AAAA6CAGTA CGTGCACAGC AAAATGGTGA ATTTGATAGT GAAATAGTTC 1680 

20 CAGTATCGAT TCCTCAACGT AAAGGTGAAC CAATOGTAGT CACTAAGGAT GAAGGTGTAC 1740 

GTGAAAATGT ATCAGTGGAA AAATTAAGTC GATTAAGACC AGCTTTCAAA AAAGACGGTA 1800 

CAGTTACAGC AGGTAATGCA TCAGGAATCA ATGATGGTGC TGCGATGATG TTAGTCATGT 1860 

2S CAGAAGACAA AGCTAAAGAA TTAAATATCG AACCATTGGC AGTGCTTGAT GGCTTTGGAA 1920 

GTCATGGTGT AGATCCTTCT ATTATGGGTA TTGCACCAGT TGGCGCTGTA GAAAAGGCTT 1980 

TGAAACGTAG TAAAAAAGAA TTAAGCGATA TTGATGTATT TGAATTAAAT GAAGCATTTG 2040 

CAGCACAATC ATTAGCTGTT GATCgTGAAT TAAAATTACC TCCTGAAAAG GTGAATGTTA 2100 

AAGGTCGCGC TATTGCATTA GGACATCCTA TTGGTGCATC TGGTGCTAGA GTATTAGTGA 2160 

CATTATTGCA TCAACTGAAT GATGAAGTTG AAACTGGTTT AACATCATTG TGTATTGGTG 2220 

GCGGTCnAAC TATCGCTGCA GTTGTATCAA AGTATAAATA ATAAGAAAAC AGGTTATCAC 2280 

AACASTATTA ATtACATGTT GGCATAACCT GTTTTTATTT GTTTATGGAT TTATTGGGTA 2340 

ATATTAGTCA nTGATGGTT TAATTGCAAA TGCTCTAACA GGGAACCCAG GTGCATCTTT 24 00 

TGGTTTAGGG CTGATAGCGT AAATGATGGC GCCACGAGTT GGTAATTGAT CTAAATTAGT 2460 

TAATAACTCG ACTTGGTATT TATCCTGACC AAGAATATAA CGTTCGCCAA CTAAATCACC 2520 

45 ATTTTTTACA ACGTCCACAG ATGCATCGGT ATCGAATGTT TCATGACCAA CAGCTTCAAC 2580 

ACGACGTTCT TCAATTAAGT ACTTCAAAGC ATCTAATCCC CAACCCGGTG CATGTTGTTG 2640 

TCCGTTCGCA TCTTTGTTTT CAAACTTTTC AATATTAGGC CAACGTTTTG ACCAATCGGT 2700 

SO ACGAAGTGCA ACAAAAGTGC CAGGTTCAAT AGTACCATGC TCTTTTTCCC ATGCTTCTAT 2760 

ATGCGCACGT GTTACGATOA AATCATTGTT GTTCGCTACT TCTGTTGAAA AGTCTAATAC 2820 



55 



30 



35 



40 
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AAAGTGAATT GGTGCATCAA TGTGAGTACC 
ATAGAAACCA TGATCTTTAA CCGTGAATAA 
^ AAAACGTGGT ATTTCCGGAT CAAATGTATG 

AGTATTTAAT TGTTGCCATA AAGGATATTG 
ATATGATAAA TGCTGCGATT ATTCTTGGCG 

10 

CAATGCATCT TTAACTTCTT CTTCTTTTCG 
GAATAATGAG CGGTCGATTT GTTGTAGTGA 
TGTT GG AATA GGTGGACTAT GAATATCATA 
AATATCTTCA AAGTCTTTAA TTAAATCACC 
ATCAGCATCT AAGTCATGAA TAGCATGAAT 

20 TGTATGGATT TGAGTTTCAT CACGAACTGA 
ATCTTTAAGA TATTGTTCGT GATATTCAGA 
TTCGTCAACT TGGATAACTT TGATTCCTGC 

2S TGCTAAAGCA ATTTGATCTT GAAOGACTTT 
GTTTAGAATT GTTACAGGTC CAGTTAACAT 
TGCATAAACT GTTTCATCAA CAGTTAAAGG 

30 

TGGTGGTTTT ACGGCACX5TG AACCATATGA 
ACCTTGTAAT TTTTCTCCGA AGAATTCAAC 
TAATACATCT AAGCCAATGT CTTCTTGAAT 

35 

GAATGTTTCA TATGCTTCGT CTGTAATGCG 
AACTTCTCGG CTtTGTGGGA ATGATCCAAT 
ACGTTTTTGT TGTTGTTCAA TACGTTGCGC 

40 

TTCGAAATCA TAATCTAAGT TTTTGAATGA 
TAATTTATCA TATTTAACAC TATCXSTTTTG 

45 GTCTAATTTT TCAGTTGCAA AGCTTAAGCC 
ATCTAAAGAT ACTGGAACAT GTAATAATGA 
GTGTGCTAAC AATTTATCGA TTAAGACTTT 

50 ACGACCATCA ATAATTCCAG CGTATAATGT 
TTGTTTAAGG TTATAGCCAT TATCATGGAC 

55 



ATATTGCGTT 


ACAATATTCC 


AACGTTGCAC 


2940 


AGTTGAAACT 


TCGCCrrXTT 


CAAACTCACT 


3000 


CXJTTAAATCA 


ACCCAAGTTG 


CTTGTTTTAA 


3060 


TGTCATAAAA 


TCACCCGTTT 


TTAGTTTATT 


3120 


TTTAGCTTTA ACAGCATTCA 


CAAGCACAGT 


3180 


CGTTTTTAAA 


CCACAGTCAG 


GGTTTACCCA 


3240 


ACGATTGATT 


GCTGTAGTAA 


TTTCTTCTTT 


3300 


TACACCTAGA 


CCAATACCTA AATCATAATT 


3360 


ATGGCTACGA GATGTTTCAA TTGAAATAAC 


3420 


GATTTGACCG AATTGAGAAT AACACATATG 


3480 


AGACX5TTGCA AGTTTAAATG ATAAAACAGC 


3540 


GCGTAATGGT 


AAGCXITTCAC 


GTAATGCAGG 


3600 


AGCTTCAAGT 


GCTAATACTT 


CTTCGTTGAT 


3660 


ACGTGGTAAA 


TCAACACGTT 


CAAATGACCA 


3720 


ACCTTTAACT 


GGTTTATCTG 


TTAAGCTTTG 


3780 


CGCTGTCCAT 


TTTACATCAC 


CATAAATGAT 


3840 


TTCCACCCAA 


CCGAATTTAG 


TTACTAAGAA 


3900 


CATGTCATTA 


CGTTCAAATT CACCGTGAAC 


3960 


TTTAATCCAT 


CGAGCAATTT 


CATTTTTTAA 


4020 


TTTGTTCTTC 


CAATCTGCAC 


GGTATTTTCG 


4080 


AGTTGTTGTT 


GGTAAATCCXS GTAAGTTCAA 


4140 


GAATGGTGAT 


TGTCTTGAAG 


TACGCACGCT 


4200 


TTGATTTTGG 


AAACGCTCAT 


AACGTGCTTT 


4260 


ATTAAATAGG 


CGACGCAATG 


CATCTAATTC 


4320 


TTCGCCAACA 


CTTGTATCTA 


ATGTTTCATC 


4380 


AGATGATGGT 


TGAATGACAA 


GTTCATTAGT 


4440 


TTTAGCTTCA ATGTCACTTG 


CCCATACATT 


4500 


TTTTGATTTA 


TCAAAATCTC 


CAGCTTCAAT 


4560 


AAAGTCTAAA 


CCTATACCAC 


CAACAGGTAA 


4620 
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IS 



AACACCAGCT TTTTCGAAAT AGTCATAAGC TTCACGTGTA ATATTTTCAT AGCTTTCGCT 474 0 

GTCGTCTGTA ACTAAGATTG GCTCATCAAC TTGAATGTAC TCAGCACCTG CATCAATTAA 4800 

TGATTCAAAC ACTTCTTTAT AAAGTGGTAA TAACGTTTTA. ACTTTTTCTT CAAAAGTTTG 4 860 

GTGACCGCCT TTTGATAATT TAACAAAAGT AATCGGACCA ACAATGACAG GGTGAGCGTT 4920 

AACGTTTAAA GATTGGGCAT ATTTAAAGCG ATCTAATAAT ACATTGCGAC TCACTTTAGG 4 980 

CTCAACATTG TCCCATTCAG GTACGATGTA ATGATAGTTA GTGTTAAACC ATTTTATAAG 5040 

TGCACTTGCA ACATGGTCTT TATTACCGCG AGCAATATCA AATAATAAAT CATCATCAAT 5100 

AGTTCTTCCT TGGAAACGTT CAGGGATGAT GTTGAATAAT AATGAC6TAT CTAATATATG 5160 

GTCATATAAA GAGAAATCAC CAACTGGGAT GCTATCTAAG TGATAGTACT TTTGtAATAA 5220 

TAAATTTyCT TTATGTAGAT CAGTTAATGT TTGATCTAAT TCTTCTTTAG AAATCTTCTT 52 BO 

20 TGCCCAATAA CTTTCGATGG CTTTTTTCCA TTCTCTTTTT CTACCTAATC TTGGGAATCC 5340 

TAAGTTTGAT GTTTTAATTG TTGTCATAAT ATTGCCTCCT TGTGAGCAGT AATAQATTTT 5400 

GAGTATGCTG CAAGTTCTAA TGAATCTTCG ACATTTTGAA ACGGTGTGAT AATGTATAAA 5460 

25 CCATTAAAAT ATTCATGAAC AGTATCGATT AAATCCTTTO AAAGCTTAAG ACTTAGTTCT 5520 

OGTGTTTTGG CTTTATCATC TTTAACTGCT TCAAATTGTT GTAAAATTTC ATCTGACATC 5580 

TTGATTCCTG GCACTTCATT ATGCAAAAAG AGTGCGTTTT TGTAACTTGC GATAGGCATA 5640 

ATGCCTATGA AAAATGGTTT GTTCAAGTGC TTAGTGGCAT GGTAAATTTC AATGATTTTC 5700 

TCTTTGCTGT ACACGGGTTG TGTTATAAAA TAAGACATTC CGCTTTCTAT CTTTTTCTCT 5760 

AATCTTTTGA CGGCACCATA TAATTTACGA ACATTAGGGT TAAAGGCGCC AgcGATGTTG 5820 

AAGTGTGTAC GTTTCTTCAG CGCATCACCG TCAGTGTTAA TACCTTGATT AAATCTTAGA 5880 

GCG^GTTCAG TTAATCCTTT AGAATTAACA TCATAGACAT TGGTTGCACC TGGTAAGTGA 5940 

CCAA C TTTTG AAGGATCACC AGTTATGGCT AATATTTCGT TAACGCCAAT GAGCGATAAT 6000 

CCAAGTAAAT GGGACTGCAA GCCGATTAAG TTTCGGTCTC GACATGTAAT ATGTACGAGT 6060 

GGTTCAATAT TGTAATATTG CTTAATTAAG CTAGCAGCAG CAATATTGCT AATTCTGACA 6120 

4S GTTGCCAATG AATTATCTGC GAGTGTTACC GCATCTACAT TAGCTTTATC AAGTTTAGCG 6130 

ATATTTTCAA AAAATCTATC CGTGTCTAAA TGTTTCGGTG TATCCAATTC GATAATAACG 6240 

GTTGGACGTT CTTGAACCTT AGATGTTAAT GATTGTCTAA CTTTATTTTG AGATGGATTG 6300 

SO AAAAGTGCTT TC3GTTGGTAT CGGAATCACT TTTTTGTCAT TAACAGGTTT AAGTGTCTGA 6360 

ATAGATTCTT TAATAAATTT GATGTGCTCT GGCGTTGTAC CACAGCAACC ACCAATTAAA 6420 
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TACTTAAATT CACTATTTTC AATATCTAAT AAGCTGGCAT TTGGATAACA AGATAAGAAT 6540 

GCGTGCTCTG GTAATTCAAT ATGTGTGAAA GACTCTTGCA TATGGTGCGG GCCATGATGA 6600 

CAATTGAGTC CCACGATGTT TGCACCACAT TGAACXjAGTT GTTTTAATCC TTCATTQATT 6660 

GCCTGACCAT TAACTAAGTA ATTTGTGTTT GAAGCXX3TTA ATTGAGCAAT GATTGGAATG 6720 

TCGTATTTCT TTCTCGTTCG TGAAATGACA TTTGTTAACT CTTCTAGGTC GTAATACGTT 6780 

TCGAAAAGTA GCGCGTCAAC GCCTTCTTCA ATTAAGGTGT CTATTTGAAT TTCAGTATGA 6840 

TAAAGAATAG TTTGTAAGCT GATATCCTCT TGTTTGATAC CTCTAAACCC ACCAACTGTG 6900 

CCTAATATAT ACXvTATCTTT ATTTGCTGCT TTTTTTGCGA T6CGAACG6C GGCTT6ATGT 6960 

ATTGCTTTAA CTTTATCTTC AAGACCGAAT CGTTTTAACT TTTCAAAATT TGCACCATAA 7020 

GT AT TG GTT T GAATQACATC AGCACCGGCT TCAATATATG AACGATGGAT GCGTTCAACT 7080 

20 TTATCTGGAT GGCTAAGATT ATATGCTTCT GGACAGGTGT CTAATCCTTC AGAGTATAAA 7140 

ATGGTTCCTA TAGCGCCATC AGCTACTAAA ACATTATCTT TCAATTGTGT GAGGAATTGA 7200 

CTCATTGAAT GCCTCCTTTA ATGCGTATTT GATGTCTGCA ATGAGTTCAT CAGGATCTTC 7260 

2S GAGACCAACA CTTAATCGGA ATAGACCGAA AGTGATACCA CGTTCTTGTC TCACTTCTTC 7320 

AGGTA6TGCA GCGTGAGACA TTGTTGCTGG ATGTGAAAGG ATCGTTTCAA CACCGCCCAG 7380 

ACTCACTGAA ACGAGTGGTA ATGTCAGTGC ATCGACAAAT TGTTGTGCTT TAGACTCATC 7440 

AGCTAAACGA AAGCCAATAA CGGCACCGCC ATTTTTAGCT TGTTCTAAAT GAGCAGTAGT 7500 

GAGTCCCGGA TAATAAACTT CTGAAATTTC ATCTTGCTTT ATTAAAAATG ACACGATTTT 7560 

TTGAGCGTTT TCGACAGATT GTTTAAATCT GATTGGAAAA GTTTTTAAAT GTTTAGCAAG 7620 

TGTCCAGCTA TCCTGAGCAG ATAACATATT GCCTGTACCA TTTTGTATTA AATAAAGAGC 7680 

GTCflCTAATT GCCTCATTAT TAGTTATGAC AGCACCAGCA ATTAAATCGC TATGTCCACT 7740 

TAAAAATTTT GTAGCACTAT GAATGACAAT ATCAGCGCCA AGTAATAAAG GTGATTGACc 7800 

TAACGGTGTC ATAAATGTAT TGTCCACAGC TACCAGTAGT TCATGCTTTT CGGCTATTTT 7860 

AGAAACAGCT TTGATATCAG TAATTTTAAA ACAGGGATTC GATGGTGTTT CGATATAAAT 7920 

TAATTTTGTG TTTGATTGAA TGGCACCCTC GATTTGTTCG AGCTTTGTAG TATCTACGGT 7980 

TGTAAATTCA ATATTAAATC GATTCAAAAT TTGCTCAGTG AGGCGAAAAG TACCGCCATA 8040 

TACATCATCG GGTAAGATGA CATGATCACC AGATTTGAAA GTCAAAAGTA CTGCTGAAAT 8100 

50 AGCAGCAATA CCTGATGCAA AAGCAAAAGC GAATTTTCCC TGTTCTAATC GTGCTAACTT 8160 

CTCTTCTAAA AGTTCACGGT TAGGGTTGCC CTTCGTGCAT AATCATATTT AACATCGCCA 8220 



SS 



30 



35 



40 



45 
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TCCACACCTC 


TACGCCAATC 


GAATATCACT TCTGTCTCTT TTGAAAGTGT 


CATACAATCT 


8340 




CTCCAATCTG 


AGCTTTATCT 


AATGCTTGGA TGATATCGCG TTCGATGTCT 


TCATAATTTT 


8400 


5 


CAACACCTAG 


TGATAAGCGG 


ATTAAATACT CATCAATGCC ACGTTTATCT 


TTTTCAGCAT 


8460 




CTGGCATATC 


AACATGTGTT 


TGGGTGTAAG GGAAGGTCAC TAATGTTTCA 


GTACCTCCTA 


8520 


10 


AACTTTCTGC AAAAATGCAA ATGTCTAAAT TTTCTAATAA TTTAGCGACG CTATAGGCCT 


8580 


TGTTAAGTCT TAAACTAAGC ATGCCAGTTT GCCCGCTATA TAGTACTTCG TCAATTGCTT 


8640 




GAAGTGACTG 


ACATTTTTTA 


GCAAGTTTTC TAGCGTTTGA TTGCGCACX5C 


TCAATGCGTA 


8700 


IS 


AATGCAAAGT TTTAAGTCCA CGTAACAACA AATAACTATC TATTGGTGAA AGTGTTGCGC 


8760 




CAGTCATGTT 


GTGAAAATCA AACAACT6TT GCXSCGAGTGA TTCATCTTTG 


ACGGTTACGA 






CACCTGCTAG 


TACATCGTTA TGTCCGCCAA TATATTTC6T GGCTGAATGT 


AAGACTATAT 


Sfi80 


on 


CAGCACCTTC 


TGCTAGTGGT 


GTTGAAAGAT AAGGTGTTAA AAAAGTATTG 


TCGATAATTG 


O 74 W 




ACAATAAGCC 


TTTAGCTTTA 


CAAAGTTGAT AGTATGGCTT TACATCAATA 


GCAATCATTT 


9QQQ 




GTGGGTTAGA 


TATTGGTTCA 


ATGAATAATG CAACTGTTTT ATCAOTGATT 


TCTTTTTCAA 




25 


CTTGTTCATA ATCTGTAAAA 


TCAACGTACT TAAATTTGAT ATCGTATTGT 


TGCTCGTAAA 






ATTCAAATAA 


TCTAAATGTG 


CCACCATATA AATCGAATGA AACTAAAATT 


TCATCATGAG 


7 JbO W 


30 


GTTTAAATAG 


ATTACATATT 


AATTGAATGG CTGACATTCC ACTT6AT6TA GCGAATGATG 


9240 

7 A4 w 


CAATACCATG 


CTCAAGTTTG 


GCAAAACAGG TTTCAAATGT TGAGCGTGTA 


GGATTTTTAG 


9300 




TACGTGTATA 


ATCAAAACCT 


GTCGATTGTC CTAGTTTTGG ATGCTTGTAG 


GCAGTAGATA 


9360 


35 


AATGGATTGG 


ATTCGCTATA 


GCACCGGTTG AATCATCGGT TAATGTGATT 


TGGGCTAACT 


9420 


GTGTATCCTT 


CATATTAAGA 


CCCTCCTATA AGAAAAAATA AAAAAAGCTT 


CCGTCCTTCG 


9480 




TACCCGAATG 


AATCGGATAA 


AAAGGACGAA AGCTTATGTT TCGCGGTACC 


ACCTTTATTT 


9540 


40 


GTTATTCCAT 


CGCTGAAATA 


ACCTTATTCA GTACGCATTA AAAGTAAATA 


TGCTTACTGA 


9600 




ACAATTATCA 


CAATTAAAGT 


CAGTAAGTAA GGATATAGTA ATGTGCTATC 


CCATACTTAT 


9660 




TAACAAAAAA 


TCGTGCGTAA AGAATCCAGT ACGCCATTTA ACATCAATGT 


TAATACTGTA 


9720 


45 


TCGCTATAAC 


GGGCGAACCC 


GTAGACACCT CATATTGGCA TCAACACTCC 


AAGGCCATTT 


9780 




TCAAACACGC 


TTTCAAAATC 


TTCTCTCAGC TACTAAAGAC TCTCTGTATA 


AGCAGGGTGT 


9840 




GTTTTACTTy 


CCTCTTTATT 


GTGTTTACGT TTCATTAAAC TGTTATAAGA 


TATTAATTAG 


9900 


50 


CTTACAGAGT AAAAAAAGAT 


TTGTCAACAA TTATTCAGAA AATTTTGATT TAAAAGTTAA 


9960 




TTTGTTTGTG 


AAATTGTAAT 


TGGTATCTTG AAGTTGAAAA ATGAATTATT 


TTTTAAATAA 


10020 
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5 



10 



15 



20 



TCAAATAAAA 


AGTGATGTGA 


GTGAATT6TC 


AAAAAGTGAA 


GATCAACGTA 


TTACTAAAAC 


10140 


AAAAGATGAA 


CAAATTAAGC 


AAATAGATAT 


ATCGGATATC 


AAACCGAATC 


CGTATCAGCC 


10200 


CCGAAAAACT 


TTCGATGAAA 


ATCATTTAAA 


TGATTTGGCA 


GATTCAATTA 


AGCAATATGG 


10260 


AATTTTGCAA 


CCAATTGTGC 


TTAGAAAAAC 


AGTTCAAGGT 


TATTACATTG 


TAGTTGGTGA 


10320 


AAGAAGGTTT 


AGAGCTTCGA 


AAATTGCTGG 


TCTAAAATAC 


GTATCAGCGA 


TTATCAAAGA 


10380 


TTTAACAGAT 


GAAGATATGA 


TGGAACTGGC 


GGTCATCGAA 


AATTTACAAC 


GAQAAGACTT 


10440 


AAATGCGATT 


GAAGAAGCTG 


AAAGTTATCA 


ACGTTTGATG 


ACAGATTTGA AAATTACACA 


10500 


ACAAGAAGTA 


GCGAAACGAT 


TGAGTAAGTC 


GCGCCC6TAT ATAGCGAATA 


TGTTGAGGTT 


10560 


ATTACATTTG 


CCGAAAAAGA 


TTGCTGACAT 


GGTAAAAGAT 


GGGCGACTGA CAAGTGCACA 


10620 


TGGACGAACG 


TTATTGGCAA 


TTAAAGATGA 


ACAACAAATG 


CTTAGGTTAG 


CGAAACGGGT 


10680 


TGTTAAAGAA 


AAGTGGAGTG 


TCAGATATTT 


AGAAAACCAT 


G1TAATGAAT 


TAAAAAATGT 


10740 


TTCGTCAAAG 


TCGGAAACAG 


ACAAAGTAGA 


TATAACTAAG 


CCTAAATTTA 


TAAAGCAGCA 


10800 


AGAACGACAG 


TTGCGAGAAC 


AGTATGGTAC 


CAAAGTAGAT 


ATATCAATAA 


AAAAATCGGT 


10860 


TG6TAAAATC 


TCATTTGAGT 


TTGATTCACA 


AGAAGATTTT 


GTGAGAATAA TTGAACAATT 


10920 


AAATCGTAGG 


TATGGTAAAT 


AGTTACACAA 


TTTTATATAA 


TAACTCTTTG 


T6CAAGTGTA 


10980 


AATAAATTGT 


AATCAGTGAC 


ATTTGATTCT 


AGAT 






11014 



30 (2) INFORMATION FOR SEQ 10 NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

40 

TCCCCTTATG GAATTTCACA TTCTAGTTTA CATAATATAT ATTATAGGAA GTTATATGTG 60 

TGTAACGCAA AAgGTACCCT ACATCATAAT CATTATCTAA TATCGTCACA TAACTTACTT 120 

ATGCTATAAT CATGGTATTA TATTGTTTGG AGTGATTTGA TGAOATTTGT CTTTGATATT 180 

GATGGTACGC TTTGTTTTGA CGGCCGATTA ATTGACCAGA CTATTATTGA TACATTGTTA 24 0 

CAATTACAAC ATGATGGTCA TGAACTTATA TTTGCATCAG CACGTCCGAT TCGTGATTTG 300 

TTGCCAGTTT TACCATCAGT ATTTCATCAG CACACATTAA TTGGCGCAAA TGGTGCTATG 360 

ATTTCACAGC AATCAAAGAT TTCTGTTATC AAACCAATTC ATACTGATAC ATATCATCAT 420 

55 
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10 



IS 



20 



25 



GCTGCACAAC TTGACGCTGn AGAACGCGAT TTTTGAGCGT TTAGATCCAC ATAAGCTGGC 540 

CAGTTGTATT GATGTTGCAA ATATCGACAC GCCAATCAAG AJcTATTTTAT TAAATATAGA 600 

CCCGGCACAA ATTACAACTA TATTAGACGA GCTAGATAAA TACCATCAAG AATTGGAAAT 660 

GATTCACCAT TCAAATGAGT ATAACATTGA TATAACAGCG CAAAATATTA ACAAATATAC 720 

TGCATTACAA TATATATTTG ATGCAGATGT TAAATATATA GCATTTGGTA ATGACCACAA 780 

TGATATTGTC ATGTTACAAC ATGCTAGTAG TGGCTATATT ATAGGACCAT CAGAAGCATA 840 

CACACACGCA ATATTGAAAC TTGATAAAAT CAAACACATC AATAATAATG CACAAGCTAT 900 

TTGCAAAGTC TTAAAATCAT ATAAATAAAA ACACCCCTAT CAAATGATAA TCATTATCAA 960 

TCGATAGGGG CTATTTTAAT AAAATTCGTC CTCGAACATT TCTTCCTCTT CATCTAATCC 1020 

AAATAATTCT GCCATTTCTC CATGTTCAAT TAACATGTTT AAATATGCAT CGCGGAGTTC 1080 

TTCTTCACTC ATATCATTAA TCATTTCTTT AAGACTATCA ATCCACATAT TTCTGCGTAA 1140 

TTGATAGTCT TCTTCAACTT CGTTTAACAT CATTATATGT TTATTTGCTG CTTCTGGACT 1200 

AGCTGTAAAG AGTAATGCAA TCATATGTTT ACATATCACT CGTCTTCCAT CAGCATGAGG 1260 

ACAATTACAT ATGGATTTTC TAGGATGTTC CATATCAATA TAACAACGAT ATACTTTGTT 1320 

GCCACTGCCC TTTACTTCAG CCTCATGCTG CGTTTCTGAA AATGATTTTA AGTTAATGAC 1380 

GCATTCACTT TGATAATAAT TAAAGCCTCT TTCTATAGAA CGAATACTTG CAATATCAAG 1440 

30 TAATCCCATT AATGaTACTC CTTTTTATTA TTATTTTTAA ATAAAGJVaAA TAAAATAGAT 1500 

AAGTGTCTAG ATTAAAATAC TTGATTTATC TATATTTTAT AACAAGTCTA GAATTATCGC 1560 

ATTCTTAAAT AACTAATATG AAAATGcTTG CACTAATTCt TTTGTATAAG GGTGTCTATC 1620 

^ AACATTAAAT AATTCCtCTA TTGCAAAATC ATCGACTATC ATGCCATCCT TAAGAACGAT 1680 

AATTCTATTA ACTAAGCGTT GTAACACGGA TAAATCATGA QAAATAACGA TAAAATGATT 1740 

TAAGTTCGTA ATCGTTTGCG CTTTTAATAT ATT6ATTACA TTTTGTTCAG CTATAACATC 1800 

TAAATTTGAA GTTATCTCAT CACATATTAA AACGCGAGGC TGTGCTAATA ACGAACGCAT 1860 

GACATTAAAT CTTTGTAATT GTCCX3CCACT CACTTCGCTT GGTAATTTAO TCAATAATTG 1920 

CGC3GTTTAAC TCAAAAGTAG ATAAATGTTG TAATAATAAT TGATCCTGAG CAGTATTATC 1980 

AGTTAGACCT CTGTAATAAT ATAACGCTTC TTTTAATGAG GTCTCAATCG TCCAATCAGG 2040 

6TTAAAGCTA GTTAAAGGGT GTTGGAAAAT CGGTAACACA GCATTGTCAC TTAAGTAAAT 2100 

CTCTCCTTTA ACAGGTTTAA ACAAGCCAAG AACCAATGAA GCGAGCX5TAC TTTTACCACA 2160 

GCCACTTTCG CCTAAAATAC CAACATTTTC TCCATCAGGT ATAGTAATAT TGATATCTTG 2220 



4S 



SO 



55 



570 



CCCTCTTTAA TTGTCTTCTA TATTTAATTA 
ACTTGAAATG ATTAATATTA CCTCGTTCAA 
5 ACTGACAATA TTTCAATACA TGACTTAAGT 

GTTCTAATAC AATATGCTGT AATAAATCCA 
ATGCAACTGG TTCGTCTGCA ATGATTAATT 
ATACGCGTTC AAGTTGGCCC CCA6AAAGTT 
TTTGTAAATT AACCCACGAC AAAGCCTTAT 

CTTTATAATG CTTACGATAA ATCGCAGTTA 

IS 

AACTTTCTGC ATAATTTTGA GAAATATAGC 

TACTAACATT TTCCCCATCA AATTGGTACG 

AATATTCAAG TAAAGCTTTA GCAATCAAAC 

20 

CATTAATCTG TTGACTAAAA ATTTTCAAAT 
TATTCTTTAT TGTTAAATTT TGTATATCAA 
AGCAATCTAT CTCTTAGTGC ATCACCGGTT 

25 

ACTGAAGCAG GTGCAATCAA CATAATTGGA 
AACATAGCGC CCCaCTCTGG TGTTGGCGGT 
CTTATATATA GAATGATTTT ACCGAAATCA 

30 

atmpiggtg ttaaatgacg tattaatatt 
atttttatat aaggcttatt catttcgcta 

35 ttcatccatt ttattaatgt aattgagata 
cttgStaaag CAATCATGAT GATAAATTCT 
AACACTAATC GTTCAATCCA CCCTTTTTTG 

40 ATGACAACGA TAGCTATTAA TGTTAAAACA 
ataattcggg TAAATAAATC TCTCCCATAA 
ATAGGTTCAA AAGTTTGTGA TAAATTGACT 

^ TGCAGTACAA TTACCACAAA AATAAATGCA 
GAAAATATTT TATGCATGAC GGTCACTACT 
TTTTGGATTT CCTAATTGTA AACGCTGCTT 

SO 

AATCGTATTG ATAATAACAA CX3AAGAAGCC 
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GACGTTCAGT ATACGGATGC AAATGCTCAT 2340 

TGATTTGACC TTCTTTTAAA ACATAAATGT 2400 

TATGTGTGAT AATAAATAAT GTTTGACCAT 2460 

TCACTTGATT ACCGTTCAAA GCATCCAATG 2520 

TAGGCTCCAA CATGAGAACG CTTGCTATGT 2 5 BO 

GGAAACTATA TTTATTTAAT ATATCTTTGC 2640 

CAACTTTGGA CAAAGCCTCT TCTTTACTAC 2700 

ACTGTTTACC TAATTTAGTA TGGTCGTTAA 2760 

CAATTGTATG ACCATAATAT TGACTCAATC 2820 

AATCATACGT GCAGCTTAAA TCAAATGGTA 2880 

TTTTTCCAGC GCCGCTCTCT CCAATCAAGG 2940 

CAATCCCTTT AATAAGAGAT TTCTCACTAG 3000 

TGAGACTCAT CATATTCACC CCGTTGTTTC 3060 

AAATTAAAAA TTAAAATAGT TATAGCAATG 3120 

TGAGACGAAA TAAAATCACG ACCTTGTTGC 3180 

TGTGCACCTA ACCCAATAAA TGATAGTGAA 3240 

AGGACCATCA AAACGATAAT AGCCGGTATA 3300 

GTTCTTGTTG GTACATGAAA TAATTGTGCC 3360 

TTAACTATAC TTCTAGTCAA CCTTGTGTAA 3420 

ACTAAATTCC ATAAAGATGG TTGAAAAAAA 3480 

GGAATACTTA GACCAACATC AATAAACCTT 3540 

TATCCGGCAA ATAGACCTAG TGTAACACCT 3600 

GTAACAAACA ATGTTGAACG TGCACCGATA 3660 

TCATCAGTTC CTAATAAATG CAACCAACTA 3720 

TTGGTTGCAT TTTCACTACT GACAAAGAAT 3780 

ACGAATACAA AAAATATCAG GTTATTCTTT 3840 

TTCTGATATC AATGGTGTAT TGGTTTTGAT 3900 

CGGATCAAGT AATAACGTTA ATAAATCAGC 3960 

AATAAATAAC ACGCATCCTT GAATAACAGG 4020 



SS 
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ATTTTCAATC ACTACAGTAC CACCTATTAG 
AATCGGCAAA ATTGTTGGTT TTAGTAAATC 
^ GCGTAATCTT GATGCTTGTA CGATATTACT 

TAAACGAATG ATGTATGCAC ACATACCTAA 
ACTTAGTATA ACGCTATCTA TATTCATTAA 

10 

GATAAAGAAC GCTGGTAAAC TAATCGATAG 
CCACTTATGA AATCGTTTGG CTGCTATAAT 
TACTAATGTT GAAAATGATA TGAGTAATGT 

IS 

TACCGGTTCT TTTGATTCAA AACTTTTTCC 
ATGCCACCAC TGTACCAATA AAGAATCATT 
TTGTTCCGTC GACACTTGTG CTACATCAAG 

20 

TGATAATTTC ATTAAAATGA ATGTAAGTGT 
CATCAGTCTA TACAACATAG ACTTTATTAT 

2s TACTAACACT TTCTTTTTAC ATGAGAATGG 
CTAACGTTCA TTATAGTATT ATCCATAAAG 
ATCATTATAA AATATAATAT TTTGAATCAA 

30 ACAAAAACAA CTATAGCACT GTATTCCATC 
CTTAAAAGTC GATGATTTTG TGCATATAGT 
AAGTTCAGCG CCCTCATAGC CGTATTTTTC 

35 TTCTTTACGC CAATTTCTAG TAAAATACCA 
CGCTAAAAAT GCAGCGTATC CTAATAATGG 
AAAAAATGCA AGCACACCTA ATATAACAAA 
TAAAACTAAG CAACAGAATA TTGTTAATAA 
ATTAGGCATA TCGAATAATG TCATATTCAT 
ACATTTTATT ATAAATTTTT AAAAACTTAA 

4S 

ATCTACTAAT AGAAAAATCA TTGTTCCTTG 
GAGACAAAAT TATGTATAAT GAATGTATTA 
TTACCTTTAA GAGAAGATGT TCCTGTTTCA 

SO 

GATGATCAAC AATATTATGA AAGTATTGAC 



ACTGCCAAGT GAAATCCCTA GTAATGGGAT 4140 

ATGAATTAAA ATATAACXTTT CATTCATACC 4200 

TTGCAATAAC ATCAATAAAT TAGAACGCAC 4260 

AGATAGCGTG ATTACAGGTA ATATAAACTG 4320 

ATTTGTGACA ATAAATAATA AAATAATACC 4380 

TGTTGAGATC ACTCTAATCA CTTTATCCGT 4440 

GCCGAGCGGT ATAGATATGC ATAACGACAC 4500 

TATGGGTGCA TAGTTGAATA ATATCTGTGT 4560 

TAAATTAAAA TGTAATAAAT GATTCATCCA 4620 

TAATCCCAAT TTATCTTTGG TTGCATTTAT 46B0 

ATGTAATATT TTATCAACAG GATTGCCTGG 4740 

AGAAATAACA AATAAAACAA CTATCATTTG 4 BOO 

GAACATAATA GTCCCCCTCC TT6TGTAAGT 4860 

CGCATGTATA TGCAACTTAC ATATTAAGAA 4920 

AAATTGAAGT ATATTTAATT TTTTAACAAA 49B0 

GTCAACCATG TAAAATATAA AAAAGTCAAA 5040 

TCTTTOGAAA TAATTGTTAC TGCA6TGTAA 5100 

TGTCGAATAT TATTTTTTAT CTTTACX3GCG 5160 

AATTTOCTTT TCTAATTTAC GC6CTTTTCT 5220 

TAATAGAAAA CTAATTAATA AACTCATAAT 5280 

TTGATATTTT ATATCTTGAA AATTTGGAAT 5340 

TGTAATTACT GCAGATACAA ACCATTTATT 5400 

AATCATTATT AATGTTGTGA TCCATAAATA 5460 

TCTCCTTTTA TTTCATTACT TTCCTTGTAT 5520 

ACAATAGCAG TCAGTTTCAA GCAATATTCT 5580 

CGACATGGAA ATCGTAACAT TATCGTTTAG 5640 

TACCAAAGGA GTGATTATAT GTCTCAAGGT 5700 

GAAACATGGG ATTTAGTAGA CTTATTTAAA 5760 

GCTCTAGTAC AnCAAGCAAA TCAATTTCAT 5820 



SS 



572 



10 



IS 



20 



2S 



40 



4S 



SO 



55 
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GAAAATATTT TAATTGCCTT AGATOGCTTA AGTAATTATG CAGAACTAG6 TTTAAGTGTA 5940 
GATACTAGTA ATATCGAGGC ACAAGTATTG AGCGCTAAAT TATCTACTAC ATACGGTAAA 6000 
ATTGTTAAGC CAATTATCCT TT 6022 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCATCAATAA TGTATACATG ATTGGCATCA TATTCCCCTT TAATTAGAGA GCTACGTACA 60 

GTTTGTyTTA TTAAAGTAGA ACTAATAAAT AACCATCTCT TATGTGCACA AACACTTCCC 120 

GCAACAATTG ATTCAGTTTT ACCAACCCGT GGCATACCTC TAATGCCAAT CAACTTATGA 180 

CCTTCTTCTT TGAACAATTC AGCTAAAAAG TCTACTAACA AGCCTAAATC TTCACGCTCA 240 

AATCGAAAGG TTTTCTTATC TTTTGCATCT TGCTCAATAT ATCTTCCATG TCTTACTGCA 300 

AGACGGTCTC TTAATTCTGG TTTTTTAAGC TTTGTTATTT CAATTTCATT TATACCACGA 360 

GCTATTTGCT CAAAACGTTC AACTTTTTCA AGATTGTCTG TTTTAATTAA AAGGCCTCGT 420 

30 TTACCTTGAT CAACACCATT AATTGTAACA ATACTTATAC CTAACATACC TAATAA 476 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3633 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
' (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

AGAAATACAA CGAAGCATAT AAATATAACC GATCTTTTTT CTAATTGAAT ATTAAGTAAG 60 

TCTATGTACT TTCTGGAAGT AGCACCTAGT rGGATTGTtC CTCCTACAAC AGGCCAAAAA 120 

TTTTTATTTT TAACTGGCTT AACAGTGTTC AGTTTTTCAT ACTCTTCTCT ACTAATTTTG 180 

GCGCACCTTT TTGGAATGAA CCAATTAATA AATGGAAAAA AGTATACAAG CCAAGTTCTT 240 

ATTACATCGA CCATTAAATA CTCATCATCA TACTTAATAA CTCTGTATTT CGGATTTTTA 300 

TTGATAATTT CGGTTTCACA AAGCAATAAT TATCACTTCC TATTAATAAC AAATTCACAC 360 
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10 



IS 



TTATATGACC TTAAATATAT AACATGAATC TTTTTGTCTA TTATTGAAGA CATATTTATA 480 

AAGAAAAATA GCATTGTCAT AATAACCCAA GCAATAAATA CTATAATATT TTGGATAGAT 540 

AAACTAATCA rrACATCTAA GAACATGATT gATAATCCAC CACAGAAAAA ATAAGAAAAT 600 

AGTACAAAGC AAAGATTCTT GAATGATGGA AAAATCATAA TTTTTCCATT GCTACTCCGA 660 

TCATTATAGA TAGATAACTT TACTTTCTGA TTTAAATATA TATAAAACAC TAGAATACTT 720 

AATAATAAAA CCGAACAAAT GATAATAACG CAATTTTTTT CTAAATGAGA ATCAGGTATA 780 

TATATTTTAT CTCTAAACAT AGTGCCAAAT AAAAGTATGC TACCTATAGC TGGCCATAAA 840 

GCTTTaTTTT TAACTGGTTT GACAATATTT AAATTATCAA AATCTTCTCT GCTGATTTGG 900 

ACATATTTTT TTGGTATTAA CCAATTAATA AACGGAAAGA ACAAAACTAA CCAGGTGCTT 960 

ACTAAATCAA TCATCAGATA GTCX5TTTTTA TATTTAATAA TTCTATATCT GGGATTTTTG 1020 

TTTACAACTC TAACCTCGCA AAGCAATATC TCCACTTCCG TCTCGTTGGT TTTATATCTA 1080 

ATACACTTTC AGATACTTTA TAAGTGTTTT GTATTTTAGT AACATACTAT TTTCCTGTTT 1140 

ATTACTTAAC TTACGAACTA CAATCTAAGT TTAGTAATTT CTATTGCTTT TTAAGTTTGG 1200 

CATAAACCTT TTTATTACTA ATTGAGCCCA TGCTTATTAG AAAGAAAAAA ATTGTAATAA 1260 

TAATCCACAT AATAAATACC AGTAGATTTT GAGGTTTTAT AGTCATTAGC CATATTAAAA 1320 

ATAATATAGA ACAACCTCCT AATAATAGAT ATGTGAAAAC TATAAAACTT CCATCTTTAA 1380 

AAOTAGGCAC TAATATAACC CTATTTTCAT TATCTAGATT ATCATCATAT ATCTTTAGTT 1440 

TAAGCTTTTT ATTTAAGTAA ATGTAAAATG CTGCAATACC TATAAATCCT ATAAAACATA 1500 

^ AAGATATTAA AATCTTATTA TCTAATTGAA CTTCAAACGT ATGTACATAT TTCCGTAAAA 1560 

TAACTACAAA TAAAAACGAA CTACCAGTAA CTGGCCAGAA AATATTATTT TTATTTTGTT 1620 

TATOG^CATT TAAATTTTCA AGTTCCTTCT CACTAAGTTT TGCATACCTT TTGGGAATGA 1680 

ACCAATTAAT AAAAGGAAAA AAGTATACAA GCCAAGTGCT TACTAAATCA ATTAACAAAT 1740 

ACTCATCATT ATATTGAACG ACTTTATATC TCGGATTTTT ATTAATAACC TTAATATTAA 1800 

AAAGCAAAAC TCACCACGCC CATTTCATTG GATTTATATG ATTGCTAATA ATATTTTTAG 1860 

CTTCACTAAC AGCATTCCCA ACACTATCCA TGGATTTTTC TGTAGTTTTT TTAACAACAT 1920 

CTATACTATT ATCGATTTTA TGCCCTACCC AGTCTACTTT ATCTTTTAAT CCAAAAATAT 1980 

SO TATTTTGATA AATTAAATCT GTTCCTAATG CAAATACTGT ACTCATAGCC AAACCTGCTA 2040 

AAATCACCCA TCCTACTGGA TTACTTCCTA AAACAAAAGT CGCTAATCCA GCTCCAACTG 2100 

CTGTCCCTGC AGATCCAGCT GCAAGCGTgC ATACCATTAT GCGACAACGC CTCTCCAAAT 2160 



25 



30 



40 



45 
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3S 



40 



45 



CCTTTACCTA 


GGTATTTTCC 


GCCTTTTGCA AATTTACTAC CATTTTCTAT 


AAACACATTA 


2280 


CCTGATGTAC 


GTTTGACTTC 


CACAAATGAA TTTGGACCTG CTGGGCCTTT 


CACTCCACCT 


2340 


GCTGTATTGa 


TAAATACACC 


GAATTTACTT GcATTTATAC CGTCTTGCTC 


TAAAAGtGTT 


2400 


GACGTAATAT 


CTAATCCTAT 


ATCTCTTTTA ATACTGTCTT TATTGTCATT 


TATATATTTC 


2460 


AATATACTTT 


TCGGGATATC 


GTCTTCTGGA TGTTCTTTGG CATATGCCTT 


TATAACAGCA 


2520 


AAGTCTGCTT 


TATTTAAAGT 


TTCTTTCTCT GCTTTATGTT CAATTTTCCC 


CATAGCAACT 


2580 


TTCAAATATT 


TTTCATGACT 


TGCTTTGGCC CAATCAAGTT CTTTACCTGA AGGAATATTA 


2640 


AATTGATTTG 


TTGAAAAGTT 


CCAAAAATTC TGCGCTTGGG TAAGTCCTTG 


TTGGACAATT 


2700 


TTTTGAAATT 


CTTCAACTTC 


TTTAAATATT TCTGGTGATT TTTGATTAAA 


CrCACGCMiT 


2760 


TTGCGTAGCT 


TCTCTTCTAA 


TTCATGTTTT TGTTGACCTA ATGTTCGTAT 


TATTTGTTGG 


2820 


TTCGATGAAA 


TGGCTTGCTG 


ATTATCGGAA GCATGCTTTT TCAAATTGTT 


ATTCAAATTT 


2880 


TCATATCGCG 


TAATTTGTTG 


ACTTAATGAT CTGATATCTT CTTCAAGCTC 


TGATTCTTTT 


2940 


AAAGATATGC 


TATCAACCTC 


ACTCGTATAA CGTGACACAA AATTaTCGCA AGCTTGCTTC 


3000 


GTTAAATCAC 


TCAATGTTTT 


CATACTTGTT GATAATGGAA TTAACACCGT ACTAAAAAAT 


3060 


TGCTTAGCTG 


ACGTATACGC 


TTTCCCTTTA AGCGCATCAT CATTAATAAA 


TTGAGTAATT 


3120 


GCTTTTTCCA 


ACGCATCATA 


ATTTGAATTC ATTGTTTGAC TCAAATTCCC 


CACACTTGAA 


3180 


GCTTGGTTTC 


GAGATCTGTC 


TAAATACATG TCAATACTCA TCGGCATGCT 


CCTTTTTCAA 


3240 


AAATATATGA 


TTTTCAAACT 


ATTTAAAATC AAATGCTTTT TACATCTACA 


AAGTTGTAAA 


3300 


ATTTTAAAAC 


TCGGCGATGA 


TTATTTCTTA TGTAAAGGAG TCTAGATGCA 


GGTAAATTGA 


3360 


GATAACATGT 


CGCCXTri-iT 


CTTATTTTAG CATATGGATA TAATGGTGTC 


TTTGTATATT 


3420 


CGCAATTAAT 


CAATAAAAAT 


TATCTTTCAA TATTTTAATT TTATTGCGAC AACATCCTTA 


3480 


ACATTAAATA 


TATTAATATC 


TCAAAATATA TTCACTATTA AAATATGTCA TCAGTTGTTA 


3540 


AAAGTATTTC 


CTCATCATGC 


GAAATATCAA AACGTATCTA AAATACGAAT 


AAGTTTATAC 


3600 


AATCACACAA 


CATCATCATT 


CAAAATTTTA TTG 




3633 


(2) INFORMATION FOR SEQ ID NO: 95: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2365 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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GATATACATT GAATGTGTTA TATGATCGTT ATCAGTTACC ACTTTTTATT GTGGAAAATG 1860 

GTTTTGGTGC AGTTGATGAA GTGGTAGATG GACATATTCa TGATGATTAT CGCATTGAAT 1920 

ATTTAAAAGC ACATATTACA GCAGCGATAG AAGCAGTTGA TCAAGATGGT GTAGATTTAA 1980 

TCX3GTTATAC ACCGTGGGGA ATCATTGATA TTGTTTCATT TACAACCGGT GAAATGAAGA 204 0 

AACGCTATGG TTTAATATAT GTTGATCGAG ATAATGATGG TCATGGCACG ATGGAACGCT 2100 

TGAAAAAAGA TTCGTTCTAT TGGTATCAAC AAGTGATAGC ATCAAATGGA GATAAATTAT 2160 

AAAGGTATAT TATAAGTATT TTAGGGTTAG AGCCCGAGAC ATAAATTAAT ATAGTAGGAC 2220 

CTACAGTGTT ATAATGGCGG gCCCCCAACA CAAAGAATTT CGAAAAGAAA TTCtAcAGGT 2280 

aATGCaAGtT GGCGGGGcCC AACACAGAGA AATTCGAAAA GAAATTCTAc AGGTAATGCA 2340 

AGTTGGGGAA GGACAGAAAT AAATT 2365 
20 (2) INFORMATION FOR SEQ ID MO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CTGCGATACG ATTTGTTGAA AGTGGGGAAA ACAAAAAAGT TATCATTACC AATTTAGAGC 60 

AGGCATACGA AGCTTTGATT GGTAATAAAG GTACACACAT TCACATGTAG CACTTTATCA 120 

^ CGCGACAAAA CATTAAATAT GTTTCTCCGT TGATTCAAAT GAAAAAGTTG TCTGCTGACA 180 

CTTTGCAAGG TTTGAAGGAG TTTAACTTAT GACAGAAAAC TTTATTTTGG GTAGAAATAA 240 

TAAATTAGAA CATGAACTAA AGGCATTAGC AGATTACATT AATATACCAT ATAGTATATT 300 

40 ACAACCATAT CAAAGTGAAT GTTTTGTCAG ACATTATACG AAAGGCCAAG TTATTTATTT 360 

TTCGCCACAA GAAAGTAGCA ATATTTACTT TTTAATTGAA GGTAACATTA TTAGAGAACA 420 

TTACAATCAA AATOGAGATG TATATCGTTA TTTTAATAAA GAGCAAGTAT TATTTCCAAT 480 

CAGTAACTTA TTTCATCCGA AAGAGGTTAA CX3AATTGTGT ACAGCATTAA CCGATTGTAC 540 

AGTTCTTGGA TTGCCTAGAG AATTGATGGC CTTTTTGTGC AAAGCTAATG ATGATATATT 600 

TTTGACACTT TTTGCATTAA TAAATGATAA TGAGCAGCAA CACATGAACT ATAACATGGC 660 

ATTAACAAGT AAATTTGCTA AAGATCGAAT TATCAAATTG ATATGCCATC TATGTCAGAC 720 

AGTAGGATAC GATCAAGATG AATTTTATGA AATCAAACAG TTTTTAACTA TTCAACtCAT 780 

55 



4S 



SO 



577 



TGAAAAACTT GTTGTTAAAG ATCATAAAAA 
TGTATGTGTT TAATATACAA TGTAAAATGA 
5 TATACGTTAG GCCTTTTTTG CTAGCATGAT 

TGTTGAAATT ACAGTAAAAT TTAAGGTGAT 
AGTAAAAAAT TTGTAATAGT GTAAAAATAG 
TTTAAGTTAT ATAAATAGGA AGAAAACAAA 
ATATAATTCT TATTTCATTA TACAATTTAG 
TTTTGAAAAA AAGAATTGAT TATTTGTCGA 

IS 

TTACAGTAGG TACCACATCA GTAATAGTAG 
ATCAAGCACA AGCTTCAGAA CAATCGAACG 

20 GTGCAGATTC CGAAAAAAAC AATATGATAG 
ATACATCTGA TATTAGTGCA AACACAAACA 
TGTCTACACA AACGAGCAAT ACCACTACAA 

25 AACCGACGGC AATTAAAAAT CAAGCAACTG 

AAGAAGCAAA TTCTCAAGTA GATAATAAAA 
ACAGTGAGCT TAAAAATTCT CAAACATTAG 

30 

CCAATGCGCA AGGAACTAGT AAACCAAGTG 
TTGCTGAACC GGTAGTAAAT GCTGCTGATG 
CGGCAAGTAA TTTCAAGTTA GAAAAGACTA 

3S 

TTATGGCGGC AAATTTTACA GTGACAGATA 
aGTTACCAGA TAGTTTAACT GGTAATGGAG 
TGCCAATTGC AGACATTAAA AGTACGAATG 
TCTTGACTAA GACGTATACA TTTGTCTTTA 
ACGGACAATT TTCATTACCT TTATTTACAG 
ATGATGCGAA TATTAATATT GCGGATGAAA 
GTTCGCCAAT TGCAGGAATT GATAAACCAA 
GTGTAGATAC AGCTTCAGGT CAAAACACAT 

so 

AACGAGTTTT AGGTAATACG TGGGTGTATA 
GTAGCGGTAA AGTAAGTGCT ACAGATACAA 

SS 
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TTGGTTAGTA AGCAAACATT TATTCAATGA 900 

ATAAGTTGAA CATGAGGTCT AACGTACATT 960 

GAATAATTTA AAATGTTAGT TAAATTTGAT 1020 

GAAAAATTTA GAACTTCTAA GTTTTTGAAA 1080 

TATATTGATT TTTGCTAGTT AACAGAaAAT 1140 

TTTTACGTAA TTTTTTTCGA AAAGCAATTG 1200 

ACTAATCTAG AAATTGAAAT GGAGTAATAT 1260 

ATAAGCAGAA TAAGTATTCG ATTAGACGTT 1320 

GGGCAACTAT ACTATTTGGG ATAGGCAATC 1380 

ATACAACGCA ATCTTCGAAA AATAATGCAA 1440 

AAACACCTCA ATTAAATACA ACGGCTAATG 1500 

GTGCGAATGT AGATAGCACA ACAAAACCAA 1560 

CAGAGCCAGC TTCAACAAAT GAAACACCTC 1620 

CTGCAAAAAT GCAAGATCAA ACTGTTCCTC 1680 

CAACGAATGA TGCTAATAGC ATA6CAACAA 1740 

ATTTACCACA ATCATCACCA CAAACGATTT 1800 

TTAGAACGAG AGCTGTACGT AGTTTAGCTG 1860 

CTAAAGGTAC AAATGTAAAT GATAAAGTTA 1920 

CATTTGACCC TAATCAAAGT GGTAACACAT 1980 

AAGTGAAATC AGGGGATTAT TTTACAGCGA 2040 

ACGTGGATTA TTCTAATTCA AATAATACGA 2100 

GCGATGTTGT AGCTAAAGCA ACATATGATA 2160 

CAGATTATGT AAATAATAAA GAAAATATTA 2220 

ACCGAGCAAA GGCACCTAAA TCAGGAACAT 2280 

TGTTTAATAA TAAAATTACT TATAACTATA 2340 

ATGGCGCGAA CATTTCTTCT CAAATTATTG 2400 

ACAAGCAAAC AGTATTTGTT AACCCTAAGC 2460 

TTAAAGGCTA CCAAGATAAA ATCGAAGAAA 2520 

AACTGAGAAT TTTTGAAGTQ AATGATACAT 2580 
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ACCAATTTAA AAATAGAATC TATTATGAGC 


ATCCAAATGT 


AGCTAGTATT 


AAATTTGGTG 


2700 




ATATTACTAA AACATATGTA GTATTAGTAG 


AAGGGCATTA 


CGACAATACA 


GGTAAGAACT 


2760 


5 


TAAAAACTCA GGTTATTCAA GAAAATGTTG 


ATCCTGTAAC 


AAATAGAGAC 


TACAGTATTT 


2820 




TCGGTTGGAA TAATGAGAAT GTTGTACGTT 


ATGGTGGTGG 


AAGTGCTGAT 


GGTGATTCAG 


2880 




CAGTAAATCC GAAAGACCCA ACTCCAGGGC 


CGCCGGTTGA 


CCCAGAACCA 


AGTCCAGACC 


2940 


10 


CAGAACCAGA ACCAACGCCA GATCCAGAAC 


CAAGTCCAGA 


CCCAGAACCG 


GAACCAAGCC 


3000 




CAGACCCGGA TCCGGATTCG GATTCAGACA 


GTGACTCAGG 


CTCAGACAGC 


GACTCAGGTT 


3060 


IS 


CAGATAGCGA CTCAGAATCA GATAGCGATT 


CGGATTCAGA 


CAGTGATTCA 


GATTCAGACA 


3X20 


GCGACTCAGA ATCAGATAGC GACTCAGAAT 


CAGATAGTGA 


GTCAGATTCA 


GACAGTGACT 


3180 




CGGACTCAGA CAGTGATTCA GACTCAGATA 


GCQATTCAGA 


CTCAGATAGC 


GATTCAGACT 


3240 


20 


CAGACAGCGA TTCAGATTCA GACAGCGACT 


CAGATTCAGA 


CAGCGACTCA 


GACTCAGATA 


3300 




GCGACTCAGA CTCAGACAGC GACTCAGATT 


CAGATAGCGA 


TTCAGACTCA 


GACAGCGACT 


3360 




CAGACTCAGA CAGCGACTCA GACTCAGATA 


GCGACTCAGA 


TTCAGATAGC 


GATTCAGACT 


3420 


2S 


CAGACAGCGA CTCAGATTCA GATAGCGATT 


CGGACTCAGA 


CAGCGATTCA 


GATTCAGACA 


3480 




GCGACTCAGA CTCGGATAGC GATTCAGATT 


CAGATAGCGA 


TTCGGATTCA 


GACAGTGATT 


3540 




CAGATTCAGA CAGCGACTCA GACTCGGATA 


GCGACTCAGA 


CTCAGACAGC 


GATTCAGACT 


3600 


30 


CAGATAGCGA CTCAGACTCG GATAGCGACT 


CGGATTCAGA 


TAGCGACTCA 


GACTCAGATA 


3660 




GTGACTCCGA TTCAAGAGTT ACACCACCAA 


ATAATGAACA 


GAAAGCACCA 


TCAAATCCTA 


3720 


35 


AAGGTGAAGT AAACCATTCT AATAAGGTAT 


CAAAACAACA 


CAAAACTGAT 


GCTTTACCAG 


3780 


AAACAGGAGA TAAGAGCGAA AACACAAATG 


CAACTTTATT 


TGGTGCAATG 


ATGGCATTAT 


3840 




TAGGATCATT ACTATTGTTT AGAAAACGCA 


AGCAAGATCA 


TAAAGAAAAA 


GCGTAAATAC 


3900 


40 


TTTTTTAGGC CGAATACATT TGTATTCGGT 


TTTTTTGTTG 


AAAATGATTT 


TAAAGTGAAT 


3960 




TGATTAAGCG TAAAATGTTG ATAAAGTAGA 


ATTAGAAAGG 


GGTCATGACG 


TATGGCTTAT 


4020 




ATTTCATTAA ACTATCATTC ACCAACAATT 


GGTATGCATC 


AAAATTTGAC 


AGTCATTTTA 


4080 


45 


CCGGAAGATC AAAGCTTCTT TAATAGCGAT 


ACAACTGTTA 


AACCATTAAA 


AACTTTAATG 


4140 




TTGTTACATG GATTATCAAG TGATGAAACG 


ACATATATGA 


GATATACAAG 


CATAGAAAGG 


4200 


50 


TATGCGAATG AACACAAATT AGCTGTGATT 


ATGCCCAATG 


TGGATCATAG 


CGCATATGCT 


4260 


AACATGGCAT ATGGTCATAG CTATTATGAT 


TATATTTTGG 


AAGTGTATGA 


TTATGTTCAT 


4320 




CAAATATTTC CACTTTCCAA AAAGCGTGAT 


GACAATTTTA 


TAGCAGGTCA 


CTCTATGGGA 


4380 



55 
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TTATCTGCTG TGTTTGAAGC GCAAAATTTA 
GAGGCCATAA TTGGCAATCT TTCAAGTGTT 
S CTAGACAAAG CTGTAGCTGA AGATAAACAA 

CAAGACTTTT TATATCAAGA CAACTTAGAT 
CCTTATCAAT TTGAAGATGG ACCAGGAGAT 

10 

AAGCGTGCTA TAACATGGAT GGTGAATGAT 
TTAAATACAC AGAGTGAGAG ATACAAACTA 
AAATTATTTT TGTATTAATA TGATTGGCGC 

15 

GAAACTTAGA TTTAGCTTAT AGTTTTATCA 
TTATAATGAG GTTAACGCTT TGAAAGGAGT 

2Q TGAGCATATG TTGTTTTATT TTGCATATAA 

AGAGAAGTAT GGTATGAGTC GTCAGCATCA 
TGGTATTACT ATTAAATCAT TACTAGAAAT 

2S AACACTTCAA AAATTAAAAG AGCAAGGTCT 
ACGTGTCAAA AAATTATATT CGACGGATAA 
GGCGCAAGAT GAATTATTGC AAAATATATA 

30 

GATGGAAGCA TTGGCTAAAG GgCGACCTGG 
AAAAGAAAGC TAGCATCAGA AATGTTAAAA 
TCAAAAAGTG TATAATAAAA ACATATAATT 

35 

AAGGAGTTTG AATGATGAAA AAATTAGCAG 

TCGCATTTAA AAAATACCAA GAACGTGTTA 

AACCATAAAA AATTCCCXSAA CACCTTGTTA 

TGAATATATC AAATATTATT TTTGCGCTTT 

CTGATCTAGG TCCGTAAGCG TAgGTATTAA 

^ ACCCTGTATA AGATTTATCA TTTACT6GCT 

GCGTTTCTAC TTCTGCGGAT TTTTCGTCTT 

TATTCTCTTT TTTAACCTTT TTCATATCAT 

50 

TAATAACTTT TTCAGGGTCT TCACCTTTAG 

GTTTCATAGA TTTAATCGGT TGAGGATTCC 

55 
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ATGGATCTAG AGTGGAATGA TTTTTCAAAA 4500 

AAAGGAACTG AACATGATCC GTATTACTTG 4560 

ATTCCAAAAT TGCTCATTAT GTGTGGTAAA 4620 

TTTATCGATT ATTTATCACG CATAAATGTT 46 80 

CATGATTATG CATATTGGGA TCAAGCGATT 4740 

TAATTATTTC TTGGAAAATA TGTGGCTGCA 4 800 

TTTACGCACG ACTAACATTT CTAAGTGTTT 4860 

AATTTGCTGA TACACAAAAA TGTTTCTCGT 4920 

TCATTTGTAT GACTTACATT ATAAATTTTA 4980 

CATCATCATG TCGACCAATA AAAACGATTA 5040 

AACCTTTATT ACTACCGCTG ATGAAATTAT 5100 

TCGTTTTTTG TTTTTTATCA ATAAATTACC 5160 

ATTAGAAATT TCTAAmCAAG GATCACATGC 5220 

CATTATTGAA AAAGTTTTAG AGACTQATCG 5280 

AGGCGATCAA CTCATTGCTG AATTGAACAA 5340 

TCAACAAGTC GGTTCGGATT GGTATGATGT 5400 

CTTTGATTTT ATTAAGCATT TGAAAGATGA 5460 

ATCTTCGCAT TCTTAAATTT AAAAAATATG 5520 

TAATTGAACT CAGTTTCAAC ACATCTTAGA 5580 

TTATTTTAAC ATTAGTTGGC GGTTTATACT 5640 

ACCAAGCACC TAACATTGAG TACTAAATTA 5700 

TAGTGCTCGG GAATTTTTTT ATGCTTTACT 5760 

CTGTATTTTC GATATTACCA CTAAATGATT 5820 

CATCCTCGCC TGTATGTCCA TCGGAAGTCC 5880 

TCTGAATAGC GTGTTGTAGG GCTTTTGTTT 5940 

TTTCTTTTTT AAGTAGTCTT TTTAGCTTTT 6000 

CTTGTGAAAA TTCAAATCCA TAACCTTCAT 6060 

CCATTTTTTC TGTCATATAT GATCCAGAGT 6120 

ATTCGTATCC TTTATCTTTA CCAATTGTTA 6180 



580 




581 



ACAAAACATA CAGCTATCTT TGACTGAATT 
AGaTAAATAT GTAAGTAGrA CTGCTTTAAT 
5 AGATGATAAC ACATTAGCTG TTAACATGAC 

CGTTAATGAT TTTATCATTA CAAAGAAAAA 
TGACCTTTCT TAAAAGTATT ACTCAGGAAA 

10 

TTGGCTTAAT GTTTTACCTG TATCATTTGC 
TTATTTTATT ATTATTACTC ATATTCATAG 
TAAGCCAACA ACAACAAATT GAAAACTTAG 

IS 

AAATTGAATA TAAAAATGAT GTAGAGAGCT 
CACCCATTAC TGCAGCACAA CTGTTACTTG 

20 TTCGTCAAGA GGTTATTCAA ATTGaTAACT 
TATTAAATGA AACTTCTGaT ATTTCTGTCA 
CAATTATTAT GAAATATTCA ATACAGTTTA 

2S CTTGTCATCA CGAAGTATTA ACTGACGTTA 
TAAATAATGC ACTTAAGTAT GCGAGAGGTA 

CCAATCAATT ACACGTAAAA GATAATGGTA 

30 

TATTTGATAA GGGCTATTCA GGTTATAATG 

GTTTATTTAT CGTAAAACAA ATTTCAACAC 

AACAAAATGA GGGTACAACA TTTACGATTC 

3S 

TGTAAGTATA CTAGTAACAT TTTTTTACTA 
AATi^TAGAAC TAACAAAGAA ATGAGGTGCA 
40 AAAGGTTTAT GGTAAAGGTT TGAATGCTAC 
TGGAGCTGGT GaATTTGTTG CaATTATGGG 
AAATTTAATT GCtTCTTTTG ATGGACTAAC 
TTTAAATAAT ATGAAAAATA AAAGTAAAGC 
TTTTcAAGAT TTTAATCTTT TACCAACAAT 
AATTTTAGCT GGTGCTAAAC GAAAAGATAT 

SO 

ATTACATTTA GAGGGATTCT TAAACAAGTA 
ACX5CATTGCC ATTGCACGTG CATTAGTTAC 

55 
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ACAAATATTA AAGTTATTAT TTCAAAATGA 8100 

TGaAAAATGT TGGGaATCAG AAAACtTCAT 8160 

GCGCCTGCTG AAAAAATTAA ATACTATTGG 8220 

TGTCGGATAT AAAGTATAGG GTGAATGCAA 8280 

TAGCAATAGT CATAGTTATT TTTGCTTTGT 834 0 

CATTAGAAGC ATATTTACTA GCACTTGGCG 8400 

GTATTAAATA TTTAAGTTTT GTAAAAACTA 8460 

AAAATGCGTT GTATCAGCTT AAAAATGAAC 8520 

ACTTTTTAAC ATGGGTACAT CAAAT6AAAA 8580 

AAAGAGATGA GCCTAATGTT GTTAATCGTG 8640 

ATACAAGTTT AGCACTTAGT TATTTAAAGT 8700 

CTAAAATTTC GATTAATAAT ATCATTCGCC 8760 

TTGATCAAAA AACAAAAATC CATTATGAAC 8820 

GATGGACCTC TTTAATGATA GAACAATTAA 8880 

AAGATATATG GATTGAATTT GATGAGCAAT 8940 

TCGGTATTAG TGAAGCGrAC TTGCCTAAAA 9000 

GCCAGCGCCA AAGTAACTCA AGTGGGaTTG 9060 

ACACAAACCA TCCTGTTTCA GTCGTATCTA 9120 

AATTTCCAGA TGAATAAAAA CTTTCAATAT 9180 

ATTTAAATGT TATTAGTATT TTTTTGTTTT 924 0 

TGCCATGTTG CTAGAAGTGn AACATGTAAA 9300 

GACAGCACTT AATCAAATGA ATTTATCAGT 9360 

TGAGTCTGGG tCAGGGAAGT CTACACTACT 9420 

TGAAGGTGAC ATTATTGTGG ATGGCGCACA 9480 

ATTGTATC6T CaACAAATGG TAGGTTTTGT 9540 

GACGAATAAA GAAAATATAA TGATGCCATT 9600 

AGAACAAAGG GTACATCAGT TGGCAGTACA 9660 

TCCTTCTGAA ATCTCTGGGG GTCAGAAGCA 9720 

TAAGCCGACG ATTTTACTAG CCGATGAACC 9780 
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5 



20 



25 



TCAATTGGAA 


CAGACAATTT 


TAATGGTAAC 


TCATTCAAAT ATCGATGCGT 


CTTATGCAGA 


9900 


GCGAGTCATT 


TTTATTAAAG 


ATGGGCGTCT 


ATATCATGAA ATATATCX5TG 


GTGAAGAAAG 


9960 


TCAATTAGCT 


TTTCAACAAC 


GAATAACAGA 


TAGCTTAGCA 


CTTGTGAATG 


GAGGAAGTGT 


10020 


CAATATATGA 


AGTTAAGATT 


GTTATGnACA 


TAGTGCGACG 


TCAATTTATT 


ACGCAGCGAC 


10080 


TTGTAATCAT 


TCCATTCATT 


TTAGCGGTAA 


GTGTACTATT 


CATGATTGAA 


TATACGCTTG 


10140 


TGTCAATTGG 


GTTAAATAGC 


TACATAAAAC 


AGAAGAATGA 


CTTCCTAGTA 


CCATTTATTA 


10200 


TCATAGCTAA 


'rrrrriTATG 


GCGCTTTTAA 


CTTTTATTTT 


TATTTTCTAT 


GCAAATCACT 


10260 


TTATGATGTC 


ACAAAGACGA 


AAAGAGTTTA 


GCATTTTTAT 


GACATTGGGC 


ATGACCAAGA 


10320 


AAAGTATGC6 


TTTAATTGTA 


GTGATGGAAA 


CTATCTTACA ATTTGTGATA ATTTCAGTCG 


10380 


TTA6TATTGC 


CGGCGQATAC 


TTACTTGGTG 


CGATATTTTT 


CTTGTTTATA 


CAGAAAATAA 


10440 


TGGGCA6TCA 


AGTTGCGACG 


TTAAOGTATT 


ATCCATTTGA 


CTCTGTAGCG 


ATGTTTATTA 


10500 


CTTTGATTAT 


CATTGCTGTA 


TTAATGGGCA 


TGCTACTTAT ATTCAACTTG 


TTTAGTATTA 


10560 


ATTTTCAA06 


GCCGATAACT 


TATCAACATC 


GTTCOGATTC 


TAGTGTCATA 


TCACGATGGT 


10620 


TGCGTTACXjT 


TTTAATTGTT 


ATAGGAAGCG 


CAnACTATAT 


TTAGGTTACT 


TTATTGCATT 


10680 


ACAACAAGAT 


ACGACGTTTG 


GTGCCTTTTT 


TAAAATATGG 


ATTGTCATAG 


GATTA6TTAT 


10740 


TATCGGTACT 


TATGCATTTT 


TTGTAGGTAT 


AAGTGAAATA 


ATTATTAGTA 


TATTGCA6CA 


10800 


GGTATCAAAA 


GTTTACTATC 


ATCCACGGTA 


TTTTTTTGTG 


GTAGTTGGGA 


TGCGTGTACG 


10860 


TCTTAAAATG 


AATGCAGTCA 


GTCTTGCAAC 


AATCACTTTG 


CTGTGTACAT 


TTTTGATTGT 


10920 


AACGCTCACA 


ATGACATTAA 


CAACCTATCG 


TGATATGAAT 


CATACCATTA 


CGAAATTGAT 


10980 


TACGAATGAT 


TakGATTTGT 


CATTTAGCGA 


CAATTCTAAG 


TCACAAaTAG 


AACGTCAACA 


11040 



AACAATTGAG 11050 
(2) INFORMATION FOR SEQ ID NO: 97: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

SO 

CGACATAACG AGGCAAGGGT ACATGATACT TTAGCCTCGT TTTTGATATG TATTTTTCTG 60 
AATATAAGGG CAATAGATGG TATTTTATAw TTTTTTTAAG GTAGTGATTA ACATAGATAT 120 

55 
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TCAAGCGGAA 


CAGCATTATG 


CACCAGTATT 


AACGCATTTT 


TTAGATCCAA GAGGGCAATA 


240 




TATATTGGAA 


GTGATTTGTG 


GCAGTTATGA 


AGATTTAAAC 


GTATCTTTTT 


ATGGTGGACC 


300 


5 


TAATGCTGAA 


AGAAAAAGAG 


CAATCATTTC 


GCCGAACTAT 


TATGAACCTA 


AAGAAAGCGA 


360 




CTTTGAATTA 


ACTTTAATGG 


AAATAGATTA 


TCCTGAAAAA 


TTCGTCACTT 


TAAAACATCA 


420 


10 


ACATATTTTA 


GGGACATTAA 


TGTCTTTAGG 


TATCGAACGC 


GAACAAGTTG 


GAGATATAAT 


480 


TGTGaATGAA 


CGAATTCAAT 


TTGTTTTGAC 


AAGTAGATTG 


GAATCATTTA 


TTATGTTAGA 


540 




ATTACAACGT 


ATTAAAGGCG 


CATCAGTTAA 


ACTTTATACT 


ATTCCAGTAA 


CAGATATGAT 


600 


IS 


ACAATCTAAT 


GAGAATTGGA 


AAAATGAAAG 


TGCaCAGTTA GTTCTTTAAG 


GTTAGATGTT 


660 




GTTATTAAAG 


AAATGATACG 


TAAATCAC6T 


ACGATTGCGA 


AACAACTAAT 


CGAAAAAAAA 


720 




CGTGTTAAAG 


TGAATCACAC 


TATTGTTQAT 


TCAGCAGATT TTCAATTACA AGCAAATGAT 


780 


20 


TTAATATCCA 


TCCAAGGTTT 


TGGTAGAGCA 


CACATTACTG 


ACTTAGGTGG 


TAAAACTAAA 


840 




AAAGATAAAA 


CGCACATTAC 


CTATAGAACA 


TTATTCAAAT AGTAATQATT 


TAAGGAGGAT 


900 




AACAAATGCC 


TTTTACACCA 


AATGAaATTA 


AGAATAAAGA 


GTTTTCACGT 


GTaAAGAATG 


960 


25 


GTTTTAGAAC 


CTACTGnAGT 


TGG 








983 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 10322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
TTTTGCAAAG CTTATTTTAT GTCAAACAGA TAGTCAATOT GAAACAAAGG TTAGTACATA 60 
40 TAATCATCCA GACTTTATGT ATATATCAAC AACTGAGAAT GCAATTAAGA AAGAACAAGT 120 
TGAACAACTT GTGCGTCATA TGAATCAACT TCCTATAGAA AGCACAAATA AAGTGTACAT 180 
CATTGAAGAC TTTGAAAAGT TAACTGTTCA AGGGGAAAAC AGTATCTTGA AATTTCTTGA 240 
AGAACCACCG GACAATACGA TTGCTATTTT ATTGTCTACA AAACCTGAGC AAATTTTAGA 300 
CACAATCCAT TCAAGGTGTC AGCATGTATA TTTCAAGCCT ATTGATAAAG AAAAGTTTAT 360 
AAATAGATTA GTTGAACAAA ACATGTCTAA GCCAGTAGCT GAAATGATTA GTACTTATAC 420 

SO 

TACGCAAATA GATAATGCAA TGGCTTTAAA TOAAGAATTT GATTTATTAG CATTAAGGAA 480 
ATCAGTTATA CGTTGGTGTG AATTGTTGCT TACTAATAAG CCAATGGCAC TTATAGGTAT 540 

SS 
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10 



IS 



GAATGGTTTC TTCGAAGATA TCATACATAC AAAGGTAAAT GTAGAGGATA AACAAATATA 660 

TAGTGATTTA AAAAATGATA TTGATCAATA TGCGCAAAAG TTGTCGTTTA ATCAATTAAT 720 

TTTGATGTTT GATCAACTGA CGGAAGCACA TAAGAAATTG AmTCAAAATG TAAATCCAAC 780 

GCTTGTATTT GAACAAATCG TAATTAAGGG TGTGAGTTAG ATGCCAAATG TAATAGGTGT 840 

TCAGTTTCAA AAAGCGGGAA AATTAGAATA TTATACACCT AATGATATAC AAGTAGATAT 900 

AGAAGACTGG GTAGTTGTCG AATCTAAAAG AGGCATAGAG ATAGGTATTG TTAAAAATCC 960 

ATTAATGGAT ATTGCTGAAG AGGATGTTGT GTTACCTCTT AAAAATATTA TTCGCATTGC 1020 

TGATGACAAA GATATTGATA AATTTAATTG TAATGAACGA GATOCTGAAA ATGCATTAAT 1080 

ACTATGTAAA GACATTGTAA GAGAACAAGG TTTGGACATG CGTTTAGTCA ATTGCGAATA 1140 

TACATTAGAT AAATCGAAAG TTATTTTTAA TTTTACGGCG GATGATCGTA TTGATTTTAG 1200 

AAAATTAGTA AAAATATTAG CGCAACATTT AAAAACACGT ATGGAGTTGA GACAAATTGG 1260 

TGTAAGGGAT GAAGCCAAAT TGCTTGGCXX5 TATCGGACCT TGTGGTAGGT CGTTATGTTG 1320 

TTCTACATTT TTAGGGGATT TTGAACCAGT ATCGATTAAG ATGGCTAAGG ATCAAAATTT 1380 

ATCATTAAAT CCAACTAAAA TTTCTGGTGC ATGTGGTCGT TTGATGTGTT GTTTAAAATA 1440 

TGAAAATGAC TATTATGAGG AAGTACGTGC ACAATTACCT GATATTGGTG AAGCAATTGA 1500 

AACGCCTGAT GGTAACGGGA AAGTAGTTGC TTTAAATATA TTAGACATTT CTATGCAGGT 1560 

GAAGCTTGAG GGACATGAAC AGCCACTTGA ATATAAATTA GAAGAAATAG AAACTATGCA 1620 

TTAAGGAGGC ATTATTACAT TTGGATCGCA ATGAAATATT TGAAAAAATA ATGCGTTTAG 1680 

35 AAATGAATGT CAATCAACTT TCAAAGGAAA CTTCAGAAIT AAAGGCACTT GCAGTTGAAT 1740 

TAGTAGAAGA AAATGTAGCG CTTCAACTTG AAAATGATAA TTTGAAAAAG GT6TTGGGCA 1800 

ATGATCAACC AACTACTATT GATACTGCGA ATTCAAAACC AGCAAAAGCT GTGAAAAAGC 1860 

40 CATTACCAAG TAAAGATAAT TTGGCTATAT TGTATGGAGA AGGATTTCAT ATTTGTAAAG 1920 

GCGAATTATT TGGAAAACAT CGACATGGTG AAGATTGTCT GTTCTGTTTA GAAGTTTTAA 1980 

GTGATTAATC AAGCACACTC AAATAGTGTT ATAATTATAA ATGAATATGG TTTGGATAAG 2040 

TCTGAGACAA TGCATGTTTC AGGCTTTAAT TGTGTATAAA GTTTTGGTGA TTGCATAAGA 2100 

GATGGCGGTA CTAAATGTTA TTATTAAGTG TGCACGCAgT ATCaTTAGTT ATAAAATGTA 2160 

GCTGTTAAAA GTCAAAAATA CATCGAATGT AGTTAGGCAT ATAATATAAA AAGAGTTTTC 2220 

AATTACTCAA TAGAAAAAGG TTGTCTTCAT AGGAGTTAAA AATGTTAAAA GAGAATGAAC 2280 

GATTTGATCA ACTAATCAAA GAAGATTTTA GTATTATTCA AAATGATGAT GTTTTTTCAT 2340 
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TGGACTTATG TTCAGGCAAT GGGGTGATAC CCTTGTTATT GTTTGCGAAA CATCCACGAC 24 60 

ATATAGAAGG TGTTGAGATT CAAAAAACAC TTGTCGATAT GGCGCGACGC ACATTTCAAT 2520 

TCAATGATGT TGATGAATAT TTAACAATGC ATCACATGGA TTTGAAAAAC GTTACTAAAG 2580 

TATTTAAACC TTCACAATAT ACTTTAGTAA CGTGTAATCC GCCTTATTTT AAAGAGAATC 2640 

AGCAACACCA ACATCAAAAA GAAGCACATA AGATAGCGAG ACATGAGATT ATGTGTACAC 2700 

TTGAAGATTG CATGATTGCA GCCCGTCATT TATTAAAAGA AGGTGGCAGG CTAAACATGG 2760 

TACATCGTGC AGAGAGACTA ATGGATGTCT TGTTTGAAAT GAGAAAAGTG AATATTGAAC 2820 

CTAAGAAAGT CX S TTTTT A TA TATAGTAAAG TAGGGAAATC AGCACAAACG ATAGTAGTAG 2880 

AAGGTCGAAA AGGTGGAAAT CAAGGTTTAG AAATCATGCC CCCATTTTAT ATTTATAATG 2940 

AAGATGGTAA TTATAGCGAA GAAATGAAGG AAGTATATTA TGGATAGTCA TTTTGTATAT 3000 

20 ATTGTAAAAT GTAGTGAT6G AAGTTTATAT ACAGGATACG CTAAAGACGT TAATGCACGT 3060 

GTTGAAAAAC ATAACCGAGG TCAAGGAQCC AAATATACGA AAGTAAGACG TCCGGTGCAT 3120 

TTAGTTTATC AAGAAATGTA TGAGACAAAG TCTGAAGCAT TGAAGCGTGA ATATGAAATT 3180 

AAAACTTATA CCAGACAAAA GAAATTGCGA TTAATTAAGG AGCGATAGTA TGGCTGTATT 3240 

ATATTTAGTG GGCACACCAA TTGGTAATTT AGCAGATATT ACTTATAGAG CAGTTGATGT 33 00 

ATTGAAACGT GTTGATATGA TTGCTTGTGA AGACACTAGA GTAACTAGTA AACTGTGTAA 3360 

TCATTATGAT ATTCCAACTC CATTAAAGTC ATATCACGAA CATAACAAGG ATAAGCAGAC 3420 

TGCTTTTATC ATTGAACAGT TAGAATTAGG TCTTGACGTT GCGCTCGTAT CTGATGCTGG 3480 

35 ATTGCCCTTA ATTAGTGATC CTGGATACGA ATTAGTAGTG GCAGCCaGAG AAGCTAATAT 3540 

TAAAGTAGAG ACTGTGCCTG GACCTAATGC TGGGCTGACG GCTTTGATGG CTAGTGGATT 3600 

ACCn'CATAT GTATATACAT TTTTAGGATT TTTGCCACGA AAAGAGAAAG AAAAAAGTGC 3660 

40 TGTATTAGA6 CAACGTATGC ATGAAAATAG CACATTAATT ATATACGAAT CACCGCATCG 3720 

TGTGACAGAT ACATTAAAAA CAATTGCAAA GATAGATGCA ACACGACAAG TATCACTAGG 3780 

GCGTGAATTA ACTAAGAAGT TCGAACAAAT T6TAACTGAT GATGTAACAC AATTACAAGC 3840 

ATTGATTCAG CAAGGCX3ATG TACCATTGAA AGGCGAATTC GTTATCTTAA TTGAAGGTGC 3900 

TAAAGCGAAC AATGAGATAT CGTGGTTTGA TGATTTATCT ATCAATGAGC ATGTTGATCA 3960 

TTATATTCAA ACTTCACAGA TGAAACCAAA ACAAGCTATT AAAAAAGTTG CTGAAGAACG 4020 

ACAACTTAAA ACGAATGAAG TATATAATAT TTATCATCAA ATAAGTTAAT CACTTTATCG 4080 

ATTaTATGAA ATTTTAAACG ATTTTATAAA CGCAAGCTGT AATTTTAAAT GGTAAGTTAT 4140 
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GTTmTAAT GTAAAATAAA TACATTGAAA GTAATAAATA CCTTAACATT GAATAAGATG 4260 

AAAATGAGAT GACGAGATAA ATGTTCGCGT CCGTTGAAAT GCATAGAAAT CTTAGATATT 4 320 

ATTTGAAGTG AGACATTACG AGGAGGAACA GTTATGGCTA AAGAAACATT TTATATAACA 4380 

ACCCCAATAT ACTATCCTAG TGGGAATTTA CATATAGGAC ATGCATATTC TACAGTGGCT 4440 

GGAGATGTTA TTGCAAGATA TAAGAGAATG CAAGGATATG ATGTTCGCTA TTTGACTGGA 4500 

ACX3GATGAAC ACGGTCAAAA AATTCAAGAA AAAGCTCAAA AAGCTGGTAA GACAGAAATT 4560 

GAATATTTGG ATGAGATGAT TGCTGGAATT AAACAATTGT GGGCTAAGCT TGAAATTTCA 4620 

^5 AATGATGATT TTATCAGAAC AACTGAAGAA CGTCATAAAC ATGTCGTTGA GCAAGTGTTT 4680 

GAAOGTTTAT TAAAGCAAGG TGATATCTAT TTAGGTGAAT ATGAAGGTTG GTATTCTGTT 4740 

CCGGATGAAA CATACTATAC AGAGTCACAA TTAGTAGACC CACAATACGA AAACGGTAAA 4800 

ATTATTGGTG GCAAAAGTCC AGATTCTGGA CACGAAGTTG AACTAGTTAA AGAAGAAAGT 4660 

TATTTCTTTA ATATTAGTAA ATATACAGAC CGTTTATTAG AGTTCTATGA CCAAAATCCA 4920 

GATTTTATAC AACCACCATC AAGAAAAAAT GAAATGATTA ACAACTTCAT TAAACCAGGA 4980 

CTTGCTGATT TAGCTGTTTC TCGTACATCA TTTAACTGGG GTGTCCATGT TCCGTCTAAT 5040 

CCAAAACATG TTGTTTATGT TTGGATTGAT GCGTTAGTTA ACTATATTTC AGCATTAGGC 5100 

TATTTATCAG ATGATGAGTC ACTATTTAAC AAATACTGGC CAGCAGATAT TCATTTAATG 5160 

GCTAAGGAAA TTGTGCGATT CCACTCAATT ATTTGGCCTA TTTTATTGAT GGCATTAGAC 5220 

TTACCGTTAC CTAAAAAAGT CTTTGCACAT GGTTGGATTT TGATGAAAGA TGGAAAAATG 5280 

35 AGTAAATCTA AAGGTAATGT CGTAGACCCT AATATTTTAA TTGATOGCTA TGGTTTAGAT 5340 

GCTACACGTT ATTATCTAAT GCGTGAATTA CCATTTGGTT CAGATGGCGT ATTTACACCT 5400 

GAAGCATTTG TTGAGCGTAC AAATTTCGAT CTAGCAAATG ACTTAGGTAA CTTAGTAAAC 5460 

CGTACGATTT CTATGGTTAA TAAGTACTTT GATGGCGAAT TACCAGCGTA TCAAGGTCCA 5520 

CTTCATGAAT TAGATGAAGA AATGQAAGCT ATGGCTTTAG AAACAGTGAA AAGCTACACT 5580 

GAAAGCATGG AAAGTTTGCA ATTTTCTGTG GCATTATCTA CGGTATGGAA GTTTATTAGT 5640 

AGAACGAATA AGTATATTGA CGAAACAACG CCTTGGGTAT TAGCTAAGGA CGATAGCCAA 5700 

AAAGATATGT TAGGCAATGT AATGGCTCAC TTAGTTGAAA ATATTCGTTA TGCAGCTGTA 5760 

TTATTACX5TC CATTCTTAAC ACATGCGCCG AAAGAGATTT TTGAACAATT GAACATTAAC 5820 

AATCCTCAAT TTATGGAATT TAGTAGTTTA GAGCAATATG GTGTGCTTAA TGAGTCAATT 5880 

ATGGTTACTG GGCAACCTAA ACCTATTTTC CCAAGATTGG ATAGCGAcGG AnAATTGCAT 5940 
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AACCTCAAAT TGATATTAAA GACTTTGATA 
ATGCTGAACA TGTTAAGAAG TCAGATAAGC 
^ AACAAAGACA AATTGTATCA GGAATTGCCA 

AAAAAGTAGC AGTTGTTACT AACCTGAAAC 
GTATGATATT ATCTGCTGAA AAAGATGGTG 

10 

TTCCAAATGG TGCAGTGATT AAATAACTGT 
TAATCGATAC ACATGTCCAT TTAAATGATG 

^5 TTACACGTGc TAGAGAAGCA GGTGTTtSATC 

CAATTGAACG CGCGATGAAA TTAATCGATC 
GGCATCCAGT TGACGCAATT GATTTTACAG 

20 CTCAGCATCC AAAAGT6ATT GGTATTGGTG 

CTCCTGCAGA TGTTCAAAAG GAAGTTTTTA 
AGTTACCAAT TATCATTCAT AACCGTGAAG 

25 

AGOAGCATGC TGAAGAGGTA GGCGGGATTA 
CAGATATTGT AACTAATAAG CTGAATTTTT 
AAAATGCTAA ACAGCCTAAA GAAGTTGCTA 

30 

AAACCGATGC ACCGTATCTT TCGCCACATC 
GAGTAACTTT AGTAGCTGAA CAAATTGCTG 

33 GCGAACAAAC AACTAAAAAT GCAGAGAAAT 

GAGAAAGATC ACCGCCATAA ATGTAAACGA 
TTCTCACTTT TTTAAATTAA AATATCGTGC 

40 AGCTTTGAAA TTAAGAATTG TAGGAAGGCG 

QTAGAAGGAC GAGATGATAC TGAGCGTGTT 
ACGAATGGTA GTGCCATCAA CGAACAAACT 

45 

CGAGGCGTTA TTGTATTAAC AGATCCAGAT 
ACTGAACATG TCAAAGGTGT TAAACATGCX3 
AAAGGGAAAA TTGGTGTTGA ACATGCCGAC 

50 

GTTAGTTCAC CCTTTGATGA AGCTTATGAA 
GGGTTAATTG TTGGGAAAGA TGCAAGGCGC 
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AAGTTGAAAT TAAGGCAGCA ACGATTATTG 6060 

TTTTAAAAAT TCAAGTAGAC TTAGATTCTG 6120 

AATTCTATAC ACCAGATGAT ATTATTGGTA 6180 

CAGCTAAATT AATGGGACAA AAATCTGAAG 6240 

TATTAACCTT AGTAAGTTTA CCAAGTGCAA 6300 

ATTTTTAAAA ATTAGGAGAG ATAATTATGT 6360 

AGCAATACGA TGATGATTTG AGTGAAGTGA 6420 

GTATGTTTGT AGTTGGTTTT AACAAATCGA 64B0 

AGTATGATTT TTTATATGGC ATTATGGGTT 6540 

AAGAACACTT GGAATGGATT GAATCTTTAG 6600 

AAATGGGATT AGATTATCAC TGGGATAAAT 6660 

GAAAGCAAAT TGCTTTAGCT AAGCGTTTGA 6720 

CAACTCAAGA CTGTATCGAT ATCTTATTGG 6780 

TGCATAGCTT TAGTGGTTCT CCAGAAATTG 6840 

ATATTTCATT AGGTGGACCT GTGACATTTA 6900 

AGCATGTGTC AATGGAGCGT TTGCTAGTTG 6960 

CX3TATAGAGG GAAGCGAAAT GAACCGGCGA 7020 

AATTAAAAGG CTTATCTTAT GAAGAAGTGT 7080 

TGTTTAATTT AAATTCATAA AGTTAAAAGT 7140 

TGCTATATTC GTTTAATATG CTATGGTTCT 7200 

ATGTGGAATA CGTGCGATAO AGATGGTTAG 7260 

TTTTAAATGA AAATCAATGA GTTTATAGTT 7320 

AAACX5AGCTG TTGAATGTGA TACGATTGAA 7380 

TTAGAAGTAA TTAGAAATGC TCAACAAAGT 7440 

TTCCCAGGAG ATAAAATTAG AAGTACAATT 7500 

TATATTGATA GAGAAAAAGC TAAAAATAAA 7560 

TTAATTGATA TTAAAGAAGC GTTAATGCAT 7620 

TCAATTGATA AATCTGTGCT AATAGAGTTG 76 BO 

CGTAGAGAAA TTTTAAGTAG AAAATTGCGA 7740 
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GCGGATGTAA GGCAAGCTTT AGAAGATGAA 
TATTGCAACA CCATCAAGAA CGCGAGCGTT 

5 

AAGTTTAGGA CAGAACTTTT TGATAGATGT 
TGATATTGAT GCACAAACTG GGGTGATTGA 
ACAATTGGCC AGACATGCTA AAAGAGTATT 

10 

TGTATTAAAT GATACACTAT CACCTTATGA 
AAAA6CGAAT ATTAAAGAAG CTGTTGAAAA 
IS TGTTGCAAAC CTGCCGTACT ATATTAOGAC 
TATACCAATT GATGGCTACG TGGTGATGAT 
TGAAGTAGGT TCAAAAGCAT ATGGTTCGTT 
TAGTAAAGTA TTAACGGTAC CTAAATCTGT 
AGTTGTAAAA CTGATGCAGA GAACTGAACC 
CTTTAAGTTA GCAAAAGCAG CATTTGCACA 

25 

AAATTATTTT AAAGATGGTA AACAACACAA 
AGGTATTGAT CCAAGACGTC GCGGTGAAAC 
TGAAGAAAAG AAAAAATTCC CTCAATTAGA 

30 

TTGTTAAAAT TTAAATTTTG TTTGACGAAA 
AGCGAGGTGG AGCAATATGC CAAAATCAAT 

35 TGTAGGAAAT CGTATTGTAC TGAAaGCCAA 
TGGAATTTTA AAAGAAACAT ATCCGTCAGT 
CAACjTTGAG AGAGTATCTT ATACATACAC 

^ ATTTGAAGAG GATAATCATC ACGAATCAAT 
AGACGTTTCT TAGTATAAGA AGTAAATATT 
TTCAATACTC TTTTTATTTA CAAAATGTTT 

45 

GTAAATGGAT AATTGTATTT ATAAACACAA 
GAAAATTACT TTTTTATTAA AAAAACACTA 
CTCTTTATGT TAAAATCATC ATATTAAGAT 

50 

AACGGCACCA GCCAAAATTA ATTTTACGCT 
TCATGAGATT GAAATGATAA TGACAACAGT 



TGAGGAAGTG AAAATGTTGG ATAATAAAGA 7860 

GTTAGATAAA TATGGCTTTA ATTTTAAAAA 7920 

GAATATCATT AATAATATCA TTGATGCAAG 7980 

AATTGGTCCA GGCATGGGGT CATTGACAGA 8040 

GGCATTTGAA ATTGATCAAC GTTTAATACC 8100 

TAATGTGACG GTGATTAATG AAGATATTTT 8160 

TCATTTACAA GATTGTGAAA AAATAATGGT 8220 

GCCAATTTTA TTAAATTTGA TGCAACAAGA 8280 

GCAAAAAGAA GTGGGCGAAC GCTTAAATGC 8340 

ATCAATTGTC GTACAATACT ATACAGAGAC 8400 

ATTTATGCCA CCACCTAATG TTGATTCAAT 8460 

GTTAGTAACA GTAGATAACG AGGAAGCATT 8520 

AAGAAGAAAG ACAATTAACA ATAACTATCA 8580 

AGAAGTGATT TTACAATGGT TGGAACAAGC 8640 

GCTATCTATT CAAGATTTTG CTAAATTGTA 8700 

AAATTAAATG ATTGACAAAG CAAAGCACTA 8760 

ACGTTGCAAA TATGGTATTA TGTAACTTGT 8820 

TTTGGACATC AAAAATTCTA TTGATTGTCA 8880 

TGGAGGCCXrr AAGAaAACAA TAAAACGTTC 8940 

TTTCATTGTT GAGTTAGATC AAGACAAACA 9000 

TGATGTGTTA ACTGaAAATG TTCAAGTTTC 9060 

TGCACACTAA ATAAGACATA TAGAGATGTT 9120 

ATGATAATTA TTTGAGTGTT GGGCATTATG 9180 

AACACTGATG TTTCGCTTAT AGATTTTTCA 9240 

ATACAAGTAA ATACTAAGTA ATTAGATGGA 9300 

AAAAACAAAT TAAAATGTCA AATATTAATT 9360 

AACGAAAAGA GGGCGGAAAA TGATATATGA 9420 

CGATACACTT TTTAAAAGAA ATGATGGCTA 94 80 

TGATTTAAAT GATCGTTTAA CTTTTCATAA 9540 
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AAATCTCGCA TATCGTGCAG CGCAACTATT TATTGAGCAA TATCAACTAA AGCAAGGTGT 9660 

AACAATTTCT ATCGATAAAG AAATACCTGT TTCTGCTGGC TTAGCTGGAG GTTCGGCTGA 9720 

5 

TGCAGCAGCA ACGTTAAGAG GATTGAATCG ACTTTTTGAT ATAGGGGCGA GTTTGGAAGA 97 BO 

ATTGGCTCTA CTAGGCAGTA AAATCGGGAC AGATATTCCG TTTTGTATTT ATAATAAAAC 9840 

TGCACTATGT ACTGGAAGAG GAGAGAAAAT CGAGTTTTTA AATAAACCAC CTTCAGCTTG 9900 

10 

GGTGATTCTT GCTAAACCAA ACTTAGGCAT ATCATCACCA GATATATTTA AGTTGATTAA 9960 

TTTAGATAAG CGTTACGACG TACATACGAA AATGTGTTAT GAGGCCTTAG AAAATCGAGA 10020 

IS TTATCAACAA TTATGTCAAA GTTTGTCTAA TC6ATTAGAG CCAATTTCTG TTTCAAAACA 10080 

CCCACAAATC GATAAATTAA AAAATAATAT GTTGAAAAGT GGTGCAGATG GTGCGTTAAT 10140 

GAGTGGAAGC GGACCTACTO TGTATGGGCT AGCACGAAAA GAAAGCCAAG CAAAAAATAT 10200 

20 TTATAATGCA GTTAACGGTT GTTGTAATQA AGTGTACTTA GTTAGACTAT TAGGATAGAA 10260 

GGGTTGAAAA GATGAGATAT AAACX5AAGCG AGAGAATTGT TTTTATGACG CAATATTTGA 10320 

TG 10322 

25 

(2) INFORMATION FOR SEQ ID NO: 99: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5614 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDSDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GATTGATTAA ATGTTTTAAT CCACTTCAAT GCCTTCGATA AACTCTACAA TCGCGCTATT 60 

CATATAATTA TTCGATTTCA TTTGTTCAGC ATATGTCTCA TTAAATCCAG ACATAACTTT 120 

TTTAAAWGCG AAAATTGAAA TTGGTATCGT TACTAATAAG GCACTAGCCA TACGCCAATC 180 

AATGAGCATT ATGTATAAAA AGATAGCAGC TGACAAAAGT AAGTTrCCTA TAACTTCAGG 240 

AATCATATGT GCTAAAGGTA ATTCTATTGT TTCAACCTTA TCGACAAATA TATTTTTTAA 300 

TTCACCTATT TTCTTAGATT CCaCTACGCC TAAAGGGAGA CGCATTAATT TTTGAGCTAA 360 

TTTTTTACGA ATTTCAGATA AAATTTCATA TGCCGTAATA TGTGATAGCA TCGTTGACGC 420 

TCCAAAACAA CACACTTGTG AAATATAAGC GATTAAAGCA ATAAAGATAT AAACCATAAT 4 80 

CGAATTAATC GTATATGTAT TGTTAATCAT CATTAAAATA ATTTTAAATA CTGCCCAATA 540 

AGGAACTAAT CCAGAAAAGA CACTGATGAT AGACAACAAA ATTGATAACA TAATTTTCCA 600 
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ATATGTAACT CCTkTCAATT AATAATCTAA ATTAAGCCGC TTATATTATT TATTTCACTG 720 

GATGATATAC ATAATATAAA TTTGTTATTT GTTAAAAATT AATACTTATT ACAAGTACAT 730 

CATATATTAG TTGATAACX3A TTATCAATGT CGCGTGGATT TGTGACACAT TTCTTTTAAA 84 0 

AATTCACAAG GTTATGGGGC AGAAATGATA AAGAGCCACT AATGATTTAT TATGTAGTGG 900 

TTCTGGGAGT GGGACAGAAA TGATATTTTC ACAAAATTTA TTTCGTCGTC CCACCCCAAC 960 

TTGCATTGTC TCTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGTCCCT GAATATAGCC 1020 

TTGTAGAGTC TAGTACATTG ATTTGTATCC CAATGTCCCT ATAATTGATT ATTCGCTTTA 1080 

TCTAATGATC CTATGACTCA ACTATTAAAT CATTTTTCGA AATACTTAAT TCTAATATAA 1140 

TTAAATTCAT TTATTGTAAT ATTGCAAAAA TACATTGCAC ACCTTGTTCA TCAATGCTAT 1200 

AATTAATTAC ATAATAAATT GAACATCTAA ATACACCAAA TCCCCTCACT ACTGCCATAG 1260 

^ TGAGGGGATT TATTTAGGT6 TTGGTTATTT GTCACCTTTT TTATTGTTGC GCGTTOGTAA 1320 

CCAATGTGCA AAAAACGCAA CAAGACAGCC GCTTATAGCT GAAGTCATGA TGTTAATTAA 1380 

TAAATTGAAC ATCCGTCATA CACCTCCTCT CT6CGTTAAA GTAACGCCCG AGATGTTAGG 1440 

CGACCATCAT ATTATATCAT TTATTTATTA TATTTCACGC AATATTAAGG CTTAAGTAAA 1500 

GTTTTTTTTA GTGGTTTACG CTACTTTAAT TGCTATCTTT TAAAATCCAT TTAGATAATA 1560 

TAAATGTGAT GGGTATCGTA ATAATTAAAC CAGCAAATGG TGCAATTTCT GCTGGCAAAT 1620 

TTAGCCAGGA TACAAATACA TATAATAAAA CTGTTTGTAA GCTTACGTTG ACAATCTGCG 1680 

TAATTGGAAA ACTAATGAAT TTTCTCCAAG TAGGTTTTAC CCTGTAAACA AAATAACAAT 1740 

35 TCAAATAATA TGAAATCACA AAAGCGACTA GAAATCCGGT AATATGACTA ATCATATATT 1800 

CAATGTGTAA TAATTTTAAC AGCAATAAAT AGACAACATA ATAATTTAAC GTATTAATGC 1860 

CGCC^CAAT GATAAATTTT AAAATTTCAG CATGCGTTTG TGTTAGTTTC ATATGTGTAc 1920 

40 TCCTCAACAT CAAAATATAT GCATAACTAC GTTCTCXSAAC ATACTCGAAT ATGCGAGCCA 1980 

ATCCGCTTCA CTTCAAATAT GCTTATTTCA ATCTTTATAC CCTTTCACAG CAAATTTAGT 2040 

CTCTTTCCCC TCATCCTTAT ACGCCATTAT AATGTAACTG ATTTATCGCG TGACTCATTA 2100 

GCACTATAGA GATTACTTTA GTTCACTAGT AATTTTATAT ACAATAAGAG CGACAACAGT 2160 

AATGAGAGGA TGTCTACTAT GCAATTACAA AAAATTGTCA TCGCTCCTGA CTCATTTAAG 2220 

GAAAGTATGA CCGCACAGCA AGTTGGCAAT ATTATAAAAC AGGCTTTTAC TAATGTTTAT 2280 

GGGAATACCC TTCATTATGA TATCATTCCG ATGGCTGATG GTGGTGAAGG TACCACAGAT 234 0 

GCTTTAATGC ATGCAACAGG TGCCACTAAG TATACAGTCA TCGTTAATGA CCCTTTAATG 2400 
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GCGGCAGCGT CAGGTTTGGA TTTATTAGAA AAAGAGGAAC GTAATCCTTT ATACACATCA 2520 

TCATATGGTA CCGGTGAACT AATTAAAGAT GCATTAAATC ATGGTGCTAA GACCATTATT 2580 

TTAGGGATTG GTGGCAGTGC AACAAATGAT GGTGGTACAG GTATGCTAAG TGCACTAGGC 2640 

GTAAAGTTTA CTGATGTAAA CGGGGACTTA TTACAAATGA ATGGTGCTAA TCTTGCTCAC 2700 

ATTGCACAAA TCGATATAAC CAATCTAGAT TCGCGATTAA AAGAGGTGAC CTTTAAAGTG 2760 

GCCTGTGATG TTTCAAATCC TTTATTGGGT GAAAATGGTG CTACCTATAT TTATGGTCCT 2820 

CAAAAAGGCG CTGATGCAAA GATGATACCA AAGTTGGATT TCGCAATGTC GCATTATCAT 2880 

GATAAGATAA AAATGTGCAC AGGAAAGTCC GTTAATCAAA TACCAGGTTC TGGTGCAGCT 294 0 

GGCGGTATGG GCGCAGCATT ATTAGCGTTT TGTGAGACAA CTTTAACAAA AGGTATTGAT 3000 

GTCGTCTTTG ACATTACAGA TTTTCATCAA AGAATTAAAG ATGCAGACCT CGTTATTACT 3060 

GGAGAAGGAC GCATGGATTA TCAGACCATC TTTGGTAAAA CACCCGTAGG CGTTGCGTTA 3120 

GCTGCAAAAC AATATCATAT TCCTGTCATC GCGATTTGTG GCAGTCTAGG CX3AAAATTAT 3180 

CAACATGTTT ACGATTTCX5G TATTGATAGT GCCTATTCTA TAATCTCTTC ACCTAGCACT 3240 

TTAGAAGATG TCCTACAAAA TAGCGAACAA AATTTATTAA ACACTOCAAC TGACATTGCT 3300 

CGTATTCTGA AATTACAATA ATGTCAAAGT AAATCATCAG CTTTATTATT TGCAGTTAAA 3360 

ACTTGAATGA GGTGAAACCC ATGAAAAGAA CTGATAAATA CCGTGATTCA TATCAATACG 3420 

ACAATCAAAA CCAAAATCAT CGTCGTCAAT CTGAAGACX5C ATCGTATAGA CAACAATATG 3480 

CTAAAGGCGA TCCTGAAGAA CACCCGGAAC GATACTATAA TGGTAGAGAT TATCGAAGAfi 3540 

AACAAATTCT TGAAGAAGAA AACGAGAAAT CCCGCCGTTC AAAAAAATGG TTATATATCA 3600 

TTATTGCCAT TCTCTTAATT ATTGTCGCTA TTTTTGTCAC ACX3CGCCTTA CTTAACAATG 3660 

ATAGCGATAA AGTTAGTAAT GACCCTAAAG TCTCTCAAAA TTATAAAAAA CAAGTTGAAA 3720 

ATCAAGACGG CCAAATTAAC CAGCAAGTAG ATAATGCTAA AGAAAATATT AAAAACAACC 3780 

AAAAAACTGA TGACATTATT AAAAATTTAC AAAATCAAAT CGACAACTTG AAGCAGCAAG 3840 

AACAAAACAA AGCTGATTCT AAGCTAACTC AATTTT A TCA AGACCAAATC AACAAATTGA 3900 

CAGAGGCAAA TAATGCACTT AAAAACAATG CAAGCCAAGG TAAAATTGAA AGCATGTTAA 3960 

ATGATATTAA TACAAAATTC GACAGTATTA AATCTAAATT AGAAAGCTTA TTTAAAGATG 4020 

ACAATGGTGG CGCTAATTAA TTATTACACC TGCTTTGATG ATAAACATTA ATTCCCTATA 4080 

CTTTATCTGT ATCACTACGT TATTCGTGAT GATGCATTAA GAGTATAGGG ATTTTTTATA 4140 

TAAACTTGTA TTCTAACTAC ATACAAATAC ACACAAAACG TATATAATTT ATATAATTAT 4200 
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TTATTGCTAA TTACGTTAGG CGTCATGACC GCTTTTGGCC CACTAACTAT AGATATGTAC 4320 

GTACCATCAT TACCTAAAGT GCAAGGTGAT TTTGGTTCTA CTACATCAGA AATTCAATTA 4380 

ACATTATCAT TCACAATGAT TGGTCTTGCA CTAGGCCAAT TTATCTTTGG ACCTTTATCC 4440 

GATGCTTTTG GTCGCAAACG GATTGCTGTA TCCATTTTGA TCATTTTCAT TTTGGTATCA 4500 

GGTTTGTCTA TGTTTGTTGA TCAATTGCCA TTATTCTTAA CTTTACGATT TATTCAAGGT 4560 

TTAACTGGTG GTGGCGTCAT CX3TGAITGCA AAAGCCTCTG CTGGTGATAA ATTTAGTGGC 4620 

AACGCACTCG CTAAATTTTT AGCATCTTTA ATGGTAGTTA ATGGCATCAT CACTATTCTT 4680 

GCACCATTAG CCGGTGGATT AGCTTTATCC GTAGCAACAT GGCGTTCTAT TTTCACAATT 4740 

TTAACTATTG TGGCACTCAT CATTTTAATT GGCGTCGCTT CTCAATTACC TAAAACATCT 4800 

AAAGATGAAT TAAAGCAGGT GAATTTTAGT AGCGTCATTA AAGATTTTGG AAGTCTTTTG 4860 

AAAAAACCAG CATTTATTAT TCCAATGCTA TTACAAGGwT TAACTTATGT AATGCTATTT 4920 

AGTTATTCAT CTGCATCGCC ATTTATTACT CAAAAATTGT ATAATATGAC ACCCCAACAA 4980 

TTTAGTATCA TGTTTGCTGT TAACGGTGTA GGTTTAATCA TTGTCAGTCA AGTCGTTGCT 5040 

25 TTATTAGTAG AAAAATTACA TCGCCACATA TTATTAATCA TTTTAACTAT TATACAAGTG 5100 

GTAGGTGTTG CTTTAATTAT CCTGACACTT ACATTCCATT TACCACTTTG GGTCTTACTC 5160 

ATCGCATTCT TCTTAAATGT GTGTCCTGTG ACGTCAATTG GACCX5CTTGG TTTCACAATG 5220 

GCTATGGAAG AACGAACAGG TGGCAGTGGT AACGCATCAA GTTTACTTGG CTTATTCCAA 5280 

TTTATCTTAG GTGGCGCTGT TGCACCATTA GITGGCTTAA AAGGCGAATT TAATACATCA 5340 

CCATATATGA TTATTATCTT CATTACAGCC ATTCTATTAG TCAGTCTACA AATCATTTAC 5400 

ITTAAAATGA TTAAAAAGCA ACATGTCGCA TAACACTTCA ACATAATTAG AACCCTAGCA 54 60 

AAGMATCTA TCTTTGTCAG GGTTCTTCTT TATGAATTAT GAGATCGAAT CTTCAACTAA 5520 

AATTACGCCT TCATAGCAAG GACATTTCTA TTCAATCACC CTTTAACAGG CATCCAAATT 5580 

TcTGTAATAT ATTTTTCACT TGTAGTATCA CCAT 5614 
(2) INFORMATION FOR SEQ ID NO: 100: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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10 



IS 



AAAGACAATG ATATGAAGTA TATGGATATC ACAGAaAAAG TGCCAATGTC GGAATCTGAA 120 

GTTAACCAAT TGCTAAAAGG TAAGGGGATT TTAGAAAATC GAGGGAAAGT TTTTCTAGAA 180 

GCTCAAGAAA AATATGAGGT TAATGTCATT TATCTTGTTA GCCATGCATT AGTAGAAACA 240 

GGTAACGGCA AATCAGAATT AGCAAAAGGC ATTAAAGATG GGAAAAAACG CTATTACAAC 300 

TnTTTGGTA TAGGAGCATT CGATAGTAGT GCTGTTCGTA GTGGGAAAAG TTATGCTGAA 360 

AAGGAACAAT GGACATCACC AGATAAGGCG ATTATTGGTG GTGCAAAGTT CATTCGTAAT 420 

GAATATTTTG AAAACAATCA ACTGAATTTA TATCAAATGC GATGGAATCC AGAAAATCCT 480 

GCGCAACATC AATATGCGAG TGACATTCGC TGGGCAGATA AAATTGCCAA ATTAATGGAT 540 

AAATCCTATA AOCAGTTTGG TATAAAGtAAA GATGATATTA GACAAACATA TTATAAATAA 600 

GACATCGGTG CTTAAAGGAG CTGGAACAAT TTATTGTTTC GAGCTCCTTT AGCGCATTCT 660 

20 GAGTGTGTTA GTTAAATGGA TTTTAACCTA ACAAAAAACG CTATATAGCA TCAAATATGC 720 

TATATCCCAC ATCATTGTTA CAAATGTACA TGATGTAAAT GAATATTGCT GTCTAAATGT 780 

GCATGTAATA TACAATGGTG CAGATAATAC ACTTAAGTCC TTAAAAATCA AACGTTAgTT 840 

25 CCAAGAGTCA TTTTTAAACA ATAGTGCATG TGATAAAATA GAAAAGAATG AAAAATATAG 900 

AGGTGACAAT ATGAAGATAG CAATTATAGG TGCAGGCATC GGTGGATTAA CAGCTGCTGC 960 

ATTATTACAA GAACAAGGTC ATACTATTAA AGTCTTTGAA AAAAATGAGT CAGTTAAAGA 1020 

AATTGGCGCT GGGATTGGTA TCGGAGATAA TGTGCTTAAA AAACTAGGTA ATCATGACTT 1080 

AGCTAAAGGT ATTAAAAATG CTGGGCAAAT CTTATCTACA ATGACAGTGT TAGATGACAA 1140 

AGATCGCCTG TTAACTACTG TTAAATTAAA AAGTAATACA TTGAATGTGA CGTTACCACG 1200 

CCAAACATTA ATTGACATTA TTAAATCTTA TGTAAAAGAT GACGCAATAT TTACAAATCA 1260 

TGAAGTCACG CATATAGATA ATGAGACAQA TAAAGTTACC ATACATTTCG CGGAACAAGA 1320 

AAGTGAAGCA TTTGATTTAT GTATTGGTGC TGATGGAATT CATTCTAAAG TGAGACAATC 1380 

TGTAAATOCT GACAGTAAAG TATTATATCA AGGGTATACA TGCTTTAGAG GTTTAATTGA 1440 

TGATATTGAT TTAAAGCATC CGGaTTGTQC AAAAGAATAC TGGGGaAGAA AAGGaAGAGT 1500 

4S AGGTATTGTT CCGTTATTAA ATAATCAAGC ATATTGGTTC ATTACAATTA ACTCGAAGGA 1560 

AAACAATCAT AAATATAGTT CGTTTGGTAA ACCTCATTTG CAAGCATACT TTAATCACTA 1620 

TCCAAATGAA GTTAGAGAGA TCTTAGACAA ACAAAGTGAA ACAGGTATCT TATTGCATAA 1680 

50 

TATTTATGAT TTGAAACCAC TCAAATCTTT TGTTTATGGT CGTACTATTT TACTAGGAGA 1740 

7GCAGCACAT GCGACAACGC CTAATATGGG GCAAGGTGCT GGACAAGCAA TGOAAGATCC 1800 
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TAAAATACGT GTCAAACATA CTGCAAAAGT AATTAAGCGT TCTAGAAAAA TCGGTAAAAT 1920 

TGCCCAATAT CGTAGTCGTT TATTTGTTGC AGTTAGAAAT CGTATTATGA AAATGATGCC 1980 

5 

AAATGCATTA GCAGCTGGAC AAACTAAATT CTTATATAAA TCGAAAGAAA AATAATACAA 2040 

CAATATGAAA ACCCCCGTAT GTTGAAACX3A GAGCTCAACA TATGGGGGTT CTTGTTTTTA 2100 

TAATGTTATT ATAATAAATT CAATTATTAG TTAACGACAA ATTGTGGTTT CTCACCTTGA 2160 

10 

ACGGCACTAA TTGCAGCATT AGCAACAATT TTAGACATCA TGTCACGT6C TTCAAATGTA 2220 

GCATTACCAA TATGCGGTGT TAATACTACA TTATTAAGTG ATTTTAAGTC ATCGGTAATA 2280 

^5 TCTGGTTCAA ATTCATATAC ATCAAGTGCA GCACCTTCAA TTTCATTATC TTTCAATGCT 2340 

TGCACTAGTG CTTGTTCGTC CACGATTGGA CCACGAGAGG CATTGATTAA ATACGCOGTA 2400 

GATTTCATCA TTTTAAATTG TTCTGTATCA ATTAAATGAT GCATTTTAGG ATTATAAGCA 2460 

20 GCGTTGATAG TGATAAAATC TGCATTCTTT AATAGTGTAT CTAAATCTAC ATATTTTGCA 2520 

CCGATTTCTC QTTCTTTTTC TTCTTTGCGA TTAGGTCCAG TGTATAGCAC ATCCATGTCA 2580 

AATGCTCTTG CACGACGAGC TACTGCACTA CCAATTTCAC CTAAACCGAT AATGCCGATT 2640 

GTTTTCCCAG ATACTTCTCT ACCTCTGAAA AATAAAGGTG CCCATCCATC AAATCCAGTT 2700 

GTACGTGATA ATTGGTCCCC TTCAACAATA CGACGCGCTA CTGCAAGTAC TAATCCAATT 2760 

GTTAAATCAG CAGTCGCGTT TGTTGATGCT TTAGGTGTGT TTGTAACATC TATACTTTTT 2820 

30 

TCTCGGGCAT ACTCGATATC AATATTATTA AAACCAGCGC CATAGTTGGC AATGATTTTT 2880 

AAGTCTTTAC CAGCATCGAT AACATCTTTA TCAACGTTTG TAGATAATAA ACTAATTAAG 2940 

GCAGTCGCGT TTTTAACACC TTTAATTAAA GTGTCTTTAT CGACTAATCC TTTACCTTCA 3000 

35 

TACATTTCAA CTTCAAAATG TTCTTGTAAA AGTTTTAAAC CTACTTCTGG TATtGCACCA 3060 

gCAAGATAAm CTTTTtCCAT AAAAGAtCAC TCCTTTTATC TTAGTATAGT AGAAGATTAG 3120 

40 ACAGTATACA ACTATGTCAT GATGTCTTGT GTATCAATGA TGTAAGCGCG TACTTTTGAT 3180 

GGAGGCGATA TAACTTAGGC ACTGTAGAAC TATGAATATT GTAATGTGGA AAAACTGGAT 3240 

CAATTAAATT AGATAACGTA GTTTTAAAGT TAATAGTATT AGAAAAAATT AATATTTTGA 3300 

^ ATATGGGAGG AAATATAAAT AAGTAGGTGG CAACXAAAAA TAGCAAAAAA AGAGCTTCTC 3360 

CTATAAAGGA AAGCTCAAAG TTTTTTGATG ACATATGTAC TAGAATTAAG TTTCAAGACA 3420 

ATATGTATCA TCGTGTTTAT ATTAAATATG GATGTAGTTG TAGTTACCTG CTTCACTTGC 3480 

SO 

AGAAATAGTT CTAGTUICTTA CTGAGAAAGG TCCGCCACTA TAATTCATTT CTGAAATTGT 3540 

AACTGAACCA TCACTGTTTA CACTTTCTAC ATATGCAACG TGACCAAATG GTCCTTCAGA 3600 
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AGCAGCAGCC CAATTATTAG CATTTCCCCA 
ATATACATAC CAAGTACATT GTCCTGCAGT 

^ TGTAGTTGTC GTAGTTGTCG TAGTCGTTGT 

GTAATTTGTA TAATTTTCAG CAGCATCTGC 
GATTCCTGCT GTTAACGTAG TTGCTGTTAC 

10 

TTCTATATCT TTTTTTATAA ATAAAACGTA 
ATGACAATAG TTACTTTAAC AAAATtAATG 
AAGAATAAAA AAACTTT6AC TAATTTTGTA 
AACAGATAAT TAATAGGAAA TATTTATTTG 
GGTATTATAT ATTCTTGGCC ATTATAATAT 
20 GATAATATTG AGAAAGCGAA TATGGATAAA 

ATAATAATGA AATCAATATC TGTAGCAATT 
AAAACGATAG ACCAAATAAT ATAAGAAATC 
TCAACTAGTT TCGATTCATC TTTTTTCAAT 
GTGAATAAAC TTAATAAATA GATAAGCATC 
TTCGGTTGAT GATTTGTTAC GTCGTTCATT 

30 

ATTGTAATAT TATCTTTAAC TATAACAAAA 
AATTATTAAA AATAAAAATA ATTGGTGGAC 
ATATATACTT AACATTTATA ATGATGCGTA 

35 

CGTATAATTT GTTTTTAATT TTAACCAAAG 
ATTGTAGGAT CAGGAAATGG CGCAGTTACG 
GATGTTAAAT TATATTGTCG TAATCAATCT 
GGCGGATTTG ATTTTAATAA TGAAGGTGAT 
GATGATATGG AATATGTTTT AAAAGATGCT 
^ TACATAGAGT ATTATGCTGA TGTAATGGCA 

TTCAACATGG CTGCAGCAAT GGGGTCAATT 

ATTGAAACAA AACCACAACT AGCGGAAgcT 

SO 

TTTGAAAATG CAGCAGTTGA TTTATCTCTA 

GATAGAAGCT GTCTAAATGA TTGTTATGAC 

55 



AGTAGAACCG ATTTCTCCGC CAACTTTATC 3720 

GTATAAGTTA CCAGAATGTG AAATTGATGA 3780 

AGTTTGAGTC GTGTTGTAGT TATAGTTGTT 384 0 

ATGATGTGCT TGACCTACTA ATGCTGTGCC 3900 

TAATTTTTTC ATGAATAAAG TCCTCCAAAG 3960 

QCGACTGTTT TATTCTCACA TCTCGAATTG 4020 

CTTCTTGTGG GGAATGTTAT TGATTTGTAA 4080 

ATAAAAATTA GTCAAAGTTA CAATGAGATT 4140 

TAATATGTTT AAATAAATC6 AATTGTTAAA 4200 

TTGACACACG CAATAATTGT GAATACAAAA 4260 

ATACCGATAA ACXSTAATGAT GAAACCTATA 4320 

AGGAAAACGC CTATTAAAGT GATAACGACT 4380 

GTATAGTTAA GATAATTTTt TCCAGCACGA 4440 

AACCATATTA TCAGTGGACC AATAATAGAT 4500 

GCCATAATGT TCTCATCATT GGATTTGCGA 4560 

TCAGTTGTCA TATTAGACAC TCCTTTGAAA 4620 

TATAATCAAA AATAAACATG TTTATTAAAC 4680 

GTCGGCGTTT AAATAGGTTA ATTTAAGGTT 4740 

ATGAATTCGC ATCATTTTTA TATTGTCTTA 4 800 

ATAGAAAGAG GGTTGTTTAT GAAAATAGCA 4860 

GCAGCAGTAG ATATGGTGA6 CAAAGGCCAC 4920 

ATAAGTAAGT TTCAAAACGC AATCGAAAAG 4980 

GAACGTTTCG TAAAATTCAC TGATATTAGT 5040 

GAAATTGTTC AAGTGATTAT TCCATCTTCA 5100 

GAGCATGTAA CTGATAATCA GTTGATATTC 5160 

CX3TTTTATGA ATGTTTTAGA AGATAGACAT 5220 

AATACGTTGA CGTATGGTAC GCGTGTCGAT 5280 

AATGTACGTC GTATCTTCTT TTCAACATAT 5340 

AAAGTTTCAA GTATTTATGA TCATTTAGTA 5400 
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CCAACATTAT TGAATGTCGG TCGCATTGAT TATGCTGGCG AGTTCGCTTT ATATAAAGAA 5520 

GGAATTACTA AACATACAGT TAGATTACTT CATGCAATCG AATTAGAACG TTTGAATTTA 5580 

5 GGCCGTAGAT TAGGTTTTGA ATTATCAACA GCTAAAGAAT CACGTATTGA ACGTGGTTAT 564 0 

TTAGAACGTG ATAAAGAAGA TGAACCATTA AATCGTTTGT TTAATACAAG CCCAGTATTT 5700 

TCACAAATTC CAGGACCAAA TCATGTAGAA AGCAGATATT TAACTGAAGA TATTGCATAT 5760 

70 

GGTTTAGTAC TATGGTCAAG CTTAGGTCGT GTTATTGATG TACCGACACC AAATATAGAT 5820 

QCAGTAATTG TAATTGCATC AACCATTTTA GAGAGAGACT TCTTTGAGGA AGGCTTAACA 5880 

GTTGAAGAAA TTGGTTTAGA TAAGCTTGAT TTAGAAAAAT ATTTAAAATA AATGATGGCT 5940 

TGAAGATAGA AAAGGATATA GCATTATGCA AAAGCAATAA ATTGAA6AAA AGAGGTTTCT 6000 

CATCAATAAG CGnAGGGGAC GATAGATGAT GAAAAGAAAA CCCACCTTTT TAGAATCAAT 6060 

20 TTCGACAATG ATTGTAATGG TTATTGTTGT TGTAACAGGC TTTGTGTTTT TTGATATTCC 6120 

AATTCAAGTA TTATTAATTA TTGCCTCAGC ATATGCCACA TGGATTGCAA AACGTGTAGG 6180 

CTTAACATGG CAAGATTTAG AAAAAGGCAT TGCAGAACX3T TTAAATACTG CAATGCCTGC 6240 

25 AATTTTAATT ATACTAGCGG TAGGAATTAT AGTAGGCAGT TGGATGTTTT CTGGCACAGT 6300 

GCCAGCCTTO ATTTATTATG GCTTAGATTT ATTGAATCCA AGCTATTTTT TAATATCAGC 6360 

CTTTTTTATA AGTGCTGTTA CATCTGTAGC AACTGGTACA GCATGGGGCT CTGCATCAAC 6420 

30 

TGCAGGGATT GCACTTATTT CTATTGGTAA TCAATTGGGG ATTCCTCCAG GGATGGCAGC 6480 

GGGTGCTATT ATAGCAGGGG CTGTGTTTGG CGATAAAATG TCACCATTAT CAGATACAAC 6540 

TAATTTAGCG GCGCTTGTTA CTAAAGTTAA TATATTTAAA CATATACATT CGATGATGTG 6600 

35 

GACGACGATA CCTGCATCAA TCATAGGTTT ATTAGTATGG TTTATTGCTG GATTTCAATT 6660 

TAAAGGGCAT TCAAATGATA AACAGATTCA AACTTTGTTA TCAGAGCTTG CACAGATTTA 6720 

^ TCAAATTAAC ATATGGGTCT GGGTTCCCTT AATTGTGATC ATTGTTTGTT TGCTATTTAA 6780 

AATGGCTACA GTGCCAGCTA TGCTAATATC AAGCTTTTCT GCCATTATAG TGGGGACTTT 6840 

TAATCATCAT TTCAAAATGA CAGATGGTTT CAAAGCAACA TTTAGTGGTT TTAACGAATC 6900 

45 AATGATACAT CAGTCTCATA TTTCATCCAG TGTGAAAAGC TTGTTAGAAC AGGGTGGTAT 6960 

GATGAGTATG ACCCAAATAT TAGTAACGAT ATTTTGCGGA TATGCATTTG CAGGTATTGT 7020 

AGAAAAAGCA GGATGTTTAG AAGTCTTATT AACTACTATT TCTAAAGGCA TCCATTCTGT 7080 

50 

AGGAAGTTTA ATATGTATTA CTGTTATTTG TTGTATTGCG CTTGTATTCXJ CTGCAGGTGT 7140 

TGCTTCGATT GTAATTATTA TGGTCGGTGT GTTAATGAAA GATTTGTTCG AAAAATACCA 7200 
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AATACCATGG GGAACATCAG GTATTTACTA 
ATTTTTCATA TGGACAGTAC CATGTTATTT 
^ TACAGGGATA GGTATTAAAA AGTCATCGAA 

ATATATAATA TGTTGAAACA CTTTAATCAT 
GTTTTAACTT AGAATAAATA TCCTCTATGC 

10 

GTTGATATAT GTAATGTAAG TTTTATGTCA 
TTGAAGGCAA GTATATTTGT AAGTACTTTA 
GACATGCCTA AATTTGGGTG TGTCAATGGC 
AATCGGATTG GTGAAAATCG AAATTTTGAG 
ATTAAAAAAG CCAACAAGGC TCITGAAACC 

20 AGTGAATGAA GTTATAACCA GCAGCTTGGC 

CTGGGCCATA ACCATAGTTC ATTTCTGAAA 
CAACGTATGC AACGTGACCG TATGCACCTT 

2S GTGTATTGTT CACTGTGTAA CCAGCTCTTG 

CCCAAGTTGA ACCGATTTTA CCACCTACAC 
AAGTGTATAA GTTACGTCCT GAAGTATAAC 

30 

GAGCCATAGT TGTAGTTACT TGAACATTGT 
CACCAGTACG GTAGCTGTTT GTGTTGTAAC 
TATTATTTGA GTAGTTGTTG TAACGGCTGT 

35 

TGTTATAGTT ATTGTAACCA TTGTAGTAGT 
ATTGACTTGG ATGCCAGTTA CCTTTCCATG 
TGTAAGTATA GCTATATGAT GTTGGGTCGT 
AAGCATGAGC TTGATTTCCT GATGCAATTG 
TAGCTGTAGC GATTTTCTTC ATTTTAAAAA 
45 TTTTCGTAAT GTCCGTGTGA CAAAATTAAT 
AAGAAAGACT ATAACAGAAA TTAGCGTCCT 

TGCTAATATC TTGACACAAT AGAATTTTAA 

SO 

ATAACTACGG CATTCTTTGT GAAAACTGAA 

TAATATTACT GAAAATTCTA AATGTATATT 

55 



TACGAATCAA CTTCATGTCT CTGTTGAAGA 7320 

ATGCGCAATT ATAGCAATTA TCTATGGTTT 7380 

TTCACGTTTA ACTTAATGTG AGCX3TGGAAT 7440 

TTATAATTGT AGCGGTTATA ATTTGAAAAG 7500 

ATATACTGAA TATGTTTTGT AGCGGAACAT 7560 

TGATTTGTAA TGACTAAATT AATTGAGAAT 7620 

ACTAAAAATT TATCAATGTA TAGCCGATTT 7680 

TGTATGTTGT TTATTCTTTA TTACAGAGTG 7740 

ATTTTTACCA ATTCGATTTT TTTCATAGAA 7800 

TTGITGGCGT AAACATAGCC ATCACTAATT 7860 

TAGCTGAGAT TGTACGTGAA GTTACAACAC 7920 

CTCTTACTGA ACCATTGCTG TTAACACTTT 7980 

GAGTTGTTTG CATAATTGCA CCAGCTTTTG 8040 

CAGCTGCGTT AGCCCAGTTA CTTGCATTGC 8100 

GATCAAATAC GTAGTATGTA CATTGACCAG 8160 

CACTTGAGAT TGAACGGCCA TTTGATGATG 8220 

TGCTTGAAGT GCTGTAGCTT GCACCTAAAC 8280 

TATTATAGTT ATTGTAGTTA TATGATTGAT 834 0 

AGTTATTGTA GCTATAACCG TTGTTGTAAT 84 00 

AATAGCTGTA GTAGCCATTA TCTTGGTTTA 8460 

TGTAATGGTA GTTACCTTGT GCATCAATAG 8520 

TTGGATTATA ACCGTAGTTA TCTTGCTCAG 8580 

CGATTGTAGC GAATCCTGCA GTTGCX3ATAG 8640 

TATCCTCCTA AAAATTTTAA ATCTAAAATA 8700 

GTTATAAGTT ATCTCTCGTA ATTAAACGAC 8760 

TGTGTGCTTT GTTAACGTTT TGTAATTTTT 8820 

AAGTATAGAA ATTTGCATTT TGCAAAACTT 8880 

TGTTTCGAAA ATAAGTCTGT TACAAATTTG 8940 

TTGTGCATAA TATAGGACTT TTAATCAGAA 9000 
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GGATGAAAAT GTATATTTAA TGGATAAAAT ATCCTAATTT AGCATAAAAA AATGTTTTAA 9120 
TAAAAGTATT ATTTGATATA ATCGATTTAT GTTTTGTTAC TGCTAAAAAA CATGTGGCG 9179 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

IS 





CCTTCAGCCA 


TTTGACTTCG 


ACATGAGTTG 


CCTGTACATA 


TAAAATAAAT 


TGTTTTTTTA 


60 




GTCATAACAA 


TCTCCTAATT 


AATTAAAATA 


TGATAAGTGT 


TAGATACAAC 


CCTATGAGGG 


120 


20 


TTATAAATAG 


TACTGGAATT 


GTAATGATGA 


TACCAGTTTT 


AAAGTATGTG 


CCCCAAGAAA 


180 




TCTTAACATC 


TTTTTGtGTT 


AAGACGTGTA 


ACCACAGTAA 


TGTAGCTAAA 


GAGCCTATCG 


240 




GTGTAATTTT 


TGGACCTAAA 


TCAGAACCGA 


TAACATTCGC 


ATAAATTAGG 


CCTTCTTTTA 


300 


25 


ACATGCCAT6 


GACATTTGAT 


TGACCAATAG 


CAATCGCATC 


TATTAAAACT 


GTAGGCATAT 


360 




TATTCATTAT 


TGATGATAAA 


AACGCTGAAA 


TGAAGCCCAT 


TCCCAAAATA 


GTGCTAAATA 


420 




GACCGTAATT 


GGAAATATAT 


TCTAATATTT 


TAGCCAATAT 


TAAAGTAATG 


CCAGCATTTC 


480 


30 


TTAAGCCGAA 


TACGACGATA 


TACATACCAA 


TTGAAAATAA 


TACTATATTC 


CAAGGTGCGC 


540 




CCTTAATSAC 


TTGCTTAATA 


TTTACAGCAT 


TTGATTTACG 


AGCCAACATT 


AGAAAAATAA 


600 


35 


AAGCAATGAT 


TCCAGTGAAA 


ATTGATACCG 


GAATTTTAGT 


AAATTTACTG 


ATTAGATAGC 


660 


CGAAAAGTAA 


TATAACTAGA 


ACAATCCaTG 


AAATTTTAAA 


TAGCTTTAAA 


TCATTAATGG 


720 




CATCFTTAGG 


ATGCTTTATA 


TTATTATCAT 


CAAACGTTTT 


AGGTATCGCT 


TTTCTAAAAT 


780 


40 


ATAACCACAA 


TACTATAATA 


CTTGCTAAAA 


GCGAGAATAA 


ATTAGGTATA 


ATCATTCTAC 


840 




TAAAATATCG 


AACGAATCCT 


ACATGAAAAT 


AATCAGCAGA 


TATAATATTC 


ACTAGATTGC 


900 




TCACGATTAA 


AGGTAAAGAA 


GTTGTGTCAG 


CTATAAAACC 


ACTCGCAATA 


ATnAAAGGGA 


960 


45 


ATATGGCCCG 


CTTACTAAAA 


CCTATATTTT 


TAACCATCGC 


TAATACAATA 


GQCGTTAAGA 


1020 




TTAAcGTGCG 


CCATCATTTG 


CGAAAAATGC 


AGCAACAATG 


GCACCCAATA 


ATATGATATA 


1080 




AACGAACATT 


TTTAAACCAT 


TGCCTTTTGA 


AGCATGAAGC 


ATGTGAATAG 


CTGACCATTC 


1140 


SO 


GAATAATCCA 


ACTTTATCTA 


ATATTAATGA 


AATAAGAATG 


ACTGAGACAA 


AAGTCAAAGT 


1200 




AGCATTCCAA 


ACAATACCTG 


TTACTTCGAA 


AACATCGGAA 


AAACTTACAA 


CACCAGTAAT 


1260 
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TAATACAAAT AATAAAGTTA CTAGAAAAAT GAGTGTCGCT AAAGTTGTCA TCATTAGCAT 1380 

TCACCAGTCT TAAGGTTATG ACAAATACAT CGTTGGTTAG AGGTATGAAC CTTAGACAAG 1440 

^ TTATTAATTA CGGACTCAAA AATATTATGA TTgAGCTGGT ATAAATGTTT ATTTCCGATT 1500 

TTTCGTGTCG TAACTAAGTT GGTTTTTACT AATGCTTTCA TATGrTAGCT AAGTGTAGGT 1560 

TGAGAGAATT GAAAATGTGC TAACAAATCA CAAGCGCATA ACTCTCCACA AGAAAGTAAA 1620 

10 

TCTAGTATTT CTAATCTGCT TGAATCTGAT AAAACTTTTA AAAATGTTGC TAGTTCTTTA 1680 

TACGTCATAA CATACCTCCT AGACXHTAAA TAGATTATCA TCTATATAGA TGAATGTCTA 1740 

TGTTCCTTTG GTATATTACA CGATATGACT ATGTAATTTA AATTTGGTTT TAGTATTAAA 1800 

IS 

AGGGTATTAA AGATAAATTA TAGATATTGA TTTTGCAAAA TATACTCTTT GTTCTGCATT 1860 

GAAAAAGG 1868 
20 (2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

30 



40 



45 



ATTTATGAAA 


TCCATAGCnA 


TAAACATTAT 


TCTTGCATCG 


GCTATACAAA 


CAGTTACCGC 


60 


AAGCAAATTT 


GTATATCAAC 


CTGGAATTGT 


GTTCACGTCA 


ATGGCaAATG 


CCGATGATGT 


120 


GTTATCAGGC 


GATAGTTATT 


TTATGGCTGA 


ATTAAAATCT 


ATTAAGCGTA 


TTGTTGAAAT 


180 


TCCAGATAAT 


CAAAAAATAT 


ACTGCTTTAT 


AGATGAAATT 


TTTAAAGGTA 


CCAACACAAC 


240 


TGAACGAATT 


GCCGCTTCAG 


AATCAGTACT 


ATCATTTTTA 


CATGAAAAAT 


CTAACTTTAG 


300 


AGTTATTGCA 


GCAACACATG 


ATATTGAGTT 


AGCTGAACTC 


TTAAAACAAC 


GTTATGAAAA 


360 


TTACCATTTC 


AATGAGGTAA 


TAGAAAATAA 


TAACATACAT 


TTTGATTACA AAATTAAGCC 


420 


TGGCAAAGCA 


AATACACGTA 


ATGCCATCGA 


ATTATTAAAA 


ATCACTTCAT 


TTCCAGCAAA 


480 


AATATATGAA 


CGAGCAAAAG 


ATAATGTCCC 


GAAAATTTAG 


CATTTAACTT TAAACATAAA 


540 


AACGTCAGCT 


ATCACAT6AC 


AGAAGACTAT 


6AACAGTTTC 


AATAATGTTC 


ATAGTAATCA 


600 


TGTTAATAAC 


TGACGTTTAT 


TTTATTCTGC 


AGAATACTCT 


TCTAAATCTA 


TATTGCTGTG 


660 


CCCATTTAAT 


GCTAAATCAG 


CAAATCGACC 


TTGCTGATAC 


AAATAGTGGC 


CGGCAACGCC 


720 


TATCATTGCA 


GCATTATCTG 


TGCATAATTT 


AGGACTTGGG 


ATAGTTAATT GAATGTCATT 


780 
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AACAATTAAT CGCTGAACAC CATATTCTTT 
CTCTACAACA CTGTnTGAA AGCTCGTTGC 
^ TTGTCGTTGA TTGTGAAGTT GATTGATTAC 

ATAACTATCT TTATCCAACC AAACACGAGG 
CAACCGATCA ACTTGTGGAC CACCTGGATA 

10 

ATAAGCCTCA CCTACTGCGT CATCTCGTGT 

CTTCATATAA ACTAATTCAG TATGTCCACC 

TAATGGCTCT TCTATGTGAT TAGCATATAT 

TGGCTnATCG TAAGCAAATG CCAATGCTTT 

AATTAGTCCA GGGCCTTCTG TAACCGCTAT 

20 ATCCCCTAGA GCCTCGTTTA TTGTTGCTGT 

TTCGGGAACG ACACCGCCAA ATCGTTTATG 

TAAAATATCT CTGCCATTTT TTATAACACT 

AGCTAGTATT AATATATCTT TAGTCATTTA 

CTTCACCATA ATAATTTTTA CGTTTACCAC 

GTTGTGCCAC TTTATTATTA ACTCTTACTT 

30 

CATAGTTTAT TCCGTATTTT AAAAGCATTT 

CGATTGCAAC TGTTGTAATT TGAGCTTGAT 

TAATTTGTTG TTCAAATTCt AAGACAAAAT 

3S 

GATAAAATGC GTCAATTGTC CAAGAACTGT 
CTTQTGGCAC ATCTTCTTTA GTCATCTCTC 

40 CCAATTTCGT TCCGCCTCAG CTAATTTATG 
AGGTTTATCT AGCAATTGAT ACATGACTGA 
TTGTAATTCA TCTTGTAATT TTACAGTATC 

^ ATCTTCTAAA AAAGCTCGCA ATGCCTCTAT 
TAATTGACCA TTTTGCCACT GGAATATGCC 
CACAGGAACC AATAATTTAT CAGTATGATC 

50 

AACACCATAT AATTTAACAT CTAACGCATA 
ACX3TAAGCCA GTATATGAAC CAGGACCTTC 

55 



ACAAGCTTGA ATAGCTTTAA ACGTGAGCAC 900 

TACXITTAGCT TCAATGATTG GaATATTTTT 960 

GGCACTTTTC AACCCACTAA AACTAAAATC 1020 

GAATGAATAA GTATCTTCAC CTTCAGCAGC 1080 

ATTTAAACCA ATTGTTCGTG CCACTTTATC 1140 

TTCACCAATG ACTTCAAATG ATAAATGATC 1200 

TGAAACAATA AGTGCAATTA GCGGGAATGT 1260 

ATGTCCTQCA ATATGATGAA CAGGAATAAG 1320 

GGCTGCATTA ACACCTATTA GTAACGCACC 1380 

GGCATCAATA TCTTCTATTG ATACATCGGC 1440 

TATACCTTCA AC6TGATGTC TACTTGCCAC 1500 

ACTTTCAATC TGACTTAAAA CTGTATTTGA 1560 

AACGCTTGTT TCATCACAAC TTGTTTcAAC 1620 

AATTCACCCA CAT7ACCATT 6CGTCCTCAC 1680 

CATATTGAAA TCCTAAATTT TCATATACAT 1740 

CTAAACTCAT CACATCACAA GTGTGACTTG 1800 

GACCTAAACC ATAGCCTCTA TAATTATCAT 1860 

CGATAACAAT CCATAAACCT AAATAACCAA 1920 

ATTTCGCAAA GTTATTTTGC TCTATTTCAT 1980 

CATTGAAACT CCGACGCTCA AGATCAAAGA 2040 

TAATGTTTAA TTGTTCTTTT GACTGTTGAT 2100 

GTATTTAGGA GTAAATGTAT GTACGTCTGA 2160 

TGCATTTGGT AGctGCGCAA TCACTTCACC 2220 

TTTCCCAATA TAAATAAATG 6TTGGTTTAA 2280 

CGACATATAT TGATCTTCTA AAATAGTCAC 2340 

TGTATAAACT GCTTGTCGTC TTGCATCAAA 2400 

GATTGTTGCT GCCAATGCCT TTAATGATGA 2460 

OGCTAATGTT TTAGCAACAG TAACACCGAT 2520 

AGCAACAATA ATCGCATCTA ATTGCTGTTT 2580 



601 



EP 0 786 519 A2 



TTGTTTAGAA TCCGTAGTTA TTTCAGCTAA 
TAATGGTTGA TTCGATGTAT CAATGAGCAG 
^ GTTCATAATG TTCTCCTTGC GCGAACAACT 

AAATGTTAAT AGATAAATGC GTCGCTGGAA 
TAACAGTAAT TGCCTGATCT TCGAAAAATT 

10 

CTAAGCGATA ACAATCCATA TGATGCAATT 
TGTTAAATGT CGGGGAATTA ATCGTACGTC 
TTAACGTTGT TTTACCTGCT CCTAAATCTC 
ATTGCTCAAC TAAAAATATA GCAAATT6AT 
ATGTTGATTC TCCTATATTA TGCTTTTCAT 
20 TATCTAACTT TATATTTAAC CTTTATATTG 
TGAAACAGAT TATCTATTCA AAGTTAATTG 
TAAAGCAGAT ATAGTAAATT AAATTTATCA 
ATATATACCA TTCACGGAAG GAGTATAATA 
CGACAACTTC AAATATTATT ATTTTATTAT 
TTACTAAACG CTTTAAGTCC TATTTCTGTT 

30 

TTGGGACGTT TTTTTATGTT TTATCTTTCA 
TTAATCATTA AAATTTACGG GGGAATTTAC 
TCACCAAGCA CCAGCAAGAA GTCTTTTACA 

35 

TATGAACAAA CCATTTGTAG CTATTTGTAA 
TCAQTTGAGA GAGCTTGCAG ATATAGCTAA 

4Q ATTTGAATTC AATACAATTG GTGTTGATGA 
ATATTCTCTA CCATCACGTG AAATTATTGC 
TTGGTTTGAC GGCGTATTTT ACATTCCTAA 

^ AGCAGCCATG AGGACAAACG TACCAGCTAT 
CTTATCTGCA CATGGAAAAG CATTAACACT 

TAAAGAAGGA TCGATTTCTA AAGAAGAATT 

SO 

TTGTGGTTCA TGTGCTGGGA TGTTTACTGC 

AGGTCTAGCA TTACCATACA ACGGTACTGC 

55 



AACTTCATCG TTTTGCATCA ATGCTACTGA 2700 

CGAATTCATG GATAATTGCC TCCTTAATTT 2760 

CAATTTGTCT TGTATTTTCA GATATTGTTG 2820 

GTAAATCTTT TATAAATTGA CTCCATTCAA 2880 

CATCAAATCC TAAATCTTCA TCAGAATCTT 2940 

TTAAATTTTT ACCCCTATAT GATTTAATGA 3000 

TTACACCAAG AGCTTTTCCT ATAAATTGCG 3060 

CGTTAAGTAA AATCAAATCA CXACTTTTCA 3120 

TCATTTCATC TAAATTATTT ATCTTTATCA 3180 

TCATAAAAAT GATTATCCAT TGTTCAATCG 3240 

TAACAAATTT CAACTTAAAT TTCTTATCTT 3300 

TAAGAAAATT TAAAATATTT GTTGACATAC 3360 

AATTTTTAGA CAATTCTAAC TATTAAAGTG 3420 

AAATGCTTAA TCAATATACT GAACATCAAC 3480 

ACTCTTTAGG ACTCXIAACGT TAgTAAATAT 354 0 

TGAATGGGAC TTGTAAACGT CCCAATAATA 3 600 

ATTACTTATT TTTATTACTA TAAAACATGA 3 660 

TATGCGAaCG AgcATGATCA AAAAAGGAGA 3720 

TGCCACGGGC GCGCTAAAAA GTCCAACTGA 3780 

CTCTTATATT GATATTGTTC CTGGACATGT 3840 

AGAAGCAATT AGAGAAGCCX3 GTGCCATTCC 3900 

TGGAATAGCT ATGGGACATA TCGGAATGCG 3960 

AGATGCAGCT GAAACTGTAA TTAACGCTCA 4020 

TTGTGACAAG ATTACACCCG GTATGATTTT 4080 

CTTTTGCTCT GGTGGACCAA TGAAAGCTGG 4140 

TTCATCAATG TTTGAAGCAG TCGGCGCATT 4200 

TTTAGATATG GAACAAAATG CCTGCCCTAC 4260 

AAATTCAATG AACTGTTTGA TGGAAGTTTT 4320 

ACTTGCAGTC AGTGATCAGC GACGAGAAAT 4380 
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10 



IS 



TATCGTTACT CGCGAAgCAA TTGATGATGC ATTTGCACTT GATATGGCTA TGGGTGGTTC 4500 

AACAAACACG GTACTGCATA CGTTAGCCAT TGCCAATGAA GCTGGTATTG ATTATGACTT 4560 

AGAGCGCATT AATGCTATTG CCAAACX3CAC GCCATATTTA TCAAAAATAG CACCTAGTTC 4620 

ATCGTATTCA ATGCATGATG TGCATGAAGC TGGTGGCGTC CCAGCAATTA TTAATGAATT 4680 

GATGAAGAAA GATGGCACGT TACACCCAGA TAGAATCACA GTTACTGGCA AAACGTTACG 4740 

TGAAAATAAC GAAGGCAAAG AAATTAAGAA CTTTGATGTC ATTCACCCTC TTGATGCACC . 4800 

ATATGATGCA CAAGGCGGTT TATCTATCTT ATTTGGTAAT ATCGCCCCTA AAGGCGCAGT 4860 

TATTAAAGTT GGCGGCGTTG ATCCATCTAT CAAAACATTT ACTGGGAAAG CAATTTGTTT 4920 

CAATTGGCAT GATGAAGCTG TTGAAGCAAT AGACAATCGT ACCGTTCGTG CAGGCCACGT 4980 

CX3TTGTCATT AGATATGAAG GACCTAAAGG TGGACCAGGT ATGCCTGAAA TGTTAGCACC 5040 

20 TACTTCCTCT ATTGTTGGTC GCGGCTTAGG TAAAGATGTT QCATTAATTA CTGATGGGCG 5100 

TTTTTCOGGT GCCACAAGAG GTATTGCAGT TGGTCATATT TCCCCTGAAG CTGCATCTGG 5160 

TGGACCAATT GCCTTAATTG AAGATGGTGA TGAGATTACT ATTGATTTAA CAAATCGTAC 5220 

ATTAAACGTA AACCAGCCTG AAGATGTTCT AGCGCGTCGC CGAGAATCTT TAACACCATT 5280 

TAAAGCGAAA GTAAAAACAG GTTATCTAGC TCGTTATACT GCCCTAGTAA CTAGCGCAAA 5340 

TACAGGTGGC GTCATGCAAG TCCCTGAGAA TTTAATTTAA TTTATTTTTA TATTGGAGAT 5400 

GGTTAAAATG TCTAAAACTC AACATGAAGT AAACCAAAAT ATTGACCCTT TAAAAATGGC 5460 

TGAATCACTT GAACCTGAAC AACTAAATGA AAAAACTTTA AATGATATGC GTTCAGGATC 5520 

AGAAGTGCTA GTAGAAGCTC TACTTAAAGA AAATGTGGAT TATTTATTCG GTTATCCTGG 5580 

TGGTGCCGTA CTACCTTTAT ATGACACGTT TTATGATGGT AAAATCAAAC ATATTTTAGC 5640 

AAGAt^ACGAA CAAGGTGCTG TTCATGCTGC AGAAGGTTAT GCACGTGTAT CTGGTAAamT 5700 

40 GGCGTCGTTG TAGTTACAAG CGGTCCaGGT GCAACTAATG TAATGACAGG TATTACGGAT 5760 

GCACATTGCG ACTCTTTACC TCTAGTTGTA TTCACTGGAC AAGTTGCTAC ACCAGGCATT 5820 

GGTAAAGATG CATTCCAAGA AGCGGATATT CTATCTATGA CTTCACCAAT TACAAAACAA 5880 

^ AATTATCAAG TGAAACGTGT TGAAGATATC CCTAAAATCG TACACGAAGC TTTCCATGTA 5940 

GCTAATTCTG GACGCAAAGG TCCTGTAGTG ATTGATTTTC CAAAAGATAT GGGTGTTTTA 6000 

GCTACAAATG TGGATTTATG CGACGAAATC AATATTCCAG GTTATGAAGT TGTTACAGAA 6060 

SO 

CCAGAAAATA AAGACATTGA CACTTTCATC TCACTTTTAA AAGAAGCGAA AAAGCCTGTC 6120 

GTATTAGCCG GOGCAGGTAT TAATCAATCA AAATCAAATC AATTATTAAC ACAGTTTGTT 6180 

55 
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30 



35 



40 



45 



SO 



GATACACTAT 


TTTTAGGTAT GGGAGGAATG CATGGTTCTT 


ATGCTAGTAA 


CATGGCATTA 


6300 


ACTGAGTGTG 


ATTTACTCAT TAATTTAGGT AGCCGCTTCG 


ATGATAGATT 


AGCAAGCAAA 


6360 


CCTGATGCCT 


TTGCACCTAA 


CGCCAAAATT GTACATGTAG 


ATATTGATCC 


TTCAGAAATC 


6420 


AATAAAGTTA 


TTCATGTAGA TTTAGGTATT ATTGCAGACT 


GTAAAAGATT 


TTTAGAATGT 


6480 


TTAAATGATA 


AAAATGTTGA GACTATAGAA CACAGTGACT 


GGGTTAAACA 


TTGTCAAAAT 


6540 


AATAAGCAGA 


AACACCCATT 


TAAACTTGGT GAAGAAGATC 


AAGTATTTTG 


TAAGCCACAA 


6600 


CAAACAATCG 


AATATATGGG 


CAAAATTACA AATGGTGAAG 


CAATTGTTAC 


TACAGACXSTG 


6660 


GGACAACATC 


AAATGTGGGC AGCTCAATrT TATCCATTTA 


AAAATCACGG 


ACAATGGGTT 


6720 


ACAAGCGGTG 


GTTTAGGAAC AATGGGATTC GGTATTCCTT 


CGTCAATTGG 


TGCCAAATTA 


6780 


GCTAATCCTG 


ATAAAACAGT 


CGTATGTTTC GTCGGTGACG 


GTGGTTTCCA 


AATGACAAAC 


6840 


CAAGAAATGG 


CACTTTTACC 


CGAATATGGT TTAGATGTCA 


AAATCGTACT 


AATCAATAAT 


6900 


GGAACATTAG 


GTATGGTTAA ACAATGGCAA GATAAGTTCT 


TTAATCAACG 


CTTCTCACAC 


6960 


TCAGTATTTA 


ATX:»TCAACC 


TGATTTTATG AAAATGQCAG 


AAGCATATGG 


CGTCAAAGGT 


7020 


TTCTTAATCG 


ATAAGCCAGA ACAACTGGAA GAACAATTAG 


ATGCAGOGTT 


TGCTTATCAA 


7080 


GGACCAGCTT 


TAATTGAGGT 


TCGTATTTCC CCTACTGAAG 


CTOTAACCCC 


AATGGTTCCG 


7140 


AGTGGCAAAT 


CAAATCATGA 


AATGGAGGGC TTATAATGAC 


AAGAATTCTT 


AAATTACAAG 


7200 


TTGCGGATCA 


AGTCAGCACG 


CTAAATCGAA TTACAAGTGC 


TTTTGTTCGC 


CTACAATATA 


7260 


ATATCGATAC 


ATTACATGTt 


ACACATTCTG AACAACCTGG 


GATTTCTAAC 


ATGGAAATTC 


7320 


AAGTCGATAT 


TCAAGATGAT 


ACATCACTTC ATATATTAAT 


TAAAAAATTA 


AAACAACAAA 


7380 


TTAATGTTTT 


AACGGTTGAA 


TGCTACGACC TTGTTGATAA 


CGAAGCTTAA 


TTTTAAGACA 


7440 


aaggCaatga 


TGCGCTAATT 


AGTTATAGAT ATATCATAGG 


CXGCTAGTTA 


ACATCTGCCA 


7500 


CTATTACAAA 


GTTATATTTC 


AGAATTTTCG AAACACAAAA 


TATTTAATTA 


TTTGGAGGAA 


7560 


TTTATTATGA 


CAACAGTTTA 


TTATGATCAA GATGTAAAAA 


CGGACGCTTT 


ACAAGGCAAA 


7620 


aaaattgcag 


TAGTAGGTTA 


TGGATCACAA GGTCACX3CGC 


ATGCACAAAA 


CTTAAAAGAC 


7680 


aatggatatg 


ATGTAGTCAT 


CGGCATTCGC CCAGGTCGTT 


cfv*\ " 1 " 1 'n&pn A 

* A 4 A X^Wm^mT^r^ 


AGCTAAAGAA 


7740 


GATGGATTTG 


ATGTGTTCCC 


TGTTGCAGAA 6CAGTTAAGC 


AAGCTGATGT 


AATTATGGTG 


7800 


CTATTACCTG 


ATGAAATTCA AGGTGATGTA TACAAAAACG 


AAATTGAACC 


AAATTTAGAA 


7860 


AAACATAATG 


CGCTTGCATT 


TGCTCATGGC TTTAACATTC 


ATTTTGGTGT 


TATTCAACCA 


7920 


CCAGCTGATG 


TTGATGTATT 


TTTAGTAGCT CCTAAAGGAC 


CGGGTCATTT 


AGTTAGACGT 


7980 



55 



604 



EP 0 786 519 A2 



10 



IS 



CAAGCACGTA ATATTGCTTT AAGTTATGCA AAAGGTATTG GTGCAaCTCG TGCAGGTGTT 8100 

ATTGAAACAA CATTTAAAGA AGAAACTGAG ACAGATTTAT TTGGTGAACA AGCAGTACTT 8160 

TGCGGTGGTG TATCGAAATT AATTCAAAGT GGCTTTGAAA CATTAGTAGA AGCGGGTTAT 8220 

CAACCAGAAT TAGCTTATTT TGAAGTATTA CATGAAATGA AATTAATCGT TGATTTGATG 8280 

TATGAAGGCG GTATGGAAAA TGTACGTTAC TCAATTTCAA ATACTGCTGA ATTTGGTGAC 8340 

TATGTTTCAG GACCACGTGT TATCACACCA GATGTTAAAG AAAATATGAA AGCTGTATTA 8400 

ACTGATATCC AAAATGGTAA CTTCAGTAAT CGCTTTATCG AAGACAATAA AAATGGATTC 8460 

AAAGAATTTT ATAAATTACG CGAAGAACAA CATGGTCATC AAATTGAAAA AGTTGGTOGT 8520 

GAATTACGCG AAATGATGCC TTTTATTAAA TCTAAAAGCA TTGAAAAATA AGATAGACCT 8580 

ACAATGAGGA GTTGTTAAAT ATGAGTAGTC ATATTCAAAT TTTTGATACG ACACTAAGAG 8640 

20 ACGGTGaACA AACACCAGGA GTGAATTTTA CTTTTGATGA ACGCTTGCGT ATTGCATTGC 8700 

AATTAGAAAA ATGGGGTGTA GATGTTATTG AAGCTGGATT TCCTGCTTCA AGTACAGGTA 8760 

GCTTTAAATC TGTTCAAGCA ATTGCACAAA CATTAACAAC AACGGCTGTA TGTGGTTTAG 8820 

25 CTAGATGTAA AAAATCTGAC ATCGATGCTG TATATGAAGC AACAAAAGAT GCAGCGAAgC 8880 

CGGTcGTGCA TGTTTTTATA GCAACATCAC CTATTCATCT TGAACATAAA CTTAAAATGT 8940 

CTCAAGAAGA CGTTTTAGCA TCTATTAAAG AACATGTCAC ATACGCGAAA CAATTATTTG 9000 

ACCyTTGTTCA ATTTTCACCT GAAGATGCAA CGCGTACTGA ATTACCATTC TTAGTGAAAT 9060 

GTGTACAAAC TGCCGTTGAC GCTGGAGCTA CAGTTATTAA TATTCCTGAT ACAGTCGGCT 9120 

ACAGTTACCA TGATGAATAT GCACATATTT TCAAAACCTT AACAGAATCT GTAACATCTT 9180 

CAAATGAAAT TATTTATAGT GCTCATTGCC ATGACGATTT AGGAATGGCT GTTTCAAATA 9240 

GTTT^KSCTGC AATTGAAGGC GGTGCGAGAC GAATTGAAGG CACTGTAAAT GGTATTGGTG 9300 

AACGAGCAGG TAATGCAGCA CTTGAAGAAG TCGCGCTTGC ACTATACGTT CGAAATGATC 9360 

ATTATGGTGC TCAAACTGCT CTTAATCTCG AAGAAACTAA AAAAACATCG GATTTAATTT 9420 

CAAGATATGC AGGTATTCGA GTGCCTAGAA ATAAAGCAAT TGTTGGCCAA AATGCATTTA 9480 

45 GTCATGAATC AGGTATTCAC CAAGATGGCG TATTAAAACA TCGTGAAACA TATGAAATTA 9540 

TGACACCTCA ACTTGTTGGT GTAAGCACGA CTGAACTTCC ATTAGGAAAA TTATCTGGTA 9600 

AACACGCCTT CTCAGAGAAG TTAAAAGCAT TAGGTTATGA CATTGATAAA GAAGCGCAAA 9660 

TAGATTTATT TAAACAATTC AAGGCCATTG CGGACAAAAA GAAATCTGTT TCAGATAGAG 9720 

ATATTCATGC GATTATTCAA GGTTCTGAGC ATGAGCATCA AGCACTTTAT AAATTGGAAA 9780 
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AAGAGGGTCA 


TATTTACCAG 


GATTCAAGTA 


TTGGTACTGG 


TTCAATCXSTA 


GCAATTTACA 


9900 




ATGCAGTTGA 


TCGTATTTTC 


CAGAAAGAAA 


CAGAATTAAT 


TGATTATCGT 


ATTAATTCTG 


9960 


5 


TCACTGAAGG 


TACTGATGCC 


CAAGCAGAAG 


TACATGTAAA 


TTTATTGATT 


GAAGGTAAGA 


10020 




CTGTCAATGG 


CTTTGGTATT 


GATCATGATA 


TTTTACAAGC 


CTCTTGTAAA 


GCATACGTAG 


10080 


10 


AAGCACATGC 


TAAATTTGCA 


GCTGAAAATG 


TTGAGAAGGT 


AGGTAATTAA 


TTATGACTTA 


10140 


TAACATTGTT 


GCCCTACCTG 


GTGATGGAAT 


CGGTCCAGAA 


ATTTTGAACG 


GATCTCTATC 


10200 




ATTGCTTGAA 


ATTATAAGTA ATAAATATAA CTTTAATTAT 


CAAATAGAGC 


ACCACXaUVTT 


102S0 


IS 


TGGTGGTGCC 


TCTATTGATA CATTCX3GCGA GCCTTTAACT 


GAGAAAACCT 


TAAATGCGTG 


10320 




TAAAAGAGCA 


QATGCTATTT 


TACTGGGTGC AATCGGTGGA 


CCTAAATGGA 


CAGATCCTAA 


10380 




CAATCGACCA 


GAACAAGGAT 


TATTAAAATT 


GCGTAAATCC 


TTAAATTTAT 


TTGTAAATAT 


10440 


20 


ACGCCCCACT 


ACCGTTGTCA AAGGCGCTAG 


TTCTTTATCA 


CCTTTAAAGG 


AAGAACGCGT 


10500 




TGAAGGCACA 


GATTTAGTTA 


TAGTCCGTGA 


ATTGACAAGT 


GGTATTTATT 


TTGGAGAACC 


10560 




TAGACATTTT 


AATAATCACG AGGCCTTAGA 


TTCTCTTACT 


TATACAAGAG 


AAGAAATAGA 


10620 


25 


ACXSCATTGTT 


CACGTAGCAT 


TTAAATTGGC 


CGCTTCAAGA 


CGAGGAAAAC 


TAACATCAGT 


10680 




TGATAAAGAA 


AATGTATTAG 


CTTCTAGTAA ATTGTGGCGC 


AAAGTCGTAA 


ATGAAGTAAG 


10740 




TCAATTATAT 


CCAGAAGTAA 


CAGTAAATCA 


CTTATTTGTT 


GATGCTTGTA 


GTATGCATTT 


10800 


30 


AATCACAAAT 


CCAAAACAAT 


TTGACGTCAT 


CGTATGTGAA 


AACTTATTTG 


GCGATATTTT 


10860 




AAGTGATGAA 


GCTTCAGTGA 


TTCCTGGTTC 


ACTTGGTTTA 


TCACCTTCTG 


CTAGTTTTAG 


10920 


35 


TAACGATGGT 


CCAAGATTGT 


ATGAGCCTAT 


TCATGGATCA 


GCACCAGATA 


TTGCAGGTAA 


10980 


AAACX3TTGCC 


AATCCATTTG 


GAATGATTCT 


ATCTTTAGCG 


ATGTGTTTAC 


GTGAAAGCTT 


11040 




AAATCAACCA 


GATGCTGCAG 


ATGAATTAGA ACAACATATT 


TATAGCATGA 


TTGAACATGG 


11100 


40 


GCAAACGACA 


GCAGATTTAG 


GCGGCAAATT 


GAATACTACT 


GATATTTTCG 


AAATTCTATC 


11160 




TCAAAAATTG 


AATCACTAAG 


GGGGAGATGT 


AAATGGGTCA 


AACATTATTT 


GACAAGGTGT 


11220 




GGAACAGACA 


TGTGTTATAC 


GGGAAATTGG 


GCGAACCGCA 


ACTATTATAC 


ATTGATTTAC 


11280 


45 


ACCTTATACA 


TGAAGTTACT 


TCTCCTCAAG 


CATTTGAAGG 


ACT^EAGGCTT 


CAAAACAGAA 


11340 




AATTAAGACG 


CCCAGATTTA 


ACATTTGCAA 


CACTCGATCA 


CAATGTTCCT 


ACTATTGATA 


11400 




TATTCAATAT 


TAAAGATGAA ATTGCAAACA AACAAATCAC 


AACATTACAA 


AAAAACGCCA 


11460 


SO 


TAGATTTTGG 


GGTGCATATT 


TTTGATATGG 


GTTCTGATGA 


ACAAGGTATT 


GTTCACATGG 


11520 




TAGGACCTGA 


GACAGGACTT 


ACACAGCCTG 


GCAAGACAAT 


CGTTT6TGGT 


GACTCTCACA 


11580 



55 



606 



ATGTTTTCGC AACTCAAACG CTATGGCAAA 
ATGGTACCTT ACCAACAGGC GTCTATGCTA 
5 ATXKjTGTTGA CTTTGGTACA GGCTATGCTT 

TTTCAATGGA TGGTCXSAATG ACTATTTGTA 
GCATAATCCA ACCTGATGAT ATAACATTTG 
ACTtCGCTAA ATCAGTTGAT AAGTGGCGTG 
ATCGTGTAAT TGAACTTGAT GTTTCAACAT 
CTGAAATGGG TGTTAATTTC AGTGAACCAT 

IS 

GTGCGTATGA TTATATGGGG TTAGAACCAG 
ATGTTTTTCT CX3GTTCATGT ACAAATGCTA 
TT6TTAAAGG AAATAAAGTT CATCCAAATA 
CAGTAAAAAA AGAAGCAGAA AAATTAGGTC 
AATGGCGTGA ACCAGGATGT TCAATGTGTT 

25 GCGTACATTG TGCATCTACA AGTAATCGAA 

GAACACATTT AGTATCCCcT GCTATGGCAG 
ACGTAAGAAA GGTGGTTGTT TAAATGGCAG 

^ AAATAGTCCC TCTCTTCAAC GACAATATCG 
TAAAGCGTAT TTCAAAAAGT GGCTTTGGTC 
CTGATGGTTC AGATAATCCT GATTTCAATC 

3S 

TTTTAATTAC TGGAGATAAT TTTGGATGTG 
TTAX^GACTA TGGTTTTCAT ATTATTATTG 
ATTGCACTAA AAATGCGATG TTGCCTATCG 

40 

CACAATATGT TGAAATTGAG GTCGATTTAC 
GTTTCCATTT TGAAATTGAT GAAACTTGGA 

45 TTGCAATCAC CCTACAATAT GAATCATTAA 

GTTGAATATT ATGACAGTCA AAACAACAGT 
AAGACTTAAA GATATTGTCA AAGAAACACC 

^ GTATGATTGT AAAGTCTATT TAAAACGAGA 
AAGAGGTGCT TACAACGCTA TTTCTGTTTT 

55 



>0 786 519 A2 

CAAAACCCAA AAACTTAAAA ATCGATATTA 11700 

AGGACATTAT TCTGCATTTA ATTAAAACGT 11760 

TGGAATTTAC TGGCGAAACA ATTAAAAACC 11820 

ACATGGCTAT CGAAGGTGGT GCCAAATACX5 11880 

AATATGTTAA AGGGAGACCA TTTGCCGATA 11940 

AgCTATATTC TGATGACGAC GCGATATTTG 12000 

TAGAACCACA AGTGACATGG GGAACTAATC 12060 

TCCCTGAAAT CAATGATATC AACGATCAAC 12120 

GTCAAAAAGC TGAAGACATC GACTTAGGGT 12180 

GACTATCAGA TTTGATTGAA GCTAGTCATA 12240 

TTACAGCTAT TGTCGTACCA GGTTCTCX5TA 12300 

TAGATACTAT CITTAAAAAT GCAGGATTTG 12360 

TA6GCATGAA TCCTGACCAA GTACCTGAGG 12420 

ACTTTGAAGG ACGACAAGGC AAAGGTGCAA 12480 

CAGCAGCAGC TATTCATGGT AAATTTGTGG 12540 

CAATCAAACC TATTACAACA TATAAAGGTA 12600 

ATACAGACCA AATCATTCCT AAGGTACACT 12660 

CATTTGCTTT TGATGAATGG CGGTACTTAC 12720 

CTAACAAACC ACAATATAAA GGGGCTTCTA 12780 

GTTCAAGTCG TGAACATGCT GCTTGGGCTC 12840 

CAGGAAGTTT CAGTGACATA TTTTATATGA 12900 

TTTTAGAAAA AAGTGCCCGT GAACATCTTG 12960 

CAAATCAAAC TGTGTCATCA CCAGACAAGC 13020 

AGAATAAACT TGTAAATGGC TTAGATGACA 13080 

TAGAAAAATA TGAAAAATCa CTTTAAGGGA 13140 

TTCTACGAAA GATATCGATG AGGCATTTTT 13200 

TTTACAATTA GACCATTACT TATCTCAAAA 13260 

AGATTTACAA TGGGTACGTT CTTTTAAATT 13320 

ATCAGATGAA GCTAAAAGTA AAGGTATTAC 13380 
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20 



25 



30 



3S 



40 



4S 



SO 



AAACGCTGTT 
CTTTGGAAAT 
TGAAGCTTTA 
TCATACAATT 
CAATGTTAAC 
TAGTACTTAC 
TGCAAGTAGT 
CGATAAATTT 
AAAAGAAAAT 
AGATATGTAT 
GCTTGAAAAC 
TAATAATGAT 
GAAGCATTAC 
TGACGTATTA 
AAATACAGGT 
CAAACAACGT 
ATATTCATTG 
AATCATCTTA 
TTTTCACTTC 
AAAAAaGAGA 
TCTACTCTAG 
TCGGCATGGG 
TACATTCAAA 
TAAGTCTTCG 
CTATTAACCT 
GGGGGCTTCA 
TGCCGTTGGC 
GGACAGCTCC 
GTTCTGAACC 



ATCTTTATGC 
AGTAACGTTG 
ACTTATACAA 
TCTGGACAAG 
TTTGATTATC 
TTTAAAACCT 
ATGTATGAAT 
GTGGACGGTG 
GTAGATGATT 
TCAAAACAAG 
TATAAAGATC 
ATTAATCGAA 
TTTATCTTAA 
GGACCTCAAG 
ACTGTCATTA 
GTAAAtCATT 
TTAATTTAAC 
TGACTGCTTT 
GCCAAGCCAT 
CCTTGCGGTC 
CGGAACGTAA 
AACAGGTGTG 
ACTAGATAGT 
ATCGATTAGT 
CATCATCTTT 
TGCTTAGATG 
ACGACAACTG 
TCTCAAATTT 
CAGCTCGCGT 



CAGTCACTAC 
AAGTTGTACT 
GTGAACATCA 
GTACGCTTGC 
TATTTGCCGC 
ATTCACCTAC 
CTGTTGTGGT 
CATCTGTAGC 
ACGTTCAAGT 
CAATTGTAGC 
ATATTAAAGG 
TGAAAGAAAT 
ATTTCCCTCA 
ACGATATTAC 
TTGGTATTCA 
TCGATCCTTC 
ACATAGTAAG 
TTATTATACT 
CTTTCTTTGT 
TCAATGCGGC 
GTTCGaCTAC 
ACCTCCTTGC 
AAGTAAAAGT 
ATTCGTCAGC 
GAGGGATCTT 
CTTTCAGCAC 
GTACACCAGA 
CCTACGCCCA 
ACCGCTTTaA 



ACCTTTACAA 
CACTGGTGAT 
AATGAACTTT 
TAAAGAAATG 
AATTGGTGGT 
CACGAAAATT 
AAATAATCAG 
TAGAGTTGGC 
AGATGAAGGT 
AGAACCTGCT 
TAAAACAGTG 
TGAAGAACGT 
ACGTCCAGGT 
TAAATTTGAA 
ACTTAAAGAT 
CAATATTTAT 
AAAAACAGTC 
TTACATTTCT 
GTTTGCTTTT 
TCATCGCATC 
CATCGACGCT 
TATAGTCACC 
GATTTTGCTT 
TCCACATGTC 
ATAACCGAAG 
TTATCCCGTC 
GGTATGTCCA 
CGACGGATAG 
TGGGCGAACA 



AAGGTAAATC 
ACATTTGATC 
ATAGATCCAT 
CTAGAACAAG 
GGOSGTTTAA 
ATAGGTGTTG 
GTAGTCACAT 
GATATTACAT 
GCAGTTTGTT 
GGCGCATTAA 
GTTTGTGTCA 
TCATTACTAT 
GCATTGAGAG 
TACTTAAAAA 
CATGATGATT 
ATTAATGAAA 
ATAAATTGAT 
CGTTTCGTCA 
aTTTTGACGT 
CACTTTTTGC 
AAGGAGCTTA 
AGACATATGA 
CGCAAAACAT 
ACCATGCTTC 
TTGGGAAATC 
CACACATAGC 
TCCCGGTCCT 
GGACCGAACT 
GCCCAACCCT 



AAGTAAAGTT 
ACTGTTTAGC 
TCAATAATGT 
CAAAGTCTGA 
TTTCAGGTAT 
AACCTTCAGG 
TGCCTAATAT 
TTGAAATTGC 
CTACGATTTT 
GTGTAAGTGC 
TTAGTGGTGG 
ACGAAGAAAT 
AATTTGTAAA 
AATCTTCTCA 
TAATACAACT 
ATAAGAT6TT 
TTCTAATTGA 
GATTCAAACG 
TTTAGACATA 
CTGGCAACGT 
ACTTCTGTGT 
ATGTAATTTA 
TTATTTTGAT 
CACCTCGAAC 
TCATCTTGAG 
TACCCAGCTA 
CTCGTACTAA 
6TCTCACGAC 
TGGGACCX3AC 



13500 
13560 
13620 
13660 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
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GTGGAACTT 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GTGGCAATAT TTCTAGTTCT CGTTTTGATA AGATTTTAAA AGGATCTGTT GTGTTTGCAG 60 

TGTCCTGATT TGAATTAGAT ACAAATTCAT TCACTAAAGA TGTTOTAAOT TTCATATCTA 120 

CATATGTTTC ACCTTTATAT ACAGTTCGAA TAGCTAACAA TAATTGTTCA TCAGGTGCAT 180 

TTTTCAATAT GTAACCTTTC GCACCATTAC GCAACACATG GAACAAATAC TCCTCATCAT 240 

CAAACATTGT TAATATTAGT ATTTTAGTTT CAGGAAAACT GTCAGCAATT TTACTCGTAG 300 

CGATAAGACC TGACTCACCT GGTGGcATAC TTAAATCCAT TAGTAACACA TCAGGTTTAt 360 

ATTCCATTAC TTTTTQGTAA GCTTCGACGC CATCTGCAGC CGTTGCAACA ACTTCCATAT 420 

CATTTTGATA ATTTAAAATC ATAGAGAACC CCGTACGGAC AACAGCGTGA TCATCGGCAA 480 

TGACTATTTT CAATTTTATT CCCCCAATGT ATGTTTCAAA TTGGAATGTT CAATGTAACA 540 

TTGGTACCCT CACCAATTTT CGTTTCAATA TTGACGCTAC CGCTGACTAA CTCAGCTCGC 600 

TCATTCATTC CATATAAACC GAGTCCAGAA CCTTTAGGCT TAGAACTTGG ATCAAAACCA 660 

TTTCCCGCAT CTATCACTTC TGCTACCAAA TGGCGCCCAG TTTGACGGAT ACCTACATTT 720 

ATTTCATTTA CATCAGCGTA TTTCAACGCA TTTAAAATAG CTTCTTGCAC TACTCGATAA 780 

ACAACCGTTT CAATATCACT ATCAAAGCGA GTATTTTTAA TATTTGATGT ATATATGATT 840 

TTTATTCCAT AATTTTCTTC AAACTGTTTA AAATATGATT TAAAAGCTGC TTCAAGGCCT 900 

AGATCATCCA AAGAAGCGGG TCTTAATTCA ACCGACATAT TACGTATATC ATCAATTAAT 960 

TTAGCGACAA TATATTCAAT ATTTTCTGCG TCTTCCAAAA GCTTAGTTGT ATCTTCTTGA 1020 

TATTTTAATA ATCTCAATTG AACATCTACA TTGAGCATTT CTTGAATCAC ACTATCATGT 1080 

AACTCTCTAG AAATTCGCTT TCTTTCATTT TCTTGGGCTG AGATTGTTTT ACGCATCATA 1140 

CGTTGTTGAT GCAATTTCTC TTGCTGTTCA ATTTGTGATG AAACATTTTG AAGCGTAAAT 1200 

GCATGAATTC CCCTGTCTTG ATCAATCAAC TGATATGTTG CTGTAAATGG CATCACTTTT 1260 

TGATCTTTCO TCTTCATAAA TACTTGGAAA TTCGTAGCTT GTACTTGCAT CGATTCTAAG 1320 
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ATCGCATTCG CCACAGCACT GTAATTATCT TCTTCAGATA ATATATCTTT AGCAGCATCA 1440 

TTCATTGCAA TAATTTTACC GTTATCATCA GCAAAAACTA TCTTTTCGAT TGAATGCTCA 1500 

TAATATTTTT TCAATAAAGT ATCTAACTGT ATACTGTCCT CATTAATCAT GACTTACACC 1560 

CTAATTCATC TCATTATTTA TCATCATTGA AAATACCAAA CTTACGTTGA ATATCATCAT 1620 

TATCAAATAT TTTTGGTAAA GGACGACCAT CTCTTTGACC AAATAATAGT ACGCCATACA 1680 

CTTGATTCTT ATACCAAAGC GGCACTGCTA AAACTGCTGT TAATGATTCG CTCAATAAAA 1740 

TTGGATAGTC AATCTTTTCT TCAGGCCCTA AAGCTAAACC AACATTGGCT ATTACCATAC IB 00 

GCTTTCCTGT TTTCATAACA GTTCCAGCTA ATCCACXaVCC TTTTCTTAAA ATAATCAATT 1860 

TAAATCGATT ATTTTTATTA CCTGAAACAT AGTGCCATTT TATTGGAGAT GATGGTTTGT 1920 

TAGATTCATA GAAAGCGATT GCCGCAAAAT CATAACCCTC TTCTTTGCGT ATTTTATCTA 1980 

20 ATGTCTCTTG AAATCTACGA TCTTCAATTA TTGCTTCTGG TGTCAAATCC TTTCACCTCT 2040 

TATGCTTACA CTTTATTCTT ACGGTAAATA ATATATCTGC GATTTATATA TGTCAAAGGT 2100 

ACACTCCAAA CATGCACCAA ACGTGTAAAT GGCCAACAAG CCATAATAGT GAAACCTAAC 2160 

25 AATATATGCA TTTTAAATGC AATCGGCACA CCACTCATCA ATGACGCATC TGGTTTTAAC 2220 

ATAAATAATT GTCTAAACCA AATTGATAAT GAAGTTCTGT AGTTAAAGTC TGGATGTTGT 2280 

ATATTTGTTA CTAATGTTGC GTAACATCCC ATAAATACGA TAAGTAATAA TAAGAAATTT 2340 

ACAAATATAT CCGACGCTGA ACTTAATCTT CGAATACTTT TCGTAGTAAC ACGTCTCGCT 2400 

GTTAATAAAA ACATCCCTAT CAAAGTTATT ATACCAAAGA TGCTACCAAT ATAAACAGCG 2460 

CCTATATGAT ATAAATGCTC AGACACACCC ACTGCATCCA TCCATGGTTT CGGTATTAAC 2520 

AATCCAACTA CGTGTCCAAA AAACACTGGA ATAATACCTA AGTGAAATAA TAAACTTCCC 2580 

CACATCAACC TTTTTCTTTC TATTAATTCA CTAGATTTAG CTGTCCAAGA AAATTTATCA 2640 

TAACGATAAC GTGCAATATG ACCTGCGACA AAGACAACTA AACATAAATA CGGAAATATA 2700 

ACCCATAAAA ACTGATTAAG CATGATGTTT CACTCCTTTT GGTGATGTCA AACATAATTT 2760 

CAATGTTTTT CTAAGTGCTT GAATCACATA GGCATATGGA TTGTTATCTT CACCAAGTGC 2820 

45 ATTCGCCATC ACATATGTTC CATCCTCAAT AATCATAATG ATTAATTGAA TATTCTCTTC 2880 

AGCTCTTGGA TCATTTCGCC ATTCTGCCAC TTGCAAAAAT TGAAGCATCA ACX3GTAGATA 2940 

ATCAGAAAGT TCATTATCTA CCATTTCTAG TCCAAACATT TCATATAATA CCTTTAATTT 3000 

^ AGCTAACATT TGCCCACGTT CTTTTTGCGT ATCAAATTTG TTATACGTCA TATATAATGG 3060 

TGCTTTTTTC GTAAAATCAA ATGTATCTGT ATAAATCGCT TTGATTTCTG ATAATGAAAA 3120 
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TGTrrCTTCA AAAGTTTTTG GATGAAAAGT TAATTTTTCT GGAAAACATA ACTGTTGTGC 3240 

CATATATCCA AAACTTTCTT GATATTTTTT AAAATTATCG AAATTAATCA CGGAAAATCC 3300 

CTCCATAGAA ATTCTCATTA TAAATTTCTT GACCAGTTTT CCCTGAACCT ACTGCAACGC 3360 

CACAGCCTTC ACAGTTATCT CCAAAATGCT CX;CCGCCGTA ATTGTATCCT GTACTACCTT 3420 

GTGCGTGATA CGTATCTAAA TAGGTTTCTT TGTGTGATGT TGGAATAACA AATCGATCTT 3480 

CATATTTGGC TAGTCCTAAT AAACGATACA TGTCTTTAGT TTGGCGCTCG GTTATACCTA 3540 

ATCGCTCTAA TCGAGACGTG TCAAATGGCT GTTGAGTAAC TTGAGATCTC ATATAACTTC 3600 

TCATCATTGC CATACGTTGT AGGGCTCCTT TTACTQGCTC TGTATCTCCT GCAGTGAAAA 3660 

TATTAGCTAA GTATTCAATA GGTAAACGCA TTTCTTCAAT GGCTGGGAAA ATCGCATCTG 3720 

GATTTTGAGT TGTATTTTTA CCTTCAAAAT AGCTCATAAT TGGGCTAAGT GGTGGGCAAT 3780 

20 ACCAAACCAT CGGCATCGTT CTAAATTCAG GATGTAACGG AAATGCAAGT TTATATTCAA 3840 

TTGCTAACTT ATAAATTGQA GAGTTTTGTG CAGCTTCAAT CCAATCGTAA CCAATACCAT 3900 

CTTTTTCAGC TTGAGCAATG ACTTCTTCGT CAAATGGGTT TAAGAATATA TCTAATTGTT 3960 

25 TTTCATATAA ATCTTTCTCG TCTACTGCTG AAGCTGCTTC ATGAACTCGA TCTGCATCAT 4020 

ATAATAAAAC ACCTAAGTAA CGCATACGTC CTGTACAAGT TTCAGAGCAT ACCGTAGGCA 4080 

TACCCGCCTC GATTCTCGGG AAACAGAAAG TACACTTTTC AGCTTTGTTC GTTTTCCAAT 4140 

TGAAGTAAAC TTTCTTATAT GGACAACCTG TCATACAGTA ACGCCATCCA CGACATGCGT 4200 

CTTGGTCAAC TAATACAATG CCATCTTCAT CACGTTTATA CATAGCACCT GAAGGACACG 4260 

ATGCAACGCA ACTTGGATTC AAGCAATGTT CACATAAACG TGGTAAATAC ATCATAAAAG 4320 

TTTCGTCAAA TTGGAATTTA ATATCTTCTT CTATTTTTTG GATGTTAGGA TCTTTTGGAC 4380 

CTGTAACATG ACCACCTGCT AAGTCATCTT CCCAGTTAGG TCCCCATTCA ATTTCAATGT 4440 

TATCCCCCGT AATTTCTGAA TACGCTCTAG CAACTGGCGA ATGCTTCCCT GATTTCX5CAG 4 500 

TTGTTAAATG TTCATAATTA TAGTTCCATG GCTCATAATA ATCTTTAATT AATGGCATAT 4560 

CTGGGTTATA AAAAATTTTA CCTAAAGCAA TTTTTGAAAT TCTACTTCCA GATTTTAATT 4620 

45 CAAGTTTCCC TTTAC6ATTT AGTACCCAAC CACCTTTGTA GTGTTCTTGG TCTTCCCAAC 4680 

GTTTCGGATA CCCTACACCT GGCtTCXSTTT CTACGTTGTT GAACCACATG TACTCAGCAC 4740 

CTGGACGATT TGTCCaAGTG TTTTTACATG TCACACTACA CX3TATGGCAT CCTATGCATT 4800 

^ TATCTAAATT TAATACCATC GCAAcTTGCG CTTTAATCTT CAAGOCAATT AACCTCCTTC 4860 

ATCTTTCTAA CTGCTACATA TAAATCCCTT TGGTTCCCAA TTGGTCCATA ATAATTAAAG 4920 
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GGCGCGTTGT GTGAACCACC ACGTGTATCT 
TTATCTTGTG CATGATACAT AAACATTGTA 
CTTGCCGTrA CAACACCATT ACGGTTATAC 
TGTTTTTCAG CATCTTCATT TGATATCCAA 
ATATGCTTAT TATCTTGATA CATTGAGTGT 
CGCAgTACCA AAGCATCTGT ACCACCTTTA 
GGCGGCAATG TCGGTTTATA TACTGGTAAG 
TCCACATAAT AACTTTGACG TCCTGTTAAT 
GTTGTAAATG GTGAATATCG TCGACCTTGT 
ATTACTTCTC GTGGTTGTGA AGTTATATTT 
CTAGAAATAT CTTTTAACGG CATTCCAGTT 
GATAATTTAC CATTCGTAGC AGATGAAATA 
GTATCAATAC GTGGACGATT CGCTCTCACA 
AACA7ACTTT TTAATTCTTC ATATTGTTCA 
ACTTTCCCTT TTTCAAGTAC AGGACCAAGC 
TCGOGTTCTA CAATTGCAAA GTTAGGCATT 
TTCGACCAAT CTTTCACTAC GCCGTATGGT 
AGTGGAGTTG TCACAACATC TTTAAACX5TT 
AATGCTTTTG CCAACGTTTT ATAAATATCC 
ATGGCAGGAT TGAAAGGATG TACATATGGA 
TCATACCAAG TCGCTGCCGG CAAAACAATG 
AAGTCTAAAG AGACCACTAA ATCTAACTTA 
TCTGGCTTTT CATCTTCATT TGGTGTAGCT 
TTCATAAAGT ATTCTTGACC TTTTGCAGAA 
AATGATTTTG GATGATTCTT TTTCAAATCA 
GATTTCACTT CATCAATTGC ACGTTGCAAA 
TTAGCTTCTT CTGCAAACAA CAAACTATTT 
CCAAGTCTAG CTGCTAAAAC ATTATAATCA 
TTAGCTAATG GAGATTTTAA ACGATCTACA 



GTAATTTCTG ACCCAGGCGT TTGAATATGT 
CCTTTAGGCA TACGATGCGA AATAACTGCT 
ACTTCTAGCC AATCATTATC TTGGATATCG 
ACCGTTGGAC CACCTCTAAA TAGTGTCAAG 
ATATTCCATT TTCCATGAGG CGTTAAATAA 
ATTTTCTTAT CTCTATTCCC AAATACCATT 
CTCTCCCCAA ATTGTTGGAA AACTTCGTGA 
GTTCTAAAAG GTACTAGACG TTCTATATTC 
TTATTTGAAC CTGGGAATAC TGCTGTCGGT 
AAAAACGAAA TTTTCTCAGC AGCGCGTTCG 
TGTTCTTCGA GATCTTCATA TGATTTTTGT 
CTTAGTATTG CATCAGCTAC ATTACGTGCT 
GAATCATCAT TTGTATCACT CCACGTACCT 
CTGACACCGA AACTTACACC ATGTGCTCCA 
GTGACATATT TGTCGTAAAT TTTAGTGTAG 
GTACGTCCAG GTACCGCTTC AATTTCACCC 
GTTGAAATTT CTTGCTTTGT ATCATGACTA 
CCAGGTAAAT AGTCTTTTGC CATTTCTGAA 
CAGTCTGAAC GCGATTCCCA TAACGGATCA 
TGCATATCOG TTGATGATAA ATCATGTTTT 
TCAGAATATA ACGGTGTTGC CGTCATTCTG 
CCTGTTGTTT CTTCACGCCA CGTAATTTCT 
AATAACCCTG ATTTTGTGCC AAGTAAATGC 
CTTGAAATTA AGTTTGAACG CCATATAAAT 
GGATCTTCTA TTGCAAATTG TGTTTGTTTT 

ATCGCTTCAT TTGAATCTAT ACCTTCATCT 

I 

TTATTAAATT GTGGATATGA TGGTAACCAA 
GCTGGATGTT GATGCTTTAA CTCCTCTGTT 

! 

TTTGACTCTT CATATTTCCA TTGGTCTGTT 
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AATGCGACAG TACTCCATCC TTCAATCGGA 
CAACCGCCAC CATTCACACC TTGACAGCCA 
^ TAAATCGTAT CTGAGTTAAA CCAATGGTTA 

CCTTCAGTAT CGATAGCGTT TTGCGCAAAT 
TTTACGCCTG AAATGGCTTC TTGCCAAGCA 

10 

CCTTTTGATT CTAATTTATG ATCAAAACGA 
AAAATTGTAG CAATACGGAC TTTGTCACCA 
GGACGATTGA ATATCCCATC TCCATCACTA 
TCGTATCCAC CrrCTGTCAT TGATAATGTA 
AGTTTTAAGT TCCACTTCTT ACCTTCTTCC 
20 ACTAAACTAT CGCTGATTGC ATCATGAATA 

TGACCTAAGT CACTCQCTCT TAAAAATCGA 
AGCATGATAA GAAACGGCAT ATCTGTATAT 
GGTTGATTAA CATAATGTTC TTGTAAAATA 
TCTGAACCAG GATTCGGTGC TAGCCAGTTA 
GGTGCTACTG AAATGACTTT TGTACCTTTA 

30 

GGAGTACGTG TTAAAGGTAC ATTAGAGCCC 
TCACTTGATT CAGGCACATC TGTTTGCTCT 
TCTGCATACC AGTCATAAAA ACTAAGCATT 

35 

CCTGCTGCAT AACTAATCAT TGACATCGCT 
GGACCATATT TTTTTATTGT ATACAGTAAT 

40 CAATTTGAAC GCACGTGCCC TCCCATACCT 

TCATTTTCAA CAATAGACGC CCATGCAGCA 
TCAGTCCATA AATCCCAGAG TTTTCCACGA 

45 TATTCATACC AAGAGAATGA CGCACCTCGT 

TCCGGACCAC AACTTGGATA GTCAGTTTGT 
ACAAATACTT TCCAAGAACA TGAGCCTGTA 

SO 

TTATCGTGGC TCCAACX5TTC TCTGTACATT 
ATCGACCAAT TCCCATTAAA TTTTTCTGTT 

55 



CGACATTTTT CTTGTCCCAC ATAGTGAGCC 6840 

CATAACATAA CTAAGTTTAA GATTGAACGA 6900 

ATACCCGCAC CCATGATAAT CATTGAACGC 6960 

TCTTTCGCTA CTTGAATGAC AACACTTTGT 7020 

GGTGTATATT TTGATTCTGC ATCGTCGTAT 7080 

CGCACGCCAT ATTGACTTGC CATTAAGTCA 7140 

TTTGCTAAAG TGACTTGTCG AGTTGGAATT 7200 

TCAAAGTATG GGAATTGAAT TGTTTCTAAT 7260 

GGGTTAATTT TAGAACCATC TTCTGTTTCT 7320 

CAACGTTGAC CCATTGTGCC ATTAGGTACT 7380 

ACTGGCTTCC ATTCGCCTTG CTCTGTTGTT 7440 

CCCGCTTTAT ATCCATTTTC ATCTTCATCC 7500 

TGTTTAGCGT AATTTATAAA GCGTTCATTA 7560 

ACATGCGTCA TTGCTTGTGC AATTGCA6CA 7620 

TCTGCAAATT TCACATTTTC TGCGTAATCT 7680 

TAGCGGACTT CAGTCATAAA ATGTGCATCC 7740 

CACATAATAA TGTATGATGC GTTATACCAG 7800 

CCCCAAATTT GTGGAGAGGC AGGTGGTAAA 7860 

TCACCACCAA GCAAATTGAT GAATCGAGCA 7920 

GGAATAGGTG TAAATCCTGC GATTCGATCT 7980 

TGTGCTGCGA TTATCTCTGT AACGTCTTTC 8040 

CGGGCTTGCT TATATTGTTT GGCTTTGTCT 8100 

ACGCGATTAC CATTGTTTTC TTCTAATGCT 8160 

ATATATGGAT ATTTGATTCG AAGCGGACTG 8220 

GGACATCCTC TCX3GTTCATA TTCAGGCATA 8280 

TGATTTTCCC AGGTAATCAC ACCATTTTTC 8340 

CAGTTAACAC CATGTGTTGT TCTTACTTCT 8400 

TTTTCCCATT CTCTACTTTT ACTTTCTAGG 8460 

GGCTTAAAGA AATTCAATCX: AAATTTTCCC 8520 
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TAAAATGCCC 
TAGGGAATCA 
ATACACAATC 
GACAAATAAG 
TTTTTATAAA 
TTACATTTTC 
TGCCATTCGA 
CACAGAGCAT 
CATATGATTT 
GTTTCGCCTC 
CAACAAACTC 
TGTATTGTTG 
ACGTGCCACC 
GACCTGTAGA 
TAGCAACTGc 
CATTTAGTAT 
GTCTAACAAC 
GCTGAATATG 
CATATTGTAA 
TTAAACGCTC 
AAACeTTGCC 
ATAAACATCT 
TTGATCGTGG 

CCCTACCTCT 
AGTTGTCACT 
TCTACGTGCT 
AGCATCCATA 
TACATTTTCA 



AAGACTATTG 
CCTAATTACT 
ATGTATGGTC 
GCTTCAACAC 
TTAGTGACAT 
TTGCTGCTTA 
TGTATCATTT 
AAGGGCGTCA 
AGGAGTGTTT 
AATTGAATAG 
GTTAAAACAC 
TATCTGTGTG 
ATTCACTAAG 
AAATGTCACA 
tGCGCTCGCT 
GTCGACTTCT 
CTTGTTATAT 
TTTTGCATAC 
AATTAACGGA 
AGCCTTTTTC 
ATATTCCTCT 
ACACAACCTT 
AGCGGGCAAA 
GGACAGATAT 
TTGCCTTTGA 
TTAATTTTTT 
TGAGCATTGC 
ATGCGTTCAA 
AATCCAAGTC 



CTTTAATTAG 
TAAGGAATTT 
ATGCTTATTG 
GAATATATTC 
AACACTGTAT 
ATAAATGCAT 
GGGTTTAGCA 
TGTTTAGAAC 
TCAGTATAAT 
TGTTGCAATT 
GATGCTTGAA 
ATAATTTTCC 
GAATTGACAT 
CTTTTAGCAA 
GATGTCACCC 
TCTTGCACAC 
CGACGCGCTG 
GGCTTTTTAC 
TTCACTAATC 
GTCAAATAAT 
ACAGACATAT 
CATCAGGTTC 
ATACATATTC 
TGTGAATCGC 
CAATAACCTT 
CTTTTGTTTC 
TAGTTATTGC 
ATAGTTCATT 
TTCTTAACCA 



ATTGTACATT 
CCCTATCAAT 
CCAATCTAAA 
TCTCGGTTGA 
TAGCATCTGC 
CATAGTAATC 
AACAGCCATA 
CACTTACTAC 
CAACAACTTC 
TTGAAATAAT 
AGACAATCGC 
CTAAACGTTT 
CCACTTCATT 
CTGTACGCAT 
CTGGTACAAT 
GACCAAATAT 
CTTCCACGAT 
CAACATCGAT 
GATCATATAG 
TCGGATTACC 
ATATACX3TTC 
TTGAACAATA 
CCCACTCACT 
ATGAATTTTG 
TTTTCCAATT 
CATGTATTAC 
TTCCCAAGGT 
TTGTCTTTCT 
TGGCGCTGTT 



TTTTCACAAA 
AACGGGATTT 
TCGTTCAAAT 
AACCTTACTT 
ACGATCGGTT 
ATATTGCGAC 
ACCTTCGTCA 
ATAAAATTGC 
CCCTATAATA 
ATTACTTAAA 
TATCGGGTAA 
TACCCCCATA 
TTCTTCTQAA 
TGTCAAACCT 
TTCAAACGCA 
CGCTGGATCG 
ACAGTCATTT 
AATTTCAGTA 
AATGACATCC 
TGGACCCGCA 
CCGTCTGTAA 
CCTGTATTTA 
GTCCCTTCAG 
CCACTTTCTG 
AGGGGTGTTA 
ACCTTCTCCA 
TCAGCTTCGA 
GGGTCAAGTA 
CTTTCAGCAT 



CATAAAATAT 
CATTGAAATA 
TTGGCACAAC 
ATTCATTTAT 
GAAATATATG 
GAATGATATG 
TATAAATGTT 
TTCATAGGAT 
CATATACCT6 
CGCCCCTTAA 
TCAATATCTG 
TAAATTGCTA 
TCTTTAAAGT 
GTCTGCATAG 
ATATGATGTT 
CCACCTTTAA 
ATTTTTTCTT 
GTCAAATTCG 
gCTTCACGTA 
CCTATCAAGT 
CTTCTACCTC 
AATCAATTTT 
ACAATGGTCC 
TTAAAAACAA 
ATTCATCTAT 
CTTCAAAAAT 
CTGCTTTTTT 
AGACTTCTTT 
ATATACCTGT 
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AGTTGTTAAA 
GAATCCATTT 
TGGGCAGCCT 
TTCTAAACGA 
AACACAGCTT 
GTCTTCCCAT 
ACCTGTCACT 
TTGGTCTGCA 
TTGAATATTC 
AT6TGGATAA 
TTTATTTTTA 
TATTTGCGTT 
TGCGGCAACA 
TTTATCACAT 
ATTTTCCGTA 
TTCATCATCA 
TGATACAAGT 
ACCATCATCG 
TTTTTTATAG 
TACAATTTGA 
TGATCCCTTG 
TTCATATAGT 
TGCATATACA 
ATCTAGACCT 
TAAGTCTGCC 
TAAGATTTCA 
TTTAAGCATA 
TAACACCGTT 
TCCTCCACCA 



AATTCAGCTT 
TCAACTGAGA 
GATACACCCA 
ATGCCAAGTC 
TTAACTGAGC 
ATATTTGGTA 
TTAACTAGTG 
TCTGTAACAC 
GCATGGTAAC 
ACCATGTTTA 
AAGTTTAAAA 
ACTATTTGAT 
AAGTCATCTC 
GAATTCCCCG 
ATCGCATTTA 
GCCATATCAG 
GTGTAATCTT 
ATATCACCAT 
TTATTATCAA 
CCAGCACTAT 
TATCCGTTCG 
GGTGCAACGA 
TTGCTATCAC 
GATTCTTTGG 
GGAATCTCGC 
GTTGTGTTGG 
TTTCCAGCTT 
ACTTCCATAC 
ATTACAATTG 



TTTCAACTTC 
TAATACCAAA 
TTTTGAATTT 
GTGTCGTGTA 
GTGTTTTCTT 
ATTCTTCTTT 
GCACATGATA 
CCCCACGCAT 
GTTCGTTAGC 
AATAATAGTT 
CATGTCGAAC 
CGCGTGTCAA 
CTAAGGTGTG 

crrrmTrrr 

CTATAGTACC 
CAATTGATAG 
CAGTGGATTC 
ATAGTACTGC 
CACTATTAAA 
ACAAGTCACA 
TTTCTTTATT 
GTCCATAAAC 
TTGTTTGCAT 
CTACTTCTGT 
GTCCATCAGC 
CTTGCATTTC 
TACXK3TCTAG 
CTTGATCTAA 

cmx-TriTT 



TGTACCACCA 
ATCTTTAACA 
ATGAGGTGTA 
TTGCGTACCA 
ACCATAAGCT 
TTTAACTCCA 
TTTCTTAGCC 
TTGAGGTATA 
AAATCTTGAT 
GATTGCTGGT 
TTCTTTAGAT 
ATCAGTACAA 
CTGCAATATT 
AGCCGTTACT 
TTTATCAACA 
CGATGCCTCT 
ACCTTTTTTC 
ACCAACTACA 
TATTTCAATA 
CCCAGAAACT 
TGTTAAATGA 
TTTGCCGTTA 
CACATCATTG 
GTATGGTCGT 
CAATTTAACA 
AAACTTCATA 
TTGCATTTCC 
TAAACCAGGT 
AGTCTTA6CA 



TTACCACCGA 
CCTGATTCAA 
TCGATGTATT 
AATCGACAAA 
GATGCTGAAC 
TACAAACCAA 
ACTTCTCCTA 
ACAGAAAATG 
TCTCTTTCAT 
CGACATTTTG 
GTTTTTAAAC 
CCACATATAC 
TGAGCAATTT 
TCTTCAACTG 
CCATTACAAC 
TCTCCACCTT 
ATCATGTTAT 
TTACCGTCTT 
CCTTTAATTT 
TTTAATGACG 
TCAGCTAATA 
TGTTCTGCAC 
ACAACAATAC 
ATACCTACTG 
CCCTCAACAT 
CCTTGCTTTT 
ATCAACCATT 
GCACACTCTA 
ATGTTCATCA 



TATAGATTTG 
CACAACTTCT 
CAAATGTTTT 
ACTCTTTACC 
GCATACCTAG 
CACGTTGTGA 
GACGAATCAG 
TACCATCATT 
CTTCATGATC 
GACATCCACC 
CTTTCGCTCT 
CAGCAGGTTT 
GCGGTTTACA 
TTGTAAAGCC 
CACAAATTGT 
TAGTAAGCAA 
AAAAGCGTGA 
TTAAAAAGAT 
CTGCATTTTC 
TAAATGTTGT 
CTTTACCTTG 
ATTCACCAAC 
CACGATTAAC 
CCATAACAAC 
CATCTTCTCC 
CTAGATCTGC 
CAGCTAAATG 
AACCTAGTAA 
TTTGTTCAGT 



10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
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GAATGCTTTA GAACCTGTCG CAAAAATCAA 
AGTAGTAACT GATTGATTTG CTCTATCTAC 
^ GATACCATGT TCCTCATACC ACTCATATGG 

ATTTTGTAAA ATATTTGAAA GCATGATGCG 
TACCXJTAATA TCATATAAAT CGTTGGCGCG 
CGCCATACCG TTACCAATCA TTACTAGTTT 
CCATAATATT TATTTCAAAA AAAGGTATTA 
GGAATCATTA AGCTTTCTAA TCTATCGTTA 

IS 

ATTGAAGGTG TGAAGTGTAT ATCTGTATTA 
TTGTTAACAA GTCTTCCGTC ATATAAAAAT 
20 TTC6AGATGC TTTCTAAATC ATGTGTAAAA 
GTCGGCTTGC TAATTTGCAA ATTTTGAGCG 
AACTTTCCAT TAATATTGCC GTGTGCAACA 
GCTAATGCGT CACAAATACG TTGTTCAATT 
GGCTCGCTTA CTTCTACCTT TATGTCTGGA 
GGTATATCCT TGAGATAATG CATTGCACTA 

30 

TCAACCCCAC TTTGAATCaA CXSTCGTCaTT 
AAAAACGCAA TATCATAGTG ATGTATATCA 
AACGCTTGaT TCTGTCGTCC GTGCCTCATG 

3S 

TTTACCAACC CTTTCACACG TATTGTATAC 
CATTATAATG TAAAATCAGG GAATTCCCTG 
TTTTCCCTTT TTGTTAAATC AAAAAAAGCG 
TTTGAGCAAG CATTAATATA TCGGTCGCTT 
TTGGCCTAAT ATTGTTTCGT CAAAGCGCTC 
45 TAAATCGCCA TCATCATTTT CATGTTCGCT 
TTTAAGTAAC CACX3GATGCA ATCTTGCAGA 
CGTATCTCGC AAAAATGCTT CTTCAACATA 
TTCATACTCA GGATTTGTCG CAAACCACCA 
ACTTCCCCAA GGATATCTAA CCGTAATCGT 

55 



TTTATCGTAT GATACTTCAA TACCATTTGC 12240 

TTCAATTACA GGATCATTTG TAATTAACTC 12300 

ATTCATAATT GTTTCTTCAA CTGTCATTTT 12360 

GTTATAGTTT GGATAAGGTT CTTTACCTAT 12420 

CTCTAATATT TCTTCGATTG TTCGAATGCC 12480 

TTGCTTTGCC ATAAAATATG CCCCTTTACT 12540 

ATTTTTCGTT AGTGCTTTTA TATTTTCATT 12600 

ATGATTTGCT TTAAAATTGG GTCGAAGTTA 12660 

ATAACCATGT CATTCATTTG CTGCTTCACT 12720 

AATGGTACGA CAATCAATTT TTGATACCGT 12780 

CTAATCTCTC CATATAGCGT TCTCGCATAT 12840 

CATAXTTGTA ACTCTTCGTG TGCCTTAGTA 12900 

ACCATAACTC CAACTTGTTG TTCGTCACCT 12960 

AATCGTCTCA TTAAAGGATG TGTGCCAAGT 13020 

TACCGTCGTT TCATTTCAT6 AACGATATTC 13080 

AAGATTAGCA ATGGTACAAT TTTAAAATGG 13140 

ACCGTCTCTA AATCCtGATG CTCACTTTCt 13200 

TCTTTTACTA ATTCAGAAAT AAATGCTTCT 13260 

CCATGTGCAA CAATGATATT CCCATTCACA 13320 

CAAATCATTT TGTTTTTGTG AAAAGAATCA 13380 

ATGCCTGTAG TCATGCATAT TCCTTATACA 13440 

ACCGATATAT GAATCCCTAC TCAACATTTA 13500 

GTAGTGTATA TTATTATCTT AAAATGGTGG 13560 

QGGTATCAAT ACTTTGCXSCA TGATCACACC 13620 

GTATATTTCA TAACCTCTTT TTTCATAAAT 13680 

TGTACCTAAA GTAACTGCCG CTGACTTTAA 13740 

AGTAAGTAAT TGGCTACCAT AGCCTTTCCC 13300 

GACAAAAGGA TAACCCGAAA TACTTTTCAC 13860 

AGATATAATT TCATCATCAA TTGTCATGAC 13920 



616 
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CCAATCAATA CCTAGTTCTC TTAGAgGCXST AAATGCTTCA TGCATGAGTT CTTGCAATTT 14040 
TTCTGCATCT T 14051 
^ (2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1885 base pairs 

(B) TYPE: nucleic acid 

10 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

TAATCCTCAA CTTnGATTAT ATGGCTTGGG CGCATATGAA CTGCTTAGTT TAGTGTATGA 60 

CATTCATACA GTTCQCATGA CTATCATACA ACCTOGAATA GATAACTTTT CTACTGAAGA 120 

20 GTTACCAATC TCAAGATTAC TTCAATGGGG AACCGATTTT GTTAAACCCT TAGCCAGACT 180 

TGCTTATAAC GGTGAAGGTG AGTTTAAAGC A66TAGTCAT TGTAGATTCT GTAAGATAAA 240 

GCATTCATGT AGAACACGTG CA6AATACAT GCAAAATGTG CCTCAAAAGC CACCACATTT 300 

25 GTTGAGTGAT GAAGAGATT6 CAGAACTTTT ATATAAACTG CCTGATATCA AAAAATGGGC 360 

TGATGAAGTA GAGAAATATQ CGTTAGAACA AGCGAAAGAG AATGATAAAA CGTATCCAGG 420 

TTGGAAGCTA GTCACGGGAC GTTCAAGGAG AGTGATAACT GATACAAAAG CAGTCCGAGA 480 

CAGGTTAGTT GAAGCGGGTT ATAAACCTGA AGATATTACA GAAACCAAGT TACTTAGCAT 540 

TACGAATTTA GAAAAATTAA TCGGCAAAAA AGCATTTTCT AAAATTGCAG AAGGCTTTAT 600 

AGAAAAGCCG CAAGGTAAAT TAACACTTGC TACC6AGTCT GATAAACGAC CAGCTATAAA 660 

GCAATCTQCT GAAGATGATT TTGACAAACT ATAAAAATTA AAAAGGACGG TATATAAACA 720 

TGAi^CAAA AGTATTAAAT AAAACTAAAG TGATTACAGG AAAAGTAAGA GCATCATATG 780 

CACaTATTTT TGaACCTCAC AGTATGCAAG AAGGGCAAGA AGCAAAGTAT TCAATCAGTT 840 

TAATCATTCC TaAATCAGAT ACAAGTACGA TAAAAGCCAT TGAACAAGCT ATAGAAGCTG 900 

CTAAAGAAGA AGGAAAAGTT AGTAAGTTTG GAGGCAAAGT TCCTGCAAAT CTGAAACTTC 960 

45 CATTACGTGA TGGAGATACT GAAAGAGAAG ATGATGTGAA TTATCAAGAC GCTTATTTTA 1020 

TTAACGCATC AAGCAAACAA GCACCTGGTA TTATTGACCA AAACAAAATT AGATTAACGG 1080 

ATTCTGGAAC TATTGTAAGT GGTGACTATA TTAGAGCTTC AATCAATTTA TTTCCATTCA 1140 

ACACAAATGG TAATAAGGGT ATCGCAGTTG GATTGAACAA CATTCAACTT GTAGAAAAAG 1200 

GCGAACCTCT TGGCGGTGCA AGTGCAGCAG AAGATGATTT TGATGAATTA GACACTGATG 1260 



55 



30 
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TTGAGGTGTC AAGAATTTGA AATTTATGAA TATAGATATT GAAACATACA GCAGTAACGA 1380 

TATTTCGAAA TGTGGTGCCT ATAAATACAC AGAAGCTGAA GATTTCGAAA TTTTAATTAT 1440 

^ AGCTTATTCG ATAGATGGTG GAGCGATTAG TGCGATTGAC ATGACTAAAG TAGATAATGA 1500 

GCCTTTCCAC GCTGATTATG AGACGTTTAA AATTGCTCTA TTTGACCCTG CTGTAAAAAA 1560 

GTATGCATTC AATGCTAATT TCGAAAGAAC TTGTCTTGCT AAACATTTTA ATAAACAGAT 162 0 

10 

GCCACCTGAA GAATGGATTT GCACAATGGT TAATTCAATG CGTATTGGCT TACCTGCTTC 1680 

6CTTGATAAA GTTGGAGAAG TTTTAAGACT ACAAAGCCAA AAAGATAAAG CAGGTAAAAA 1740 

TTTAATTCGT TATTTCTCTA TACCTTGTAA ACCAACAAAA GTTAATGGAG GAAGAACrAG 1800 

IS 

AAACCTACCT GAACATGATC TTGAAAAAtG GCAACAATTT ATAGATTaCT GTATTCGAGA 1860 

TGTAGAAGTA GAAAT6GCGA TTGCT 1885 
20 (2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
2S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



35 



40 



TAATCCTTAG 


TTCACTGnCA 


AATTTCAAAA 


CACCAGTTCC 


CTCTATCTGC 


ATCCATAGAA 


60 


ACTGnATGTT 


TGTGTCAATA 


ACCGGATTAT 


ATTGTGATGn 


TGTTTGTAAC 


TCGATTAAGT 


120 


TATCATCTTT 


CGAAAAATTA 


TCTACTACCA 


TTATTCAACC 


ACCTTTCCTT 


CGAATAAACT 


180 


CCAriTACCA 


ACkCCACCAG 


TACCAAAGTT 


TCTAACTAAA 


AATTGATGTG 


CAGACGGGAA 


240 


GTTATTACGT 


CTTAATACTT 


GTGTTGTATT 


ACCTGGTGTA 


TTCGATTTTA 


CTTCTAATAT 


300 


CCAACCTGCA 


ATACCTTTAA 


AGTCTTTAGG 


AAAATCAGTA 


AATCGGTTTG 


ATTCTTCAGT 


360 


AGTGATATAG 


AAATCTAAAC 


CAACGATTTT 


TAAATCTGAT 


AATTTTGTAA 


TACTCTTAGG 


420 


GATATGTTCC 


CAATAACCGG 


CGTTTTGCGG 


GCAGAAATTC 


CATGCTCCGT 


TGTTTTTCTT 


460 


ATTGAAAATG 


TCAATGACAC 


GTTCGAATTT 


AAGCATATTT 


CTACCTGTGC 


TGTTTCTGGt 


540 


AAGTACTTGT 


CTTAGAGCAC 


CATTATAGTG 


TCCAGGCAGT 


ACATCCAAGA ACCACCCTGC 


600 


ATCTCTAAAC 


GCTTTCGGTA 


ACGGGAAATC 


TAATGCATTT 


TGTGTGTCTT 


GaCGTATAGA 


660 


TATAGTAATG 


ACCAACTTCC 


GTAATATCAC 


TTAGATATGC 


TGGGTTCTGT 


ATTGGTAACG 


720 


GTTTAACACG 


TCCGCCTGAA 


TCAGTCATTG 


ATACTTGAGG 


TGCGATGTTT 


TTCAAGAATT 


780 
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TAGTTACCCC GATTAGAAGT GCTTTACGTC CTGTTTCTAG ATCGTAATAC ATATCTAGAC 900 

CCTCAGCCTC TTGGAAATCT CCTTTAAAGT TGTTATTCAC ACCGCCTATA TCGATGCGAC 960 

GTTTAAATAA CAATTCTTTC GTTTTGATAT CGAAGCCTTG TAAGTAGTTA GGGTTGGCTG 1020 

TATTCGAATC ACCTGTATAC CAATATAAGA TACCTGCATC ATAAGTGATA CCTTGCATA6 1080 

GTTGTGTATC TGAAGTGTAT TCCATAGGTA TATCCATTTG ATACAATACT TTGTCTATAC 1140 

CTTTATCAAT ATCGTCAGCA CTTCTAACCT CAACAAAGTT CAACGAATTC TTAAGTTGTC 1200 

TTTCAGTGGG TTTATATTCA CGTCTAAAAA TCATTAAATT TTCTACCGGA TTATAAATCG 1260 

CTGACGTATA TCTGTCGTTA AATATATTCG GCATGACATC TTGCATTTCA TTACCATAAG 1320 

TTATTTCTCC AGTTCTATAT TGGAAACGTA CAAACTTGTT GTTTTTGTTA CTGTCCAATA 1380 

CAGCTGAATA AATCCATAAT TCTCCATCAA TGTATCTATA CGCATTGTGT GTACCGT6AC 1440 

CGCCGTTTTT AACAAGCAAT CTATCAATAA ATTGTCCGTT GGGCTTCAAT CTAGATAACA 1500 

TGTAATGATT ACCTGGACGA GCTTGCGTCA TATAAATAAT TTTCGTTCTA GGGTCTACCC 1560 

AAAATGATTG CATTACTGCA TTTGTATATG GCGATAAATC AGTGATAAAT TCCGGTTCTT 1620 

25 GCTCITTTGG TTCGAATCGG TATTCTGTC6 CTCGATATTC TTTATAGTGT TCATCTACAG 1680 

CTTTCTCAAC CTTTTTAGTG AAAACATCTA GTGTTGAATA ATCATGATAC AAACGATCTT 1740 

GCAATGTCTT ATGACCATAA CCTGTATTAT CAACGCGCGC GTCTTTTAcT TCGTTGATAC 1800 

^0 C3GTCGCCX3TT ATGACCTAGT ACCATGTTGC TAAATCGACC GTTTAAATAT GTTAAAAAGT 1860 

CAGAGACGTT ACTTGTAACA TTTAAATGTT CATACTTTAT TTGTTCTCCA TCATGTGCGA 1920 

ATACCTCTTT ATTTCTGTGG TATTCAAGAG AGAAATTAAA ATCCGTCAGC ATGTCTGAAA 1900 

TAAGTTTAAA GTTATACTCA TTTTCATCTA CATATCTGTA GTCAAAGACT CTACTTAAAT 2040 

CTGTAATTAG TTTATTACTC ATGTTTTCCT CCTTTACTAT CCATAAAACT GATmATAATT 2100 

TTTAATAAGC TCATACATAA TAACTTCATG ACCTCTTTCA TTAGGATGTA ATCCATCAGG 2160 

CATGCTAGAT TTTCTAAATG CTGGATTATA TGGTTTGAAA TAATCTGTGT GATAAGCATC 2220 

ATATACTGGT ACATCCAATT CACTACAAGC CAATATCTGA GCATTGACAT AATCCTCTAA 2280 

AGTTAACCCT AGTTTGTTTT TGTCCGTATC TTTACGGCGT ATCGTTGTAC CACTCATAGG 2340 

GCATTGCCTA GTAGCTGTCA TTACAAGTAT TTTTGAAGCT GGATTATTTT TCCTGATAAC 2400 

TTCAATTGCA GAACAAAAGG CGCCGTAAAA CGTTTTAGTG TCGGTTTTAT CAGTGCCTAT 2460 

SO CGGTACGCCT GCCCAATAAC CATGTAACCA GTCATCATCT GTACCTTGTA ATATGATTAG 2520 

GTCTCCTCTT ATTTGCTCTG CTTGTCTaTA AATOCTOTIT TCTaCCGCTT CTTTACCTAT 2580 



ss 



3S 



40 



45 
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CTTGCCTAAC ATTTCT 2656 
(2) INFORMATION FOR SEQ ID NO: 106: 

5 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 106: 



IS 


AAAATGAGG6 


TTCTAGOGGA 


AATTACC!AAA 


AGCGTCGTTC 

nS9\»VJ X >>V9 X X W 


ATACTATGGG 


CAGCGTAATC 


60 




GTA'i'iTCAAA 


AGAAAAAACA 


CCTAAATGGT 


TAGa&AATAG 


AGATAAACCT 


AGTGAAGAAG 


120 




ATTCGGCTAA 


AGATAATAGC 


GTAGATGATr 


AAPAATTAGA 


GCAAGATCGA 


CAAGCATTTC 




on 


TAGATAAATT 


ATCTAAAAAA 






ATGAAGCAAT 


TTAAAAGTAT 






AATTAACAC6 


TCGCAGGACT 


TTGAAAAAAG 


AATAGAAAAG 


ATAAAnCAGA AGTAATCAAT 


300 




GACCCAGATG 


TTAAGCAA'n 


TTTGGAAGf^ 


CATCGAGCTG 

WA X WV«nA9W X w 


AATTmACQAA 


TGCTATGATT 


360 


2S 


GATGAAGACT 


^^^^^^^^ Sm A\m ^ A 


ACAAGAGTAT 


AAAGATCAAC 


AAAAACATTA 


TGACGGTCAT 


420 




AAATTTGCTG 


ATTGTCCAAA 


TTTCGTAAAG 


GGGCATGTGC 


CTGAGTTATA TGTTGATAAT 


480 




AACCGAATTA 


AAATACGCTA 


TTTACAATGC 


CCATGTAAAA 


TCAAGTACGA 


CGAAGAACGC 


540 


30 


TTTGAAGCTG 


AGCTAATTAC 


ATCTCATCAT 


ATGCAACGAG 


ATACTTTAAA 


TGCCAAATTG 


600 




AAAGATATTT 


ATATGAATCA 


TCGAGACCGT 


CTTGATGTAG 


CTATGGCAGC 


AGATGATATT 


660 


35 


TGTACAGCAA 


TAACTAATGG 


GGAACAAGTG 


AAAGGCCTTT 


ACCTTTATGG 


TCCATTTGGG 


720 


ACAGGTAAAT 


CTTTTATTCT 


AGGTGCAATT 


GCGAATCAGC 


TCAAATCTAA 


GAAGGTACGT 


780 




TCGACAATTA 


TTTATTTACC 


GGAATTTATT 


AGAACATTAA 


AAGGTGGCTT 


TAAAGATGGT 


840 


40 


TCTTTTGAAA 


AGAAATTACA 


TCGCGTAAGA 


GAAGCAAACA 


TTTTAATGCT 


TGATGATATT 


900 




GGGGCTGAAG 


AAGT6ACTCC 


ATGGGTGAGA 


GATGAGGTAA 


TTGGACCTTT GCTACATTAT 


960 




CGAATGGTTC 


ATGAATTACC 


AACATTCTTT 


AGTTCTAATT 


TTGACTATAG 


TGAATTGGAA 


1020 


45 


CATCATTTAG 


CGATGACTCG 


TGATGGTGAA 


GAGAAGACTA 


AAGCAGCACG 


TATTATTGAA 


1080 




CGTGTCAAAT 


CTTTGTCAAC 


ACCATACTTT 


TTATCAGGAG 


AAAATTTCAG 


AAAGAATTGA 


1140 




ATTTTAAAAT 


GATTGGTGTA 


TAATGAATAC 


AAATCTAAAT 


CGTTTAAATG 


ATTGAAGACA 


1200 


SO 


AGATGATCTA 


ATCAATATTA 


CACAGAAAGC 


CATTGTTTGA 


TGAGAATATG 


GTTAATAAAT 


1260 




TAGATGATTA 


CTACTTCATT 


TATGGTATTT 


GTAATGAATA 


CCCGGATCAA 


GACCGTTATC 


1320 



5S 
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CTCGTCCCTT GTATAGGGGC GGGATTTTTT GTTTTTTTCA GACATAAATG TTTGTTGGTG 1440 

TCATAAATTC CCTGTTTATT GTTAATAGGT TTAATGTTAA AACGATGATT GTTGTTCAAT 1500 

^ TTTTTAACGA GGTCAGATAA AAGTATTTAT AAAGCAAATA GGAGGGTTTA ACATGGAACA 156 0 

AATTAATATT CAATTTCCAG ATGGTAATAA AAAGGCGTTT GATAAAGGTA CTACTACTGA 1620 

AGATATAGCA CAATCAATTA GTCCTGGATT ACGTAAAAAA GCTGTTGCCG GCAAATTTAA 1680 

10 

CGGGCAACTT GTAGATTTAA CTAAACCGCT TGAAACTGAT GGATCAATTG AAATTGTGAC 1740 

ACCAGGTAGT GAAGAagcGT TAGAGGTATT ACGTCATTCT ACTGCACATT TAATGGCACA 1800 

CGCGATTAAA AGGTTATATG GTAATGTTAA ATTTGGTGTA GGTCCTGTAA TAGAAGGTGG 1660 

ATTCTACTAT GACTTCGACA TTGACCAAAA CATCTCATCT GATGACTTTG AACAAATTGA 1920 

AAAAACAATG AAACAAATCG TTAACGAAAA TATGAAAATC GAAC6AAAAG TGGTTTCAGG 1980 

20 AGATGAAGTG AAAGAGTTAT TCAGCAATGA TGAATACAAA TTAGAATTAA TCGACGCGAT 2040 

TCCTGAAGAT GAAAATGTAA CATTATATAG TCAAGGTGAT TTTACTGATT TATGTCGTGG 2100 

AGTTCACGTT CCATCAACAG CTAAAATTAA AGAGTTTAAA CTATTATCTA CAGCAGGTGC 2160 

ATACTGGCGT GGAGATAGTA ACAACAAAAT GTTACAACGT ATATACX3GTA CTGCTTTCTT 2220 

TGATAAAAAA GAATTGAAAG CACATTTACA AATGTTAGAA GAGCGTAAAG AACGTGATCA 2280 

TCGTAAAATT GGTAAAQAGT TAGAACTATT CACAAATAGC CAATTAGTTG GTGCTGGTTT 2340 

30 

GCCATTATGG TTACCTAACG GTGCAACAAT TAGACGTGAA ATTGAACXSTT ACATTGTTGA 2400 

TAAAGAAGTT AGCATGGGAT ATGACCACGT TTATACACCA GTACTTGCTA ATGTTGATTT 2460 

ATACAAAACA TCTGGTCACT GGGATCACTA TCAAGAAGAT ATGTTCCCAC CAATGCAGTT 2520 

3S 

AGATGAAACT GAATCTATGG TATTACGTCC AATGAACTGT CCACATCATA TGATGATTTA 2580 

TGCqAATAAA CCACATTCAT ATCGTGAATT ACCTATCCGT ATCGCTGAGC TAGGAACX3AT 2640 

GCATAGATAT GAAGCAAGTG GTGCTGTATC AGGATTACAA CGTGTTCGTG GTATGACTTT 2700 

40 

AAATGATTCA CATATCTTTG TTCGACCTGA TCAAATTAAA GAAGAATTCA AACGCGTTGT 2760 

AAACATGATT ATTGATGTGT ATAAAGACTT TGGTTTCGAG GATTATAGCT TTAGATTAAG 2820 

45 TTATAGAGAC CCTGAAGATA AAGAAAAGTA CTTTGATGAT GATGATATGT GGAATAAAGC 2880 

TGAAAATATG CTTAAAGAGG CAGCGGATGA GCTTGGCTTA TCGTACGAnG AAgCGATTGG 2940 

TGAAgCGGCA TTCTATGGTC CGAAACTAGA TGTTCAAGTT AAAACAGCX3A TGGGTAAAGA 3000 

^ AGAGACATTA TCAACAGCAC AACTTGATTT CTTATTACCA GAACGTTTTG ATTTAACTTA 3060 

TATTGGTCAA GATGGTGAAC ATCATCOTCC AGTTOTTATT CATCGTGGTG TTGTATCAAC 3120 

SS 



621 



10 



IS 
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AGCGCCAAAA CAAGTTCAAA TCATTCCAGT TAACGTTGAT TTACATTATG ATTATGCGCG 324 0 

CCAATTACAA GATGAATTGA AATCTCAAGG CGTTCGTGTA AGTATTGATG ACCGTAATGA 3300 

AAAAATGGGT TATAAAATCA GAGAAGCTCA AATGCAAAAA ATACCTTATC AAATCGTAGT 3360 

TGGGGATAAG GAAGTTGAAA ATAATCAAGT GAATGTGCGT CAATATGGAT CGCAAGACCA 3420 

AGAAACAGTT GAAAAAGATG AATTTATCTG GAATCTAGTT GATGAAATTC GTTTGAAAAA 3480 

ACATAGATAG ACAGTTGTCG CAATAAAATG CTTTAAAACT TTTATTGCGT ATCAAGTTTT 3540 

ACAGGGTTGA TTATGCGTGA TGAATCCTGT ATATTACAAG TTAGTTAAAA TATTAAATTG 3600 

A6TTAGAGGT TGCATGTTTA ATTAGTAACT TGTCAGAA6T ATTTATGGTA CATAAGTTGA 3660 

ACAAGTGAAA GGTAAAGATG CCGAAATAGA TATAAACCAT AAATTATATC TATTGGGACA 3720 

GTTTTCGAAT AGGAACTGTA CTGTCACAGA ATGTGATGTG CTACCTTATA TAGATAATTG 3780 

20 CCAAAGTGGT TGCATATCTT AAAGGTATGT AGCCACTTTT TTACTTTTAA TATCACTATG 3840 

TTCTGTAAAA AAGGGTATGA AAGTGAATAA AGGTTATTTA TTTCTTGGCC TCTAAAACAT 3900 

GGAAAGGGAG CTTATATGTC AAAAGTTCAA AATGAAAGTA ACAATGTTGT CAAAAGGGGA 3960 

25 CTTAAAGATC GTCATATTTC TATGATTGCG ATTGGGGGTT GTATTGGTAC AGGTTTATTT 4020 

GTAACTTCTG 6TGGAGCAAT TCATGATGCA GGTGCTTTGG GTGCATTAAT AGGATACGCA 4080 

ATTATCGGAA TAATGGTATT TTTCTTAATG ACGTCACTTG GCX5AAATGGC TAC6TATTTG 4140 

CCAGTATCAG GTTCA1TTAG TACATATGCT ACAAGATTTG TTGATCCATC TTTAGGGTTT 4200 

GCGCTTGGTT GGAACTATTG GTTTAACTGG GTAGTGACTG TAGCAGCAGA TATTACGATT 4260 

GCAGCACAAG TCATTCAATA TTGGACACCA TTGCAAGGCA TACCCGCTTG GGCATGGAGT 4320 

GCGTTGTTCT TAGTTATAAT TTTTAGTCTG AATTCGTTAT CAGTTCGCGT CTATGGTGAA 4380 

AGTCJAATACT GGTTGGCATT GATAAAAGTG GTTACAGTTA TTGTTTTCAT TGCAATTGGT 444 0 

TTATTAACGA TTGTCGGAAT CATGGGTGGT CATGTTGTAG GATTCGAAAT ATTTAATAAA 4500 

GGTGAAGGTC CAATTCTTGG TGGCAACTTA GGAGGAAGTT TGTTATCAAT TCTAGGTGTA 4560 

TTCTTAATCG CTGGTTTCTC ATTCCAAGGT ACTGAGTTAA TTGGTATTAC GGCTGGTGAA 4620 

45 TCAGAAAATC CTGAACGTGC TGTGCCGAAA GCAATTAAAC AAGTATTCTG GAGAATTTTA 4680 

TTATTTTACA TTTTAGCCAT TTTTGTTATC GGTATGTTAA TTCCTTAT6A TAGTAGTGCA 4740 

TTAATGGGGG GTAGTGATAA TGTAGCAACX3 TCTCCATTCA CATTAGTGTT TAAAAATGCT 4800 

SO GGATTTGCGT TTGCAGCATC ATTTATGAAT GCAGTCATTT TAACGTCTGT GTTA 4854 
(2) INFORMATION FOR SEQ ID NO: 107: 
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(A) LEKGTH: 24 8 B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

to ATCAAAAATT GATTGTTTTC nATTTTTTGT TTCAGCGCGG GATCTTTTAC GTCTTTTGTG 60 

AAAACGaTTT TATTATTAAC TACTTTTACT GGATAACTTT TGTATGTCGA GTCAGTAGCA 120 

TTTTTTCTAT CGTTTGTAGT TGTGTCATAT TCACCAgTTA TTTTATGTGT GTTCTT A TCT 180 

^5 ACCTTTAACA ACATACGGTC rrCTTTTAAA AGCTCATCTG ATCCAACAAC TGAATAAOAO 240 

GATTCTATAT ACCATGTGTC TTGATCATTA TTTTCATAAT GGGGATTATC GTGACCATCA 300 

ATTTCATAAA GCGTTTCTAA GTTTTTAATA GGATACGTAC TTAGTACTTT TTTAAGACCA 360 

20 

TCTTTCAAAT GAATTTGTTC CCACTTCATT GCCAAAAACA TATCGCCACT GACTACAATT 420 

GAAATAATAA TAATTGCTGC TAAGTTTAAC CAGAAAATTT TATGTGCTTT CATACATTCC 480 

CACCGTTTCT CAAAATACTT CATTAACACT ATAATAATAT ATTTTGAAAA ATATTTACAT 540 

25 

CAGTATTAAA GTGAATATCA AATTTTAAAT TTATGAAAAT AATAGATATT TATAAAAAGC 600 

GGAAAAGAGA TACAATAAAA AACTGCATGA CGTTTGAGAC GTCACACAGT GTAACTAAAA 660 

ATTTAAAAAG TTGTTGCTAA TTTTTCAGCA TTATTAATAC TAGTTGCTTT AATTTCTTCA 720 

30 

GTCTTATGAG GTTCAGCATT GTGTCCTTCA ATAATGATTG TTTCATATGA TGGCACACCT 780 

AAGAATGTCA TAATTGTTCT TAAATAACGG TCACCCATTT CAAAATCAGC AGCAGGTCCT 840 

35 TCAGTATAAT ATCCACCACG TGATTGAATG TGTAATACTT TTTrGTCAGT TAGTAAACCT 900 

TGTGGTCCTT CAGCAGAATA TTTAAAAGTT TTACCTGCAA TTGAAATAGC ATCAATATAT 960 

GCTTTAACTA CAGGTGGGAA AGAAAGGTTC CACATAGGCG TTACAAATAC ATATTTATCT 1020 

40 GCACTTAAAA ATTCTTCTAA AATGTCACTC AATCTTGAAA CTTTCATTTG TTCATCATCA 1080 

GTTAACGTTT CGCCATTACT CATTTTTCCC CAACCAGTTA ATACATCTTT GTCAATAACT 1140 

GGAATATAAG TTTCArATAA ATCAATATGT TTCACTTCAT CATCAGGATG TTGTTGTTGA 1200 

45 TATGTTTCGA TAAATGCTTT ACCAGCCGCC ATAGAATTTG ATACCAGTTC ATTAAAAGGG 1260 

TGTGCTGTAA TATATAATAC TTTTGCCATT TGAAAATTCT CCTCTGkTTC TGTTATTTTC 1320 

TTAAGTATAA TTATTATACT CGATATAAAA TTTAATATCA ATCAAAATAT TCAAATTACC 1380 

SO 

ATCATTTTCT TCATCTATAT nTGGCAGTAC TACTAAA6TA TOAGTGCATT TAATTATGAa 1440 

ATAGTTGATT TaGAATAtAT ACTTAATACC CAAAATATAT GAAGGATGGA TGCCACTATG 1500 
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5 



10 



IS 



20 



25 



ATTATTTATA 


TAGATGACAT 


TCAAAAATGG 


TTTAACCAAT ATACCGATAA ATTGACACAA 


1620 


AATCATAAAG 


GACAAGGACA 


CTCAAAATGG 


GAAGACTTTT TTAGAGGGAG 


TCGGATTACT 


1680 


GAGACTTTTG 


GTAAATATCA 


ACATTCACCA 


TTTGATGGTA AGCATTATGG 


CATTGATTTT 


1740 


GCATTGCCAA 


AAGGTACACC 


AATTAAAGCG 


CCX3ACGAATG GTAAAGTAAC 


ACGTATCTTT 


1800 


AATAATGAAT 


TGGGCGGCAA 


GGTATTACAG 


ATTGCCGAAG ACAATGGAGA ATATCACCAG 


1360 


TGGTATCTAC 


ACTTAGACAA 


ATATAATGTC 


AAAGTAGGTG ATCGAGTCAA 


AGCAGGTGAT 


1920 


ATTATTGCAT 


ATTCAGGCAA 


TACAGGTATA 


CAAACXSACAG GCGCACATTT ACATTTTCAA 


1980 


AGAATGAAGG 


GTGGCGTAGG 


TAATGCATAT 


GCAGAAGATC CAAAACCGTT 


TATCX3ATCAG 


2040 


TTACCTGATG 


GGGAACGTAG 


CCTATATGAT 


TTGTAGTTAT AGAAGGGTGC 


CCGCAGTCTA 


2100 


AAAAATTAAG 


CAATCATTGT 


GTGAGTATGA 


TACTTACATA ATGGTTGCTT 


TTTTCAATGA 


2160 


AAATCGTAAT 


GCTAAGTCAT 


ACTTGTTTCA 


TTTAGATATT ACTTAAAATG 


TAAGACAAGG 


2220 


TTGTTAGCAT 


TGGCAGTGAA 


ATATCGCACA 


TAAAAAACAT TATTGTCACA 


CTAGAAAATA 


2280 


GTTGTGCACT 


ATATCAATTT 


TCTGTATAAA 


AGTTTAATTC TGACAGTAAT 


GTAAACGTTT 


2340 


ACAATTTATG 


ATTGACATTA 


ATAATGACTG 


AATATATGAT TTATGTAAGT 


ATTTGTGCAA 


2400 


CGTTTTCACA 


AAGTGTATTG 


CACaAyCAAA 


CTGtAAACaA aGTATGGGGg GCCATAACAT 


2460 


GGCAGAACTA 


AGTTAGAGCn 


TATTAAAA 






2488 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 093 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



4S 



TTTTCTTTAT 


TTCAAmCTGT 


ATATTaATGA 


TGTCACTTCA TTTGATACGA 


TTCTTGATAA 


60 


CCTATTCAAA 


ATTCCGCCAA 


ATAACATAAA 


TATTATATAA ATGCCGATAC 


TTTTAATCAT 


120 


TTTCTACTTT 


TTCTTCGATA 


CGGAAACTTG 


TTTTCGAATT GAACACTTCA 


CCAGCTTTTA 


180 


AAATTGACGG 


TGCTTTTTCA 


CCATATAAAT 


TAATATCATT TGGTAAAAAT 


TGTGTTTCTA 


240 


ATGTAAAGCC 


AGAATGTGGT 


TTATAAATAT 


TAAATGGACT ATCCCACTCA 


TCAGGCTGGT 


300 


TAAAAGTAAA 


GAACACAACA 


TGAGGCATAT 


CTGTATCGAC CTCTAACATA AATTCATGAT 


360 


TTTCAACATA 


CATTTTATGT 


TCACCAACTG 


TAAATGGGTG ATCXSAGACCA 


CCAAAACGTG 


420 
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10 



20 



TATCTTCAAA CACTTCATGT AAATCTAGAA TATCACCTGT AACAATATTT CGCTCATCTA 540 

ATACATACAT ATCTAATTGA TTACTTGAAA TGCGATGATT ATCAACX3ACA TTATTATCTC 600 

GATTCAAATT GAAGTACACA TGATTCGTAG GACTAAACAA TGTGTCTTCT GATGCAACTG 660 

CTTCGTATTC AATCGACCAT TGGTGATCCG CATCATAAAT ATGTGTAATC GTCACATCGA 720 

TATCACCCGG GAAATGATCA TCAGCTGATT TCAACACCGT CTTAAATATA ACTTTAATTT 780 

GAGCAATTTC ATTTCTAATT TCATAATCAA ATAACTTATT GTCCAAACCA TGACATCCAC 840 

CATGTAAATG ATGTTCACCG TTGTTTTTTT CTAACTGATA TTCTTTACCT TTCAACTTAA 900 

IS ATTTAGCATT ATCAATTCTA CCGCTATATC TTCCTATAGA AGCACCAAAT TTAAAAGGAT 960 

TACTATGATa AAATTCATCC GCTTCAACAA CATTTCCAAG AACAATATTA TTATCATGAT 1020 

ATTTCCAAGA CACTACTCTT GCTCCATAAT TCGTAAAAAT AATTTTAGTT TCATCATTAT 1080 

CAATTTTGAT TAAATCTACA CCTTGTCTTT GGTGCTCAAC TTCAACTATC ATTTTTACTT 1140 

CTCCCTTCTA ACCACAAGTG TTCAAGCTCT GCTGGGTAGC AACATTACTA AAACACCTAC 1200 

AATACT^TG ATTGCACCGA TAACATCATA TTTATCTGGC ATTTGTTTAT CTACGACCAT 1260 

CGCAAAAATC AAACTCATGA TGATAAATAC GCCACCATAT GCTGCATATA CTCTTCCGAA 1320 

TGATGGAAAT GATTGAAATG TCGCAATGAC ACCATATAAC ATGAGTATCG CACCGCCTAT 1380 

TAGCCCAACA AGTGAAGACT GTCCTTCCCT AAGCCACAGC CAAATCAGGT ATCCCCCACC 1440 

TATTTCACAT AAGCCAGCTA ATATAAATAT AAAAATCXXSA TATAACATGA AATCACTCCA 1500 

TCACACATTT GCTATCAATA ATCTATCGGC TACATATCAT TTGTTTACAT TTCTTCTTAC 1560 

^ TTCACATTCC CATTTTAAAA AGTTCGTTTT CACATTCATA TTGTACACTT TTTTAGACAT 1620 

TATTCTATAG CTAAATATAA AAAAATAAGA GTAACACXKTT TTCATCATCA TTTTATATGA 1680 

TAAATGTGTG TCACTCTCAT CAATTTTATT TTTTAAATAC ACGTTTCATT GAATTAAATA 1740 

AGCCACGTTC AAATGTAAGT ACTGAATCTT TATATGTTTr AATTQCAATC CATATCAAGA 1800 

CAGCTACCAT TACAATTGAG ATTAAAGAAC TTAAGATGAC CTCATATATT TGAAGCCCTG 1860 

AAGTTTGAGC GCGTACAACT AATTGAAATG GCGCTAAAAA CGGAATATAA CTTGTGATTA 1920 

AAGCAAGTTG TCCATCAGGA TTATTTATCG TGAATATCGC GATATAAAAT GCAATCATAC 1980 

CAAGTAATGT CAGTGGCATC AAAGATTGAT TTAAATCTTC TATTCTAGAT GTTAATGATC 2040 

50 CGAGGATGGC TGCAAGTAAT ACATACGCCG TAATTCCAAC AATACTACTT ATAATTCCGA 2100 

CAATAATAAT TTGCCAAGAC AATTGATTCA TTTCCACGTT AAAACCTTGT AGCAAGTCTT 2160 

TTAAGTCAAA GGCAAAAATG CATATAACTG CCATCAATAC AATTAAAATA ATCTGAGTCA 2220 
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TAATAATCAT TTCAATGACA CGCGATGTTT 
ATGCATAATT TAAAACAATG AAGAACATTA 
^ TGAAAATCTT TTGTCCTTCT GATACTTTAT 

CAACTTTACT TTGTGCTTGT AATTTTTGTA 
CTACCATATT TGTTTGAATA GCTGTAAGCA 

10 

TTACTCX3CTT CTCACTAATG ATTGTCCCTT 
TATAAGCTTT ATCAAGTTTA TGTTTTTTTA 
TAGTAAACTT AGCATCACTA TGAAATGTAT 
GTTCATTCGG TGCTGCTACA CCAATTTTAT 
TATCAATGTT AGATAGGCCA ATCATTAAGG 
20 ATTTAGCTTT AATTTTTTTG ATATATGTCA 

TTGCCACCAA CCTTCTCAAT GAATATATCT 
TTAACATAAC CTTGATGTGC CAGAACTTGA 

25 

ATCGTCAACT GAAGACCTTG CTTCATGTTT 
AAATCTGGTA GTGTTGTTTC TGATTCAATG 
ACATGATTGA TATCACCAGA AACAACAAGT 

30 

CATAATTCTT CAACATGCTC CATACGGTGA 
TTTAAGTCTT TAACTGCTTc TTTTAATAAC 
35 GGCTCATCTA ATATTAGTAA TTCTGGTTTA 
TGTTGATTCC CTTTTGATAG ACTATCAATT 
CGCTCAAGCC AATACGATAT TTGCTGTTGT 
GCCAAATATT TCAATTCTTC TTCAACTGTC 
AAATAACCAA TACGATTGTA CATTGTTTTA 
CCTTCAGTTG GTTCACTTAA GCCTAAAATC 

45 

TTTCTTCCTA GAAAACCTAA CATTTTACCT 
GCCGTCATCT TGCCAAAACG TTTCGTAACA 
so CTAAAAAnAT ATGTATTTAT CTTAATATAA 
TAAAATGAAT TTATTTTTAA AATTTCTGAA 
ATGTTAAGTA TCATTAGCAC TAGATATGTT 
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TCTCACTAGC AATTTCCATA GCTATTTGAG 2340 

GAAAGATAAT GCCATmaGcT AAAGCATAGT 2400 

CGACTTCATC ATTAGAAATC ACCTTATTAT 2460 

AGTCTTCTTT GTTGATATTT AATTCCCCGG 2520 

GTGCTTGTAC TTTTTGTGAA TCTTCATGAC 2580 

GTAACGTGCG ATTTTGATTC ACCTTGATAA 2640 

CTTCTTTTTC AGCATCTTCT ATAGAAACTT 2700 

TCGCCTGTTG CTTGAAAACC TTATAGATTT 2760 

CTGGACCATC ATCAAACATG TTAATAATCT 2820 

CAGCAATAAT AATCATAAAA ATTACAAATG 2880 

AAGTAAATGT CGCCCAAAAC TTATGCATCC 2940 

TGTAATGATG GTTCTACAAC TTGGAATCGT 3000 

TAAATATCTT TGGCTACGTC TTCATTCTCA 3060 

TCACTATGAA TGATGCCTCT AATGTTTGTT 3120 

ACAACTTTCT TGTTACCATT AGATGCACGT 31 BO 

TGACCTTTAT CTAAAATACA AACATCATCA 3240 

GAACTATAAA CGATTGTACT GCCCCAATCA 3300 

TCAACATTAA CTGGGTCTAG ACCACTGAAA 3360 

TGTAACATAC TTGCTAACAG CTGAATTTTT 3420 

CGTTTTTTGC GGTTTTCAGT AATATCAAAA 3480 

ATTTCTGTTT TTQACATTCC CTTTAAAGTT 3540 

AATTTCCCAT GTAAACCGCG TTCTTCCGGT 3600 

TCTAGTTTTT TACCGTTATA CGTrrTGTGT 3660 

ATACGAAATG TCGTTGTTTT ACmTGCACCA 3720 

GATTCTAACT TTAATGAAAT ATCATTTACT 3780 

TGTTCAATTA CAAGTCCCAT ACTTTGCCTC 384 0 

CATTTCCATT CTCTATAAAT GCAATATTTT 3900 

ATTGAAAAAT TTAAATAGTG CCATTTTTGC 3960 

TTTTCCATGC CTTTATTGCC TTATTTGTAA 4020 



626 



EP 0 786 519 A2 



CTTnCCGGTG TTT 4093 

(2) INFORMATION FOR SEQ ID NO: 109: 

^ (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 17846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TGCCAAACTA CCTTTTGACA GTCGTTGCTG TACTTCAGGA TGATCAATCA CATATnTTAC 60 
TTTATCAAAT AGGGCATCTT CATCATTTTT AGTAATTAAA TAACCATTGA AATCTGAAGT 120 
AATCAGTTCG TTAGGTCCAT ATTTAATATC ATAACTAATA ACTGGAACAC CATGTGCTAA 180 
20 AGATTCAAGT AGCGCTAAAG AGAAACCTTC CATGTTACTT GTTATTAAAC TCAAATAGGC 240 
ATCGCTATAT TCTTGGTCTA GATTGCTTAA AAAGCCGCGT AAGTAAACAT GATTTTCCAA 300 

TCCATATTTT TGTATCAATT CATTTAATTT TTTACTTTCA GAaCCAAAAC CATACATATG 360 

25 

AaGCTCTATT TTTGGGACAT ACGATACTAA GCGTTTAATT AATTCAATTT GTTGATGTAA 420 

TTGTTTTTCA GGTGAATAAC GAGCAACGGA AATTAATTTA ACACTGCGCT GATCTAATGT 480 

TTGGACTGGT GTATCAATTG TTTCACTATA GCCGACAGGA ATATTAACAA CTGGAATAGT 540 

30 

ATGGTTAATA CGTTTTTCAA CATCTAATTT TTGCTGCTCA GTAGAAACGA TAATTGCACG 600 

ATATCGAGAT AAATTTTCAA ACATCGCTTT ATATACATTT TTAAATGGCG ATGAATCTAA 660 

35 TGCATCAATA TTTTTAATGT GTGTACTGTG AAGCACAGCT ACTACTGGGA TTGACTCAGG 720 

CGTTAAGTTG AAAATAGGTG CTGTGTACAC ATTACGATCA CTGAAAAATA AATCCCCATG 780 

TT6ATATAGT TGTTTAATGA AAAATGCGCC TAATTCCGTT TCATTATTAA AGAAATATTG 840 

^ TTTGTTAGCA TAGTAAACAA TAATTTTTTG TACTTCTGGT TTGCCATCCT TGTAAGAAAA 900 

ATACTTTTCT AATTTTGTGT CACCTTCTGG ATTATAGAAA AATTCACATA ATGTTTGTTG 960 

TTTATCAACA AGAATCCTAC TACAACTTAA AAAGCCACGC ACATCATAAA AATCACGTTT 1020 

45 

TACTTtTCGT CTTTGACTAT CAAAATGATT TACATAATCT AATATACGAT ATTTAGGATC 1080 

TTGAAAATGG GCATACATTA AGAAACGCTC TTGATCATAT ATTCTAAAGT CATGACTATT 1140 

so TTCAACATGT TTTAAAGTAT AATGACATTC ATCAGTCCAA TACGACAACC AGTCAAATGG 1200 

TTCATTGCGT TCTAAATATG TTGCTTCTTG GAAGAAATCA TACATATTAA TATAGTCAGA 1260 

ACTAGTAATA TAATTTTGG6 CATTTCTATA TAAATATCTA TTCCATGACA GAAATACACA 1320 
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CCCAGTTAAA TTAACACCTA AACTATTACC TACAAAATAA TTCATTTACA ACACCACTTA 1440 

TATCTATTTT TTATAATTAT ATCACATAAT ATTTAATTAC TTCTTTTAAC TGGAAGATGT 1500 

^ GTTTATTTAT AAAACAACAA ATTTTGATAT TTATAATGAT AGTAGTTATT CAATCAcTAC 1560 

GACCCAATAT ATCATkGTAG AGCTTAGGAT ATTGATTTAT GACTCAGGCA CATCAAATGa 1620 

GAgGATTTAT AAArGAGATA TACAACTCTA GAAGGTATAA TAAAAACGCG CAACTAATGT 1680 

10 

TACGCGTTTG AATTAATCAT ATGATATTAT TTGCGATACT TTAATTTAGC GAAAgcATCA 1740 

TGTTGATGGA TAGACTCTTC ATTACGACAT TCGATATCGA AACCGTCTAA CCAATCAAAT 1800 

TCAACTAAGT CCGCGGCAAT TAAACGAATT AAGTCTTCGA CAAAACGTGG ATTTTCATAT 1860 

GCACGCTCTG TCACACGTTT TTCATCAGGA CGTTTTAAAA TAGGGTATAG AATTGAACTT 1920 

GCATTAGCTT CCATTGCATC TAAAATTTTA TTTTTATAGT CATCAACTAT GTCTTGATCT 1980 

20 TTATTAATAT ATGTTTTAAC AGTGACAACA CCACX3TTGGT TGTGCGCTGA ATACTCACTT 2040 

ATTTCTTTTG AACAAGGGCA TAGCGTTGTG ACAGTTGCTT CAATAGTAAG TTCTTTACGT 2100 

GTAnCTTTAT CACCGTCAAT TGCTAATCCA TAAGTGACAT CGGCATTACC AACTGCTITA 2160 

25 

ATATTTGTGG TTGGACTATA GCGATCAAAG AACCATTTCC CAGAAACATC AACGCCTGCC 2220 

GCATTTTGTT TCATATTCGT TTGTAAAGTG CGTAACACCT GATAAAGTGT ATTAAATTCA 2280 

AGTTCAATAC CATTATCATA GTGCTTTTCA ACACTTTCGA TTATACGGCT CATATTAATA 234 0 

30 

CCTTTTTCGT CTTTTGTTAA ACTTGTTGAA AAACTAAATG TGCCAGCTGT TTGATACTGG 2400 

TCAACAAGTA CAGGGTACAC TAAGmTTA ATACCAACTT CTTCTATTTC AAATAAAAAA 2460 

^ TCTTTATGTG TACTTTGTAA ATCTGTCATT TCGTTCTTAG TAGTAGGTTT CGTGCCTTCA 2520 

ATAGGATCTA CGGAACCAAA GTGTTTCCAA CGACCTTCTC GTGTCGATAA ATCAAATTCA 2580 

GTCMTTTTT TCCTCCGTTA AGATTTAAAG TGATATGTCC AATATGGTTC GACTGTTAAA 2640 

^ AAGCTGTGTT GTTTACCATC GATTTCAGGA CTTGCTAATT GTTTTAAAAA TGGACCTGTT 2700 

TGAGAAGCAT GTGCTTCAAA TGCCTTAATT TTAAGTTCTT TAi\AATCTGT AATATCATTT 2760 

TGAATATCAG G1TCTCCAAG AGCTTCGGTT GCATCATTAC TGAACGCAAC TAAAGTTAAA 2820 

45 

CGAGGGCGTT CTTCTTTAGG CATGCGTTCA ACCX3TTCGAA TTACAGCGTC TGCTGTTGCT 2880 

TCGTGATCAG GATGTACTGC ATATCCAGGA TAAAATGAAA TAATCAATGA TGGATTTGTA 2940 

TCATCGATTA AAGATTTAAT CATACCATCT ATATGTTCAT AGGGTTCAAA TTCGACAGTT 3000 

50 

TTGTCACGTA AACCCATTTT TCTTAAATCA GTAATACCGA TAACTTTACA AGCTTCTTCT 3060 

AGTTCACGCT CACGAATACT TGGTAATGAT TCGCGTGTTG CAAATGGGGG ATTACCTAAA 3120 

55 



10 



IS 



EP 0 786 519 A2 

TAATTTGCTA ATGTGCCTGC AGATGAGAAG GTTTCATCAT CAGGATGTGG AAATATTACT 3240 

AATACATGTC TTTCGTCAGT CATGTTGATG CCTCCTCTAT AAATTAAATG GTCGCTCACT 3300 

AATTTGAAGT GCTGCAGCGA GTTGACCTTC GTAATTAAAA CCTGCAATTA AAAATTCATC 3360 

ATGCTCATTG ACCTCAAAAT GCGTTAGACC TTGTACATAA ACCCAACCAC CATTTGATAG 3420 

TTTAAGACCA ATGCGATAAG GTTCTTTATT ACCACCTTTT AGTTGTGCAT GCGTATATGT 3480 

TATTTGTATG TTTCTTAAAA AAGTACCAGC ATTAAAAACA CGTTGATCGA AATGGTTCGC 3540 

ATAGGCCCCA TTTGTCGTTT CAACATGCAG ATACACAGGT TTATGTTCAA AAGAAGCAAG 3600 

TAAATCTATA ACTTCTTGTT CTTTAATTGG TTCCAACACG TTCACTCCTT ACACTATCAA 3660 

TGTGTTTATC TTTCTATTTT ACTAAAAACT ATTCGATAAT TGTATACGAT TGCTCAATTA 3720 

TTTATAAATT AATTTTCATG AAGGGTAATT ACTCAGGATT ACOTAATCAT ACAGCATTAG 3780 

20 TTTTTTACTT TTAAAAATCA AAAATTTGTT GGAATTTGAA AAGTGTTAAA CATTAAAAAT 3840 

GATGCTATAT TAATGGTGTA TGAATGAATT CATAAGTTTT TAAAATGTAT TAAATTTGTG 3900 

GAGGCATGTA AACAATGAAA GTATTAAACT TAGGATCGAA AAAACAAGCA TCATTCTATG 3960 

TTGCATGTGA GTTATATAAA GAGATGGCAT TTAATCAGCA CTGTAAACTA GGTTTAGCAA 4020 

CTGGTGGTAC AATGACAGAT TTGTATGAAC AACTTGTTAA GTTGTTAAAT AAAAATCAGT 4 080 

TAAACGTAGA CAATGTATCC ACGTTTAATT TAGACGAATA TGTAGGTTTA ACCGCATCAC 4140 

ATCCGCAAAG TTATCACTAT TATATGGATG ACATGCTTTT CAAACAATAT CCTTATTTTA 4200 

ATAGAAAGAA CATTCATATT CCAAATGGAG ATGCCGATGA TATGAATGCG GAAGCGTgCA 4260 

35 AAATATAATG ACGTTTTAGA ACAACAAGGT CAACGTGATA TTCAAATTTT AGGTATTGGT 4320 

GAAAATGGTC ATATTGGATT TAATGAACCT GGTACGCCGT TTGATAGCX3T TACTCATATC 4380 

GTTGffTTTGA CTGAAaGTAC TATTAAGGCT AATAGTCGAT ATTTTAAAAA CGAaGATGAT 4440 

40 GTTCCAAAGC AAGCCATTTC GATGGGACTT GCTAATATTC TTCAAGCCAA ACGTATCATT 4500 

TTACTCGCAT TTGGTGAAAA GAAACGTGCT GCTATTACAC ATTTATTAAA TCAGGAAATT 4560 

TCTOTTGATG TTCCAGCCAC ATTACTTCAC AAACACCCGA ATGTTGAGAT ATATTTAGAC 4620 

GAOGAAGCTT GCCCGAAAAA TGTTGCGAAA ATTCATGTCG ATGAAATGGA TTGATTGCAA 4680 

TGTTTAATTA AGAAATGCCT CX3GGAAAGGT TCCAATAGAA AGATAAAAAG CATTGGAAGG 4740 

ATGATTTTTA GTGGAATTAC AATTAGCAAT TGATTTATTA AACAAAGAAG ACGCGGCTGA 4800 

GTTAGCAAAT AAAGTAAAAG ATTATGTAGA TATCGTAGAA ATCGGTACGC CAATCATTTA 4860 

CAACGaAGGT TTACCAGCAG TTAAACATAT QGCAGACAAC ATTAGTAATG TAAAAGTATT 4920 
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CGCGGATGTA ATTACAATAC TAGGTGTTGC 
AGAAGCTCAT AAAAATAATA AACAATTACT 
^ AAAACGTGCA AAAGAACTAG ATGAAATGGG 

TGATTTACAA GCAGAAGGGC AATCACCATT 
TAAAAATTCT AAAGTTGCAG TAGCAGGTGG 

10 

CGCTGAAAGT CCTGATCTTG TTATTGTTGG 
AGAAGCTGCG AAACAATGTC GCGCTGCAAT 
ACTATCAATT AATTCTAGAT GAATTAAAGA 
TTTCAACTTT TGCATCCAAA ATACTACATG 
GTTCAGGATT CX3TGGCGAAT AGTTTTGCAA 
20 ATGTTGTTGG AGAATCAACG ACACCTGCGA 

CTGGTTCAGG TTCCACGGAA CATTTAAGAT 

CTGACATCGT ATTAATTACT ACAAATAAAG 

25 

ACATCGTTTT GCCTGCAGGT ACAAAATATG 

GTTTGTTTGA ACAAGCATCT CAATTATTTT 

AAATGAATGT TACGGAACAA ACGATGCAAC 

30 

ATAGTCGATA ATATGATGCC TAGGCAGAAA 
AAATTATAGT ATAATATCAA TAATAAACGA 
TATATTTTTG ATTTTGATGG TACGTTGGCA 
CAAAGTGCAT TTAAAGCATG TGGCTTAAOG 
ATGG6AATAC CTATTGAAGA ATCATTTTTA 
40 GCATTAGCAA AGTTAATCGA TACATTTAGA 
ATTTATGAAT TTGCGGGTAT AACTGAAGCC 
CTTTTCGTGG TGTCTAGTAA GAAGAGTGAT 

45 

TTAAATCACT TGATTACCGA AGCTGTTGGA 
CCTGAAGGCA TACACACAAT TGTGCAACGC 
ATTGGTGATT CAACGTTTGA TGTTGAGATG 

50 

GTCACTTGGG GTGCACATGA TGCAAGGTCA 
AATGATCCAT CAGAAATTAA TACCGTATTA 
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AGAAGATGCA 


TCAATTAAAG CAGCTATTGA 


5040 


AGTTGATATG 


ATTGCTGTTC AAGATTTAGA 


5100 


TGCTGATTAT 


ATTGCAGTAC ACACTGGTTA 


5160 


AGAAAGTTTA 


AGAACCGTTA AATCTGTTAT 


5220 


AATTAAACCA 


GATACAATTA AAGATATTGT 


5280 


TGGCGGAATC 


GCAAATGCAG ATGATCCAGT 


5340 


CGAAGGTAAG 


TAATATGGCT AAATTTAGTG 


5400 


TGACTTT6TC 


ACATGTTGAA GCGGATGAGT 


5460 


CTGAACATAT ATTTGTAGCT GGCAAAGGAC 


5520 


TGCX3CTTAAA TCAGCTCGGC AAACAGGCAC 


5580 


TTAAGTCXyUV 


TGATGTATTT GTAATTATCT 


5640 


TATTAGCAGA 


CAAAGCAAAA TCAGTAGGTG 


5700 


ATTCTGCAAT 


AGGCAATCTA GCTGGGACGA 


5760 


ATGAACAAGG 


CTCGGCACAA CCATTAGGAA 


5820 


TAGATAGTGT 


TGTAATGGGA TTGATGACTG 


5880 


AAAATCATGC 


TAATTTAGAA TAAAATAAAG 


5940 


TATTATCGAT 


TATTTTTTTA TTTAAATAAT 


6000 


ATAGGGGTGT 


TAATATTGAA GTTTGACAAT 


6060 


GACACGAAAA AATGTGGTGA AGTAGCAACA 


6120 


GAACCATCAT 


CTAAAGAAAT AACGCATTAT 


6180 


AAATTAGCAG 


ACCGACCATT AGATGAAGCA 


6240 


CATACATATC AATCTATTGA AAAGGACTAT 


6300 


ATTACAAGTT 


TGTATAACCA AGGGAAAAAA 


6360 


GTATTAGAAA 


GAAATTTATC GGCTATTGGA 


6420 


TCCGATCAAG 


TAAGTGCATA TAAACCAAAT 


6480 


TACAATTTAA ATAGCCAACA AACGGTGTAT 


6540 


GCACAACGTG 


CTGGTATGCA ATCTGCAGCT 


6600 


TTACTTCATT 


CAAATCCGGA TTTTATTATT 


6660 


TAAAACTTGT 


TAAAACAGAG AATACCATGG 


6720 
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ATTTAAAATA AATATTTATT AAACATTATG AATTTTTAAA GAGTAATGTC TGACTCGTTG 6840 

ATAATTTATT TTTGTAAAAA TAAATTAAAG TAATGACAAA GTTATTGAAG TAAATTGAGT 6900 

ATAAACATTT AAATACGATG TCGAAAATGG CGATAGCATA TCACTTACAT GAAGTTGTGT 6960 

GctATCGCTA TmTAGTTA TAATTCCAAA AAGTTAATCG TTCGATGATT TAAGAATTAT 7020 

TATTGTTTAA TTCAAATGTA TGAGGGTATA AAATCATTGA ATTTAATTCG ATAAAGCGAA 7080 

ATTTTTGAAC AAACATACTT TTGTATTTAT ATAAAAGTIT AAATTCTTAT AAATTTGACA 7140 

AAACTAATTA ACTCCGTATA ATTATGAAAC ATACAAGAGG GAGTGTATGA ATTCATGGAT 7200 

TTTAATAAAG AGAATATTAA CATGGTGGAT GCAAAGAAAG CTAAAAAAAC CGTTGTTGCA 7260 

ACCGGTATCO GTAATGCAAT GGAATGGTTC GATTTT G GTG TCTATGCATA TAcAACTGCG 7320 

TACATTGGAG CGAACTTCTT CTCTCCAGTA GAGAATGCAG ACATTCGACA AATGTTGACT 7380 

20 TTCX5CAGCAT TAGCCATTGC GTTTTTATTA AGACCAATTG GTGGTGTCGT ATTTGGTATT 7440 

ATTGGTGACA AATATGGACG TAAAGTTGTA TTAACATCTA CAATTATTTT AATGGCATTT 7500 

TCAACATTAA CCATTGGATT ATTGCCAAGC TATGATCAAA TTGGACTTTG GGCACCAATA 7560 

CTATTATTGC TTGCAAGAGT ACTACAAGGG TTTTCAACAG GTGGAQAGTA TGCGGGGGCA 7620 

ATGACATATG TTGCCGAATC ATCTCCAGAT AAGCGTCGTA ACTCATTAGG TAGTGGACTA 7680 

GAAATTGGGA CATTATCAGG TTACATAGCT GCTTCAATTA TGATTGCTGT ATTAACATTC 7740 

TTTTTAACAG ATGAACAAAT GGCATCATTT GGTTGGAGAA TCCCATTCTT ACTCGGTTTA 7800 

TTCCTAGGAT TATTCGGCTT ATATTTACGT CGTAAGCTGG AAGAATCACC AGTTTTCGAA 7860 

AATGATGTTG CAACACAACC AGAAAGAGAT AACATTAACT TTTTACAAAT CATCAGATTT 7920 

TATTACAAAG ATATATTTGT ATGTTTTGTA GCTGTTGTAT TCTTCaATGT TACAAACTAT 7980 

ATGGTAACTG CATATTTACC AACCTATTTA GAACAAGTTA TTAAATTAGA TGCAACGACA 8040 

ACAAGTGTAT TAATTACTTG TGTCATGGCA ATAATGATTC CATTAGCATT AATGTTTGGT 8100 

AAGTTAGCGG ATAAAATAGG TGAAAAGAAA GTATTTCTAA TTGGTACTGG TGGGCTAACA 8160 

TTATTCAGTA TCATCGCATT TATGTTATTA CATTCACAAT CATTTGTTGT AATAGTAATC 8220 

GGTATATTTA TATTAGGATT TTTCTTATCA ACTTACGAAG CGACAATGCC AGGGTCGTTA 8280 

CCAACGATGT TTTACAGTCA TATAAGATAT CGAACTTTAT CAGTAACATT TAATATCTCT 8340 

GTTTCGATAT TTGGTGGTaC GaCGCCATTA GTkGCAmCaT GGTTaGTTAC GAAAACTGGA 84 00 

GATCCATTAG QnCCTGCGTA TTATTTAACA GCAATCAGTG TTATTGGCTT TTTAGTTATT 8460 

ACATTCTTAC ATTTAAGTAC AGCAGGAAAA TCTCTAAAAG GTTCGTATCC AAATGTAGAT 8520 
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CAATTTTTAA AAAAGAAAGT 


GAAaCgnTaT 


GAAATTCCAT 


CGATGATTCA 


TCATGTAGAA 


10440 




AAGATGTATT ACACTGCAAG 


tGGTaAAATT 


GCTAGAGAAA 


AAATGATGTC 


GATGTATTTG 


10500 


5 


AGAGGTGAAT TATAATATGA 


ATCAAGCAGT 


CATAGTTGCA 


GCTAAACGAA 


CTGCATTTGG 


10560 




GAAATATGGT GGCACTTTAA 


AACATTTAGA 


GCCaGAACAA 


TTGCTTAAAC 


CTTTATTCCA 


10620 


10 


ACATTTTAAA GAGAAGTATC 


CAGAGGTAAT 


ATCTAAAATA 


GATGATGTAG 


TTTTAGGTAA 


10680 


TGTTGTTGGG AATGGTGGCA 


ATATTGCAAG 


AAAAGCATTG 


CTTGAAGCGG 


GGCTTAAAGA 


10740 




TTCAATACCT GGCGTCACAA 


TCGATCGGCA 


ATGTGGGTCT 


GGACTTGAAA 


GTGTTCAATA 


10800 


IS 


TGCATGTCGC ATGATCCAAG 


CCGGAGCTGG 


CAAGGTATAT 


ATTGCAGGTG 


GTGTTGAAAG 


10860 




TACAAGTCGA GCACCTTGGA 


AAATCAAACG 


ACOGCATTCT 


GT6TACGAAA 


CAGCATTACC 


10920 




TGAGTTTTAT GAGCXSTGCAT 


CATTTGCACC 


TGAAATGAGC 


GACCCATCAA 


TGATTCAAGG 


10980 


20 


TGCTGAAAAT GTGGCCAAGA 


TGTATGATGT 


TTCAAGAGAA 


TTACAAGATG 


AATTTGCTTA 


1104O 




TCGAAGTCAT CAATTGACAG 


CGGAAAATGT 


AAAGAATGGA 


AATATTTCTC 


AGGAAATATT 


11100 




ACCTATAACC GTTAAAGGAG 


AAATATTCAA 


CACTGATGAA 


AGTCTAAAAT 


CACATATTCC 


11160 


25 


GAAAGATAAC TTTGGCCGAT 


TTAAGCCCGT 


GATCAAAGGT 


GGGACCGTTA 


CCGCTGCGAA 


11220 




TAGTTGTATG AAAAATGATG 


GTGCAGTTTT 


ATTGCTTATT 


ATGGAAAAAG 


ATATGGCATA 


11280 


30 


CGAATTAGGT TTCGAGCATG 


GTTTATTATT 


TAAAGATGGT 


GTTACGGTAG 


GTGTTGATTC 


11340 


TAATTTTCCT GGCATTGGTC 


CAGTACCAGC 


CATTTCCAAC 


TTACTAAAAA 


GAAATCAATT 


11400 




AACGATAGAA AATATTGAAG 


TCATTGAAAT 


TAACGAAGCG 


TTCAGTGCAC 


AGGTAGTTGC 


11460 


35 


CTGCCAACAA GCTTTAAATA 


TTTCAAATAC 


GCT^TTAAAT 


ATATGGGGTG 


GTGCATTAGC 


11520 


ATCAGGTCAT CCATACGGTG 


CAAGCGGTGC 


CCAATTAGTG 


ACTCGATTAT 


TTTATATGTT 


11580 




TGACT^AAGAG ACTATQATTG 


CATCTATGGG 


GATAGGGGGA 


GGTCTAGGAA 


ATGCAGCATT 


11640 


40 


ATTTACTCGA TTCTAACCAG 


CGATTAAATG 


TGTCATTTTC 


TAAGGATAGT 


GTGGCTGCAT 


11700 




ATTATCAGTG TTTTAACCAA 


CCTTATAGAA 


AAGAAGTACC 


ACCATTAATG 


TGTGCGTCAT 


11760 




TATGGCCAAA ATTTGATTTA 


TTTAAAAAAT 


ATGCAAATAG 


CGAACTGATT 


TTAACAAAAT 


11820 


45 


CAGCAATTAA TCAAACTCAA 


AAGATAGAAG 


TAGACACAAT 


ATATGTAQGG 


CATTTAGAAG 


11680 




ATATTGAATG CCGACAGACT 


CGCAATATCA 


CACGTTATAC 


AATGGCTTTA 


ACATTAACTA 


11940 




AAAATGATCA ACATGTCATA 


ACGGTtACAC 


AAACTTTTAT 


TAAGGCGATG 


AAGTAGAGAT 


12000 


SO 


GGAGTTTAAT GAOATATGGA 


TAAATGAATA 


TTTGGCGCTC 


GTAAATGATG 


ATAATCCAAT 


12060 




ACATAATGAG ATTGTGCCAG 


GACAATTAGT 


GAGTCAAATG 


ATGCTGATGG 


CTATGTCATT 


12120 



££ 
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ATTCATTGAA CAACACGAAC ACGAAATTAT AGCAATTAAT GACGATGGAG AGATTAAAAT 1224 0 

AAAAATTTCT TTGAGCACAA AAAAATAACC GATATTAGCT GCATGAACGC ATATTAATTA 12300 

GGAGATGAAA GGACAGCTAA TATCAGTTAT GTATTGTTAT TATTATTGGG AACAGAGATG 12360 

AATATAGGTT ACGTTTCTTT CTTTGCACGG GGATGCATTA ATCTAAAATA ATAATAACAA 12420 

CTATATCAAT GTTTAATAAA TTCTGGATTA TTGGAACGAT TAGTCAATTT AACTAACTTT 124 80 

CATATGATCT ATATCGTCTT GTAATAAAGA GAGCAATTTG AATATTTCAG TATCACTAAA 12540 

TGAATCGTCA CATTTAATTG AAACATGCTG AAACX3TTTTG GTTATAATTT CATAAACTGG X2600 

TGOGCCTTCA TGGTGATACT GTCGATAAAT AATCATAACC TATATTACCT CCTTTGCTAC 12660 

TCTATGGTTA TATTATAAAT AACATTTTTA TGTGTGACAT CAACCTTAAG TATCAACTTT 12720 

TTATCAGACA TAGAACGTAT GATTTACTAA GACTATTTAT GTATAAAAGT TCTAAATAAA 12780 

TATATATTTA TAGAGTCGCC TGGCAGTCAT TTGGGaAATA TAACATATAT GATTAGAGAG 12840 

GCATCTATOG CAAAAGAATG ATAATGATAG AG6TATTGAG CATATAGATG AGTTTAAGTT 12900 

CATCTTGAAA ATAAAGGGTT ATTTAGTCAT AGATGTAGAT GTATAGGAAA TATTTGTATG 12960 

TATTGTTCGA TATGTATGAA ATTTTCAATA AAAGCTAATA ACGCTTATAT GTAACTTTCA 13020 

AATTTAAATT ATATACAGAG CATGATGATT ATAAAAAAAT AACCACATCA CATAAATTGA 13080 

GTTCATACCC AATTTAAGTG GTGTGGCTAA TAATGTTGAT TTATAGATGA ACCGCCTAAT 13140 

CGTTAAACCT CTGTTACTTC AACATCGATA TGTTCAATAC GGTTGTATGC ACCGTGATCC 13200 

ACAGGACCAA CAAAATCATT CATTTTCCAA CCXSTTTTTAA TAGCAGAAGC GACGAAAGCT 13260 

TTCGCGCTAA TCACAGCTTC TTTCGGTGAC TTACCGTTAG CTAAATATGC AGTTGTTGCC 13320 

GCAGCAAATG TACAACCAGC ACCATGGTTA TAACTTTGTT GGAACATGTC TGTTGTTAGT 133 80 

TGATAAAATG TTTGACCATC ATAGTATAAG TCATACGATT TATCTTGATC TAAAGCTTTG 13440 

CCACCTTTAA TGATGACATG CTGTGCGCCT TTATCAAAGA TAATTGTTGC AGCCTTTTTC 13500 

ATATCTTCAA TTGAATTTAA TTTACCTAAT CCTGATAATT GACCCGCTTC AAATAAGTTT 13560 

GGTGTCACTA CCGTTGCTTT AGGTAGTAAA TATTTAATCA TCGCCTCAGT ATTTCCAGGA 13620 

TTAAGCACTT CATCTTCGCC TTTACAAACC ATGACAGGAT CTACTACAAA ATATTGTGCA 13680 

TTAGATGCCT CATATACTTC TCCAGCACX3T TTGATTATCT CCTCAGTACC TAACATACCT 13740 

GTTTTAATAG CATCAGGTCC GATPGATAAA QCCGTTTCAA GTTGTTTTTC AAATACATCC 13800 

ATTGGTAATG GTGTAACATC GTGTGACCAT GTATCTTTAT CCATAGTAAC GATGGCAGTT 13860 

AAAGCGACCA TGCCATACGT ATCTAATTCT TGGAACGTTT TCAAATCTGC TTGCATACcT 13920 
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CACTCCTACA TAATAATATT GTATTCATCA 
AGCATTCAAT ATTTGATGAT TGTTGAAATG 

^ TGTCATTCAC TTTAGATAAG TGTGATATGT 

AATGGTCGCA AATTTTTCAT GACATAACAA 
TTTTAGAAAA AGAATATTCG ACTGCAATCG 

10 

CGTTTGArrr aacaccgttt gaaaatatca 

ATGGTCCAAA CCAAGCACAT GGATTAGCAT 

CATCTTTACG taatatgtat aaagaattag 

CGCATTTACA AGATTGGGCA AGAGAAGGCG 
GACAGGGTGA AGCAAATTCT CATCGTGATA 
20 TTAAAGCAGT GTCTGATTAT AAAGAACATG 

AGCAAAAAAT AAAGCTTATC GATACATCTA 
GTCCACTGTC TGCATATAGA GGATTCTTTG 
ATTTAGAGTC AGTAGGAAAA TCACCAATTA 
ATAGAGAAAC TTTAATAGCA CGAATTGAGC 
ATGACCATGA CTTTGAAAAA CATATGTATG 

30 

CAACATCAAA TACACCACAT ATTGGTGAAC 
ATCAAATGCC ACAATCACAA ATAACGCAGC 
AAGCGATGGG TGGTAAAGTA AATACGCATT 

35 

AACCTTCAAA CCAACAACAA AGATTAGCGA 
TATftGATTT TTAAAAAGCA ACAATGAAAC 

40 GGTTAATAAT CAAGACGCAT ATACTTTTAT 

ACTGAATTAT ATAAGGAGAG GTAGCAATGA 
CGATGATGGC TGTCGGTACA GGTGCATTTG 

^ ATCACTATTT ATCAGTATGG GAAAAAGCAA 

TATTAATTAT AGGTGTAATT AGTGGTACAA 
TAATATTTGC TGGTATTATT TTCTTTAGTG 

SO 

TTAAAGTTTT AGGTGCGATT ACGCCAATTG 
TGTTAATCAT TGCGACATTC AAATTTGCTG 
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TATCATTTTT AACCTAATTG AAAAATATTA 1404 0 

AATCATTCAT ACTATTGTAA CTTTTGAAAA 14100 

TAAAATATGT CCTGAGGTGA GATTGAATGG 14160 

CGAAACATGA CTTTAAAGCT ATGCATGATT 14220 

TATACCCTGA TAGGGAAAAT ATATATCAAG 142 BO 

AAGTTGTTAT ATTAGGACAA GACCCGTATC 14340 

TTTCAGTGCA ACCTAACGCA AAATTCCCTC 14400 

CAGATGATAT TGGATGCGTT AGACAAACAC 14460 

TCTTGTTATT GAATACAGTT TTAACCGTAA 14520 

TTGGTTGGGA AACATTTACT GATGAAATTA 14580 

TTGTCTTTAT TTTGTGGGGG AAACCTGCAC 14640 

AACATTGTAT TATAAAATCA GTGCATCCTA 14700 

GATCAAAACC GTATTCCAAA GCGAATGCCT 14760 

ATTGGTGTGA AAGTGAGGCG TAGATGTIGA 14820 

AAGAATTAGT ACAAGCAGAG CAGGCACAGC 148B0 

CCATACATAT ATTAACATCT TTATATGCTT 14940 

AACAAATGAA TCGTCGTATT GCTAACCATA 15000 

CAACTCATCA AGTGACAGTT GCTGAAATTG 15060 

CAGCACATCA TCATAATAAG TCATATTCAC 15120 

CAGATGATGA CATTGGCAAT GGTGAATCCA 15180 

ATAATTACTT AATAGCTTGT TAAGTATGTA 15240 

TCGAGTGTTC GGATTTAAAC ATTTATTAAT 15300 

AATTATTTAT TATTTTAGGT GCATTAAACG 15360 

GTGCX3CATGG TTTACAAGGA AAAATAAGTG 15420 

CGACGTATCA AATGTACCAT GGCTTAGCAT 15480 

CTTCAATCAA TGTTAACTGG GCTGGCTGGT 15540 

GATCATTATA TATTTTAGTA TTAACTCAAA 15600 

GTGGCGTATT GTTCATCATT GGATGGATAA 15660 

GTTAAATTTT AAAACTTTAG ATTACCTATG 15720 
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TGGGTATAGA 
ATAGAGGCGA 
CATGTATCGG 
TTGCAGCATC 
ATGGCGCATT 
GTTTCGGCAG 
TCGTTGCTTT 
TAAATAATGG 
TTGCGACCXST 
GATCATTACA 
GTTCATTCTT 
AAGGATGGTT 
TTGATAATAT 
TTATCGTGTA 
GTTGGTTATC 
CACAAACAGC 
TATTTACTGG 
GTTCAGGTAT 
TCGCAATCAT 
TGACTTGGAT 

tgtcSgcagc 
aaacgtttgc 
caggttctcc 

TCX5GTTTAAT 
TAASTCGCTT 
AAAAAGAAAA 
ATCGTGCQAT 
TAATTGCTAA 
TTCACGAACC 



ATACCTTCGA 
TTTAAAACAA 
ATGGGGCGCA 
AATTGGTATA 
AGTAGAGAGA 
ATATGTGAGT 
AAAtGCGACC 
GAAACTATAC 
ATTACTACTT 
ATATTATTTC 
TGGTAATAAT 
AGTGTCTATT 
TCCACAAACA 
CAGTTTATTA 
AACAAGTCAT 
ATTTGGTTAT 
TTTAAATGGA 
TATGCCAACA 
ATTCCTAGTA 
TGTAGATATG 
GAAATTATTC 
TATTATCGOC 
TGCAGCACTG 
ATTCTTTGTG 
GATTTTAAAT 
AACTAAATAA 
TTTTTGTATT 
GAGTTAGGGC 
CAGAAACAAT 



GGTGAGTTTT 
AACCTATCTG 
TTCATCTTAC 
GTTATTGGTG 
TTTCCAGTAT 
TTCTTCTCAT 
GCATTCAGTT 
ACCATTGCGG 
GTATTCATGC 
TGTGTGGOGA 
TTTGCACTTG 
GTGGTTATTG 
GCAGAAGAGT 
GCAGCATCAT 
CAAAGTTTAA 
ATTGGATTAG 
TTCTTGATGA 
ATGTTTAGTA 
GGAGTGTCGT 
TCATCTACTG 
AGTTATAACA 
TCATTTGTAT 
ACTGCACCGT 
ATTCGATATC 
AGAASTGAAA 
TAAAAGAATC 
ATAAATTCAC 
TGAGCCATTT 
TAATTTGGAA 



TATTTATGGA 
AAAAGTTTGT 
CAGGAGACTG 
CATTATTAAT 
CAGGGGGCGC 
CATGGTTTTT 
TACTAGTTAA 
GCTGGGACGT 
TAGTAACGAT 
TGGTAATCGT 
AAAATTTACA 
TATCCGTOGC 
TTAACTTTGC 
TAACTTATGT 
ATGGGCAGTT 
GTGTATTAGC 
GTTCAAGTCG 
AATTACATAG 
TAATTGCACC 
GTGTATCCAT 
AACAAAGTAA 
CATTCATTTT 
CTTATATTGC 
CTAAATTGAA 
ATGAAGTTGA 
GCACAATAAA 
ATTTAAGACG 
CTAACAAATA 
ATTTGGTCGG 



AAAAAAGAAT AAGCAAATAG 15840 

ATGGGCGATT GCATATGOTT 15900 

GATTAAGCAG TCAGGTCCGA 15960 

GATATTAATT GCGGTTAGTT 16020 

GTTTGCCTTT AGTTTCTTAA 16080 

AACTTTTGGT TATGTCTGTG 16140 

ATTCTTATTG CCAGATGTCT 16200 

TTATATTACG GAAATCATTA 16260 

TCGTGGCX5CA AGTGTATCTG 16320 

CGTATTATTG ATGTTCTTTG 16380 

ACCGTTAGCT GAACCTAGCA 16440 

ACCATGGGCA TATGTTGGAT 16500 

ACCAAACAAG ACATTTAAGC 16560 

TGTCATGATT TTATACACTG 16620 

GTGGTTAACA GGTGCTGtTA 16680 

AATTGCAATT ATGATGGGTA 16740 

CTTGTTATTT TCTATGGGAC 16800 

TAAATACAAA ACACCATATG 16860 

TTGGCTAGGA AGAACTGCAT 16920 

TGCCTACTTT ATTACATGTT 16980 

TACGTATGCA CCGGTTTACA 17040 

CTTAGCGTTG TTATTAGTGC 17100 

ATTACTTGGA TGGTTAATCA 17160 

AAATATGGAT AATGATGAAT 17220 

TGATATGATT GAAGAACCTG 17280 

CCTTCTTCAT TCGGAGGCGT 17340 

AGGCAGCTGA ACCTTATATA 17400 

TTTATAATCG TTTAAAAGAT 17460 

CGAATAATAA ACCTAATGCG 17520 
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AAGACTAAAT TTTTTGTAGC ATCX3TATGCT AAGCCACCAG GTACTAATGG AATGATACCC 17640 

GTTACCATAA AAATGATGGC AGGTTCTTTT TGTTTACGAG CCATATAATG ACTTAACAAG 17700 

^ CCTAATGCTA AACTACCAAA GAAACTAGAG TATATAGTGT GCACATTAAA GCCGTTGAAG 17760 

AATAAGGTGT AAACCATCCA TCCACACGTA CCAACGAAAC CACATGATAG ATATAATTTT 17820 

CTAGGTGCAT CAAAAATGAC GCAGAA 17846 

10 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 





ATTGACACTT 


GGTGAAAGTA 


ATATCGCCGC 


GCTATTTTGG 


CAAAATGGAC ACTTAGAACC 


60 




TGAGTTACAA 


GATGAACAGC 


CAATTAATAT 


ATTAGGATCT 


GkTCAAATCA ACGAATGGAA 


120 


25 


TGGTAATCAA 


TCACCGCAAA 


TAATTATTCA 


AGATATTGCG 


ATGAATGAAC AGCAAATATT 


180 




AGATTATAGA 


AGTAAGCGAA 


AAAGTTTACC 


TTTTACAGAA 


AATGATGAAA ATATTGTCGT 


240 




GCTTATTCAT 


CCTAAAAGTG 


ATAAAGTAAA 


TGCGAATGAA 


TATTATTATG GTGAAGAAAT 


300 


30 


TAAACAACAA 


ACTGATAAAG 


TAGTATTAAG 


AGATTTACCA 


AC6TCAATGG AAGACTTGTC 


360 




TAATTCCTTG 


CAACAACTGC 


AATTTTCTCA 


ACTTTATATA 


GTTTTGCAAC ATAATCATTC 


420 


35 


GATTTACTTC 


GATGGTATAC 


CTAATATGGA 


TATTTTTAAA 


AAGTGTTATA AAGCATTAAT 


480 


AACTAAACAA 


GAAACAAATA 


TCCAGAAAGA 


GGGTATGTTA 


TTGTGTCAAC ATTTAAGTGT 


540 




GAAAS:CAQAT 


ACACTTAAAT 


TCATGTTGAA 


AGTTTTCTTA 


GACTTAAAAT TTGTAACACA 


600 


40 


AGAAX^TGGT 


TTAATTCGAA 


TCAATCAACA 


ACCTGATAAA 


AGATCGATTG ATTCCAGCAA 


660 




AGTATATCAA 


TTAAGACAAC 


AACGTATGGA 


TGTTQAAAAG 


CAATTATTAT ATCAAGATTT 


720 




TTCAGAAATA 


AAAAATTGGA 


TAAAGTCACA 


ATTGTCGT6A 


GCAATTTAGG AGGAAATATT 


780 


45 


AATGGATTTA 


AAGCAATACG 


TATCAGAAGT 


TCAAGATTGG 


CCGAAACCAG GTGTTAGTTT 


840 




CAAGGATATT 


ACTACAATTA 


TGGATAATGG 


TGAAGCATAT 


GGCTATGCAA CAGATAAAAT 


900 




TGTAGAATAC 


GCAAAAGACA 


GAGATGTTGA 


TATCGTTGTA 


GGACCTGAAG CGCGTGGCTT 


960 


SO 


TATCATTGGC 


TGTCCTGTAG 


CTTATTCAAT 


GGGGATTGGC 


TTTGCACCTG TTAGAAAAGA 


1020 




AGGGAAATTA 


CCTCGTGmAG 


TCATTCGTTA 


TGAGTATGAC 


CTAGAATATG GTACAAATGT 


1080 
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ATTAGCTACT GGTGGTACGA TTGAAGCAGC 
CGTAGTAGGT ATTGCATTTA TAATTGAATT 
^ AGATTACGAT GTTATGAGTT TAATCTCATA 

AATGAAATCC TTCATCAAAT GTATAAGAAC 
TTTCTTAACA TGAGATGTTA GGATTTTTTA 

10 

ATACCTTAAT AACATCGTTT ATTTATTTCA 
AAAAATGAAA CAGTAGATTT AGGTCGAATT 
TACAAATTAA ACTCGCTCAA GTAAAATTAA 
TTATCGTCX3A CGGACGTATG ATTGGTGTGG 
TCATTGTTTA AGGCGAAGTA ATAAATATGA 
20 ATCCATATAG TGCAGACGAA tTCTTCACAA 

TGAGTATOTT TTAAAAAGCT ATCATATTGC 
AAACXKSATTA CCATACATTA TGCATCCTAT 
ATTAGACXX5A CCGACX5ATTG TCGCAGGTTT 
TACATTTGAA GATGTAAAAG AAATXSTTCAA 
GACGAAGCTT AAAAAAGTAA AATACCGCTC 

30 

CAAGTTATTT ATTGCGATTG CCAAAGATGT 
ATTACATAAT ATGCGTACCT TGAAAGCCAT 
AGAAACATTA GAAATTTATG CACCATTAGC 

35 

GQAACTAGAA GATACGGCTC TTC6TTATAT 
TTTAATGAAG AAGAAAGGTA GTGaACGTGA 

40 ACGTACTGAA ATGGACCGAA TGAATATCGA 

TTACAGTATT TATCGGAAAA TGATGAAGCA 
GTTGGCGATA CGTGTTATTG TCAATTCTAT 

^ GCATACGTTA TGGAAACCGA TGCCAGGACG 

AAATTTGTAT CAGTCATTGC ATACTACAGT 
CCAAATACGA ACGTTTGATA TGCACGAAAT 

SO 

TTACAAAGAA GGTAAAAAAG TAAGTGAAAA 
GTTAAAAGAA TTAGCTGAAG CGGATCATAC 

55 



AATAAAATTA GTTGAAAAAT TAGGCGGTAT 1200 

GAAATATTTA AATGGTATTG AAAAAATTAA 1260 

CGACGAATAA TAAATAATAT AATTTTATCA 1320 

CAATGACTTA ATTAAAAAAG TTGTTTAAGT 1380 

TTTACTGAAA ATGTTAGATG ATTGAGCATT 1440 

TAAATTGTAG TATCATAGAA CTAATATTTA 1500 

TTTGTAAAAG TTTTAAAAGT AGGAATAGTA 1560 

TATTACGATT AATGACGACA GGATAAATAT 1620 

GACAAATACT ATTCAACAAG A6TACCTAAA 1660 

ATGGGGTGTA TCATATAATG AACAACGAAT 1740 

AGCAAAATCA TATTTGTCAG CAGATGAATA 1800 

TTATGAAGCA CATAAAGGTC AGTTCCGAAA 1860 

ACAAGTTGCA GGTATTTTAA CAGAAATGCG 1920 

TTTGCATGAT GTAATTGAAG ATACACCGTA 1980 

TGAAGAAGTT GCTCX3AATTG TTGATGGTGT 2040 

AAAAGAAGAA CAACAAGCTG AAAATCATCG 2100 

ACGCGTAATT TTGGTGAAAT TAGCAGACAG 2160 

GCCGCGCGAA AAACAAATTA GAATTTCTCG 2220 

ACATCGTCTT GGTATTAATA CAATCAAATG 2280 

TGATAATGTG CAATATTTTA GAATAGTCAA 2340 

AGCGTATATC GAAACGGCTA TTGATAGAAT 2400 

AGGCGATATA AATGGTAGAC CTAAACATAT 2460 

GAAAAAACAA TTTGATCAAA TTTTTGATTT 2520 

TAATGATTGT TATGCGATAC TTGGGTTGGT 2580 

TTTTAAAGAT TATATTGCAA TGCCTAAACA 2640 

AGTAGGCCCA AATGGAGACC CGCTCGAAAT 2700 

TGCTGAGCAT GGTGTTGCAG CACACTGGGC 2760 

AGATCAAACT TATCAAAATA AGTTAAATTG 2820 

ATCGTCTGAC GCTCAAGAAT TTATGGAAAC 2880 
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10 



IS 



TAGTGCTTGA TCCTGGTCAT GGAGGTAGTO ACCAGGGTGC TTCAAGCAAT ACTAAATATA 4800 

AAAGTTTAGA AAAAGATTAT ACGTTGAAAA CAGCAAAAGA ATTGCAGCGT ACTTTAGAAA 4860 

AAGAAGGCGC AACTGTTAAG ATGACAAGAA CAGACGATAC ATATGTTTCA CTAGAAAATC 4920 

GTGATATCAA AGGCGATGCC TATTTGAGTA TACATAATGA TGCGTTAGAA TCATCTAATG 4980 

CAAATGGAAT GACaGTTTAT TGGTATCATG ATAATCAAAG AGCTTTAGCA GATACGTTAG 504 0 

ACGCTACGAT TCAGAAGAAA GGTCTACTTT CTAATCGCGG TTCAAGACAA GAAAATTATC 5100 

AAGTGTTAAG ACAAACAAAA GTTCCTGCTG TTTTATTAGA ATTAGGTTAT ATTAGTAACC 5160 

CAACTGATGA AACGATGATT AAAGATCAAT TACATAGACA AATTTTAGAA CAAGCAATTG 5220 

TTGATGGCCT TAAAATTTAT TTTTCTGCGT AGGGCTTGCA AAAATATGTG AAAGTAGTTA 5280 

TCATTGATAT TGAATTTTAT AACTAAAACC GTTAGTATTC TTGAAATGGT AAATOAAATA 5340 

20 GGTAGCAATC TAACTAAGAT TGTGTAGGAA TATAATCCAT AGACTGAAAG ATTATGCTGA 5400 

GTAGTTTATA TACATTGAAC ACAAGAAGAG GTGCTTTATG AAAAGTAAAG CCGTTAAACG 5460 

TACGTTaAAC GTTTTGAGTG GGTTTATTAA ATGCACGCTT ATAAAAAGTA ATGATGATTA 5520 

CAATTAGGCA TGTTTTTTAA ACCA 5544 

(2) INFORMATIOI^ FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\ible 

(D) TOPOLOGY: linear 

3S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAATOATTGC AAATATAAAT GGCATGTTTA ATATGTTAGA ACAACAAATC ATTCATAGCC 60 

4Q AAGATATGGC TCATTTTAGA AGTGAATTTT TTTACGTCAA TCATGaGCAT CGAGAAAACT 120 

ATGAAgCACT CCTAATTTAT TACAAAAATA GTATCGACAA TCCTATTGTA GATGGTGCAT 180 

GTTATATTTT AGCCCTACCT GAAATTTTCA ATAGTGTTGA TGTTTTOQAA TCAGAGTTAC 240 

45 CATTTTCATG GGTATATGAT GAAAATGGCA TTACCX3AAAC AATGAAATCA CITAGCATTC 300 

CATTACAATA TTTAGTTGCA GCAGCTTTAG AAGTAACTGA TGTGAATATA TTTAAGCCTT 360 

CAGGATTTAC AATGGGAATG AATAATTGGA ATATTGCTCA AATGCGAATC TTTTGGCAAT 420 

SO 

ATACAGCAAT TATTAGAAAA GAAGCACTAT AACATTAATA ATTAATTAGC TATAAAGATG 480 

ATTCACAACA ATCATCTTTA TAGCTTTTTT ATGTCTAATT ATTTTTGAGG AAAATinACAA 540 

SS 
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AATTTTATGT TTTCAAAAGT AAACAATCAA AAGATGTTAG AAGATTGCTT CTATATAAGA 660 

AAGAAAGTGT TTGTAGAAGA ACAAGGCGTC CCTGAGGAAA GTGAAATTGA TGAATATGAA 720 

^ TCTGAATCTA TTCACCTCAT TGGATATGAT AATGGACAGC CAGTTGCCAC TGCTCGAATA 780 

CGCCCTATTA ATGAAACAAC TGTCAAAATA GAACGAGTAG CTGTGATGAA ATCACATCGT 840 

GGACAAGGAA TGGGTAGAAT GCTTATGCAA GCTGTAGAAT CATTAGCTAA AGATGAAGGT 900 

10 

TTTTACGTAG CTACTATGAA TGCCCAATGT CATGCTATCC CATTTTATGA AAGTTTAAAC 960 

TTTAAAATGA GAGGTAATAT ATTTCTTGAG GAAGGCATCG AGCATATTGA AATGACAAAA 1020 

AAGTTAACCT CGCTTAATTA AAAAAAGTTG TATCTATTTT AGAAACA 1067 

75 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18613 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 





AAGACGtAtG 


ATAACAACAA 


TACgTGTAGT GAAAGATTTT 


AATCTACATA 


TTACTGACAA 


60 




AGAATTCATT 


GTATTTGTTG 


GACCATCGGG ATGTGGTAAA 


TCAACAACAT 


TACGAATGGT 


120 


30 


TGCTGGACTA 


GAGTCTATCA 


CATCTGGAGA TTTTTATATT 


GATGGGGAAC 


GCATGAACGA 


180 




TGTTGAACCA 


AAGAATAGAG 


ATATTGCGAT GGTATTTCAA 


AACTATGCAT 


TATATCCACA 


240 


3S 


TATGACTGTT 


TTTGAAAATA 


TGGCATTTGG GCTAAAGCTA 


CGTAAAGTAA ATAAAAAAGA 


300 


GATTGAACAA 


AAAGTTAATG 


AAGCAGCTGA AATATTAGGA 


TTAACTGAGT 


ATCTTGGTCG 


360 




TAAftCCAAAA 


GCGTTATCTG 


GCGGACAGCG TCAACGTGTT 


GCTTTGGGCA 


GAGCTATTGT 


420 


40 


TAGGGATGCG 


AAAGTCTTTT 


TAATGGATGA ACCATTATCG 


AATCTTGATG CGAAyTtCGA 


480 




GTACAAATGC 


GCACAGAAAT 


ATTGAAATTA CATAAGCGAC 


TTAATACTAC 


GACAATTTAT 


540 




GTTACACATG 


ATCAAACTGA 


AGCATTGACG ATGGCTAGTC 


GAATTGTTGT 


TTTGAAAGAT 


600 


4S 


GGCGACATTA 


TGCAAGTCGG 


CACACCTAGA GAAATATATG 


ATGCCCCTAA 


TTGCATATTT 


660 




GTGGCGCAAT 


TTATCGGCTC 


ACCAGCAATG AATATGTTGA 


ATGCTACAGT 


T6AAATGGAC 


720 




GGATTGAAGG 


TAGGAACACA 


CCATTTTAAA TTACATAATA 


AAAAATTTGA AAAGTTAAAA 


780 


SO 


GCTGCTGGCT 


ACTTAGACAA 


GGAAATTATT TTAGGTATTC 


GAGCTGAAGA 


CATTCATGAA 


840 




GAACCAATAT 


TTATTCAAAC 


TTCTCCAGAG ACACAATTTG 


AATCTGAAGT 


AGTTGTATCC 


900 



55 
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AAATTAGATT CAAGAACTCA AGTGATGGCX; 
AATAAGTGTC ACTTTTTTGA TGAAAAAACA 
^ ATGTCTAAAA TTTTAAAATG TATCACGTTA 

TGTGGCCCTA ATCGTTCGAA AGAAGATATT 
GACAAGCCTA ACCAACTTAC GATGTGGGTG 

10 

AAAATTACGG ATCAATATAC TAAAAAAACT 
CAAAATGATC AACTAGAAAA TATTTCGCTA 
TTTTTCTTAG CACATGATAA TACTGGAAGT 
AAATTATCAA AAQATGAGTT GAAAGGTTTC 
GACAATAAGC AACTAGCATT GCCAOCTATC 
20 AAATTAGTGA AAAATGCACC GCAAACGTTA 
ACTGATAGTA AAAAGAAACA ATACGGTATG 
TATCCGTTTT TATTCGGCAA TGATGATTAT 
ATTCATCAGC TAGGACTAAA TTCAAAACAT 
TGGTACGACA AAGGGTATCT TCCTAAGGCA 
AAAGAAGGAA AAGTAGGACA ATTTGTCACT 

30 

ACGTTTGGTA AAGATTTAGG AGTAACAACA 
CCATTTCTAG GTGTACGTGG TTGGTATTTA 
AAAGATTTAA TGCTGTATAT CACTAGTAAA 

35 

AGCGAAATTA CTGGACGTGT TGACGTGAAA 
AAG»AGCAC GTCATGCTGA ACCGATGCCT 

40 CCGATGGGCA ATGCAAGCAT ATTTATTTCA 

GAGGCQACGA ATGATATAAC GCAAAATATT 
AAAGGAGATT AGTTATGACG AAACGTAACC 

45 CTGGTTTGGG ACAGTTTTAT AATAAAAGAC 
TCATCAGTTT TATTTCTGTT TTTTATAGCT 

CATTAGGGAC AGTACCTAAG TTAGACGATT 

SO 

CTATCTTACT CGTTGCTTTC GCAATCATGC 

GTAATGCTGA ACX3ATTTAAT CGCAATGAGG 

55 



AACGACAAGA TTACACTAGC ATTTGATATG 1020 

GGAAATCGTA TCGTCTAAGG GGGAGTATTC 1080 

GCCGTGGTAA TGTTATTAAT CGTAACTGCA 1140 

GATAAAGCAT TGAATAAAGA TAATTCTAAA 1200 

GATGGCGACA AGCAAATGGC GTTTTATAAA 1260 

GGCATCAAAG TAAAGCTTGT AAATATTGGT 1320 

GACGCTCCTG CAGGAAAAGG TCCAGATATC 1380 

GCCTATCTAC AAGGCTTAGC TGCTGAAATC 1440 

AATArGCAAG CACTTAAAGC GATGAATTAT 1500 

GTTGAAACAA CCGCACTTTT TTATAATAAA 1560 

GAAGAAGTTG AAGCTAATGC TGCCAAACTA 1620 

TTATTTGATG CTAAAAATTT CTATTTTAAT 1680 

ATTTTCAAGA AAAATGGCAG TGAATATGAT 1740 

GTCGTCAAGA ATGCTGAACG ATTACAAAAA 1800 

GCAACACATG ATGTCATGAT TGGTCTTTTT 1860 

GGACCGTGGA ACATTAATGA ATATCAAGAA 1920 

TTACCTACAG ATGGTGGCAA ACCTATGAAA 1980 

TCTGAATATA GTAAACATAA GTATTGGGCT 2040 

GATACATTAC AAAAATATAC AGATGAAATG 2100 

TCATCTAATC CAAATTTAAA AGTGTTTGAA 2160 

AATATTCCTG AAATGCGACA AGTTTGGGAA 2220 

AATGGTAAGA ATCCTAAACA AGCGTTAGAT 2280 

AAGATTCTTC ATCCATCACA AAATGATAAG 2340 

CTAAATTAGC GGCATTATTA TCTGTTATAC 2400 

CCATTAAAGG GACGATATTT TTTATCTTTT 2460 

TTTTAAATAT TGGTTTTTGG GGATTGTTCA 2520 

CTCGTGTCTT ACTTGCACAA GGTATTATTT 2580 

TATATATCAT TAATATTTTA GATGCATATC 2640 

AAATAAAGGA TCCGAAGcGC GTATGGTGGC 2700 
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TGTAGTTGTA TTTCCATTAA TAyyTATGTT TGGAGTAGCA TTTACAAATT ACAATTTATA 2820 

CAACGCGCCT CCX3AGACACA CATTAGAATG GGTTGGTTTA GATAACTTTA AAACGTTATT 2880 

^ CACAATTGGC GTTTGGCGTA AAACATTTTT CAGTGTTATT ACTTGGACAT TAGTATGGAC 2940 

GCTTGTTGCA ACGACACTTC AAATTGCATT AGGGCTGTTT TTGGCAATTA TTGTAAATCA 3000 

CCCTGTCGTC AAAGGTAAGA AATTTATCCG TACTGTGTTA ATCCTACCTT GGGCTGTACC 3060 

10 

ATCATTTGTG ACAATTTTAA TATTTGTAGC GTTATTTAAT GATGAATTTG GTGCGATAAA 3120 

TAATGATATT TTGCAACCTT TATTAGGTGT AGCACCAGCA TGGTTAAGTG ATCCGTTTTG 3180 

GGCAAAAGTG GCATTAATCG GCATTCAAGT ATGGCTIGGA TTCCCATTTG TCTTTGCACT 3240 

IS 

GTTCACTGGA GTACTGCAAA GTATTTCATC AGATTGGTAC GAAGCAGCAG ATATGGATGG 3300 

TGCGTCTAGT TGGCAAAAGT TTAGAAACAT CACATTCCCG CAIGTCAITT ACGCCACAGC 3360 

20 GCCATTGTTA ATTATGCAAT ATGCAGGTAA TTTCAATAAT TTTAATCTTA TTTATCTATT 3420 

TAATAAAGGC GGTCCACCAG TGTCAGGGCA GAATGCTGGT AGTACAGATA TCTTGATATC 3480 

TTGGGTGTAT AATCTGACAT TTGAGTTTAA CAACTTCAAC ATGGGTGCAG TTGTGTCATT 3540 

2S AATTATTGGA TTTATTGTTG CTATTGTCGC ATTTATTCAA TTCAGACGTA CAAGTACGTT 3600 

TAAAGATQAG GGAGGTTTAT AAGATGACAA AGAAGAAAAA CATATTAAAA GCAATCGGTA 3660 

•nTACAGTTT TATAGCGATG ATGTTTGTCA TCATTTTATA TCCACTACTG TGGACATTTG 3720 

30 

GCATTTCCCT TAATCCAGGT ACGAACTTGT ATGGTGCCAA AATGATACCA GACAATGCAA 3780 

CATTTAAAAA TTATGCATTC TTACTATTOG ATGACAGTAG TCAATACCTG ACTTGGTATA 3840 

AAAATACGCT TATCGTAGCA TCTGCAAATG CACTGTTTAG TGTGATATTT GTCACGTTAA 3900 

35 

CAGCATATGC TTTTTCTAGA TATCGCTTTG TTGGTCGTAA ATACGGGCTG ATTACATTTT 3960 

TGA'mrTACA AATGTTCCCT GTATTAATGG CAATGGTCGC AATCTATATT TTGCTAAATA 4020 

^ CAATTGGATT ATTAGATTCT TTATTTGGAC TAACACTGGT ATATATTGGT GGATCAATAC 4080 

CGATGAATGC CTTTTTAGTG AAAGGTTACT TCGATACGAT TCCAAAAGAA CTTGATGAAT 4140 

CTGCCAAAAT TGATGGTGCA GGGCATATGC GTATTTTCTT ACAAATTATG CTTCCATTAG 4200 

^ CTAAGCCGAT TTTAGCAGTT GTTGCTTTGT TCAATTTTAT GGGGCCATTT ATGGACTTTA 4260 

TATTACCTAA AATACTATTA AGAAGTCCTG AAAAATTCAC ATTAGCAGTT GGATTGTTCA 4320 

ACTTTATTAA TGATAAGTAT GCAAATAATT TCACAGTGTT TGCAGCAGGG GCAATTATGA 4380 

SO 

TTGCAGTACC TATAGCAATC GTATTCTTGT TCTTGCAACG CTATTTAGTA TCAGGTTTAA 4440 

CAACAGGTGC GACAAAAGGT TAGTTTGAAA TTAGGAGTGG GGCAGAATTO ATAAAGAACC 4500 

55 
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GGGTGTGGT6 GTATTGCGAA TGGCAAGCAC 
GAAATGATCX3 CATTTTGTGA CGTAGACATT 
^ GGAACTGACA ATGCAAAGGT TTATGATGAT 

GATGTTATCC ATGTTTGTAC GCCAAATGAC 
CATGCTGGTA AACATGTGAT GTGTGAAAAA 

10 

AAAATGATAG ATACAGCTAA ATCAACAGGT 
TTCCGAGCAG ATAGTCAATT TTTACATCAA 
TACTTCGGAA AGGCACATGC CATTCGTOGT 
GACGAAGAAG CTCAAGGTGG AGGACCATTA 
ACGTTATGGA TGATGGATAA TTATGAACCA 
20 TTAAATAAAC AGCATCATGC GGCAAACGCT 

GTTGAAGATT CTGCGTTTGG ATTTATTAAA 
TCCQCTTGGG CGATTAATTC TTTAGAAGTG 
AAAGCAGGTG CTGATATGAA AGATGGTCTA 
TATACCAAAC ACGTTGAATT GGAAAACAAA 
GATGAAGCTG AAGAAGAAGC AAAAGCTTGG 

30 

GTTGTGAAAC CGGAACAAGC AATGGTAGTT 
GCAAAATCAG GCAAAGCAAT TTACTTTGAA 
ACAAAATTAA AAGTTGGTGT GATAGGTGTT 

3S 

GCATTGCTGA 7VACTCAAAGA CACAGTCTCA 
CAGMGATTG ATGTTGCGAA gCGCTTTAAT 

40 CTGTTTAAAC TTGTTGATGC GGTGGTCATT 

TCTATAGAAG CATTGAACCA TGGTGTCCAT 
ACGGAAGAGT GTGATCGCAT GATTGAAGCG 

^ GCATATCATT ATCGTCACAC AGATGTGGCA 

GTGGTTGGTA AACCTTTAGT AGCACGTGTA 

TGGGGTGTTT TTACCAATAA AGCGTTGCAA 

SO 

CACTTGTTAG ACTTATCTTT GTGOCTACTA 

GGAAAAACAT ATAATCAATT GAGCAAACAA 

55 



ATGCCAA6TT TACAAAAAGT TGAAAATGTT 4620 

TCGAAAGCAG CGAGTGCGGC AGAAGCATAC 4680 

TACAAAGCAT TGTTAAAAGA TGACACGATT 4740 

TCGCATTGTG AAATTACTGT AGCAGGGTTG 4800 

CCAATGGCTA AAACGACAGC AGAAGCTCAA 4860 

AAAAAATTAA CAATAGGTTA TCAAAATCGT 4920 

GCAGCGCAAC GTGGCGACTT AGGAGACATT 4980 

CGAGCAGTAC CAACATGGGG TGTCTTTCTA 5040 

ATCGATATCG GTACACACGC TTTAGATTTA 5100 

GAATCAGTGA TGGGTTCAAC ATTCCATAAA 5160 

TGGGGTTCAT GGAATCCAGA TGAATTTACA 5220 

ATGAAGAATG GAGCGACGAT CATTTTAGAA 5280 

GATGAGGCAA AATGTTCATT ATCAGGAACT 5340 

CGTATTCATG GTGAAGACAT GGGTACACTT 5400 

GGCGTCGACT TTTATGAAGG TAATGAAGTG 5460 

ATTGATGCAG TTGTAAATGA TACTGAACCA 5520 

ACAAAAATTC TTGAAGCQAT TTATCAGTCT 5580 

TAACATCATA CGGTAAGGAG GCACATCATG 5640 

GGTGGTATTG CACAAGACCG TCATATTCCA 5700 

TTAGTTGCAG TACAAGATAT TAATACAGTG 5760 

ATACCTCATG CAGTTGAGAC ACCTAGCGAG 5820 

TGTACACCTA ATAAATTCCA TGCTGATCTT 58 80 

GTATTGTGTG AAAAGCCAAT GGCGATGACG 5940 

GCTAATAAAA ATCACAAATT ATTAACTGTC 6000 

ATTACTGCTA AAAAAGCAAT TGAATCAGGT 6060 

CAAGC6ATGC GTAGGCGTAA AGTGCCTGGC 6120 

GGTGGCGGTA GTTTAATCGA TTATGGTTGC 6180 

GGTAAAGATA TGGTGCCGCA TGAAGTGCTA 6240 

CCGAATCAAA TTAATGATTG GGGAACATTT 6300 
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GCAAGCATGC AGTTTGAATG TTCGTGGTCT GCAAATATCA AAGAAGATAA GGTTCACGTT 6420 

AGTTTATCAG GAGAAGATGG CGGTATCAAT TTATTTCCAT TTGAAATATA TGAGCCCCGC 6480 

TTTGGAACTA TTTTTGAAAG CAAAGCTAAT GTTGAGCATA ACGAAGACAT TGCTGGTGAG 6540 

AGACAGGCGC GTAACTTTGT CAATGCGTGT TTAGGGATAG AAGAGATTGT GGTGAAACCG 6600 

GAAGAAGCAC GCAATGTAAA TGCCCTTATA GAAGCGATTT ATCGTAGCGA TCTTGATAAC 6660 

AAGAGCATAC AACTTTAATG ATTATCATAT ATGATACAAA ATTCTCAATA TAAAAAGAAG 6720 

GAGTGCTTTT CAATGAAAAT AGGTGTATTT TCAGTATTAT TTTACGATAA AAATTTTQAA 6780 

GATATGTTAG ATTATGTCTC AGAATCTGGA TTOOATATGA TTGAAGTTGG AACAGGTGGT 6840 

AACCCAGGA6 ATAAATTTTG TAAGTTAGAT GAGTFGTTAG AAAATGAAGA CAAGCGCCAA 6900 

GCATTTATGA AGTCAATCAC AGACAGAGGC TTACAAATAA GTGGTTTCAG TTGTCATAAC 6960 

20 AATCCAATTT CTCCAGATCC GATAGAAGCG AAAGAAGCCG ATGAAACGTT ACGTAAAACA 7020 

ATCCG1TTAG CAAATCTATT AGAOGTGCCA GTTGTTAATA CATTTTCTGG CATTGCAGGA 7080 

TCAGATGATA CCXKTAAAAA GCCTAATTGO CCTOTTACAC CTTGGCCAAC AGCCTACTCT 7140 

GAAATTTATG ATTATCAGTG GAATGAAAA6 TTGATACCAT ATTGGCAAGA TTTAGCTGAG 7200 

TTTGCAAAAG AGCAAGATGT AAAAATTGCC ATAGAGTTGC ATGCAGGATT TTTAGTGCAT 7260 

ACACCATATA CAATGTTGAA GTTACGTGAG GCTACAAATG AATATATCGG TGCTAACTTA 7320 

GATCCTAGTC ATCTATGGTG GCAAGGTATT GACCCAATTG CTGCGATTCG CATATTAGGC 7380 

CAAGCAAATG CAATTCATCA CTTCCATGCT AAAGATACGT ATATTAATCA AGAAAATGTA 744 0 

AATATGTATG GTCTAACTGA TATGCAACCA TATGGTAACG TTGCGACAAG AGCATGGACA 7500 

TTCCGTACAG TTGGTTATGG ACATAGTCCA TATGTATGGG CAGATATCAT AAGTCAACTT 7560 

ATTATTAATG GATATGATTA TGTATTAAGT ATTGAACATG AAGATCCTAT TATGTCAGTA 7620 

40 GAAGAAGGTT TCCAAAAAGC TTGTCAAACT TTGAAATCTG TTAATATTTA CGACAAGCCA 7680 

GCAGACATGT GGTGGGCATA ATACGAACTC GAGGTTAGTC TGAAGTTTGT CTGAAGTAAG 7740 

ACTGGTGGCA GTGTTGAATA AATGCATATG TCGCCAAGCC ATTGCCAAAA ATTTCACACC 7800 

TTAAATCAA6 TCATTGTTTG TAAAGAAGGT GTACTTTATA TAAGTATATA GCGATGGTCA 7860 

TACCCATTCA CAGTAACAAT CCTCACCATT GAAAAGAOTA TATAACCTTT TCAATAGTGA 7920 

GGTATATGAT AATAAAAAAA GCCTGTTGTC ACAATGGTCA TAGACACGAC ATACTTTAAA 7980 

SO 

GGTTTCTGAA TATAATATTT CAGAATGCAC TTTAAAGATG GACGTCGATG TAGACTAAAG 8040 

TGATGACAGG CTTTCATCTT TTTAAATATT CATTAATTTC TCTTCTTOTT TAATACGTAC 8100 

SS 
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TAATACACCG ATTAATTCAG GAATGATGTT 
ATATAATCCA GATTTAATAA TAGGATGGTT 
^ ACCACCTAAA GTTTTAATAA CCATAAATAA 

GCCAATACCA TTTGCAAAGC TAAATGTATC 
TACATAAATT AAAACGTGTG TTATTGCTAA 

10 

TGCACCGTCT GCTTTTAATT GTTTTGAGTG 
GATACAGAAA AAGATAAGTA ATATAGATAG 
TGTATAAGCG TTGATTTTGA CAACATAAAG 

IS 

TTAACTATTT ATTAATTTTA GTACATAAAT 
TTTTGGATAA TTTAATAATT TTAAGGATAT 
20 CTTAACXvAAA ATGATTGAGG TGACAGAGAT 

CAAAGGCATT CCATTATCGG TACAACGTAA 
CTTCGTAGTG TTCTTTGTTT ATATGGCTAT 
ACAACCGTTT TTAAAAGAGG AAATTGGATT 
AGCATTTAGT ATCACGTACG GTTTAGGAAA 

TAACACAAAA CGTATTATCT CGTTCTTACT 

30 

GGGATTTGTT TTAAGTTACT TTGGTTCTGT 

TAACGGGGTG TTCCAATCAG TTGGTGGACC 

GCCAAGAACG AAACGTGGCC GATACTTAGG 

3S 

TGCCATAGCA GGTGGTGTTG CACTTTGGGG 
AGGOATGTTC ATTTTCCCAT CGGTGATTGC 
CGGAAAAGAT GATCCGGAAG AATTAGGATG 
GGTCGATAAA GAAAATATTG ATTCTCAAGG 
TATCCTGGGA AATCCTGTTA TATGQATTCT 
ACX5AATCGGT ATTGATAACT GGGCACCGTT 
AGGCGATGCA GTTAATACGA TATTCTACTT 

ATGGGGCTAC GTATCAGACT TATTAAAAGG 

50 

GTTTATGATT ACATTTGTTG TCTTATTCTA 

CATTTCATTG TTTGCATTAG GTGCGTTAAT 

55 



TAAGAAGTAA TTTGGGTGTT TTGTAATTTT B220 

AGGTAAAATG AATAATTTTA ATGTCCAAAT 8280 

CATGATATAA GCAAAGATTA ATATAACTAA 8340 

TTTATTAATA AATGCCTCTA CACCAGCCAA 8400 

AAACTTCGAA TTTTTAACGC CATATTCAAC 8460 

ATTAATAGAT ATCTTTAAGC TGACAAGTCT 8520 

AATCATGATG TCCTCCGTCA TTATGTCATA 8580 

TATTTTATAG ATAAAGCTTG TCAAATACTA 8640 

ATGTTTCTAA GTATGTGTTT ATGTTCAGTA 8700 

TAAGCGCTTA CACCGACGTG ATATATTTGG 8760 

GAACTTTTTT GATATCCATA AGATTCCGAA 8820 

ATTATGGCTT AGAAACTTCA TGCAAGCTTT 8880 

GTATTTAATT CGAAACAACT TTAAGGCGGC 8940 

ATCTACATTA GAACTTGGTT ATATCGGATT 9000 

AACATTACTT GGATATTTTG TCGATGGACG 9060 

TATCTTATCT GCGATTACAG TTTTAATTAT 9120 

AATGGGATTA TTAATTGTAC TTTGGGGACT 9180 

TGCAAGTTAT TCAACGATTT CAAGATGGGC 9240 

ATTCTGGAAT ACATCACATA ATATCGGTGG 9300 

TGCTAATGTA TTCTTCCATG GAAATGTTAT 9360 

ATTACTTATT GGTATCGCAA CATTATTTAT 9420 

GAATCGTGCT GAAGAAATTT GGGAAGAGCC 9480 

TATGAGGAAA TGGGAGATCT TTAAAAAATA 9540 

ATGTGTTTCA AACGTCTTTG TATACATTGT 9600 

ATATGTGTCA GAGCATTTAC ACTTTAGTAA 9660 

TGAAATTGGT GCATTAGTTG CAAGTTTATT 9720 

TCGTCGTGCA ATTGTAGCTA TTGGCTGTAT 9780 

CACAAATGCT ACAAGTGTCA TGATGGTTAA 9840 

CTTTGGTCCG CAATTATTAA TTGGTGTATC 9900 
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20 



2S 



30 



35 



40 



45 



SO 



CGCGTATCTA TTCGGTGACT CAATGGCGAA AGTTGGTTTG 
ACGTAACGGT TTAAACATCT TTGGATATAC ATTAAGTGGA 
CTTCTATGTT GCATTATTCC TAGGCATGAT TCTATTAGGA 
AAAGAAAATT AGAAGTTTAA AAATTTAATA TAAATCGGAT 
TGCAATATAG TTGGCAATCC TGCCCCGACG GCATGTGCGT 
TTCTACCCTT GCAAATATAT CATCTCTATG TCTCGGGGCA 
AAGTATCCTT ATTTGCCCGA CTTAGGGTGA CTCAATGAAT 
ATA1AGGGGT GTCAATATTG TAGGGA6TAT TGTTTTATAT 
ACTGAAAGAA AAGTGAAAAC TTCTCTATCA GTCCGCTTTT 
GCCATAATCA TTAGTTATGT GCTAATCTAT TTTGCTTGCT 
ATTTGTAAAT ATTTTTTAAA ATGATAGCTA AACATTTTAT 
TCTGCAATTT GATAGTGTTT GTAATGTCGA TCTAACAATT 
TAGCX3ATTTA AATAATCGAC AATTGTAATA CCAACATGAT 
TACGATTCAC TAACATCGAT ATGTTGAATT AAATCTGAAA 
6ATTGCTTAA TTTGATCCAC AATCTGGTTT ACATAATAAT 
AGTGGTTGGA AGGCATCATG ACAAGATGCT AAGCTACGGC 
AATAAGGTAC GGACAAGTCT TCCTAAAATA ACTTCTAATT 
AATAAATAAT CAAGAACATG ATGTTGAATG CCGGCTTTCA 
CTCGATAATA TGATGACATT ACAATCTAGA TGCGCAATAT 
TTTTTACGTG GCATACGAAT ATCAGTAATT ACTAATTCTG 
GATi^TGCTT CAACACCATC TTTAGCAGTG TATATTGTAT 
GGAATGATTT GCTTTAATCC TTCTCGAATA ATTCGTTCAT 
AACATCTACA TTCCCCCTTG AAAGTGGTAT TTTATAACAA 
CTTTGAAAAA ATATGGAGTC GTGCATGTGA ACCATATTGA 
ATGATTTAAT CCCAAATGCT TAGTATCAAA TACATCATTA 
TTGCAGGCGA GATGACGACA TCCCGATACC ATTGTCGCAA 
ACGTGCCAAT GTCAGGCGTA TAGTAATGTC CAATGACTCA 
AGCATTTTCT ATGAGTGGCT GAAGCATCAT TTTACCAATT 
AGAACTTTCA ATATGGAGCT TAATCATGTC ATCAAAACGG 



GCGGCTATTG CTGATCCAAC 10020 

TGGACAGATG TTTTCATCGT 10080 

ATCGTTGCTT TCTATGAAGA 10140 

TAAAAGTATC GCCAATCTAT 10200 

GAAGAGATGA AAGATACTGC 10260 

GATCATAATT CCCTGTTATG 10320 

TTACTCCTTA CAATAAAGAC 10380 

TTAAACTCTC TAAAAAGCGG 10440 

TCATAGAACA AAATGGAGGC 10500 

TACAATAATC ACTTGGCGAC 10560 

ACTCTGAAAA GCCTACTTTG 10620 

GCAGAGATTG TAAAATACGA 10680 

CTTTAAATGT TCGCATCGCA 10740 

CAGtCACTTT CGTTTQATAA 10800 

CATCGTATTC TACTTTTAAT 10860 

CGTTCTGTGA TTGTTGCTCT 10920 

GTGCATGGTC TACTGGTTTT 10980 

TATATTCAAA GTCATCGTAA 11040 

CATTGAGTAA ATCGACGCCA 11100 

GCTGATGTTG TTGAATTAGT 11160 

TGAAATGATA GTCTCCCCAA 11220 

CATCACAAAT AACTACCTTA 11280 

ATTAACGTAC CTTQATTACG 11340 

ATCATTGCTT TATTGTGTAA 11400 

TTAAGAGATT GGCGTACATA 11460 

ACTAAAACAT GTAAATTCTG 11520 

GTATCTCTAC CATGTTTAAT 11560 

GTCTGGTGAC GCGCTTCTTC 11640 

aTGTTTTGTA TTGCAACATA 11700 
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GTAACGTAAC ATTTGCGATA 
AATTAAATAT TGTATTGTTT 
TATTTCCTTT AACTGAATAT 
TTGCTCATTT GATTCAAATA 
GTTTTTTAAA GGCGTATATG 
AATCGTTTCG ATATCTTTTG 
TATTACTAGT AAAACAAGAA 
GTTTTCATGT ATATCTTTAT 
TTCATTCATA AATCCX5AATT 
GTTGGCGTAT AAAATATTGT 
CtTAATTTCA CTTAAACGTG 
TTTGAATTTA AGCTGATGCG 
ATTATTGGTT ATAAACTGAT 
GkGTATTAAT TTTAATAATT 
QAGCTCTTGA AACGAATTAT 
ATGTAAAGAT GACTGACTTT 
ATAGATGGCA TAGAAGCTTA 
AGACnCAAAT AACXUVTCGTC 
ATCTCCAAAA ATTTATGATG 
TTCTTAAGAT TTTCX5ATAGA 
GCATCGAACT CTGCAACTAA 
TTTTTACGCT TAGGATGAGG 
ATTGTTCCTT CAATTGGATA 
GCATCTTGTT CGTAGCTTAG 
TTAGACGTCT TTGACAGTTG 
TGATGCATGC TATAAATAGC 
GAGTACGCAA TTTTACCTTT 
ATATCTCCTT GTAAATCTGA 
ACATATTTAT TGTTCGAGCG 



ATTGTTGGAC CACAGTTtGT 
GCATCGTATT GAATAGGAAA 
CACGCAAGCG ACGTTCTGTA 
AATCGTAAAT ATAATTATTA 
TACCTAGATG ACGATTTTTG 
TTTGTCGTTT AGCCATATTA 
CTACGGCCAT AACAATTAAC 
AAATAATGAG ACGATGGTCA 
GTTGTGGTcT ATACTTTTCA 
CATATTGATC AmCGATAAGT 
GGGTGTtAGC CATATAAATt 
TTGAAAATAA ATACATATTT 
TTGGTCCAGA TAATTCATAA 
CACGTTTTGT AGCGGTCACA 
TATGCTGTGT AATAAATGTC 
CATCAACATG TTGATGAATC 
CTAGTCCAAT AATAATGACT 
TTAATTGATG TCTATAAGGT 
TGGAATATCC GGTAATTTAG 
CTGATCGCTT TGTTCACTAA 
TCGTtGTTGT ACTGAGCGGC 
GTGTGCATTT TTAACTAAAG 
AACGATTGAT ACAGGATAAC 
ACCTGCGTAA TATTTACCTT 
CATOSCATGG TTTTGGAATT 
ACGCATATGT TGATAGCCTG 
AAGTATAGGT TGTAATAAAT 
ATTCACTACT ATAACTGTTG 
ATAATCCTCT AATTGCTGTG 



GCTAATTTCG GAGATAACGT 11820 

TQAGGCTGGA ATTGGOGTTC H880 

TGCTCGATAG AATGGATCAG 11940 

ATTTCTTCTA GTTCACTGTT 12000 

GCATAGTAAA TTTTTTGAAT 12060 

TCTGCGCTAA TGAAACCAAA 12120 

AACGTGATAC CATCTTCAAT 12180 

GCATGGTTTA ATTTTACAGA 12240 

CCTATAGTAA AACX3GTCATC 12300 

GCX5AATTGTC GGTTATCTTT 12360 

TTaAGCATAT ATGTACTATT 12420 

TTAGTGTTTA AATGTTCATA 12480 

TAAAGTGTTG CGGGCTGTTG 12540 

TCATGATGAT TTGyTAAATC 12600 

TGAATCTGCT TTTCAGTATG 12660 

GTACXSATGCT CAATCCAAAT 12720 

AAAAATACTG GAAAAATAGT 12780 

TTGTATGCCn TCATTGAATC 12840 

ATTTCGGTAT TAAAGGTATG 12900 

CATCCTTTCG AATTGACTTG 12960 

TTGTTAAATA TTGCACTAAC 13020 

CAATrCCATC AACATTTAAC 13 080 

CTTTGTTTTT CCATGTGCX3T 13140 

TTGCAACATC TTCAATGACT 13200 

GATGCACATC ACTTACTCGA 13260 

TCGTTGTTGT ATTTGGATTT 13320 

CTTGATAACC TCGAATCTTA 13380 

GCATTAATAG AAAACTAGTA 13440 

TTACAGATGT ATCTTGATAG 13500 
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CCACGCTCCG AAAAATCTTC GTTATGCAAG 
TTAATTTCAA TTTTGACATG CTCTTGTTTT 

5 

TTTGATTGAT ACX3GAGAATA AACTGTTAAT 
TTAGCGCATG CTGaTAAAAA AATGAGAAAT 
ATATCCCATC AATTCTATGT ATATTTTAAT 

10 

GTAATGTTAA ATATTTAGAA ATGTTTATAG 
ATAGCACAAA ATT T TTGTTT GTCAAGACGA 
ATTTTATTTG TAGCTGTTAT ATAAAAATCG 
TTTACGTCAA TAAAAGTATT TAATCCAGTC 
TGTTGATTTA ACGCTTATCA ACAATCATTT 
20 CTTTTAAAGC AATGAAAATA GTGAACATTA 

TATTACTGTT ACACAAATTA GTACAGTTTC 
TACATAATTT ATGTGAAAAA AATCACAACA 
CATAGCATTT CAAATTCACA ACATTATACA 
TaAAAATCAT GCAACAGCTT GGCAAGGATT 
TGTAAGAGAG TTTATCCAAT TAAACTACAC 

30 

AGGACCAACA GAAGCAACTT CTAAACTTTG 
ACGTGAACGT GGCGGCATGT GGGATATGGA 
TGATGCTGGT TATTTAGACA AAGATTTAGA 

3S 

ATTCAAACGT TCAATGCAAC CATTCGGTGG 
TTACGGTTAC GAATTAGACG AAGAAACTGA 

40 TAACCAAGGT GTATTCGATG CATATTCTAG 
AATCACTGGT TTACCTGATG CATAOGGACG 
AGCTTTATAT GGTGTAGATT TCTTAATGGA 

^ TACAGAAATG TCAGAAGATG TAATTCGTTT 
ATTAAAAGAA TTAAAAGAAC TTGGACAAAA 
AAACTTCAAA GAAGCAGTTC AATGGTTATA 

SO 

AAACGGTGCA GCAATGAGTT TAGGTCGTAC 
TGACCTTAAA GCAGGCGTTA TTACTGAAAG 
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TTTGAAAGCA GTACTTGAGT AGATCCGTGT 13620 

TCAAATTCAT TTAAAATTGG ACGAATCAAG 13680 

ACATTTTTAT CGGATTCAGA GTGACGCGTA 13740 

AATAGCAAGA TATAAATTTT TGATTTCATG 13 800 

ACAATAATTT TAGCAATAAA TGACGCATAA 13860 

ATGACTTGTT AAGACGTTGC AAATGTTGTG 13920 

TTTACCGAGG CTGTAAAATC AAACTGTTAT 13980 

GCAAGATATT GAACGGTTCA AAAGTGAATT 14040 

TCTTCATATA TAAAAGTAAA TCTTTCTAAG 14100 

TTTATAAACA AATATATACT CCTAAATTAA 14160 

TAACTGTTGT GTAACAGAAT GCAATTAGCA 14220 

TATGTTTTGA CATACATTTG ATGAAAATTG 14280 

AACATGCTAC AATGACTATG AAAACGTTAA 14340 

GATGGAGGCG TTTAGTATGT TAGAAACAAA 14400 

TAAAAATGGA AGATGGAACA GACACGTAGA 14460 

TCTTTATGAA GGTAATGATT CATTTTTAGC 14520 

GGAACAAGTA ATGCAGTTAT CGAAAGAAGA 14580 

CACGAAAGTA GCTTCAACAA TCACATCTCA 14640 

AACAATTGTA GGTGTACAAA CTGAAAAGCC 14700 

TATTCGTATG GCGAAAgcAG CTTGTGAAGC 14760 

AAAAATCTTT ACAGATTATC GTAAAACACA 14820 

AGAAATGTTG AACTGCCGTA AAGCAGGTGT 14880 

TGQACGTATT ATCGGTGACT ATCGTCGTGT 14940 

AGAAAAAATG CACGACTTCA ACACGATGTC 15000 

ACGTGaAGAA TTATCAGAAC AATATCGTGC 15060 

ATATGGTTTC GATTTAAGCC GTCCAGCAGA 15120 

CTTAGCATAC CTTGCTGCAA TTAAAGAACA 15180 

ATCAACATTC TTAGATATCT ATGCTGAACG 15240 

CGAAGTTCAA GAAATTATTG ACCACTTCAT 15300 
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AGACCCAACT TGGGTAACTG AATCTATCGG TGGTGTAGGT ATTGACGGAC GTCCACTTGT 15420 

TACGAAAAAC TCATTCCGTT TCTTACACTC ATTAGATAAC TTAGGTCCAG CTCCAGAACC 15480 

AAACTTAACA GTATTATGGT CAGTACGTTT ACCTGACAAC TTCAAAACAT ACTGTGCAAA 15540 

AATGAGTATT AAAACAAGTT CTATCCAATA TGAAAATGAT GACATTATGC GTGAAAGCTA 15600 

TGGCGATGAC TATGGTATCG CATGTTGTGT ATCAGCGATG ACAATTGGTA AACAAATGCA 15660 

ATTCTTCGGT GCACGTGCGA ACTTAGCTAA AACATTACTT TACGCTATCA ATGGTGGTAA 15720 

AGATGAAAAA TCTGGTGCAC AAGTTGGTCC AAACTTCGAA GGTATTAACA GCGAAGTATT 15780 

AGAATATGAC GAAgTATTCA AGAAATTTGA TCAAATGATG GATTGGCTAG CAGGTGTTTA 15840 

CATTAACTCA TTAAATGTTA TTCACTACAT GCACGATAAA TACAGCTATG AACGTATTGA 15900 

AATGGCATTA CATGATACAG AAATTGTACG TACAATGGCA ACAGGTATCG CTGGTTTATC 15960 

A6TAGCA6CT GACTCATTAT CTGCAATTAA ATATGCACAA GTTAAACCAA TTCGTAACGA 16020 

AGAAGGTCTT GTAGTAGACT TTGAAATCGA AGGCGACTTC CCTAAATACG GTAACAATGA 16080 

CGACCGTGTA GATGATATTG CAGTTGATTT AGTAGAACGC TTCATGACTA AATTACGTAG 16140 

TCATAAAACA TATCGTGATT CAGAACATAC AATGAGTGTA TTAACAATTA CTTCAAACXST 16200 

TGTATACGGT AAGAAAACTG GTAACACACC AGACGGACGT AAAGCTGGOG AACCATTTGC 16260 

TCCAGGTGCA AACCCAATGC ATGGCCX3TGA CCAAAAAGGT GCATTATCTT CATTAAGTTC 16320 

TGTAGCTAAG ATCCCTTACG ATTGCTGTAA AGATGGTATT TCAAATACAT TCAGTATCGT 16380 

ACCAAAATCA TTAGGTAAAG AACCAGAAGA TCAAAACCGT AACTTAACTA GTATGTTAGA 16440 

TGGTTACGCA ATGCAATGTG GTCACCACTT AAATATTAAC GTATTTAACC GTGAAACATT 16500 

AATAGATGCA ATGGAACATC CAGAAGAATA TCCACAGTTA ACAATCCGTG TATCTGGTTA 16560 

CGCTSTTAAC TTCATTAAAT TAACACGTGA ACAACAATTA GATGTAATTT CTCGTACATT 16620 

CCATGAAAGT ATGTAACAAA ATTTAAGGTG GGAGCACTAT GCTTAAGGGA CACTTACATT 16680 

CTGTCGAAAG TTTAGGTACT GTCGATGGAC CGGGATTAAG ATATATATTA TTTACACAAG 16740 

GATGCTTACT TAGATGCTTG TATTGCCACA ATCCAGATAC TTGGAAAATT AGTGAGCCAT 16800 

CAAGAGAAGT CACAGTTGAT GAAATGGTGA ATGAAATATT ACCATACAAA CCATACTTTG 16860 

ATGCATCGGG TGGCGGTGTA ACAGTCAGTG GTGGCGAACC ATTGTTACAA ATGCCATTCT 16920 

TAGAAAAATT ATTTGCAGAA TTAAAAGAAA ATGGTGTGCA CACTTGCTTA GACACATGGG 16980 

CTGGATGTGC TAATGATACA AAAGCATTTC AAAGGCATTT TGAAGAATTA CAAAAACATA 17040 

CAGACTTGAT ATTATTAGAT ATAAAACATA TTGATAATGA CAAACATATT AGATTGACAG 17100 
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TATGGATTCG ACATGTCCTT GTGCCTGGTT ATTCTGATGA TAAAGACGAT TTAATTAAAC 17220 

TAGGGGAATT TATTAATTCT CTTGATAACG TCGAAAAGTT TGAAATTCTG CCATATCATC 17280 

^ AGTTAGGTGT TCATAAGTGG AAAACATTGG GCATTGCATA TGAATTAGAA GATGTCGAAG 17340 

CGCCCGATGA TGAAGCTGTT AAAGCAGCCT ACCGTTATGT TAACTTCAAA GGGAAAATTC 17400 

CCGTTGAATT ATAAATACAA TTCAGACCGA AAAGAAAGCA TATGCAACTT CAAGAGTGAA 17460 

10 

GGGGCATATG CTTCTTTTTC AATTGAGTAT TGAGTATTAG CAAGACGTAG TAAGTATATG 17520 

AGACAACTTC TACAATGGTT GAAGGAAGAC GTTTTTGTAA GTAGCTATGC TGATAAAGAA 17580 

TGTGATGTCT TGTTAAAGGT GGGGTTCCAA TATCATCATT TAGCTGATGT TGAATGGGTT 17640 

IS 

ATTATTTGCT ACTTGCATAT GAATATGAGT CTTTTCAAAT TTTTATTGAC CCTGAGTAAT 17700 

GAAAAATATT AAGATGAAAC TTAATATTAA AgCAATGCGG AGCGTGATTA TGAAGAGAAT 17760 

20 TAGTAAAGAT ATATGGGCAG TATTTAAATT ACTGTATCaA AATAAAGGGC GTTTTAGCAT 17820 

TAATGCCTTA CTATTGCAGT TAATCATGAT TTTTATTAGT AGTACATACT TAATTTTACT 17880 

ATTTAATATG ATGTTAAAAG TAGCTGGcAA AGCCAACTTA CGATTAACAA TTGGACGGAA 17940 

ATCGTTAGTC ATCCCGCCAG TGTGATACTT CTTATTATAT TCATATTAAG TGTTGCCTTT 18000 

CTGATTTATG TAGAGTTTTC ATTGTTAGTT TATATGGTTT ATGCCGGCTT TGATCGACAG 18060 

ATTATTACAT TTAAATCCAT TTTTAAAAAT GCCTTTGTAA ATGTGCGTAA ACTCATAGGT 18120 

30 

GTACCAGTTA TTTTCTTTGT CATTTATTTA ATGTTAATGA TACCCATTGC CAACCTAGGA 18180 

CTAAGTTCAG TATTAACAAA AAATATTTAC ATACCTAAAT TTTTAACGGA AGAACTTATG 18240 

AAAACGACGA AAGGTATAAT CATTTACGGT ACCTTTATGA TTGCTGTATT TATATTAAAT 18300 

35 

TTTAAATTAA TATTTACTCT ACCGTTAACG ATTTTAAACC GCCAGTCGTT ATTTAAAAAT 183 60 

ATGMACTAA GTTGGCAAAT TACGAAGCGA AATAAGTTTC GGCTTGTTAT AGAAATAGTT 18420 

ATATTAGAAC TCATCATTGG TGCGATTTTA ACATTAATTA TTTCAGGAGC AACATATCTT 18480 

40 

GCTATTTGTG TAGATGAAGA AGGAGATAAG TTTTTAGTCT CATCAATTTT ATTTGTTGTA 18540 

TTGAAAAGCG CATTGTTCTT CTATTATkTA TTtACGAAAT TATCATTAAT CAGTGTGTTA 18600 

45 GTACTGCACT TAA 18613 

(2) INFORMATION FOR SEQ ID NO: 113: 

<i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AAAGTTTTAA AAGGGGTGAG ATACTTGGCG AATAATCCAT TCCAGCTTTG CGTTTAAAAG 60 

5 GAATTATACT TGCCATTGTC GGTGCTTGTT TATGGGGATT AGGTGGTACT GTTTCTGATT 120 

TCTTGTTCAA ATATAAGAAT ATTAATGTCG ATTGGTACGT CACTGCTCGA CTTGTAGTCA 130 

GTGGTGTTTT CTTACTTATT ATGTACAAAA TGATGCAACC CAAACGTTCA ATATTTAGCG 240 

TATTCCAAGA TCGACGTATG TTAGGCAAAT TACTTATCTT CAGTATACTG GGCATGTTAG 300 

TAGTACAATA TGCTTATATG GCATCTATTA ATACAGGTAA TGCTGCGATT GCAACATTAC 360 

TACAATACAT TGCGCCAGTT TATATTATTA TTTGGTTTGT CATAAGAGGC GTTGCAAAAC 420 

15 

TAACATTATT TGATGTGCTT GCTATTATCA TGACACTATT AGGAACATTT TTATTATTAA 480 

CAAATGGTTC ATTTTCTAAT TTAGTCGTCA ATCCT6CAAG TTTATTCTGG GGTATTTTAG 540 

CTGGTGTAGC ACTCGCTTTT TACACAATTT ATCCTTCAGA CCTACTTAAC CGCTTCGGTT 600 

CGATTCTAAT TGTCGGGTGG GCAATGCTTA TTTCTGGTGT TGCGATGAAT TTACGCCATC 660 

CAATTTGGCA CATTGATATC ACTAAATGGG ACATATCAAT TATATTATTT TTAATCTTTG 720 

2$ GTATTATCGG TGGTACC6CA CTOGCATTTT ATTTCTTTAT OGACAGTTTA CAATACATAT 780 

CAGCGAAAGA AACAACATTA TTCGGAACTG TTGAACCTGT CGTAGCCGTT A7CGCAAGCA 840 

GTCTATGGTT ACATGTGGCA TTCAAACCAT TTCAAATCGT AGGCATCATT CTTATTATGA 900 

^ TTTTAATTTT ATTACTATCA CTTAAAAGAC AACCTGAAAC ATTAGATGAA TAAGAAAACT 960 

CTGATAATCA CTTTAGCAAG TAACTATTAT TTAACAACGT AGTTACCTTA TAGGTGATAT 1020 

CAGAGTTTTT TATTTTAGTT AATAATATTT TTCACTTGGT ATAAAAAaGC GTOGTCGCTC 1080 

35 

TGGTAATCGG AAATACTGGA ATAAAATATG GAATTGGGTA ATAATCCCAG GTAxiTAAAAG 1140 

TCCATGTTCC GATAnCCTnT CCGCAnCTCC AACCAAATTT GCCGATAAGG TTCCAAAAGG 1200 

CATCCTGGGG GTAC 1214 

40 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9458 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ATTTrGGTTT CATTCACGAT GGGGTnATAC AGCAAACACA nCTAAAATAA CTATCAATAG 60 
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CTTAGACAAT AAAAAATATG CCACTACAAT CGCTAATATT ACGATTAAAA AAGAAGCGTT 180 

AACGATTACT TTCATCGTTG TTCTATCTCT GAACATCATA TTAAAGACAA CTAGACTAAT 240 

TGATAATGAA ACAGCAAAAA AAGTAATAGC TAACACTAAT TTCATCATAA ATAGACAGAC 300 

TAAACCTATG ACTAATAATG TATTAGAAAT TACAGCTGAC GTTTTTAACA TTCTCGaATT 360 

AATATGCACT CACCCTTTTT ATTTAAATAA CTTACATAAT CATAATAATA CATGATGTTT 420 

CATAGGCCTG TCGATGATTG ATTCACAATA GCACGTGATT TTTTTGTTTT TCAATATTAT 480 

TCATTTATTC CATCAAAAAC ACCCTTTTTA ATTTTTACAA AAATTAAAAA AAGTGCTCCT 540 

ACACTGCTTG CATGTAGAAA CACTTTTTCA TTGTAATGTT ATTCTTCTCG AGACATACCT 600 

TTTAGCATAT TAAGCATGTA TGTTAAACTA CGGTTCATGT CGTCATCTTT CAATACGCCC 660 
AATAGACTTC TTATAGTTGT CTTAGCATTT GGACTCXICTT GATTGGCAAC GTGTAATCCT . 720 

TTATTAACTT TATTTAGGAA GTCGCTTAAA TCTGATACAT TGAGTTCACC TAATAAAAAT 780 

ACCATTGAA6 CCATATTAGA TAATAGCCCT GTATAAATAT CTTTATTAAG TTCAACTGCA 840 

AATTTATTTA TGATGACTTG ACGTCCTCGA ATTGCACCAT TTAAAGCATC TAATAGTTTT 900 

25 GCATCATCTA ATGTTTTAAT AAGCTTGATT GCTTTTAATA TACTATCTTT ATTCX3CTGCA 960 

ATTGCCTCTG TAACTTCATT TAAACTTTCT AACTTAATTT GTTCTTCTGA TTTTTCTAAG 1020 

CGTCTAATTT TAGAAGATAT TCTCTCAGCC ATTATTTATC CACCTGATTT CCCGGGAAAA 1080 

CATAATCTGA ACGTTCCCAT TTTTTCTGTA CTTGAACACT GTACTGCGGT TGACGTTTTT 1140 

TATTGACACG GAAATTATTA GGGTTCAACG GTGACTTACC ACX3TTTCGTA ATTACCTCCA 1200 

AACGACAGCT AGTACGTTTA TAAGATGGTG TATCCGTGTA TTGATCAACA TCACTaTTAG 1260 

TTAATAAGTT AATTGCACCT AGATCTCCAT TTTCCATCGC aTCaTTATTT AATGGAATAT 1320 

AGATTTCTTT ACCTTTAACA CGATCTGTCA CGTGAACTTG TAATACCGCT TCTCCTGTyT 1380 

CAGAAATCAG CTTAACTTCT GCACCTTCAT GAATGCCTCT ATCTTCAGCA AGCTCTGGAG 144 0 

AAATTTCAAC AAATGCACGT GGCACTTTGT ATTTAATCAT TGGTGTTTGA TAAGTCATAT 1500 

TACCTTCATG GAAGTGCTCT AACAATCGAC CATTGTTTAC ATGAATATCA TAAATTTCAT 1560 

4S CTTGCTTAAA GTAATTATCA AATGATAATG GGAATAATTT TGCTTTACCA TTATCAAAAT 1620 

TGAATCCTTC TAAGTATAGA ATAGGCTCAT CAGTACCATC AGGTTGTACT GGCCATTGTA 1680 

AACTATTGAA TCCTTCTAAA CGATCATAAC TTACCCCAGC ATATAGAGGT GTTAAGCGTG 1740 

SO 

CTACTTCATC CATAATTTCA CTAGGATGCT TGTAATTCCA ATCAAATCCT AATCTATTAG XBOO 

CAATTGCTTG GAAAATTTTC CAGTCAGGTT TTkAATCACC AAGAGGTTCT AATGCTTGGT 1860 
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TTGCTGGCAA TACAACATCT GCGTATGTTG CTGTGAATGT TAAAAATTCA TCTTGGACTA 1980 

CCATGAAATC TAATTTTTCA AACGCAGCTT GTACAAAATT AATATTTGAA TCCACAATAC 2040 

CCGTATCTTC ACCATATAAG TACAATGAGT GTACTTCTCC GTCATGTATA CCTTCTACCA 2100 

TTTCATGATT ATCTTTACCA GCTTTTGGAT TCAATTTAAC GCCATATTCT TTTTCAAATT 2160 

TAGCGCGAAT ATCATCCGCT TCAATACTTT GATAACCAGT AATCTTATCA GGCATACTTC 2220 

CCATATCACT ACATCCTTGA ACATTATTAT GTCCACGTAA TGGATACGCA CCAGTACCAG 2280 

GACGACGATA ATTACCTGTT ACTAATAATA AGTTTGAAAT CX5CTGTACTT GAGTCACTAC 2340 

CAATGTCTTG TTGTGTAATA CCCATTGCCC AACAAATTAC AACAGATTCA GCTTTAGCAC 2400 

ATTCTTCAGC AAATTTAATC AATTCTGATT CAGGAATACC TGTTGCTTCT TCAGCAAAAG 2460 

CCATTGTAAA TGTTTCTAAT GATTTGTAAT ATTCATCAAA ATCATCTACC CACTCATCAA 2520 

20 TAAATGCTTT ATCGTGTAAA TCATGATCAA TAATATACTT AGTCACTGCA CTTAACCACG 2580 

CTAAATCCGT ACCTGGTTTA GGTTGATAAA AACGATCOGC ACGTTCTGCC ATTTCATGTT 2640 

TTCTAATATC AAATACATGT ATTTTTTGAC CAAATAA1TT TTGTGCACXST TTCATGCGTG 2700 

ATGCGATAAC TGGATGAGCT TCGGCTGTAT TAGTACCTAT CAATACAGAC ATTGCCGCTT 2760 

TTTCTAAATC TTCAATACTA CCTGAGTCAC CGCCGTGTCC AACCXtTTCTA AATAAGCCTT 2820 

TTGTTGCAGG TGCTTGGCAA TATCTTGAAC AGTTATCAAC GTTATTTGTG CCAATAACTT 2880 

GTCTTGCTAA TTTTTGCATT AAATACGATT CTTCATTCGT CGCTTTAGAA GAAGAAATGA 2940 

ATGATAGTGC ATCTGGGCCA TGCTTTTCTT TAATAGCTGT AAAATTATCT GCAATGACGT 3000 

TTAAAGCTTC ATCCCATTCT ACTTCATGGA ACTCACCATT TTTCCTTACT AGTGGTTTAG 3060 

TTAATCGTTG ATCTGAATTA ATATGTCCCC ATGAAAACTT ACCTTTAACA CAAGTCGCAA 3120 

TTTTATTTGC TGGAGAATCA TGTGATGGTT GTACTTTTAA AATTTCTCTA TCTTTAGTCC 3180 

^ AAACTTCAAA TGAACAACCC ACACCACAAT AAGTACACAC TGTTTTAGTT TTCTTAATAC 3240 

GCTCTTTACG CATTTCTGCT TCTGAATCTG AGATTGCAAA TAGTGGACCA TAACCAGGTT 3300 

CTGCTTTTTT AGTTAAATCA ATCATTGCTG CTAATGAACC AGGTTCCGTA TCAGTCATAT 3360 

^ AACCCGCATT ACCTTCCATA TTCACTTCCA TCATGGCATT ACATGGACAT ACCGTCGCAC 3420 

ATTGACCACA AGATACACAT GAAGACTCAT TAATCGGTAC ATCATTATCC CAAATAACAC 3480 

GTGGATGTTC ACGATCCCAA TCAATTCTAA TAGTTTCATT CACTTCGATA TCTTGACATG 3540 

SO 

CTTCTACACA ACGCCCACAT AAGATACATT GATTTGGATC ATAACGATAA AATGGGCCGT 3600 

AATCTTTTTC GTATGGCTTC TCTTTATATT CATACGTTTG ATGCT6AAGC CCCCATGCAT 3660 
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GGTAGGCTCA TTTTACTTTT AGACGAACGT TTCAATCCCA CCACTCCTTT ACTATTCCTT 5580 

ACATACTTTG TCTGTTTTCT CTATTTATTA TATAGTAAAA TAATTTTTTT ACTATACTTC 5640 

^ TGTAGACGTA TAACTATTTT TTATCATTTT TTATCTCTAG AGAATATCTA TCTGTATTTT 5700 

TGATAACCAC CATTTGCATT TAAAATTTTA AGTACCGTTT CATGACATGC TTTATTACTT 5760 

ATAATAAAAG GTGCACCCTT TAAATGATCA ATTGCCTTAC CATCTAAAGT CGTCATTTTT 5820 

10 

AGATTCAATA GTTCTGCAAA TAAAAACTGT GCAGCAATGT CCCAAGGTTT AGGATTTGTA 5880 

TTAATATGTG CCCCAAATTG ACCTTTTGCC ACTCGCATAG AATCTAATCC GCAAGCACCA 5940 

ACTAAACGAT AACTAAATGA GGCGTCAAAT AAATCTTGCA CCGTATCTAG ATTCATCACT 6000 

TGTGCATTAA ACGATATAAT AGCGTCTTCC AATTTTAACG ATGGTGGTTC TTCCATCTTA 6060 

ATTCCATTAC AAAAA6CACC TTCTCCTCGT ATTGCTTTAT AAAGCTTTTT ATGCGGATAA 6120 

20 TCATATACGT ACGATAACAT TGGTTTACCT TCATAAAAAT ACGCCAATAT AATACAATAA 6180 

TCTTCTTGCT GTTTTACTAA ATTGGCAGTT CCATCAATGG GATCCATAAT CCATAAATGA 6240 

TTAATTTCAT TCGTAATCAT TTCATTACTT TTTTCTTCCG CTAATAGTTG GTGTTCCGGA 6300 

25 AAATGTGTTG CTAAAAATTG TTGGAATTGT TGTTGAATCT GTTTATCTAC ATTTGTAACT 6360 

AAATCAAATC GATGACGCTT AGTTTCTGTA GTCATTTCCA TAATTAATTG CGGAATAACA 6420 

TTGTCTATTT GTTTCAACCA CGAACATATT AACTTATCTA TTTGCTGTAA TGTTTTATCT 6480 

GTCATTTCGT CCACCACTTC TCATATCATT ATCATTTTAT TATTACCCTA TATTAAAAGA 6540 

ATCAACAATA CAACTGAAGA CTTCTTCATT TTATGCATAA AAAAATCGGC TAGTCACGTG 6600 

CTAGCCGACA AATAGAAAGG AAAGTAAGTA ATAAATATTG AAGATGTTGT GATGTAACTT 6660 

3S 

GAACGATTAA AAGCTATCTG TTATATAGCT CTACCCCTTT GTTTAATCX5C TCCCCCTGTT 6720 

ACAAGTAATA TCATAGCACA ATCTTTTTTA AAATGTAAGC GTTTTCCACA AAATTTTTAC 6780 

GATTTTTTTA AAAAGATATT GAAAATGTCC TCATTGTCAC TCTTATGTTA TACTTTGTGT 6840 

40 

AATATATCAT CTTTTAGGAG GTGGCTGTCA TGAATAAAGC TGAAAGGCAA AATTTAATAA 6900 

TTACTGCAAT TCAACAAAAT AAAAAAATGA CCGCTTTAGA ATTAGCTAAA TATTGCAACG 6960 

45 TATCCAAACG CACAATTTTA AGAGATATTG ATGATTTAGA AAATCAAGGT GTTAAAATTT 7020 

ATGCGCATTA TGGGAAAAAT GGTGGTTACC AAATACAACA AGCACAATCT AAAATTGCAT 7080 

TAAACTTATC TGAAACACAA TTATCAGCCT TATTTTTAGT GCTTAATGAA AGTCAGTCGT 7140 

so ACTCGACATT ACCATATAAA AGCGAAATCA ACGCAATTAT AAAACAATGT TTAAGTCTTC 7200 

CACAAACACG CTTAAGAAAA TTGCTTAAAC GCATGGACTT TTATATTAAA TTTGATGACA 7260 
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IS 



ATGTGATGTT AGTAGATCAT AGGGTTGATG ATAATATTAA AGCTGAAAAC GTTATATTTA 7380 

TTGGCCTTTT GTGTAAACAT GGACATTGGC ATGCAGTCAT TTATGACATT GCTCAAGACA 7440 

AAACTGCCGA ACTCGAAATT GAAAATATTA TAGATATTTC GTATTCATTC GGTAAGACGA 7500 

TTCAAACCAG AGACATATCC ATTGATAACT ATCATCAATT TTTAAACCCC ATCGATTCCT 7560 

AAAAAACAGC AGTAAGATGA TTTTCAATTA GAAAATATCT TGCTGCTGTT CTCTATTTAT 7620 

ACAATACTTC GTATTGAATG GnTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG 7680 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC 7740 

TTTGAAATAC TTAAAAAAAT AAAACACTTT GCCCAACTTA CACTACCAAT AGAAACTGCT 7800 

GTTAGAATTC CTCAAAATGA TATTTOGCGA TATGTTAATG AAATTGTTAA AAAGATAGCTT 7860 

GATAGCGAAT TCGATGAATT CAGACATCAT CGTGGCGCAA CATCCTATCA TCTAAAAATG 7920 

20 ATGTTAAAAA TCACCTCATA TTCATATACT CAATCTGAAT TTTCTGGCOG TAGAATA6AA 7980 

AAATTACTTC ATAACAGTAT TCGAATGATG TGGTTAGCTC AAGATCAAAC ACCTTCTTAT 8040 

AAAACTATTA ATCTTTTTAG AGTGAATCCT AATACTGATG CGCTAATTGA ATCTTTATTT 8100 

2S ATTCAGTTTC ATAATAAAAT GCATATCAAA AAAGCTGATT TCTATCAAAT AATTAATAGA 8160 

AATCAGCTTT TTTCaTTGCC TAAAAACTTA ATGTCCCX3AC CTCTTTATCT ACGCATAAAT 8220 

ACTTATTACT GATATAACGA AAGAAACAAA ATTATTTGCT ATATOTAATG CAATTGTTGA 8280 

ACCTAGGTTT CTTCCAGATT TTAAATAAGT GAAAACTAAT ATGATGGATA GTATGAGATA 8340 

TGGACCAAAC TCAAACGGCG ACTTTGCATC AGTCACATGA ATAAATGCAA ATAAGAACAC 8400 

CGAAACAATA CTCATAGCTA TAAAATTAAA CTTCTTACCT AATTCTCCAA TTAAAATATG 8460 

TCTAAATACG ATTTCTTCAA CTATTGGACC TACAATCACA ATTAATAAGA ATGCTACAGG 8520 

TAAAAATGCA GGCACTTCAA ACATTTTATT TAGCTCAAGT TCATTGGCTG TTtCACTATA 8580 

TTGCAAATGT TTAGGTAGAA ACTGTGTCAT ATATTCATAT GTATAAATTA AGATGAGAGC 864 0 

AATAATATAC GTTATTGACA ATCTAAGCCA ATATTTTTTG ATATACGCAA AACCAGCTCG 8700 

AAGCCTTGAT GGCATCACTT TTAAATGAAA TAAATAAAAT GCGCCAATCC CAATCX5TATA 8760 

45 TGCTAAAGCT TGTGTGATAG TCGCTACAAA TATCAGATTA CTATOGATTT CATAATAACC 8820 

AAACAAAATT GGTCCTATGT AAGGTGCAAT TGTGAGTGCA TAAAATATAA CACCTATAAT 8880 

TGGAATTATA AGCAAATCTC TCCATGCTAT ATCTTTAAAC GTGTATTTCT TTTTTTCATT 8940 

SO TTCCaCTGTT ATATCCtTTC CTGTTTAATA ATTGATTTTT GGAGGTACTT CTACATGATA 9000 

AACGAAACTA AGTATATGAG ACAACAAATT ACTAATTTGA TTCAAATCAT TGATACGATT 9060 



SS 



30 



3S 



40 
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ATAGTTACTA ATGAATTGAA TAAGTTCAAA GGCTTTGAAA CATCATATAT AATAAACGAA 9180 

AATCAAGTTT CCTATTATGA AATTATAACA CTACTTAATA AACX5TCCCCT CgACAAGTCG 9240 

^ ACTATGGTAA CAAAATTCAA TATCTTAATT TTTATCATAC AGAACTATCT AACGCATTAT 9300 

TTGCAATTAA ATTTGCCCAT TAACCTATTT TTCATAAAAT GTCATTTAAA CAAGTTATTT 9360 

ATTAAAATTC ACTTTATTAC ATAAATTATA CAATTArAAA GTTTCTTCAA ATTGTAAAGA 9420 

70 

TGCATTAATC GAGTTATAAT CATAATGATT AAGATGGT 9458 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



2S 



30 



35 



40 



45 



AnGCGTATCA 


TGTCACGCAT 


TTTAACTACT 


TCTTTACCAC AAGATTATAC 


AGTCACATTA 


60 


GTTGATCGTA 


TGCCATTTCA 


TGGATTGAAA 


CCAGAATTTT 


ATGCTTTAGC 


TGCGGGCACG 


120 


AAATCAGATA 


AAGATGTTCG 


TATGAAATTC 


CCTAATCATC 


CACAAGTGAA 


TACAGTTTAT 


IBO 


GGTGAAATTA 


ACGACATAGA 


TTTAGATGCT 


CAAATTGTCT 


CAGTCGGTAA 


TTCTAAAATT 


240 


GATTATGATG 


AGCTAATCAT 


TGGTTTAGGA 


TGTGAAGATA 


AATATCATAA 


CGTTCCAGGA 


300 


GCCGAAGAAT 


ATACACATAG 


TATTCAAACA 


CTCTCAAAGG 


CTCGGGATAC 


TTTCCATAGT 


360 


ATTAGTGAAC 


TACCAGAAGG 


TGCTAAAGTC 


GGTATCGTT6 


GTGCTGGATT 


AAGCGGCATA 


420 


GAACTTGCCA 


GCGAATTAAG 


AGAAAGTAGA 


TCAGACTTGG 


AAATATATCT 


TTATGACCGT 


480 


GGGCeGCGAA 


TTTTAAGAAA 


TTTTCCAGAA 


AAATTAAGTA AGTATGTTGC 


GAAATGGTTC 


540 


GCCAAAAATA 


ATGTTACCGT 


TGTTCCAAAT 


TCAAATATTA 


ATAAAGTTGA 


ACCTGGTAAA 


600 


ATATATAACT 


GTGATGAACC 


TAAAGATATT 


GATTTAGTTG 


TATGGACAGC 


AGGAATTCAA 


660 


CCTGTTGAAG 


TTGTTCGTAA 


CTTGCCGATT 


GATATAAATA 


GTAATGGACG 


CGTGATAGTT 


720 


AACCAGTATC 


ATCAAGTACC 


AACATATCGT 


AACGTCTATG 


TAGTTGGTGA 


TTGTGCTGAT 


780 


TTACCACATG 


CGCCAAGTGC 


TCAGTTAGCC 


GAAGTTCAAG 


GTGATCAAAT 


TGCCGATGTG 


840 


CTTAAAAAGC 


AATGGCTAAA 


TGAACCATTA 


CCTQACAAAA 


TGCCGGAACT 


AAAGGTACAA 


900 



SO 

GGTATCGTTG .910 
(2) INFORMATION FOR SEQ ID NO: 116: 
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(A) LENGTH: 10182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

10 TTTTTGATTC AAAGTGGTGA TTTAACAAGC ATTTTAAATA GCAATGATTT GAAAGTCACA 60 

CATGATCCTA CCACTGATTA TTATAATTTA TCTGGTAAGT TGTCGAACGA TAATCCAAAC 120 

GTTAAACAAT TAAAACGTAG ATATAATATT CCTAAAAACG CATCAACAAA GGTGGAATTA 180 

AAGGGAATGA GTGATTTAAA AGGCAATAAT CATCAAGATC AGAAACTTTA TTTTTATTTT 240 

TCAAGTCCTG GAAAAGACCA AATCATTTAT AAAGAAAGCC TTACTTATAA TAAAATAAGT 300 

GAACATTAAT ACTTATGCTG TAATTATAGA AACATCCAAA TCATCTATTA nAATCCTATA 3 60 

20 

TTATAAAAnC ACCTCACATA ACTCGTTCAA CTGTACCAAA CCACATTACA TTAGATTTTA 420 

GGCTAACTAT TGTGATGTAC ATCAAAAACG AATTTGTGAG GCGTTGTATA TTTTACAAAG 480 

GTGACTAGCG TTTCGTATAG CATTTCCAAC ATTACTACAC TCAAGCGTCA CGCTAAAGTT 540 

2S 

CGAAATCGAA TCCTTTCATT CAACAAAAGC TCATATCCAC TACAAACTTC ATATCAAGCG 600 

TATAAACTAT CTTGTGATAC TATCTCGATC ATATCTATAG TATGCATTTG TGTTCCGTTT 660 

30 CACTGAAGTA TATGTATCAT CAGTTAAGTA TAAACCGTCA TCCTTCAATG TTACTTGATA 720 

AGCATATTTC CGTGCTAACC AGGCAATATC TATATAATTT TCTCCTGCGT TTTCATAACT 780 

TCTTAAATCT TCAATATGTG CACTAACTTC AGGGaAAATG ATTCTAACAA CACTTTCATC 840 

AACCCAATAT TTGTCATOCA TCCATCGCAC TTGATCTGCC AATAAAGGTA ACTGCACATC 900 

ATTGAAATAT AGACGAAAGC CGTCACTATC ATACATTTGC CGATATGGTA ATGGCTGTTT 960 

TCTAATCACT AACACCTCGC CACCCATTAC GGTGCCTTCT CTAGTATCAT CACTTCCACC 1020 

40 

CGAAGCTTCA TACGTTGTTG GGTCAACCTG TAGTCCATGT ACATCTCCAA TATAAGCATC 1080 

TCX3TTTATGT TCCATTGCAT GTCCATGTGC AATCAATGCT AATATTGTAG ATTGTGAAAA 1140 

1TGAGGCTCC CATTCAATGC GATTAGGATG GCTACTATAA ATTCTAGGTT CATCTATAGC .1200 

45 

CTGCTGAATA TCCATGCCAA ACACTAATAC ATTGATTAAT GTTTGCGCAA CACTAGCAAT 1260 

GATACTTATG GCACCAGGTG CACCTACTGT TAATATTGGC TTCCCGTGAT ACATCACAAT 1320 

SO CGTTGGAGCC ATGTTACTTA GTGGTCGTTT ATATGGTGCA ATTTCGTTAA TACCACCATC 1380 

TACTACATCA AAGCCATCCA TTGTCGTATT CAATAACACA CCGTAGCCTG GAATCGTGAT 1440 

ACCTGAACCA TAAATCATAC CAATTGATGT CGTAAATGAA GCAATATTAC CTTCCTTATC 1500 
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ATCAGACACA ACACCATGCT CTATATCAAT 
GCGTGCTTTT AAATAATCAT CATCAATTAA 

^ ATCCGCCAAG TATTGCGCAC GATCACTATA 

CAAGTAATCA ACAGATCTTG GACCCATAGA 
TATTTGAATT ACCGTGATAC CGCCAGAACT 

10 

TTTAAATGTT GCACTGATTG GCGCTTTAAT 
AGTGATTGTC CCACCACATG CTTTGACAAC 

^5 ATAAAATGCA TTAAACCCTT GTTCTCTTAA 
TACAATCCAA TCACCTTCAC GCCAATATTG 
ATGATACTTT GTCAATC6TG CGTGTTGCTG 

20 TGCATGACCT TCAATGGCTA GTTCAATTGC 
AGCATAACGC TTGTGAATAT AATCAAACAG 
ACCATGTGTA GTCATATCAA AAAATGATTT 
TTTGTCTACA TGTTCAGGTG CTGTCTCACG 
TTGCTCATAA TATAGCAAAT ACCCGCCACC 
ATTCAATGCC AGTTGAATTG CAATCACTGC 

30 

ATCCTTACCA ATTTTAGCCG CAAGAGGATG 
TTTTGTCTGT TTGTCATTTA AGTTAATGAC 

^ ATTTAAAACA TTATTGATTA ATGGCTTTTT 

TACCAGTATC GACAAGTGGT GTAATCXK3TG 
ACTTAATAAA TTGATCCTGA TCTATCGCAT 

40 TCTTAAATAT ACCTTTTTTA ATATTTAGCA 
GAATAATGCT TAAATTTTTA TCC6ACTTAA 

CCAACATATC AATTGAATGA TTTCTAAGTT 

4S 

ACTTCAATGT AATATTTTTA ATTTTAGCTG 

ATCCTGGATT ACGTTGAAAC GTTGCTTGAT 

CACTTGCATA CAGCGCATTT TTCCCATCTG 

50 

ATCCTTTTGG ATATTCTTGA TTTACTTGAT 
AAGAGTGTGT TAAGTAATTT ACCTCTCGAG 

55 



ATTTGCTTTA TTGCTATCAA TGAGCGTACT 1620 

TGACTGTACA GGCACCTCAT GAAAATTATC 1680 

TGCTAAATGC ATCGCTTGTA TCAAATGATG 1740 

TGGTAAATCG ACATGTTCTA ATAACTTCAA 1800 

AGATGGTCCC ATTGaATAAA TGTCATAGTC 1860 

CTGAATGTCA TATTTGGCTA GATCCTCTAA 1920 

ATTGACTAAT TGTTTCGCAA TGTCACCTTT 1980 

TATTTGAAAT GTCTTACCTA ATTCGGGTTG 2040 

ATTTTCATGC GTAAATACTT GTGCCGTTTC 2100 

GCGCGAATAT TTTTCAGTAG CCCAATTGGC 2160 

AGQATTAATT AAATCTTCCA ATGACAATTT 2220 

CTTTGGAATT GCTGGCACAG CGACAGTTTT 2280 

ATATTCGCCT GAATCATCTA GATAAAATTG 2340 

lOCATCAAAC GCAGTTATAC TGCCAGTACT 2400 

ACCAATACCT GATGCAAATG GTTCTACCAC 2460 

ATCCATGGCG TTGCCACCTT GATCTAATAC 252 0 

TGATACGGAA ATTAACCCTT CTTTAGATGT 2580 

CATACTATAT CCTCCTACTT TCTGTTAAAT 264 0 

CTACTTTTTC TAAATCTTGA CGTTGCTCX3T 2700 

ATGCAATTTT AAATTTATCG CCACGATAAA 2760 

TAACTACTGC TTGTCTCAAG TTTGGATGCG 2820 

TTAAAAAGAC TGACTTGCGT CCATTTTTGC 2880 

TTAAATCAAA ATGTTTTTGA TTCACATCTG 2940 

CTGACAATGC ATTATTCGGG TCACCATTAA 3000 

GTCCATAACT ACCTTTTTCT GTTTCGTTGA 3060 

ATGCATTTTT CTGTGTCATA ATGTATGCGC 3120 

AATTTGCAGG AATTGTACTG CTATCCCCAT 3180 

TAACAAATTT TTTAGATAAA ATGCCTGCCG 3240 

GCATCGATTG ATCTGTCGTA ATTTTAACAA 3300 



660 




661 



GTATCAAATG TAAATTGTGA CACAATTGAT 
CCGATAATCG CAATACGATG TCTAGTAGTT 
5 CTATGTATTT GCGATGTACG TTGGTAACGT 

ACGTTGCCTG TTAATGTCAT CAACAATAAC 
TGATGCTTCC ATGTTTGTTC CGTTGTTAAA 

10 

AGCAATGTTT CAGCAATATA GAACGTATCG 
CTCGTTAATA TAACTAAAAT ATAAGTTGCT 
GCTATCTTTG AATTAAGTTG TGCCATAT6C 

IS 

ATTTTG6ATC GATTAAAGCA TAAAATATAT 
TTGAIATATA TACGACCCCA CCCATGACTG 
20 CGATATAACG CCCGATACCA TTAATGTTAA 

GTAACTCTGC CACTAGAAGA CCAACTAACG 
TATGTTTAAT AACAACtTGT GTCGTCXSATA 
CGctGCGCAT TACTTCAAGT ACAGAAGACC 
TTGTTCCAAT GACAAGTACA GGTAAAATCA 
GTGGCAAAAT ATCCAATTTC AATGAGAACX3 

30 

TTGGAATAGA TAAACCAATT AATGCAATTA 
GCTTCATCGC ACTGATAATA CCAATTGGTA 
ATACTGCGAC AATTATTGTA ATTGGAATTC 

35 

TCCCTTTGTA AGTCGTACCT AAGTCAAAGG 
GAAXSAAATA AGGTTCGTTA AGATGATGTA 

40 TTGCA.TTTTG TCCCAGTATG CTATAAGCCG 
ACACACTGAC AATAACACCA ATGATGACAA 
ATCTAACTAA TGGCTGTAAA TAAAAAGTCA 

^ CTTTGATCAT GATGAACTTA TGAAATAATA 

GTTCAATATT CTTTGAACTC GTATTAGAAC 

TATGTATTTG TGTGGATGAC TTTTGCTGCG 

SO 

GTTCAATTTC TGAAATTTCA GAATTATTAG 

AAAACTTTAT CACTGCATAT AAAAATATTG 

55 
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AATGTCAGCA TGTAGACTAA AATAAGTAAC 5220 

TTTCGTATAA ACGATTCCCA CCCGTTATAA 5280 

CTAATACTTA CAAACATTAA TAATGTAAAT 5340 

AACACTTCGA CGATACGTCG CCATAGGTCA 5400 

ATAATAATTA AAATGATGGT TAAAACGATT 5460 

GCCACATAAC CTTTAAAAAG ATTTAATGCA 5520 

ATGGCGTAAC TTGCGAATAA TTTTAAGGAA 5580 

CTCACTTCCT TTC6TTGATT TCACTACGTA 5640 

CAATAATTAA GTITGCTAAA GATATTACAA 5700 

CTGGAATATC AGGTATTAGT TGTTTTTGGA 5760 

ATACTTGTTC CX5TCACTGCT GAACCGCCTA 5820 

TTACAATTGG AATAATGGCA TTTTTCAAAA 5880 

ATCCTTTTGC ATAAGCAGTT AAAACATAAT 5940 

TTGTCATACG CGTGATAGAA GCAGCAATAC 6000 

ACGATATIGG ATGTTCTGGC ATATAAGATG 6060 

CTAAAATGAA TAATAGCXCT TGCCAGAAAC 6120 

TCATTAACGT GATATCAAGC CAACTATTTC 618 0 

TTGCAATAAT TAATGCCACC ATTAGCGCTA 6240 

TTTCGCCAAC TGCTTTAGTC ACAACCTCAT 6300 

TAAAAACACC CTTGATGGTA TCCCACAATT 6360 

ATACATTGAA TTGATGTATC TGTGCCTTTG 6420 

CATCAAGCXX; TGAAAAATAC AGAATGGTAA 6480 

TCACAGCCAT GACAATTCGT TCAAAAATAT 6540 

ATAAGATGAA CATCGGCAAG GCCAATATCA 6600 

CATTTTCAAA GTATGTTGAA AAATGTGCTT 6660 

TTTGTGCCTT GAATATTTTT AATGCTTCTT 6720 

ATAAATATTT ATATTTTTGA TGTAACGCCT 6780 

CGTAAAAATT TTTCCTCTTA GCAGAAAAGA 6840 

GCAAGCTTAA TACCGATAAT ACAAACTTGT 6900 
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CTTGTAAAAT AATCTTGAGT AGATTACTAT 
ATTTGTGaAT AGGGAGGCAC AACATCATGT 
^ TACAATTCAA TTATGATGAA ACTACAGTTC 

GAAAAAAACA TATCCTAGGT ATTGTTGGTG 
AATCTATTTT AGGGCTACTA CCAGATTATC 

10 

TTAATGGGCA ATCGTTAAAT AATTTATCAA 
ATATTTCAAT GATTTTTCAA GATCCACTCT 
AAGAAATTAC AGAAGTAATA TTTCAACATA 
TGACAATAGA CATTTTAGAA AAAGTAGGTA 
ATCCACATGA ACTTTCTGGT GGTATGCGTC 
20 TAAAGCCACA AATTTTAATC GCAGATGAaC 

ATCAATTACT GCAGTTAATG AAGTCCCTTT 
TCACTCACGA TTTAGGCGCT GTGTATCAAT 
GAAGTGTCGT TGAAAGTGGC ACGGTTGAAA 
CAAAACGCTT AATAGATGCG ATTCCTGATA 
ACAATGATAT TTTATTAAAA TTCGATCGCG 

30 

CCTATACCGA GCAGTTAATG ATATTAACTT 
TGTCGGTGAA TCAGGGTCAG GGAAATCGAC 
AGTGTCAGAA GGCTTTATTT GGTATAACGA 

35 

ATTGAAATCT TTACGACAAG AGATACAAAT 
TCCAAGATTT AAAGTCATTG ATGTGATTAA 

40 AGATAATGAT GACATTATTA AAACTGTCGT 
AACTTTCTTA TATCGCTATC CACACGAATT 
CGCGAGAGCA CTTGCTGTTG AACCTAAAGT 

^ AGACCTTTCA ATTCAAAAAG ATATCATCGA 
CATCACTTAT TTATTCATCA CACATGACAT 
TGCAGTTATG AAAAATGGCG AAATCGTTGA 

SO 

TCCGCAGTCA GACTATGCAA AGCAACTTAT 
GTCATGGGTT GTGCAACTTT ATCACTGTAT 

55 



GATATACAAA AGTATAGAAT AAATTTACAC 7020 

CAAATTTATT AGAAGTCAAC AGTCTGAATG 7080 

AAGCGGTAAA AAACGTCTCT TTCGAATTAC 7140 

AATCAGGATC AGGAAAAAGT ATTACCGCTA 7200 

CAGATCACAC ATTAACAGGA GAAATTATTT 7260 

CTTCAGCGTT ACAACAAATT CGAGGTAAGG 7320 

CTTCGTTGAA TCCAAGATTA ACGATTGGCA 7380 

AACGT6TATC TAAATCTGAA GCAAAGTCGA 7440 

TAAAACATGC AACTCGACAA TTTGATGCTT 7500 

AACGTGTCAT GATAGCAATG GCATTGATTT 7560 

CAACAACGGC ATTAGATGCC AGTACACAAA 7620 

ATGAGTACAC AGAAACATCT ATTATTTTTA 7680 

TTTGCGACGA TGTGATTGTA ATGAAAGATG 7740 

GTATTTTTAA ATCGCCACAA CATACCTATA 7800 

TTCATCAAAC GCGTCCGCCA AGACCX5TTAA 7860 

TGAGyGgGAT TACACATCAC COAGTGOCAG 7920 

GGCTATTAGA AAAGGCX3AAA CATTAGGCAT 7980 

ATTAGCTAAG ACXX3TCGTCG GTCTAAAGGA 8040 

ATTACCATTA AGTTTATTTA AAGATGATGA 8100 

GATTTTTCAA GATCCATTCG CATCTATTAA 8160 

ACGACCACTA ATCATTCATG GGAAAGTCAA 8220 

ATCGTTGTTA GAAAAGGTTG GCCTAGATCA 8280 

ATCTGGTGGG CAACGTCAGC GTGTAAGTAT 8340 

GATTGTTTGC GACGAGGCAG TGTCCGCTTT 8400 

GTTATTAAAA CAATTACAGT TAGACTTCGG 8460 

GGGTGTTATC AATGAAATAT GTGATCGCGT 8520 

ACTGAATAAC ACAGAAGATA TTATCAAACA 8580 

TTCAGAAGTA GCAGTTATTG CTAAATAAAA 8640 

GGTCTGAAAT AAATTGCGCG ACTTCTGATG 8700 
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IS 



TATCAAGTTT TAGGTGCTTT GCCATGATTT AAGAGTCACC CCCATACTTT GGGCATTTTA 8820 

ACGCCAGAAT AAATCCCCCG CCACTATGTG AAGTGTGGGG GATTATTTAT ATTTTATTAG 8880 

AATATTCAGA TTTTTGAGTG TGTCAACTTA GCTTAGTCAA TGTATATTTA ACGTCACTTA 894 0 

CTCTTTTTCT TTCATAATTA ACACATTCAA ATAAACTTTG ATCAAAAAAC ACAAAGTTAA 9000 

AAGTACCATC TTGTAATATG CTCTCATACA TTATCCCGTC ATATTTAAGG CTTCGAATAT 9060 

AATCAGCTAA ATATTGAAAT GGCAAATAAT CTATTCCTTG TTCATCGCTT GGATTTGTTA 9120 

TTCCTTTATG AATCTTTTTT AATGTTTGGT AATTTACAAA ATACTTTCTA AATCCATCAT 9180 

CGCCAGCTTT GATTGCATTA CTAGTTAAAT TAGTTAAATT CGCAATTTTC AATTTCTCTT 9240 

TTGTGACGTT TTTTTGTAAC TTAACCTTAC CTATATAAAT AATGTCATTA TGCTTAGGTT 9300 

TAACTTCTTC TATACTGACC TGTTCTTTTG TACTAAGGTA TAATACGCTT ATCCATTTAG 9360 

AATTCAATCT TCCTGCCGTT GCAAATCCCT TTGGTGGTGA CATTAGTTCA CTTTTCTCTG 9420 

TAATGAACTT AACTATTCTA GATCTATATA ATGGTTCAAA TCTTTCTCTA AATTCCTCAA 9480 

TACTATAGTA ATTAGTAGTG ATATCGAGAA AGAACGCTAA ATTCTCTAAA TTGATCATAT 9540 

TTTTATGAAA TCTATTTTTA TACTTCAAGC TCTCACAAAA TCCATCCCAG TCATTATTTG 9600 

CTACAATTAG ATTTTTATTT GTATATTTTT TATCGTTTAT GATTTTAGCG CCTACTAAAT 9660 

CTTCCAACAC TCGTCTATCT AAATTTTCAT CATCTTTAAA AAGTTCATTT AAAATACAAC 9720 

TTATTTGAGC TTCCTCAACA TTAAATATAC TCCAGTCGTC TTTTAATGCT ATTTCAATCT 9780 

TTTTACCTTC TTTTGGGCTA AAAGTATCTG GTAAATTTAT ACTAATATCA TATAATTCTA 9840 

35 ATGCTGGTCT TAAATAATCT CTAATAAGTT CTAATTTATC TATGTCCTTA GTCGTATCAA 9900 

ATATTTTAAC ACCAAGATGA TTGTTATCAA TATCACAATT GTCAAATTTG CTATTTATCA 9960 

TTTGC3U^TGA TTTCTACGAT TTCAGTATTA TTAAAACATT TTTCACATAT TTTCATTTTG X0020 

^ AGACtCCAAG TATCTATTCA TAATTTCTAG GTGATGCATG ATAGATAACC TTrTAATTAA 10080 

ACCTAATCCT GGATaCTTAT TATTTTCATT TAATTCTTCA AATTGTCCCA AGOGCATAAG 10140 

ATCTATTTTT AATATCTAAG TTTTTTGACC ATGTTACTAA TT 10182 
(2) INFORMATION FOR SEQ ID NO: 117: 



25 



30 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AACTCAGGCA 


ATTGAAACAG 


CATTAGGTGC 




AAAAGATGGA 


CGCCAGGCTA 


TTCAATTTTT 


5 


TTTACCATTA 


AATGTTATAC 


AGAGTAGAGT 




AGAGGCAAAC 


GGATTTATTA 


GTATCGCTTC 


10 


AAATATTATC 


GGGAATTTAT 


TAGGTAATAC 


TGAATTGGCA 


CGTGCGATTA 


AATATCGAAC 




AAATCCTGGT 


GGtTCTATGA 


CTGGTGGTGG 


IS 


AAAAGAOGAG 


TTGACAACAA 


TGAGACACCA 




AXTTGAACAA 


CAATTTAAAG 


AGTTGAAGAT 




TGAAAAAAGT 


CAAAAGCATA 


ATACACTTAA 


20 


CXSATAGATTA 


ACTACACAAG 


AAACACAAAT 




AAAAAATGAT 


GGTTATACGA 


GTGACAAAAG 




TCTAGAAAGT 


ATTAAAGCAT 


CTTTAAAACG 


25 


ACTTTCTAAA 


GAAGGTAAGG 


AAAGCGTTAC 




ATCTGATCTT 


GCTGTGGTTA 


AAGAGCGTAT 


30 


AAATAATCAA 


AATCAACAAA 


CTAAACATCA 


CTTTAATTCX5 


GATGAAGTGA 


TGGGCGAACA 




TGGTCAACAA 


GAAACGAGAA 


CACGCTTATC 


35 


TATTGAGTTG 


AATGAACAAA 


TCGATGCGCA 




TATTTTAGCT 


ATCGAAAATC 


ACTACCAAGA 




ATTAATTCAT 


CATGCGATAG 


ATCATTaAAT 


40 


ArATCTGAAT 


ATACGaGTGA 


TGrATCGATg 




AGaTGyCGAT 


TGATGrACTA 


GGTCCTGTAA 




TAAATGAACG 


TTATACATTT 


TTAAGTGAAC 


45 


CATTAGAGCA 


AATTATAAGT 


GAAATGGATC 




TCCATGCTAT 


TCAAGGACAT 


TTTACAGCTG 


SO 


CAGAATTGCA 


ATTAACTGAA 


GCCGATTATT 


CACCGGGTAA 


AAAGTTGCAA 


CATTTATCGT 




CTATTGCTTT 


ACTATTTGCA 


ATTTTAAAAG 



55 



TTCATTACAA CATGTCATTG TAGATTCAGA 60 

AAAAGAACX3T AATTTAGGTC GTGCGACGTT 120 

GGTAGCGACT GATATTAAAT CTATTGCTAA 180 

GGAAGCAGTT AAAGTAGCAC CAGAATATCA 240 

GATTATCGTT GATCATTTAA AGCATGCAAA 300 

TCXSTATTGTT ACTTTGGAAG GTGATATTGT 360 

CGCTCGTAAG TCAAAAAGTA TTCTGTCTCA 420 

ATTAGAAGAT TACTTGOGTC AAACAGAATC 480 

AAAAAGTGAT CAATTAAGTG AACTGTATTT 540 

AGAGCAAGTG CATCATTTTG AAATGGAGCT 600 

AAAAAATGAT CATGaAGAAT TCGAATTTGA 660 

TCGACAAACT TTGAGT6AAA AAGAAACTTA 720 

ACTAGAAGAT GAAATTGAAC GCTACACAAA 780 

TAAAACACAA CAAACCTTAC ATCAGAAACA 840 

TAAAACACAA CAACAGACAA TAGATCX3ATT 900 

ATTAAAAGAT GTTAAAGAAA AAATTGCATT 960 

AGCTTTTCAA AATATTAAAG ATCAAATTAA 1020 

AGATGAATTA GATAAATTGA AACAACAACG 1080 

AGAAGCTAAA CTACAAGTTT GTCACCAAGA 1140 

TATTAAAGCT GAACAATCAA AGCTAGATGT 1200 

GATGrATATC AATTGACTGT TGAACGTGCG 1260 

ACGCATTACG TAAAAAAGTT AAGTTAATGr 1320 

ACTTAAATGC AATTGAACAA TTTGAAGAGT 1380 

AACGTACAGA TCTTCGTAAA GCTAAAGAAA 1440 

AAGAGGTTAC TGAAAGATTT AAAGAAACTT 1500 

TGTTCAAACA ATTGTTTGGT GGAGGCGATG 1560 

TAACAGCTGG TATTGATATT GTGGtACAAC 1620 

TACTGAGTGG TGGTGAGCGT GCATTAACTG 1680 

TAAGATCTGC ACCTTTTGTT ATATTAGrTG 1740 
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10 



IS 



TATCAGACX3A AACACAATTC ATTGTTATTA CACACCGTAA AGGAACAATG GAATTTGCAG 1860 

ATAGGTTATA CGGTGTAACA ATGCAAGAAT CAGGTGTTAC TAAACTTGTG AGTGTGAATT 1920 

TAAATACAAT AGATGATGTG TTGAAGGAGG AGCAATAATG AGCTTTTTTA AACGCTTAAA 1980 

AGATAAGTTT GCAACAAATA AAGAAAATGA AGAAGTTAAA TCCTTAACAG AAGAACAAGG 2040 

TCAAGACAAA TTAGAAGATA CACATTCTGA AGGTTCAACG CAGGACGCAA ATGATTTAGC 2100 

AGAAAATGCT GAAGTGAAAA AGAAGCCACG CAAGTTGAGT GAAGCGGATT TTGATGACGA 2160 

TGGCTTAATA TCAATTGAAG ATTTTGAAGA AATTGAAGCT CAAAAAATGG GTGCTAAATT 2220 

TAAAGCAGGA CTCGAAAAAT CTCGTCAAAA TTTCCAAGAA CAATTAAATA ATTTGATAGC 2280 

GAGATATCGT AAAGTAOATG AAGACTTTTT TGAAGCTTTA GAAGAAATGT TAATCACTGC 2340 

AGACGTCQGT TTTAATACAG TGATGACGTT AACTGAAGAA TTACGTATGG AAGCACAACG 2400 

20 ACGTAATATT CAAGATACTG AAGATTTGCG TGAAGTCATT GTTGAAAAGA TCGTAGAGAT 2460 

TTACCATCAA GAAGATkATA ATTCAGAAGC TATGAACTTA GAAGATGGTC GTTTAAATGT 2520 

CATTTTAATG GTTGGTGTGA ATGGTGTTGG TAAAACAACA ACAATTGGAA AATTAGCTTA 2580 

CCGATATAAA ATGGAAGGTA AAAAAGTAAT GTTAGCTGCO GGCGATACTT TTAGAGCGGG 2640 

TGCTATTGAT CAATTGAAAG TTTGGGGCGA ACGTGTTGGT GTAGACGTAA TTAGCCAAAG 2700 

TGAAGGTTCT GATCCAGCTG CTGTTATGTA TGATGCgATT AATGCCGCTA AAAACAAAGG 2760 

TGTTGATATT TTAATCTGTG ATACCGCTGG ACGTTTACAA AATAAmACAA ATCTAATGCm 2820 

AGAATTAGAA AAAGTTAAGC GTGTAATTAA TCGAGCAGTG CCAGATGCGC CTCATGAAGC 2880 

ATTACTATGT TTAGATGCTA CAACTGGTCA GAATGCGTTG TCACAAGCTA GAAACTTTAA 294 0 

AGAAGTAACA AATGTTACAG GTATTGTATT AACX3AAATTA GATGGTACAG CCAAAGGTGG 3000 

TATC33TATTA GCCATTCGTA ATGAATTGCA CATCCCAGTT AAATATGTAG GTTTAGGTGA 3060 

^0 GCAAT'TAGAT GACTTACAAC CATTTAACCC TGAAAGTTAT GTCTACGGCT TATTCGCTGA 3120 

TATGATTGAA CAAAATGAAG AAATAACAAC AGTTGAAAAT GATCAAATTG TAACAGAAGA 3180 

AAAGGACGAT AATCATGGGT CAAAATGATT TAGTtAAAAC GTTACGAATG AATTATTTGT 3240 

^ TTGATTTTaT CT^TCCTTAT TGACGAATAA ACAACGTaAT TATTTGGAAT TATTTTATCT 3300 

TGAAGATTAT TCTTTAAGTG AAATCGCAGa TACTTTTAAT GTGAGTAGaC AAGCAGTTTA 3360 

TGATAATATA AGAAGAACTG GCGATTTAGT TGAAGATTAT GAAAAQAAAT TGGAATTATA 3420 

SO 

CCAGAAATTT GAGCAACGCC GAGAAATATA TGATGAAATG AAACCACATT TAAGTAATCC 3480 

AGAACAAATA C ^^^^ 

55 
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(i) SEQUENCE CHARACTERISTICS: 

U) LENGTH: 4253 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



10 





AGTACGTTTT 


ATAATTATAA 


GTACGTAATT 


AACATATTAA 


CATATCGCAA 


GTATGTATTT 


60 




AAATAAgATT 


GTTATAATTT 


CAAAGTTCAT 


CCAAGaTTAT 


GGCGTTTGCA 


TTTACCTATT 


120 


IS 


AAAAACGTTA 


TTATATCAAA 


GAT6CGAAAG 


ATAATACGGG 


TTTATTTTAT 


GAAAGTGAGA 


180 




AGGATAAAAT 


GGATAATGAG 


CAACGCTTAA 


AAAGAAGAGA 


GAATATAAGG 


AATTTCTCGA 


240 




TTATAGCACA 


TATTGACCAC 


GGAAAATCTA 


CATTGGCTGA 


TAGAATTTTA 


GAAAATACCA 


300 


20 


AATCA6TIGA 


AACAAGAGAT 


ATGCAAGATC 


AGTTACTAGA 


TTCAATGGAT 


TTAGAAAGAG 


360 




AACGTGGTAT 


TACAATCAAA 


TTAAACGCgT 


ACGTTTAAAG 


TACGAAGCTA 


AAGATGGAAA 


420 




TACTTATACA 


TTCCATTTAA 


TCGATACGCC 


TGGACACGTC 


GATTTTACAT 


ATGAAGTGTC 


480 


25 


ACGTTcTTTG 


GCAGCTTGTG 


AGGGCGCGAT 


TTTAGTAGTA 


GATGCGGCTC 


AAGGTATCGA 


540 




AGCACAAACA 


TTAGCAAATG 


TTTATTTAGC 


ATTAGATAAT 


GAGTTAGAGT 


TATTGCCTGT 


600 


30 


TATTAACAAA 


ATTGATTTAC 


CTGCTGCAGA 


ACCTGAACGC 


GTGAAACAAG 


AAATTGAAGA 


660 


TATGATAGGT 


TTAGACCAAG 


ACGATGTTGT 


TTTAGCAAGT 


GCTAAATCTA 


ACATTGGAAT 


720 




TGAAGAGATA 


CTAGAGAAAA 


TAGTTGAAGT 


TGTGCCAGCT 


CCAGATGGTG 


ACCCAGAAGC 


780 


35 


ACCACTAAAA 


GCGTTAATAT 


TTGATTCTGA 


GTATGATCCA 


TATAGAGGGG 


TAATTTCATC 


840 




GATAAGAATT 


GTGGACGGTG 


TTGTTAAAGC 


CGGAGATAAA 


ATTCGAATGA 


TGGCCACTGG 


900 




TAAACAGTTC 


GAAGTAACAG 


AAGTTGGAAT 


TAATACACCT 


AAGCAGCTTC 


CAGTTGATGA 


960 


40 


ATTAACAGTT 


GGTGATGTTQ 


GTTATATTAT 


TGCAAGTATT 


AAAAATGTTG 


ATGATTCTAG 


1020 




GGTTGGTGAC 


ACCATCACAT 


TAGCTAGTAG 


ACCTGCATCA 


GAACCATTGC 


AAGGTTATAA 


1080 




GAAAATGAAT 


CCAATGGTAT 


ATTGCGGACT 


GTTCCCAATA 


GATAACAAAA 


ATTATAATGA 


1140 


45 


TTTAAGAGAA 


GCATTAGAAA 


AATTACAATT 


GAATGATGCA 


TCATTAGAAT 


TTGAGCCTGA 


1200 




ATCGTCACAA 


GCATTAGGTT 


TTGGTTATAG 


AACTGGTTTC 


TTAGGTATOT 


TACACATGGA 


1260 


50 


AATAATTCAA 


GAAAGAATTG 


AAAGAGAATT 


TGGTATTGAA 


TTAATTGCAA 


CTGCACCATC 


1320 


TGTAATTTAT 


CAATGTGTTT 


TAAGGGACGG 


TTCAGAAGTG 


ACGGTTGATA 


ACCCAGCACA 


1380 




AATGCCAGAT 


CGTGATAAAA 


TTGATAAAAT 


ATTTGAGCCA 


TATGTTCGTG 


CAaCTATGAT 


1440 
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TATAAATATG GACTATTTAG ATGATATTCG TGTAAATATT GTTTATGAAT TACCTTTAGC 1560 

TGAAGTTGTA TTTGATTTCT TCGATCAACT TAAATCTAAT ACTAAAGGAT ATGCATCATT 1620 

TGATTATGAA TTCATCGAAA ATAAAGAAAG TAATTTAGTC AAGATGGATA TTTTATTAAA 1680 

TGGTCATAAA GTGGATGCGC TAAGCTTCAT AGTTCATAGA GATTTTGCAT ATGAACGTGG 174 0 

TAAAGCATTA GTTGAAAAAC TTAAAACGTT AATTCCAAGA CAGCAATTTG AAGTACCTGT 1800 

ACAGGCTGCA ATAGGACAAA AAATTGTAGC GCGTACAAAT ATTAAATCAA TGGGTAAAAA 1860 

CGTTTTAGCT AAATGTTATG GCGGTGACAT AAGCCGTAAA CGTAAATTAC TTGAAAAACA 1920 

IS AAAAGCAGGT AAAGCTAAGA TGAAAGCAGT TGGTAATGTT QAAATTCCAC AAGATGCTTT 1980 

CTTGGCTGTA TTGAAAATGG ATGATGAATA ATTTTAAAAA ATCAATTAAC AATTTACAAT 2040 

GAATAAAGTT TAATAACTAA AAAGAGGGAO CCTAGGATAA ATTAACGTGC TGGGCTTTAC 2100 

AATGTTATAT TGGCAGCCAT CGACAGAGTT AAAATCAGCT TATAACAATG GGGCCCCAAC 2160 

ACAGAAGCTG ACGAAAAGTC AGCTTACTAT AATGTGCAAG TTGGGGTGGG GCCCCAACAT 2220 

AGAGAATTTC GAAAAGAAAT TCTACAGGCA ATGCAAGTTG GGGTGGGACG ACGAAATAAA 2280 

TTTTGCGAAA ATATCATTTC TGTCCCACTC CCTTATGCAT GAGTTTTACT CATGTAATTT 2340 

TATTTTTAAG GACATATTAC ATCTGGCTAA TGTGTAAGAG CCACTACATA ATAAATCATT 2400 

AGTGGTTCTT TATTATTTCT ATCTCACTCC CTCTAAACAA GAATAAATAT TAAAATGAAT 2460 

CGATATATTA GACAATCATT GATTAAACXST TAAAGTTAAA AGTAAGAATA ATTGCAGATA 2520 

GTCCAACAGG ATATAGCCGA TTGGATAAAA AGTCTGAGAA GCGGGGCATT AAAATGACGG 258 0 

35 TACAAAGTGC ATATATACAT ATTCCATTTT GTGTAAGAAT ATGTACATAT TGTQATTTCA 2640 

ATAAATATTT TATACAGAAT CAACCTGTAG AT6AGTACTT AGATGCACTA ATCACAGAAA 2700 

TGTCJACAGC AAAATATAGG ATCTTAAAGA CCATGTATGT AGGTGGCGGC ACACCAACGG 2760 

40 CCCTTTCTAT TAATCaGTTG GAAAGATTAC TTAAAGCAAT ACGTGATACG TTTACAATCA 2820 

CAGGCGAGTA TACATTTGAA GCAAATCCTG ATGAGTTAAC TAAAGAGAAA GTCCAACTAT 2880 

TAGAGAAATA TGGAGTAAAA AGGATTTCAA TGGGCGTTCA AACATTCAAG CCGGAGTTAT 2940 

TGTCTGTTTT AGGTAGAACG CACAATACTQ AAGATATTTA CACTTCGGTG TTAAATGCTA 3000 

AAAACGCAGG TATTAAATCA ATCAGTTTAG ATTTAATGTA TCATTTACCO AAACAGACGA 3060 

TTGAAGATTT TGAACAAAGT TTAGATCTAG CTTTAGATAT GGATATTCAA CATATTTCGA 3120 

GTTACGGCTT AATACTTGAA CCTAAAACCC AATTTTATAA TATGTATAGA AAAGGCTTGC 3180 

TCAAACTTGC TAATGAGGAT TTAGGTGCTG ACATGTATCA GTTGCTGATG TCTAAGATAG 3240 

55 



25 



30 



45 



SO 
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AACATAATAA GGTTTACTGG TTTAATGAGG AATATTATGG ATTTGGAGCA GGTGCAAGTG 3360 

GTTATGTAGA TGGTGTGCGT TATACGAATA TCAATCCAGT GAATCATTAT ATCAAAGCTA 3420 

5 

TAAATAAAGA AAGTAAAGCA ATTTTAGTAT CAAATAAACC TTCTTTGACT GAGAGAATGG 3480 

AAGAAGAAAT GTTTCTTGGG TTGCGTTTAA ATGAAGGTGT GAGTAGTAGT AGGTTCAAAA 3540 

AGAACTTTGA CCAATCTATT GAAAGTGTCT TTGGTCAAAC AATAAATAAT TTAAAAGAGA 3600 

10 

AGGAATTAAT TGTAGAAAAG AACGATGTGA TTGCACTTAC AAATAGAGGG AAAGTCATAG 3660 

GTAATGAGGT TTTTGAAGCT TTCCTAATAA ATGATTAAAA AAAATTGAAA TTTCGAGTCT 3720 

IS TTAACATTGA CTTACTTTGA CCAATTTGAT AAATTATAAT TAGCACTTGA GATAAGTGAG 3780 

TGCTAATGAG GTGAAAACAT GATTACAGAT AOGCAATTGA GTATATTAAA CGCAATTGTT 3840 

GAGGATTATG TTGArrTTGG ACAACCCX3TT GGTTCTAAAA CACTAATTGA GCGACATAAC 3900 

^ TTGAATGTTA GTCCTGCTAC AATTAGAAAT GAGATGAAAC AGCTTGAAGA TTTAAACTAT 3960 

ATCGAGAAGA CACATAGTTC TTCAGGGCGT TOGCCATCAC AATTAGGTTT TAGGTATTAT 4020 

GTCAATCGTT TACTTGAACA AACATCTCAT CAAAAAACAA ATAAATTAAG ACGATTAAAT 4080 

2S 

CAATTGTTAG TTGAGAATCA ATATGATGTA TCATCAGCAT TGACATATTT TGCAQATGAA 4140 

TTATCAAATA TATCTCAATA TACAACTTTA GTTGTTCATC CTAATCATAA ACAAGATATT 4200 

ATCAATAATG TACACTTGAT TCGTGCTAAT CCTAATTTAG TTATAATGGT TAT 4253 

30 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3395 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

40 -(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TCCCTAATCG AACAAAATTA TGCGCATAAA CAAAGTAGAT TGATATAAAA TTCTTAATTA 60 

TCAGAATATA TTTACAAATC TGAATTTTAT TAGTATATTG GrTAGTrTTC ATAGAGGCAT 120 

GACGGTaTTT 6AGCAGGATT TTAAATCGGg ATTTTATAAT CGATTTAAGA GAGGCCACtT 180 

TGCTTGcACA TTAATACTGT CAATGGGAGG GQAATGTATA TGAGTrAAGC ACATCAATTA 240 

ATTCAAGAGG ATGAACATTA TTTTGCGAAA TCAGGACGTA TTAAATATTA TCCGTTAGTG 300 

SO 

ATTGATCATG GATATGGAGC AACATTGGTT GATATTGAGG GGAAGACATA TATCGATTTG 360 

TTATCGAGTG CGAGTTCTCA AAACGTAGGT CATGCACCTA GAGAAGTAAC AGAAGCGATA 420 

55 
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GTACGTTTAG CTAAGAAGCT TTGTGAGATT GCACCTGGAG ATTTTGAAAA AAGAGTGACC 540 

TTCGGATTAA CCGGATCAGA CGCAAATGAT GGCATCATTA AATTTGCCAG AGCATATACA 600 

5 

GGGCGTCCrr ATATCATTAG TTTCACTAAT GCATATCATG GTTCAACTTT TGGCTCATTG 660 

TCTATGTCAG CTATTAGTTT AAATATGCGC AAACATTATG GTCCGTTATT GAATGGTTTT 720 

TATCATATTC CGTTTCCAGA TAAATATCGT GGTATGTACG AGCAGCCACA AGCTAATTCA 780 

10 

. GTAGAAGAAT ATTTAGCACC CTTAAAAGAA ATGTTTGCGA AGTATGTACC TGCTGACGAA 840 

GTAGCATGTA TTGTTATTGA AACGATACAA GGCGATGGTG GACTTTTAGA ACCAGTTCCA 900 

75 GGGTATTTTG AAGCGTTAGA AAAGATTTGT CGTGAACATG GTATTTTAAT CGCTQTCOAT 960 

GATATTCAAC AAGGTTTTGG GAGAACAGGT ACATGGAGTT CAGTCTCGCA TTTTAATTTT 1020 

ACGCCTGATT TAATCACTTT CGGAAAATCC TTAGCAGGTG GTATGCCTAT GTCAGCAATT 1080 

GTTGGACGCA AAGAGATTAT GAATTGTTTA GAAGCACCAG CACATTTATT TACAACAGGT 1140 

GCTAATCCAG TTAGTTGTGA AGCTGCATTA GCCACAATTC AAATGATTGA AGATCAGTCG 1200 

CTTCTTCAGG CTAGTGCGGA AAAAGGGGAA TATGTTAGQA AACGAATGGA TCAATGGGTA 1260 

TCTAAATACA ATAGTGTAGG CGATGTTAGA GGTAAAGGTC TGAGCATTGG TATTGATATT 1320 

GTTTCCGACA AAAAACTCAA AACACGTGAT GCCAGTGCGG CACTTAAAAT TTGTAATTAC 1380 

TGCTTTGAGC ATGGCGTAGT TATTATAGCT GTAGCAGGAA ATGTGTTGCG ATTCCAACCG 144 0 

30 

CCATTGGTAA TAACATATGA GCAATTAGAC ACGGCGTTAA ACACTATAGA AGATGCACTG 1500 

ACTGCTTTGG AAGCAGGTAA CTTAGATCAA TATGACATAT CTGQACAAGG TTGGTAATAG 1560 

35 CGATTATCTT AATATAAAAT AAAAAATCAT TTCCACATCT GGATGTTAAT CAGATGGGAA 1620 

ATGATTTTTT TTATTTTTTA TTTTGGTGGG TGGTATTCAG CTACGTCATT TTTCTTAGAA 1680 

TGTCTAAGTC CATAACTTAA ATATAGGATG ATACCAACAA TAAACCAAAT TAAAGTGTAT 1740 

40 AATTTCGCTT CGAATCCTAA TCCCCAGAAT ACTAGCAATA CTAAAACAAA TGTAATTGCT 180O 

GGTAACACAG GATATAAAGG TAATTTAAAT GCAGGAATTG GTAGATCTTT ACCTTcACGC 1860 

TTTCTCAAAC GATACATTGC TAATGAAACG AACATAAATG CAACAAGTGT ACCTGCTGAA 1920 

ATTAATTGTG CTAAAAATGC GAATGGGAAC ATAGAACCAA TTAAAACACC AATAATAGTA 1980 

AGTATAACTA GTGCGCGATT AGGTAAATGT TTGTCGTTTA AGTGGCTTAA CCATGAAGGT 2040 

AATAAGCCGT CACGTCCAAA TGAATAAAGT AAACGTGAGC CTGCTAACAT CATACCAATT 2100 

SO 

AATGCTGTAA ACATACCGAT AACAGAGATA GCTTGAACAA TAGCTGCTAC AACACCATGA 2160 

CCACTTTGAC GTAAAGCCCA ACCAACAGGT TCAGCATTGT TTGCGTATTG TGAGTAATGG 2220 

SS 
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CCAAGAATAC CTCTAGGCAT TGTCTTTTGA GGATCAAGTG CTTCTGCTGA GTTTGCTGCG 234 0 

ATAGAATCGA AACCGATATA CGCTAAGAAA ATCATTGAAA CACCAGCATA TATGCCTTGC 2400 

5 

CATCCACCAA AGTCACCTGT AGCAGTTACT TTGTGTTCTG GAATAAATGG CACATAGTTA 2460 

CTAACATTTA TTGCTGTTAA ACCTACGATG ACAAATAAAA TAATAGCTAA TACTTTTAAA 2520 

ATAACTAAAA TATTTTCCAT ACGAGCTGCT TCCGACATAC CACGTGATAG TAATAATGCA 2580 

10 

GTTAATAAAA TAACGATAGC AGCAATAATA TCGATAAAAC CGCCATTTGT ACCAAATGGA 2640 

TTTOATAATG CTGCAGGTAA TTCGATGCCA ATTGGTTTCA CAAGTCCGCG TAAATTCGCT 2700 

GAGAATCCTG ATGCAACAAA GGCTACGGCG ATAAAATATT CAGCTAATAG AGCCCAACCG 2760 

GCAACCCATC CAAAAAATTC ACCAAATAAT ACATTGACCC AAGAATAGGC TGAACCTGCA 2820 

AATGGCATAG CGGCA6CCAT TTCTGCATAA GTAAATGCAA CTAAACCAGC AACAATAGCA 2880 

20 GCGAGTAA6A ATGATAACGC AACGGCCGGT CCTGCATGTT CTGCAGCAAC AATGCCAGGT 2940 

AGCGTAAAGA TAGATGTCGA TACAATTGTT CCTACACCTA AAGCTAAGAA ATCACGCACC 3000 

CGAAGTGTAC GCTTTAAATG ACCATCTTTA TTTTGATAGA TAGCCGGATC CTCTTTTCGT 3060 

^5 GCTAXTTTAT TGAAAAAACT TCCCATAAAC TTTCCTCCCA AACATTCATA AACAATTCTA 3120 

TACGGTGTTT TTTAATATGT TATATCATAG CACAAATAAT CAATATTTTG TCTAAAAATT 3180 

CTGAAAAATC ACAACTTTAT GTTACGTATT AATGACTTGT CTTGATAACA TCCATAGATT 3240 

30 

TTTTAAATGA TAAAACTGAT TATAACAGAT ATTAAATGAA TAAGTACTAT TTTTTGCnAA 3300 
TTTTCTAACA ATTTTGCACA TTATATGTTT AAAATCAATT TCATGTTTAT GGTCTGATTG 3360 
GCTAGTGTGT ATGAAATGTA AnTCTTTGAC TnnGA 3395 

35 

(2) INFORMATION FOR SEQ ID NO: 120: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13508 base pairs 
40 (B) TYPE: ziucleic acid 

{C) STRANDEDNESS : doxible 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATCAGGTAAT GCCATGCGTT TAGCTGAAAA TTTTTTCAGA ACGTTTAAGT GATATCGGAC 60 

ATCAAGTTGT TTTGATGTCA ATGGATGAAT ATGATACGAC AAACATCGCG CAGTTAGAAG 120 

ATTTATTTAT TATTACGTCT ACTCATGGTG AAGGAGAACC GCCTGATAAT GCATGGGATT 180 

TCTTTGAATT TTTAGAAGAC GATAACGCAC CTAATTTAAA TCATGTGAGA TATTCAGTAC 240 



671 



EP 0 786 519 A2 



TACTAGAAAA TCTAGGCGCT GAGCGTATAT 
AAGAAGACGC AGAAAAGTGG ATGGCAGACA 
^ GTATTCAAAG TGAATCGGTG ATAAGTGAAT 

CTAAATCAAA TCCATACCAA GCAGAAGTAT 
CAAATAAAGA AACACGACAT ATAGAATTTT 

10 

CAGGAGATTG TATAGTAGCA TTACCGCAAA 
CCATGTTAGG TTGGGATCCG CAATCTCCX3G 
CTATTGTTGA AGCACTAACA TCACATTTTG 
AAAATGCAGA TATCTATTTT GACAATGAAG 
GGGCGGGTGA ATATGTTATA AATCGGGACT 

20 TAGAATTACA ACCTGAGAAT ATGTATCAAA 
CGATTTCTAG TAGTTTTATG GCAACGCcAG 
GTTATCAAGC ACATGGACGT GAGAGAAAAG 

25 TTAAACCAGG CX3ATATAGTA CCAATTTATT 
TGAAGCAAGA TATACCGGTT ATTATGATTG 

CATATTTACA AGAACGTGAA GAACTTGGTA 

30 

ATCAACACCG TAGTTCTGAC TTTTTATATG 

GAAACTTAAC ACGCGTAGAT TTAGCATTTT 

AGCATCGTAT AATGGAAGAA AGTAAACGTT 

35 

CTATATTTGT GGCGATGAAA AATGTATGGC 
ATTGGTAAAA GAACGTCATA TTTCTCAAGA 

^ ACAACAACAA cgctatcaac gtgatgttta 
taatgatgta aaaagatata aaggatgttg 

TTAACAAATA AAAATGTCGT CATAGTTGGT 

^ ACATTAAATC aatacgttga acatatgaog 

CAAAATATGG TAGATAACGG TGTCGTCATA 
ATTGTAGACG CGTATCTAGT TATTGCAGCA 

SO 

AAAAAAGCCT TACCTGAGCA TGCCCTTTTT 
GTTGTATTTC CAAGTGCACT ACACCGCGAC 

SS 



GTAAGCGTGT AGATTGTGAT ATTGATTATG 360 

TCATTAATAT TATTGATACC ACATCAGAAG 420 

CAATTAAGTC TGCCAAAGAA AAGAAATATT 4 80 

TAGCGAATAT CAATTTAAAT GGTACCGATT 54 0 

TACTTGATGA TTTTAGTGAA TCATATGAAC 600 

ACGACCCTGA ATTGGTTGAA AAACTAATAT 660 

TGCCAATTAA TGATCATGGT 6ATACAGTTC 720 

AATTTACTAA ATTAACATTG CCATTATTGA 780 

AATTATCTGA ACGTATTCAA GATGAGTCAT 840 

TTATAGATTT AATAACAGAT TTTCCAACTA 900 

TCCTTAGAAA ATTACCACCA AGAGAGTATT 960 

ATGAAGTGCA TATTACCGTT GGTACGGTTC 1020 

GTGTATGCTC GGTTCATTTT GCTGAGCGAA 1080 

TGAAGAAAAA TCCGAACTTC AAATTTCCGA 1140 

GACCAGGTAC TGrAATTGCT CCTTTTAGAG 1200 

TGACTGGAAA AACATGGTTG TTCTTTGGTG 1260 

AAGAAGAAAT AGAAGAATGG CTTGAAAATG 1320 

CAAGAGACCA AGAACACAAA GAATATGTAC 1380 

TCAATGAATG GATTGAGCAA GGCGCACAAT 1440 

GAAAGATGTC CATCAAGCCA TTAAAGATGT 1500 

AGAAGCAGAG TTATTATTGC GACAAATGAA 1560 

TTAGCGATTG GTGTTAAATA TTTTAAGGTG 1620 

CTCAACATGA ATATGCCATT AATGATAGAT 1680 

GGAGGCGTCG TTGCAAGTCG TCGGGCACAA 1740 

GTCATCAGTC CGACAATCAC TGAAAAACTT 1800 

TGGAAAGAAA AAGAATTTGA ACCAAGCGAT 1860 

ACCAATGAGC CACGTGTCAA TGAAGCGGTA 1920 

AATAATGTTG GAGATGCATC AAATGGCAAT 1980 

AAGCTAACTA TCAGTGTATC AACTGATGGT 2040 
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TACAGTTCGT ATATCGACTT TTTATATACT 
ACATATAACG AAAAGCAACA GTTACTGTCA 
^ GACAAACAAG CTCAATTTTT AGCGTGGTTG 

CCGTCTAAGG TAAGTCTTCT TATTTTAACT 
GGCGGCTTTG TGAATAGTCT AATAATGAAG 

10 

TATGAATATT ACAATAGAGA AAAAGATACG 
ATTGAAATCT GGTTGAAGTC GTTACTATCA 
TAGAAAGCAA TAATAAAAAT GATAGATCAA 

15 

TAGTCATGCC ATGTTATCAA GTAGGAAAAT 
AGATTTTTTT GATAAAATGA GATAACTTAA 
20 AAGTTAAAGA GGGGGATTAT GTAAATTGTA 
AGTOATAATA TGTGGTTAAC AGTAATGGGG 
CTCATTGCCA AAAAGATAAA TCCAGTTGTA 
ATGATTTTAG GATATAGTGT GACAGATTTG 
GTCATCAACG TTGTTATTAT GTTTATCTTT 
AGTGGTTTAT TCAAGCCGCT TGTCAAACGC 

30 

ATTGTCTGTG CAATGACAGC TTTAATTGGC 
GTAACATTTT TGCTTTCTAT TCCTGCATTA 
AAATATTTAT TGATTTTACT ATTAGCATTA 

35 

GGAGGTCCAA TGGCTCGTGT AGCTGCAGTG 
TATC25ATTAA TACCTATTCA AATAATAGGT 
CTTdiGAnTA AAGAACAGAA ACGTATCAAA 
ACACAAGATA TAGATGTACA TAAATTAGTT 
TTTCCTGTAA AAGGAC6TGC AAGAACAAAA 
45 ACTTTAGCTG TTATTCTATC GATGTTAATA 
ATAGGTGTTy CGTTGGCACT TGTTATTAAT 

TTAAGAGCgC ATGCGCCGAA TGCATTAATG 

so 

TTTTTAGGTG TACTAAATXxA AACCGGTATG 

GTGATTCCTG CAGAAGTAGO ACCATACTTG 

55 



TGCCGACAGA AAATAAAAGT ACTTGATATA 2160 

CAAATTGTGT CACAAGAATA TTTAAATCAT 2220 

GATGTAAGAT AATAATAGCG GACCGTCTAA 2280 

TTAACGCTTA ATCATTGAAA TTAAGACATG 2340 

GATTTAAGCG ATAATGATAT GCX3TTTTAAA 2400 

TAGAACAAAC TTAATAAAAT AGGTGGATAA 2460 

TAGOGACCTT TAGCCAGATT TTTTGTGCAA 2520 

AATGAAATAC AGGACAGGAT ATACAAGGAT 2580 

CAAACTTCAC TATTGATAGT TACXSCAAAAA 2640 

ATATAAAAAA TTATATTAAT TATAATATTT 2700 

TTAAAAGTGG AGGGAGAAAA TAATATGAAT 2760 

CTCATTATTA TTATTTCAAT TGTAGGTTTA 2820 

GGTATGACAA TCATACCTTG CTTAGGGQCA 2880 

GTTGGATTTT TTGCTAAAGG GTTAGATCAA 2940 

GCCATTATTT TCTTTGGCAT CATGAACGAT 3000 

TTAATATTAA TGACACGAGG CAATGTCGTC 3060 

ACAATAGCCC AATTAGATGG GGCCGGTGCG 3120 

TTACdTTAT ATAAAGCX3TT AAATATGAAT 3180 

AGCGCGGCGA TTATGAACAT GGTACCTTGG 3240 

TTAAAAGCCA AAAGTGTCAA TGAATTATGG 3300 

TTCATTCTTG TTATGTTGTT TGCGGTATAT 3360 

AAAGCAATAG AGAGAAATGA ATTACCGCAA 3420 

GAAGTATATG AACGAGATCA AGATGTAAGG 3480 

TCATGGATAA AATGGGTGAA TACAGCTTTA 3540 

AATATTGCGC CACCTGAATT TGCATTCATG 3600 

TTTAAATCAG TGGATGAACA AATGGAACGA 3660 

ATGGCTGCAG TQATTATTGC AGCAGGTATG 3720 

CTTAAAGCGA TTGCGACCAA TTTAATCAAA 3780 

CATATTATTG TAGGTTTACT TGGCGTACCA 3840 
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ACAGCAGGGC 


AATTTGGTGT 


ACCXSTCTGTA 




ATTATAGGTA 


CATTTGTCAG 


CCCATTTTCA 


5 


GAGGCAAACA 


TGGGCACGTA 


TATTAAGTAT 




GTTATGTTAG 


TAATTGCAAT 


GTTGATGGGC 


10 


AACTATGGTC 


ACGTTGCAAA 


ATGAAATAAT 


AATCTTTAAA 


CAATTTTAAA 


AATTAATGAA 




ATGTTAATAA 


CTTTGTGACG 


TTTTAGTTAA 


IS 


ATAGTGGTAT 


AACGTAATGA 


GTAGACACAA 




ATCGAAGAAG 


CATTAACGGG 


TTTGATTTCT 




AACAAAGATA 


GTGATACATT 


TTCAACAATG 


20 


TCTTACGCAT 


TAAATCATTT 


ATTACCAAAG 




ATACATTTTC 


ACGACTTAGA 


TTATCATCCA 




GATGCTAAAA 


ATATGCTACA 


TAATGGATTT 


2S 


AAATCAATAC 


AAACTGCATC 


AGCX3CAGCTT 




CAATATGGTG 


GCTGTAcGGT 


TGACCgCGTT 




TAATGAAGAA 


CAACATAGGA 


ATATsCGCAA 


30 


GTTATGTTGA 


TCAACAAGTC 


ACTAAA6ACA 




AAATTAATAC 


CTTATATACA 


TCTAATGGAC 


3S 


TAGGTACAGA 


TCATTTAAGT 


CGCAAAATTC 


GCTTAGGAAA 


AGACCGCACG 


ACAGCGATTT 




GAACC3VACTT 


TAGTCCGCAA 


GATCCGAACT 


40 


CAACGAAACG 


TATGTATCCA 


GATATTTTAA 




ATTTCAAAGC 


GCCAATGGGT 


TGTCGTTCAT 




ATTTTGAAAA 


TAATGGTCGT 


TGTAATCTTG 


45 


CATTAGAATC 


TGCCGGTAAT 


ATGACGAAAT 




TGTTACATGA 


TGCATTACTT 


TATCGTATAA 




CACCGATTTT 


ATATAAAAGT 


GGCGCATTTA 


SO 


CTGAGTTATT 


TAAAAATAAA 


CGTGCAACGA 




CAGCTACTGT 


TTTCTATGGT 


CCAGACTGGG 



55 



TCAACAGCTT ATTCAATGGT CATAGGGAAT 3960 

CCAGCCTTAT GGTTGGCAAT TGGTTTAGCA 4020 

GCATTCTTTT GGATTTGGGG ATTCGCTATC 4080 

ATTGTGACGA TTTAAGTATG AAAAAATAGA 4140 

AGTTGCATAA ACATGTCGAA ATGACGGACG 4200 

ATAATTGTGT AGAAATATGA ATTTCACTAA 4260 

CAGACTAATA AAAATTTGAA AATACTATAT 4320 

TATATAGGAA GAAGGGGTAA AATGAATCAA 4380 

AAAGATCCTG CTATTGTTAA CGAAAATGCT 4440 

AGAGATTTAA CAGCAGGTAT CGTTTCTAAA 4500 

CACGTTGCAG ATGCACATCA AAGAGGGGAC 4560 

TTCCAACCGT TAACTAACTG TTGTTTAATA 4620 

GAAATAGGCA ACGCGAATGT AACTTCACCA 4680 

GTACAAATTA TAGCCAATGT TTCTAGCAGT 4740 

GACGAATTAC tTAGTACATA TGCACGACcA 4800 

AGCAATTTGT CAAAGAATCT GAAATTGATC 4860 

TCAATGATGC GATTGAAAGT TTAGAATATG 4920 

AGACACCTTT TGTAACATTA GGATTCGGCT 4980 

AACAAGCTAT CTTAAATACT CGTATCAAAG 5040 

TCCCGAAACT TGTATTTTCA ATTAAAAAAG 5100 

ATGACATTAA ACAACTAGCA TTAAAGTGTT 5160 

ATTATGACAA ACTCGTAGAA ATATTAGGTG 5220 

TTTTACCAAG TTGGAAAGAT GCGGAAGGTC 5280 

GTGTTGTTAC ACTTAATTTA CCTAGAATGG 5340 

TCTGGGAAAT CTTTTATGAA CGTATCGATG 5400 

ATCGTTTGAA AGATGCTOTA CCGAATAACG 5460 

ACTATAAATT AAAAGAAACA GATGATGTTG 5520 

TTTCAATGGG CTATATAGGG TTGTATGAAA 5580 

AAACATCTCA AGAAGCAAAA GCATTTACGC 5640 
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GGTTCAGTAT TTmCAGTACG CCGAGTGAAT CGCTAcGGAT CGTTTTTGTC GTTTAGACCA 5760 

AGAGAGATTT GGAGATATTA AAGACATTAC AGATAAAGGA TATTATCAAA ACTCTTTCCA 5820 

TTATGATGTA CGTAAAGATG TTACACCTTT TGAAAAGTTA GATTTTGAAA AAGATTATCC 5880 

TTATTATGCG AGTGGTGGTT TCATTCACTA TTGTGAGTAT CCGAAATTGC AACACAATTT 5940 

GAAAGCACTA GAAGCGGTAT GGGACTACTC TTATGACAAA GTTGGTTACT TAGGTACAAA 6000 

TATTCCGATT GATCATTGTT ATGAATGTGA TTACGATGGA GATTTTGAAG CAACTGAAAA 6060 

AGGATTTAAA TGCCCGAACT GTGGCAATGA TAATCCTAAA ACAGTTGATG TCGTTAAACG 6120 

AACATGTGGT TACCTAGGCA ATCCAGTTCA ACGTCCAGTA ATTAAAGGCC GTCATAAAGA 6180 

AATTTGCGCA CGAGTAAAAC ATATGAAAGC GCCTAAAGAA TGATACTTTT AGACATTAAA 6240 

CAAGGACAAG GTTATATTGC TAAAATAGAA TCAAATAGCT TTGTTGACG6 TGAAGGAGTA 6300 

20 AGATGCAGTG TTTATGTATC AGGATGTCCA TTTAATTGTG TTGGATGTTA TAACAAAGCC 6360 

TCACAAAAGT TCAGATATGG CGAGAAATAC ACTGATGAAA TATTAGCAGA AATATTAGAT 6420 

GATTGCGATC ATGATTATAT ATCTGGGCTA AGTCTATTAG GTGGCGAACC ATTTTGTAAT 6480 

TTGGATATTA CATTAAATCT TGTCAAAGCA TTTCGAGCAC GTTTTGGAAA TACAAAGACA 6540 

ATTTGGGTAT GGACTGGATT TTTATATGAA TATTTAGCAA ATGATTGTAC AGAACX5TCQA 6600 

GAGrrATTAT CATACATTGA CGTTTTAGTA GATGGTCTAT TTATACAACA CTTATTCAAA 6660 

CCTGATTTAC CATATAAAGG TTCTTTAAAT CAACGCATTA TAGATGTACA ACAATCACTC 6720 

TCGCATGCGC GTATGATTGA ATATATAGTT AGTTGAATAT GTATTAGAAG TCAAGGTAAC 6780 

ATTCGTTGCC TTGGCTTCTT TTTAGGTTAG GTACATAATT GAAAGTTAAT AAAAGCAATT 684 0 

CTTTATAAAA ATATATTGAT AGAATATGAC CTAACAATCA TTTTGATACC AATACTAAAA 6900 

GTTGCATATC CGTTTTTTAA AAAAGTTGAA AGAGAAAAGT GGTATTTTAG TGGGAAGGAA 6960 

40 GTCTAACTTT TTGGTAGCGT TTTACAATAA ATAAATATTC GTTAATAACG TATAAATATT 7020 

CTTAAATGCC ATTCTAGTAA AATTTGTTAA ATTCGTTAAA TOGTAACTTA ACACTGTTAT 7080 

TTTAGCGCTA TTAAGGTTTT GTTTATTAC6 GGAAAAATTA TATAAATATT CAATAATTGC 7140 

^ CAAGTTTCAA ATTGTATGAA ATTTGCATTA TTATTAAATG TTAGTTATTG TCAATTTTGT 7200 

GAATCAATAT AATTATTACA TTTTGAGATA AATCXy^CA GGATTCATAA AATTAATAAT 7260 

TAGGGGGAGC ACAATTGAAA AAAGAGAAAG TTATGGACTG GACGACCTTT ATAGGGACAG 7320 

SO 

TAGCTGTACT TCTTTTTGCA GTTATACCTA TGATGGCTTT TCXJUUUIGCA AGTGAAGATA 7380 

TCATCACTGG TATTAATAGT GCCATTTCTG ATTCAATTGG TTCGATATAT TTATTTATGG 7440 

55 



2S 



30 



35 
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TTGGTAAAGC AAGTGATAAA CCAGAATTTA ATACATTTAC ATGGGCGGCA ATGCTGTTTT 7560 

GTGCAGGCAT AGGCTCTGAT ATTTTATACT GGGGCGTTAT TGAATGGGCT TTTTACTATC 7620 

AAGTTCCACC AAATGGCX3CG AAAAGTATGA GTGATGAAGC ACTCCAATAT GCGACGCAAT 7680 

ATGGTATGTT CCACTGGGGG CCAATTGCTT GGGCTATTTA TGTTCTACCA GCATTACCAA 7740 

TTGGTTATTT AGTATTTGTT AAAAAACAAC CGGTGTATAA AATTAGTCAA GCTTGTCGTC 7800 

CGATTTTAAA AGGTCAAACA GATAAATTTG TAGGTAAAGT TGTAGATATC TTATTTATCT 7860 

TTGGATTGCT AGGTGGTGCG GCAACATCAC TAGCGTTAGG TGTGCCATTA ATTTCTGCAG 7920 

,5 GCATAGAAAG ATTAACTGGT TTAGATGGTA AAAATATGAT TTTACGTTCG GCCATTTTAT 7980 

TAACAATCAC GGTTATATTT GCCATTAGTT CATATACAGG ATTGAAAAAA GGTATTCAAA 8040 

AGTTAAGTGA TATCAACGTT TGGCTATCCT TTGTACTTTT AGCCTTTATA TTTATTATTG 8100 

20 GACCGACTGT TTTTATTATG GAAACGACAG TGACAGGGTT CX3GAAATATG TTGAGAGATT 8160 

TCTTTCATAT GGCAACATGG TTAGAACCAT TCGGTGGTAT TAAAGGTCGA AAAGAAACGA 8220 

ATTTCCCACA AGACTGGACA ATATTCTACT GGTCATGGTG GTTAGTATAT GCGCCATTTA 8280 

TCGGTTTATT TATCGCTMA ATTTCAAAAG GTCGACGCCT TAAAGAAGTC GTGCTAGGAA 8340 

CAATTATTTA TGGAACGCTT GGATGCGTAT TATTCTTTGG TATTTTTGGT AACTATGCTG 8400 

TGTAnTACA AATTTCTGGA CAGTTTAATG TAACACAATA TTTAAATACA CATGGTACAG 8460 

AGGCAACCAT TATTGAAGTG GTGCATCATT TACCATTCCC ATCATTGATG ATTGTACTAT 8520 

TCTTAGTATC TGCTTTCTTA TTCTTAGCAA CAACATTTGA TTCGGGTTCA TATATTTTAG 8580 

CGGCAGCATC TCAG7VAAAAA GTGGTAGGCG AACCATTACG TGCCAATCGT TTATTCTGGG 864 0 

CATTTGCATT GTGCTTATTG CCATTTTCAT TGATGCTAGT TGGTGGTGAA CGTGCATTAG 8700 

AAGTATTGAA AACTGCTTCA ATACTGGCAA GTGTGCCATT AATTGTTATT TTTATTTTCA 8760 

40 TGATGATATC ATTTTTAATC ATTTTAGGGC GCGATAGAAT TAAACTTGAA ACGCGTGCTG 8820 

AAAAATTAAA AGAAGTTGAA CGTCGTTCAT TGCGAATCX3T TCAAGTATCa GAAGAAGAAC 8880 

AAGACGATAA TTTATAATTC AAAGCGGGTC TGGGACGACG AAATGaATTT TGTGAAAATA 8940 

TCATTTCTGT TCCaTTCCCC TTTTTTTAGT AGCATTGTAG GATGAACTTT TAGGTTTTCA 9000 

TTAATGTTGT ACTAAAAGAT TTAATTTTTT AGTGCTCCAA GTACTTATTT ATTGTATGAA 9060 

GCATATTCTA AATCGAAGTT TGAAAGACTC TCATTGATTA TTAAATTAAA TAAAGGGTAT 9120 

GCGTATGTAC AATTCAAATT AATCGAAGGA TGAAATAAAA TGACTAATCA ATTTAAAAAT 9180 

AAACAGTCCA AATTACATGA CAGTTTAGAA TCXATCACAA AAAACTTATA TGCGACACCT 9240 
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25 



30 
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ACAGAATAIT GTTATCTATC ATTCCGGACA CTTAGGTGAC TCCCAACAAG ACATTGCATC 9360 

ATTAGGTGGT GTTTCAAAAG TATTGATGAA TCATGATCAT GAATCTATAG GAGGTTCTAA 9420 

5 

TCAAGTTGAA GCCCCTTACT TTATACATGA AAATGATGTG GCTGCACTGA AACATAAGAT 94 80 

TTCTGTTCAA AAACAATTTA GTAATCGTGT AATGTTGGAT AAGGATTTAG AAGTTATTCC 9540 

CGCGCCTGGA CATACACCAG GGACGACACT ATTTTTATGG GATGATGGTC ATCACCGTTA 9600 

10 

CTTATTTACT GGAGATTTTA TATGTTTTGA AGGGAAGAGA TGGCGTACAG TTATATTAGG 9660 

TTCAAGTGAT AGAGAAAAAT CTATTCAAAG TTTAGAGATG GTTAAAGAAT TAGATTTTGA 9720 

^5 TGTACTTGTA CCTTGGGTTA CTATCAAAGA TGAACCGTTA GTTTATTTTG TAGAAAATGA 9780 

ATATGAAAAA CGTGAACAAA TACAAAATAT TATTGATAGA GTACGTGAGG GCGAGAATAG 9840 

CTAATTGAAA TATATTGGCG AAgCAATGTA ACGAATCTAA GAAAGCCCTA GAAAATACCT 9900 

20 CCATAATTGA TTGTCATATA AAACAAAAAC GGTAATTTCT ATTTATTGAG ATAGAAATTA 9960 

CCGTTTATTT CGTGGACCTA TTGCATTGTT TTTATCATGC ATAATCATCA TTGTCGTTGT 10020 

TTGAGTCAAT TTTAATTTTC AGAATCAGAA GGCTGTTCTG GAATTGGGAA ATATTTGAAA 10080 

25 

ATTTCACCGC TTTCAATCGC TTCGGTTAAC TGTTCTAACC ATTCGTAATA AACATGTGTA 10140 

TGATCAAGCT GAGCTTTAAT TTTTTGTGCC TCTTGTGTTT CAGCTTCAGT TAAATCACTG 10200 

CTTTCAAGTA ATGGATTGAT AATAGCTTGA GCATCTTTTA CTGCTTCGAC ATTGATGTCA 10260 

30 

ATTTCACGCT GGAATTTTTT AGTGAAAAAG TTTCGGAAAA AGATGAAAAA GTCTTTCTCG 10320 

GCGATAAAAT GTTGTTTGCG GCTTCCTCTC GTAAATTGTT GTTTAACAAT ATCAAATTCC 10380 

^ TGCAATTTCT TAACGCCAGC ACTCATACTT GGTTTGCTCA TTTGCAATTG ATGACGCATT 10440 

TCATCAAGCG TCATACTGCC TTCAAACACC ATTGTGCCAT ATAAGTTTCC TACACTTCTA 10500 

TTAGTGCCAT ACAAATCCAT TGTCTGTCCA ATTGAATTAA TTACAATATC TTTTGCTTGT 10560 

40 TCTAATTGTT GCTGTTTGTT CTGAGAACGA GTCATCATTG CACCTCCGTA CATCATTTTG 10620 

GTCACGTTAA AATAAATACT AATACATTAT AAAACXnTTT CTAAAAAAAG ACATTAAAAA 10680 

TATTTAAAGC ATTAAAGTTA AATGTTTCGT TAAATAAAAA TCTAACGAAC TTACAAAACT 10740 

45 TAATTCTTGA GTTGTTTTGT AAATTGACAC ATTTTTCATT TCTATGCTAA CATAAGTnTG 10800 

TAAAATTcGT TAAATAAAAA TTTAACAAAC TTAACGGrGG TTGTTGAAkG GrACTTTTAA 10860 

aACATTTATC TCAGCGTCAA TATATTGATQ GTGAGTGGGT TGAAAGCGCX5 AATAAAAATA 10920 

50 

CAAGAGATAT TATCAATCCT TACAATCAAG AAGTGATATT TACGGTTTCT GAAGGGACAA 10980 

AAGAGGATGC AGAACGTGCA ATCTTAOCTG CAAGACGTGC GTTTGAGTCT GGTGAATGGT 11040 
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AACATCgCGA AgCgTTAGCA CGATTAGAAA CATTAGATAC TGGAAAAACG TTAGAAGAAT 11160 

CATATGCAGA TATGGATGAT ATTCATAATG TGTTTATGTA TTTTGCTGGA TTAGCAGATA 11220 

5 

AAGACGGTGG CGAAATGATT GATTCACCAA TTCCAGATAC AGAAAGCAAA ATTGTTAAAG 11280 

AACCAGTAGG TGTAGTTACA CAAATTACAC CTTGGAATTA TCCGTTATTA CAAGCATCAT 11340 

GGAAAATTGC GCCAGCGCTT GCTACGGGTT GTTCACTAGT TATGAAACCA AGTGAAATTA 11400 

10 

CACCATTAAC AACAATACGT GTTTTT6AAT TAATGGAAGA AGTTGGTTTC CCTAAAGGAA 11460 

CAATTAATCT TATTCTAGGT GCAGGTTCTG AAGTTGGTGA CGTAATGTCA GGTCATAAAG 11520 

^5 AGGTTGACCT TGTATCATTT ACAGGTGGCA TTGAGACTGG TAAGCATATT ATGAAAAATG 11580 

CTGCTAATAA TGTTACGAAT ATTGCCTTGG AACTTGGCGG TAAAAATCCA AACATTATCT 11640 

TTGATGATGC TGATTTTGAA TTGGCAGTAG ACCAAGCGTT AAATGGTGGA TATTTCCATO 11700 

20 CAGGTCAAGT TTGTTCAGCA GGATCAAGAA TATTAGTACA AAACAOTATT AAA6ACAAAT 11760 

TTGAGCAAGC ACTTATTGAT CGCGTGAAAA AAATCAAATT AGGTAATGGT TTTGATGCTG 11820 

ATACTGAAAT GGGACCAGTG ATTTCAACAG AACATCGTAA TAAGATCGAA TCTTATATGG 11880 

ATGTAGcTAA AGCAGAAGGC GCAACAATTG CTGTTGGTGG TAAAOGTCCA GATAGAGATG 11940 

ATTTAAAAGA TGGTCTATTC TTCGAGCCAA CAGTCATTAC AAATTGTGAT ACGTCAATGC 12000 

GTATTGTACA AGAAGAGGTT TTCGGACCTG TCGTTACTGT AGAAGGCTTT GAAACTGAAC 12060 

30 

AAGAAGCGAT TCAATTAGCG AATGATTCTA TATATGGTTT AGCAGGTGCT GTATTTTCTA 12120 

AAGATATTGG AAAAGCACAA CGCGTTGCTA ACAAGTTGAA ACTTGGAACG GTGTGGATTA 12180 

ATGATTTCCA TCCATATTTT GCACAAGCGC CATGGGGTGG ATACAAACAA TCAGGTATCG 12240 

36 

GTAGAGAATT AGGCAAAGAA GGCTTAGAAG AGTACCTTGT TTCAAAACAC ATTTTAACAA 12300 

ATAciAATCC ACAATTAGTG AATTGGTTTA GCAAATAAAA ATTAGATAAO GTGAGTGCCA 12360 

40 TTGTAAGAAC ACAAGACACT CACTTTGTTT TGTATAAGTG GCGAAATGTT GATTGATAAT 12420 

TTGGACTAAA CGCAAAATGA ATCATAGATT ATTTCATTAC TGTTAGTAAC AATC6TAAAA 12480 

GGAAAAGCGA GTGTTTTGGT TAGCTAAGTT TAGCAATTCA ACGATAACCA ATCAGCCACT 12540 

AACAAATATT TCATGCAATA CTCACTTTGA AATACAACAA ACTTTGGAGG TCATAACGAT 12600 

GAGTAACAAA AACAAATCAT ATGATTATGT CATCATTGGA GGAGGCAGTG CAGGTTCTGT 12660 

ACTAGGTAAT CGTCTGAGTG AAGATAAAGA TAAAGAAGTC TTAGTATTAG AAGCGGGTCG 12720 

SO 

CAGTGATTAT TTTTGGGATT TATTTATCCA AATGCCTGCT GCGTTAATGT TCCCTTCAGG 12780 

CAATAAATTT TACGATTGGA TTTATTCAAC AGATGAAGAA CCACATATGG GCGGTCGTAA 12840 
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TCAACGTGGT 


AATCCAATGG 


ACTATGAAGG CTGGGCAGAA CCAGAAGGTA TGGAAACTTG 


12960 


GGATTTTGCG 


CACTGTTTAC 


CGTATTTTAA AAAATTAGAA AAAACATACG GTGCAGCGCC 


13020 


TTATGATAAA 


TTTAGAGGCC 


ATGATGGACC AATTAAGTTA AAACGAGGGC CAGCAACGAA 


13080 


TCCTTTATTC 


CAGTCATTCT 


TTGATGCAGG TGTTGAAGCA GGCTATCATA AAACACCTGA 


13140 


TGTGAATGGA 


TTTAGACAAG 


AAGGTTTTGG ACCGTTCGAT AGTCAAGTAC ATCGTGGTCG 


13200 


CCGAATGTCA 


GCTTCAAGAG 


CATATTTACA TCCAGCGATG AAGCGTAAAA ACTTAACCGT 


13260 


TGAAACACGT 


GCCTTTGTAA 


CTGAAATTCA TTATGAAGGT AGAAGAGCAA CTGGTGTTAC 


13320 


GTATAAGAAA 


AATGGCAAAC 


TACATACCAT CGATGCTAAT GAAGTCATTT TGTCTCOTGG 


13380 


GGCATTCAAT 


ACGCCACAAT 


TACTACAATT ATCTGGTATC GGTGATTCAG AGTTCCTAAA 


13440 


ATCAAAAGGC 


ATTGAGCCAC 


GTGTTCATTT ACCTGGTGTG GGTGAAAACT TTOAAGATCA 


13500 


CTTAGAGG 






13508 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



35 



40 



45 



50 



GTAAGTATTG 


TCTTGATTTC 


CTAATAAAGT 


TATATCTTGT 


AATTCATCTT 


GTTGACGGCC 


60 


ATGTGCCATA 


TAAAGCGCTC 


CTTTAAATTT 


ATTTTTTTAT 


TATTTTGGCG 


TCTCGGCGTG 


120 


CTTTTTCAAA 


CATGTAATAA 


CTTGCACCGA 


TAATAACGAC 


GTAACCTAAT 


GTTGCATAGA 


180 


AATCS'GGAGA 


TTCTCCGAAT 


AGAATAAATC 


CAAGTATTGC 


TGTGAAAATT 


ATAGATGCAT 


240 


ACGTAAAAAT 


AGAAATATCT 


TTTGCTGCTG 


CAAAACTATA 


TGCTAAAGTA 


ACACCAATTT 


300 


GACCCACAGC 


GGCAgCTAAG 


CCAGCCCCTA 


ATAGATAAAG 


TATTTGCATC 


TGACTCATTG 


360 


GTTCATAAGT 


ATATGCAGTG 


AAAGGTATTA 


AAACGATGAC 


AGAAAATAAG 


GAGAAGTAAA 


420 


ATACTATAGT 


ATATGGTGCT 


TyTCTTGTAC 


TAAGTGCTCG 


AACACATGTA 


TATGCTGATG 


480 


CTGCAAAAAT 


ACCTGAGAAT 


AAGCCAGCTA 


ATGATCGAAT 


CATAGATGAT 


GAAAATTCAG 


540 


GTTTCACTAT 


TAAnAGCAaC 


CTAAAATAGC 


AATTATCATT 


GCTGTAATTT 


GaTACTTCCT 


600 


TACCTTTTCA 


TGtAAGAaaA 


CAATGCTTaA 


TAAAATCGTC 


CAGAAAGGAT 


TGAGTTTCAT 


660 


TAAT6AATCG 


GCATCACTAA 


GTACCATATG 


ATCAATGGCA 


TAAATATTTA 


ACAATACACC 


720 
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TGGCTGATGG TATTTATATA TAAAAAATAA TGGAATAAAC ATTGCTACTA AGTTTCGTGC 840 

TAATGATTTT TGAAAAACAG GAAGGTCACC TGCAAGTCTG AAAAACACTG ACATAAAACT 900 

GAAACCAATA GCCGAAATTA AAATGGCAAT GATACCTTTT ACTTTAGGAT TCAATTTTAT 960 

CGCCTCTTTT ATATAAAATT AACGTATTTA TATTAGCATA AAACAACATG TTGTGCATAA 1020 

ATAGTTGAAA TTTACTATAA AAAGACTATA ATAGACTGTA GCX5AACAAAC GTTCTGTGTT 1080 

TATTTGTCGG AATAATAGGG CATTACACTT TTATGAATGT TTGTGTTATT ACATAAAACA 1140 

AATATCAATT CAGTATCAAG CTAATAAGCT TTTTCTTGAT TTCTGTTCAT ACAATTGAGA 1200 

TTGACACAGA TTTAAAAAAA TCAAGTGATA TCTACTAAAA AATTTTTTTA AATTTGTTCA 1260 

AGTTTTTCTA ATTTAGTATT GGTGCCTAGT TGGAACaTTT TACGAACATT CGATTAGAAA 1320 

ATGGCACTTT AAATCATAOT GTGTCTTATG TATAATGAAA CAGATAATAT AGTGTTGGTG 1380 

AAACGAAAAA gACACAATAT CTTGTGTTTT GTATGCAAAT GCTTTATTTA TGAAGAAATT 1440 

ACAT7TAAAA GTAATTTAAC ACAGAAATTT AATAGTTATT ATCAATTAAT AGTCATATTT 1500 

TTAGAAAATG TACTGAGCAA ATGGAAGATA TCCAATGAT6 TAAACACTAC ATATAGTGAT 1560 

TTTTATACAT TCAACCCATA TAAGCTACTA TTTTCTCAAA TATAAATCTA TGCAATTGGT 1620 

TTACATTTGA GAAAATAAGT AGCTTCATTA TAGTTAATAC AATGCTGAGA TAACCATAGT 1680 

AACCATGTTG TTAAAGCATT TTTTAATTGG AATGACTACT TTATTTAAAA GGGTTGAAGA 1740 

AAGAAGGTGA TCCAATGAAA ATAATATATT TTTCATTTAC TGGAAATGTC CGTCGTTTTA 1800 

TTAAGAGAAC AGAACTTGAA AATACGCTTG AGATTACAGC AGAAAATTGT ATGGAACCAG 1860 

TTCATGAACC GTTTATTATC GTTACTGGCA CTATTGGATT TGGAGAAGTA CCAGAACCCG 1920 

TTCAATCTTT TTTAGAAGTT AATCATCAAT ACATCAGAGG TGTGGCAGCT AGCGGTAATC 1980 

GAAi^^TGGGG ACTAAATTTC GCAAAAGCGG GTCGCACGAT ATCAGAAGAG TATAATGTCC 2040 

40 CTTTATTAAT GAAGTTTGAG TTACATGGAA AAAACAAAGA CGTTATTGAA TTTAAGAACA 2100 

AGGTGGGTAA TTTTAATGAA AACCATGGAA GAGAAAAAGT ACAATCATAT TGAATTAAAT 2160 

AATGAGGTCA CTAAACGAaG AGAAGATGGA TTCTTTAGTT TAGAAAAAGA CCAAGAAGCT 2220 

TTAGTAGCTT ATTTAGAAGA AGTAAAAGAC AAAACAATCT TCTTCGACAC TGAAATCGAG 2280 

CX3TTTACGTT ATTTAGTAGA CAACGATTTT TATTTCAATG TGTTTGATAT TTATAGTGAA 2340 

GCGGATCTAA TTGAAATCAC TGATTATGCA AAATCAATCC CGTTTAATTT TGCAAOTTAT 2400 

ATGTCAGCTA GTAAATTTTT CAAAGATTAC GCTTTGA/yU^ CAAATGATAA AAGTCAATAC 2460 

TTAGAAGACT ATAATCAACA CGTTGCCATT GTTGCTTTAT ACCTAGCAAA TGGTAATAAA 2520 
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ACATTTTTAA ACXKIAGGCCG TGCGCGTCGT GGTGAGCTAG TGTCATGTTT CTTATTAGAA 2640 

GTGGATGACA GCTTAAATTC AATTAACTTT ATTGATTCAA CTGCAAAACA ATTAAGTAAA 2700 

ATTGGGGGCG GCGTTGCAAT TAACTTATCT AAATTGCGTG CACGTGGTGA AGCAATTAAA 2760 

GGAATTAAAG GCGTAgCGAA AGGCGTTTTA CCTATTGCTA AGTCACTTGA AGGTGGCTTT 2820 

AGCTATGCAG ATCAACTTGG TCAACGCCCT GGTGCTGGTG CTGTGTACTT AAATATCTTC 28 80 

CATTATGATG TAGAAGAATT TTTAGATACT AAAAAAGTAA ATGCGGATGA AGATTTACGT 2940 

TTATCTACAA TATCAACTGG TTTAATTGTT CCATCTAAAT TCTTCGATTT AGCTAAAGAA 3000 

GGTAAGGACT TTTATATGTT TGCACCTCAT ACAQTTAAAG AAGAATATGG TGTGACATTA 3060 

GACGATATCG ATTTAGAAAA ATATTATGAT GACATGGTTG CAAACCCAAA TGTT6AGAAA 3120 

AAGAAAAAGA ATGCGCGTGA AATGTTGAAT TTAATTGCGC AAACACAATT ACAATCAGGT 3180 

20 TATCCATATT TAATGTTTAA AGATAATGCT AACAGAGTGC ATCCGAATTC AAACATTGGA 3240 

CAAATTAAAA TGAGTAACTT ATGTACGGAA ATTTTCCAAC TACAAGAAAC TTCAATTATT 3300 

AATGACTATG GTATTGAAGA CGAAATTAAA CGTGATATTT CTTGTAACTT GGGCTCATTA 3360 

AATATTGTTA ATGTAATGGA AAGCGGAAAA TTCAGAGATT CAGTTCACTC TGGTATGGAC 3420 

GCATTAACTG TTGTGAGTGA TGTAGCAAAT ATTCAAAATG CACCAGGAGT TAGAAAAGCT 3480 

AACAGTGAAT TACATTCAGT TGGTCTTGGT GTGATGAATT TACACGGTTA CCTAGCAAAA 3540 

AATAAAATTG GTTATGAGTC AGAAGAAGCA AAAGATTTTG CAAATATCTT CTTTATGATG 3600 

ATGAATTTCT ACTCAATCGA ACGTTCAATG GAAATCGCTA AAGAGCGTGG TATCAAATAT 3660 

CAAGACTTTG AAAAGTCTGA TTATGCTAAT GGCAAATATT TCGAGTTCTA TACAACTCAA 3720 

GAATTTGAAC CTCAATTCGA AAAAGTACGT GAATTATTCG ATGGTATGGC TATTCCTACT 3780 

TCTGAGGATT GGAAGAAACT ACAACAAGAT GTTGAACAAT ATGGTTTATA TCATGCATAT 3840 

^ AGAtTAGCAA TTGCTCCAAC ACAAAGTATT TCTTATGTTC AAAATGCAAC AAGTTCTGTA 3900 

ATGCCAATCG TTGACCAAAT TGAACGTCGT ACTTATGGTA ATGCGGAAAC ATTTTACCCT 3960 

ATGCCATTCT TATCACCACA AACAATGTGO TACTACAAAT CAGCATTCAA TACTGATCAG 4020 

^ ATQAAATTAA TCGATTTAAT TGCGACAATT CAAACGCATA TTGACCAAGG TATCTCAACG 4080 

ATCCTTTATG TTAATTCTGA AATTTCTACA CGTGAGTTAG CAAGATTATA TGTATATGCX5 4140 

CACTATAAAG GATTAAAATC ACTTTACTAT ACTAGAAATA AATTATTAAG TGTAGAAGAA 4200 

SO 

TGTACAAGTT GTTCTATCTA ACAATTAAAT GTTGAAAATG ACAAACAGCT AATCATCTGG 4260 

TCTGAATTAG CAGATGATTA GACTGCTATG TCTGTATTTG TCAATTATTG AGTAACATTA 4320 
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ATGTTTTGGA GACAAAATAT ATCTCAAATG TGGGTTGAAA CAGAATTTAA AGTATCAAAA 4440 

GACATTGCAA GTTGGAAGAC TTTATCTGAA GCTGAACAAG ACACATTTAA AAAAGCATTA 4 500 

5 

GCTGGTTTAA CAGGCTTAGA TACACATCAA GCAGATGATG GCATGCCTTT AGTTATGCTA 4560 

CATACGACTG ACTTAAGGAA AAAAGCAGTT TATTCATTTA TGGCGATGAT GGAGCAAATA 4620 

CACGCGAAAA GCTATTCACA TATTTTCACA ACACTATTAC CATCTAGTGA AaCAAACTAC 4680 

10 

CTATTAGATG AATGGGTTTT AGAGGAACCC CATTTAAAAT ATAAATCTGA TAAAATTGTT 4740 

GCTAATTATC ACAAACTTTG GG6TAAAGAA GCTTCGATAT ACGACCAATA TATGGCCAGA 4800 

GTTACGAGTG TATTTTTAGA AACATTCTTA TTCTTCTCAG GTTTCTATTA TCCACTATAT 4860 

CTTGCTGGTC AAGGGAAAAT GACGACATCA GGTGAAATCA TTCGTAAAAT TCTTTTAGAT 4920 

GAATCTATTC ATGGTGTATT TACCGGTTTA GATGCACAGC ATTTACOAAA TOAACTATCT 4980 

20 GAAAGTGAGA AACAAAAAGC AGATCAAGAA ATGTATAAAT TGCTAAATGA CTTGTATTTA 5040 

AATGAAGAGT CATACACAAA AATGTTATAC GATGATCTTG GAATCACTGA AGAT6TGCTA 5100 

AACTATGTTA AATATAATGG AAACAAAGCA CTTTCAAACT TAGGCTTTGa ACCTTATTTT 5160 

GAGGAACGTG AATTTAACCC AATCATT6AG AATGCCTTAG ATACAACAAC TAAAAACCAT 5220 

GACTTCTTCT CAGTAAAAGG TGATGGTTAT GTATTAGCAT TAAACGTAGA AGCATTACAA 5280 

GATGATGACT TTGTATTTGA CAACAAATAA CAATTAAATT AAAAGACCTT CACATGTAAA 5340 

30 

GGGAAATAGC GATTCGTTTC GTCTTGTCTC CTACATGTTG AAGGTCTTTT TTTATGTGTA 5400 

TCTAACTCAT TATGAGTCTG AGTAAGAAAT CAATGCTCTA AGATGTACAA TGCTATTTAT 5460 

ATTGGCAGTA GTTGGCGGGG CCCCAACACA GAAGCAGGCG GAAAGTCAGC TAACAATATT 5520 

35 

GTGCAAGTTG GCGGGGCCCC AACATAGAAG CAGGCGGAAA GTCAGCTAAC AATAATGTGC 5580 

AAGTTGGCGG GGCCCCAACA TAAAAGCAGG CGGAAAGTCA GCTAACAATA TTGTGCAAGT 564 0 

40 TCGGgCGGGG CCCCAACATA AAGAAAAACT TTTTCCTTTA GAAATTATCA CTTCCaCaTG 5700 

AGTTTTACTC ATGTATTCCT ATTTTTAAGT ACACATTAGC TOAGGCTAAT GTTAAGAACC 5760 

ACTACTTAAT CAATCATTAG TAGTTTTTAT CATTTCCACT ATTCCCaGAC ATCaAAATCT 5820 

^ TAAGTGTTCT ATTTTACTTT AAGTAAACAA AATACACATT CCGAAAAATT AAATTTCAGT 5880 

TTAATTGCAA ATATCAATAA AATTGACACT AAATTATTTG AAAGGCTATT GAAATTATGG 5940 

TCAAAAAACG CTACTATTAA TGAGAAATAT TATCAATGAT AATGATTATC ATTAATTTAA 6000 

SO 

AGGGAGAAAA ATTTGTAATG AAGTATTTAT TAAAGGGAAA TATTTTGCTT CTATTACTAA 6060 

TATTGTTGAC AATTATTTCG TTGTTCATAG GTGTGAGTGA ACTATCAATT AAAGATTTAC 6120 

SS 
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GTATTTTAAT TGCTGGAAGT TCGTTGGCTT TAGCAGGCTT GATAATGCAA CAAATGATGC 6240 

AAAATAAGTT TGTTAGTCCG ACTACAGCTG GAACGATGGA ATGGGCTAAA CTAGGTATTT 6300 

TAATTGCTTT ATTGTTCTTT CCAACCGGTC ATATTTTATT AAAACTAGTA TTTGCTGTTA 6360 

TTTGCAGTAT TTGCGGTACG TTTTTATTTG TTAAAATCAT TGATTTTATA AAAGTGAAAG 6420 

ATGTCATTTT TGTACCGCTT TTAGGAATTA TGATGGGTGG GATTGTTGCA AGTTcACAAC 64 80 

CTTCATCTCA TTGCGCACGA ATGCTGTTCA AAOCATTGGT AACTGGCTTA ACGGGAACTT 6540 

TGCCATTATC ACAAGTGGAC GCTATGAAAT TTTATATTTA AGTATTCCTC TTTTAGCATT 6600 

GACATATCTT TTTGCTAATC ATTTCACGAT TGTAGGAATG GGTAAAGACT TTACTAATAA 6660 

TTTAGGTTTG AGTTACGAAA AATTAATTAA CATCGCATTG TTTATTACTG CAACTATTAC 6720 

AGCATTCGTA GTGGTGACTG TTGGAACATT ACCGTTCTTA GGACTAGXAA TACCAAATAT 6780 

20 TATTTCAATT TATCGAGGTG ATCATTTGAA AAATGCTATC CCTCATACGA TGATGTTAGG 6840 

TGCCATCTTT GTATTATTTT CTGATATAGT TGGCAGAATT GTTGTTTATC CATATGAAAT 6900 

AAATATTGGT TTAACAATAG GTGTATTTGG AACAATCATT TTCCTTATCT TGCTTATGAA 6960 

AGGTAGGAAA AATTATGCGC aACAATAATA AAAAAATAAT GCTTTTAATT GCAGTAACGT 7020 

TATTAATTAG TATGCTGTAC TTATTTGTAG 6TATTGATTT TGAAATATTT GAATATCAAT 7080 

TTTCAAGTCG TTTAAGAAAG TTCATATTAA TTATTTTAGT AGGT6CTGCC ATTGCAACTT 7140 

CAGTGGTGAT TTTTCAAGCX; ATTACAAATA ACCGTCTATT GACACCATCA ATAATGGGGT 7200 

TAGATGCAGT TTATTTATTT ATCAAAGTAT TGCCAGTCTT TTTATTTGGA ATTCAATCGG 7260 

TATGGGTTAC TAATGTATAT TTGAACTTTA TATTAACACT TATAACGATG GTGTTATTCG 7320 

CACTAATCCT ATTCCAAGGT ATCTTTAAAA TCGGACATTT TTCAATTTAT TTTATCTTAC 7380 

TTAJTGGTGT CCTTTTAGGA ACATTTTTTA GAAGCATAAC AGGTTTTATT CAACTGATTA 7440 

40 TGCEATCCTGA GTCATTTTTA GCAATACAAA GTAGTATGTT TGCTAATTTT AATGCTTCTA 7500 

ATTCGAATTT AGTTACTTTC TCAGCAGTGC TATTAGTAAT CTTATTAGTC ATTACAATTT 7560 

TACTATTGCC TTATTTAGAT GTATTGCTTT TAGGTCGTGC TGAAGCAATT AATCTTGGGA 7620 

^ TATCGTATGA AAAATTAACG CGAATT 7646 
(2) INFORMATION FOR SEQ ID NO: 122: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

ATGAATATAT TTnnAAATAA ATTATTATGG ATTGCACCAA TnGCCACTAT GATTATCTTG $0 

GTAATCTTTT CTTTAGCTTT TTATCCTGCA TATAATCCTA AACCAAAAGA TTTACCAATT 120 

GGTATATTAA ACGAGGATAA AGGTACAACG ATTCAAGATA AAAATGTTAA CATTGGTAAA IBO 

AAATTAGAGG ATAAATTATT AGATAGTGAT TCTAATAAAA TTAAATGGGT TAAGGTTGAT 240 

AGTGAAAAAG ACCTTGAAAA AGATTTGAAA 6ATCAAAAAA TCTTTGGAGT AGCTATTATT 300 

GATAAAGACT TTTCAAAAGA TGCTATGAGT AAAACACAAA AAGTAGTTAT GGATAGTAAA 360 

AAAGAAGAAA TGCAACAAAA AGTTGCTTCA GGTGAAATTC CGCCACAAGT GGTTCAACAA 420 

ATGAAACAAA AAATGGGGAA TCAACAAGTA GAGGTTAAGC AGGCTAAATT TAAAACGATT 480 

GTAAGTGAAG GATCAAGCTT ACAAGGTTCA CAAATT6CAT CAGCTGTGTT AACTGGTATG 540 

20 GGTGATAATA TTAATGCTCA AATTACGAAG CAAAGTTTGG AAACATTAAC GAGTCAAAAT 600 

GTTAAAGTCA ATGCCGOGGA CATCAATGGT TTGACGAATC CAGTAAAAGT GGATAATGAA 660 

AAACTTAATA AAGTTAAAGA TCACCAAGCA GGTGGTAATG CACCATTCCT AATGTTTATG 720 

CCAATTTGGA TAGGTTCAAT CGTAACGTCT ATCTTATTGT TCnTGCATT TAGAACTAGT 780 

AACAATATCG TCGTGCAACA TCGTATCaTT GCtTCAATTG GACAGATGAT ATTTGCAGTT 840 

GTTGCAGCAT TTGCAGGTAG CTTTGTTTAT ATTTATTTCA TGCAAGGCGT TCAAAGATTT 900 

GATTTTGACC ATCCAAATCG TATCGCAATT TTTGTAGCAT TTGCGATTCT TGGTTTCGTG 960 

GGCCTTATTT TAGGTGTTAT GGTATGGCTA GGTATGAAGT CAGTTCCAAT TTTCTTCATT 1020 

TTAATGTTCT TTAGTATGCA ACTTGTAACG TTACCTAAAC AAATGTTGCC TGAAAGTTAT 108 0 

CAAAAATATG TATATGATTG GAATCCATTC ACACACTATG CAACAAGTGT AAGAGAcTAT 1140 

TATACTTGAA TCATCATATT GAATTAAATA GTACAATGTG GATGTTTATA GGGT 1194 
(2) iNFORMATION FOR SEQ ID NO: 123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^ (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GACCGACCTA TACATCCGTA TAAGTATTTC TTGATATAAG TCTTCTAAAT CATAATGATT 60 
AAATCCAAAT GTTTTGATGC GTCGAATAAT TAATGGTTGT AGATCCATTA CTAACITTTC 120 
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GTATTTCAAA TATTAAACTA ACCCCTTCTA TCTAAAATTT AAGGTTAGTT TAATATTGTT 240 

ACATTCAAAA TTTCAAGATG ACGGAAATGT CATTTCTTAT GATGTCCTCT TCGTATTTTT 300 

TCAAATTCTG CAAGGATTTC AGAAGATAAC GGAATTCGAG TTCTTGGCTT GTTTTCACTT 360 

ATATCATCTA ATGATTTACT CACATCAATT TCATTTTCTT TTAAATCTCT CCACATTTCG 420 

CGAGATGATA TTCTATATGC ACCTGATCCA AAGATAGCAT GTTGcTCACT CaTATCACTT 480 

GTTACAACTG TAATATGcTT AGtATGCTTG tCaTAAAGtT CaTAAACCAT AACGGTTCTA 540 

ATGGAAACCA ATCAGCTG 558 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7762 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

2S GCTTCAGACA TnTGATGATA TAATCTCTCA TCATCGATTA ATTCTTTTGC AGCTTGATAC 60 

ACATxiTTGCT TATTTGTTCC AATGACTTTT AATGTGCCA6 CTTCAACACC TTCAGGACGT 120 

TCTGTAACAC TTCGCCAAAA CTAAAACTGG CTTATTAAAT GATGGCGCTT CTTCCTGAAT 180 

^ TCCACCTGAA TCTGTCAAAA TAAAATAAGA TTTTnTAGCA AAATTATGGA AATCTATAOG 240 

TCCAAAGGTT CAATCAATTC AATTCTGTCA TGACTACCTA AAATCTTTTG AGCCACCTCT 300 

CGAACTTTCG GGTTTTTATG CATTGGATAT ACCAGTGCTA AATCAGTATA CTCATCTATT 360 

3S 

AAGCGTCTAA CCGCTTTAAA TATATTTTCC aTGGGTTTCC CQATATTTTC TCGTCGQTGT 420 

GCTGTCATrA GAATGAATTT IcTtGTCATGG TATTTATCCA TGATGTTAGA TTTATAATTG 480 

TCATCAACTG TATATTTCAT AGCATCAATC GCAGTATTAC CAGTGACAAC AACACTTTCT 540 

40 

GAATATTTCC CTTCACTTAA CAAATGCGAT GCAGCATTTT TAGTAGGTGC AAAATGTAAG 600 

f 

TCAGCTAATA CACCAACTAA TTGTCTATTC ACCTCTTCTG GAAAAGGTGA ATATTTATCA 660 

45 TAACTTCTAA GCCCTGCTTC AACGTGTCCA ATCGGCACTT GOTTATAAAA TGCCGCTAAA 720 

CCACCTGCAA ATGTCGTCAT CGTATCACCA TGTACAAGTA CCATGTCTGG TTTTTCTAAT 780 

TGAATCACTT GTTCTAATTG AGTQATTGAT TTAGAAGTTA TCTCAGAAAG TGTCTGTCCT 840 

SO GATTTCATAA TATTCAAATC GTATTTTGGT TTGATTTCAA AGGTACTTAA TACTGAATCA 900 

AGCATTTCTC TATGCTGTGC TGTAACAACA ACAATTGGCT CGAGCATTTT TTCTTGTTCC 960 
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ATCTTTTTCA TCAAACTACT TATCTCCGAT 
CAGCGTCGCC TAACCCTGGT GTGATATATG 
5 AATATAAATA TCTACATCTG GATGTGCTTC 

AATTAAACAC ATGAAGCGAA TATTTTTAGC 
AATTGCTGAT GCGCCTGTTG CTAACATAGG 
ATCTTGAGGT AACTTAGCAA AATACTCTAC 
ACCGATATGT CCAACTCTGG CTGCAGGTAC 
ACCAGCTCTT AAAATTGGAA CGATAGCTAA 

IS 

TTTAGTTACA GGCGTTTCAA TATCAACATC 
TGCCATCAAC ATACCAACTT CGTCTACAAG 
ATCTCTAATA TAGCTTAGTT TGTGTTGAAT 

20 

CATAAAAATT ACTCCTATCT TTGTGTATGT 
TTTCGCAACG CGTTCTTTAG CTTGTTGTAA 
25 TAAACTGATG ATTTTTGCAA CTTCCTCAAA 

AGCAGGTGTA CCTAAACGTA TACCACTCGT 
GGTATTTTTG TTACATGTGA TACCAACTGA 
AAGTCCTATA GACCCTTTTA CATCAACAGC 
AATTCTAAAT CCTTCATTAA TTAATGCTTC 
TTGTTGATAC GTTTTGAAAT TATTTTCTAA 

35 

AATAACATGC TCAAGAGGTC CACCTTGAAT 
TTTMATTCT TCCTTACATA AAATCATACC 
TGTAGTTGTT ACAAAATCAG CATATTCTAC 

40 

TCCTGCAATA TQTGCCATGT CTACCATTAA 
AAACTTTTTG AAGTCAATTG TTCTTGAATA 
CTTATGCTCT AACGCTAATT TACGAACTTC 
TACTCCATAT TCAACGAAAT TGTAGAATTT 
CAAGTGACCA CCATGACTCA AATTCATACC 
SO TAAGTAAACA GCCATGTTCG CTTGTGAACC 
TCCAAACAAT GCTTTAGCAC GATCAATTGC 

55 



TCTTCTATTT AGTACCAAAC AATCTATCTC 1080 

CTTTGTCATT aGCTTTTCAT CAAGTGCAGC 1140 

ATGCATCTTT TCTACGCCTT CTGGTGCTGC 1200 

GCCACGTTTC TTCAATGAAG TAATAGCTTC 1260 

ATCAACAACA ATGATTTGTC TTTCAGTAAT 1320 

AGCCTTTAAT GTTTCGGGAT CTCGATATAA 1380 

TAAACTTAAA ATACCATCAG TCATACCTAA 1440 

TTTTTTACCA GCTAATCGTT TAGCCGTCAT 1500 

CTGAAGCTCT AAGTCTCTAG TTACTTCATA 1560 

TTCTCTAAAT TCTTTAGTAC CTGTATTTAC 1620 

TAATGGATGA TCGAAAACGT GTACTTTACT 1680 

TTATTGATAT AGAGGATATT CAGCTGTTAA 1740 

TTTTTCTTCA TCTTTACTAT TTTTCAATGC 1800 

AGCTTTTTCA TCAAATCCAC GCGTTGTTGC 1860 

TACAAAAGGT TTTTCTTGAT CGAACGGAAT 1920 

ATCTAAAGTC TCTTCAGCTT CTTTACCAGT 1980 

TACTAAGTGA TTATCTGTAC CGCCAGAAAC 2040 

TGCAAGAACT TTTGCGTTTT TAACCACTTG 2100 

CGCTTCTCCA AAAGCAACTG CTTTtGCTgC 2160 

ACCAGGGAAA ATTGTTTTAT CTATGTCTTT 2220 

ACCACGtGGT CCGcGTAATG TTTTGTGTGT 2280 

TGGATTTGGA TGTAAACCTG CCXSCTACTAA 2340 

CTTAGCGTTT ACTTCATCTG CGATTTCTTT 2400 

TGCTGATGCT CCTGCCACAA TAAGCTTAGG 2460 

ATCATAATTG ATTCGTTCTG TGTCTTTATC 2520 

ACCACTAAAA TTAACAGGCG CTCCATGTGT 2 580 

TAAAACTGTG TCGCCCATTT CTAATGCAAC 2640 

TGAATGTGGT TGAACATTGA CATGTTCAGC 2700 

GATGCTTTCA GTAACATCTA CAAACTCACA 2760 
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GCACTTGGAG CAGCTAGAGG TTCATTTATG ATTTGTAATA ATTGTCTACC TACAGAATGG 4680 

CTTGGCATTC TAACAGCAAC TGATGATAAA CCTCCAGAAA CTTTTCGACA TAGATAGCCT 4740 

5 AGCTTTAACG GCAATATAAA CGAAATAGGG CCCGGCCAGA ATGCCTGCAT TAACTTTTCT 4800 

ACGCGTGGAT CCAAAGTATA TGTAAAATCT TTTAATTGAC CTTTACTGTG TATATGAACA 4860 

ATAAGCGGAT TGTCAGATGG ACGGCCTTTA GCTTCATATA TTTTAGCTAC AGCTTCTTCA 4920 

10 TCTGTCGCAT TTGCTGCAAG TCCATAAACT GTTTCAGTTG GTAAACCTAT TAAACCACCG 4 980 

TTTAAAACAA TGTCTTTTAT TTCATTAATT TTAGGATATT GCTGTAAATC TTCATTATAT 5040 

TCTCTAACAT CCCAAATTTT AGTATCCAAC TTAATCACGC CTTTCTTATT TATCATAATA 5100 

IS 

TAAAGCAAAA AGCTATGCAC TTAACTAATC ATAGCAAAGG CATAACTTCT AATTACCATT 5160 

TAAATGAGAC GATTCGATCG TGGCCATTTA TATCTTTAAT AATGTCGATT TTTTTGTCAG 5220 

GAAATTTATT TAAAATTATT GATTTAAGTG CCTCACCTTG ATTGTAACCA ATTTCAAAAA 5280 

20 

CAACTGGGCT GCCTTTTTCC ATAACX3TGAG GTAAATCTTC AATGATTGAT TCATAAATAG 5340 

CATATCCATG GTTATCTGCA AACAATGCCT GATGTGGTTC GAATCTCGTA ACCGTTGGAG 5400 

23 ACATCGTAAC CATATCTTTT TCATCTATAT ATGGTGGATT AGATATCAAG CCGTTCAACT 5460 

TGATACCTTC ATTAATTAAG GGCTTTAATG CATCCCCTGT TAAAAATTGT ATTTGTGATT 5520 

GATGCTTCTC AGCATTATTA CGAGCCATAT TCATTGCTTC AAGTGAAATA TCAGTAGCAA 5580 

30 TAACATTTAA ATCCGGCTTT TCACATTTCA AAGTAATTGC AAGTACACCA CTACCCGTTC 5640 

CGATATCTAC GATTGTTGCA TCATCTTCTA ACTGTTGTAA 6AAATGCAAC ATTACTTCTT 5700 

CAGTTTCAGG TCTTGGTATC AAACAATTTG AGTTTACATC AAACGTTCTA CCATAAAATG 5760 

^ AGGCAAAGCC AACTATATAC TGTATAGGCT CTCCTAATAA CATACGTTGT AATGCTAAGT 5820 

CCAACTTCAT AATCATCGCT TTCX3GCATAT CATCATGCAT GTGGACTACA AAGTCCGTAC 5880 

GCGTCCATTG AAATACATCT AACATTAACC ATTCAGCTCG TGTTTGTTCA AACCCTTTTT 5940 

40 

GTTGTGTTAA ATGAATTGCT TCATCTAACT TTTCTTTATA ATTCACCATT ATTAAGTTCT 6000 

TTCAATTTAT CTGTCTGCTC TGATAAAGTC AGTGCATCTA TAATTTCTTC TAAATGGCCT 6060 

TCCATAATTT GCCCTAATTT TTGAAGCGTT AGACCTATAC GATGGTCTGT TACACGGCTT 6120 

45 

TGTGGATAAT TATAAGTTCG AATACGTTCT GAACGATCAC CAGTACCGAC TGCTGATTTA 6180 

CGTTGTGACG CATACTTTTG TTGTTCTTCT TGAACTTTCA TATCGTATAA ACGTGCTTTT 6240 

^ AACACTTTCA TTGCTTTTTC ACGGTTTTGA ATTTGAGACT TCTcAGAAGA TGTTGCAATG 6300 

ACACCAGTTG GTAAATGGGT AATACGTACT GCAGAGTCAG TTGTGTTTAC GTGCTGACCA 6360 

SS 
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40 



ACATCTTCAA 


CTTCTGGTAA 


AACTGCCACT 


GTAGCTGTTG 


AAGTATGAAT ACGTCCACCT 


6480 


GATTCTGTTT 


CAGGCACACG 


TTGAACGCGG 


TGOGCACCAT 


TTTCAAATTT 


CAATTTACTA 


6540 


TACGCGCCAT 


TACCAGAAAC 


TGAGAAACTA 


ATTTCTTTGT 


AACCACCATG 


GTCACTTTCA 


6600 


GACGCTTCTA 


CTATTTCAGT 


TTTGAATCCT 


TGTGATTCAG 


CATACTTTGA 


ATACATACGC 


6660 


ATTAAATCAC 


CAGCAAAAAT 


CGCAGCCTCA 


TCACCACCTG 


CTGCTGCTCT 


TATTTCTACA 


6720 


ATAACGTCTT 


TGTCATCATT 


AGGATCTTTA 


GGAATCAATA 


ATATTTTAAG 


CTCTTCTTCA 


67B0 


AGATTTGGAA 


GTTCAGCTTT 


AATACCATTA 


CTCTCCTCTT 


TTAACATTTC 


TACTTCTTCT 


6840 


TTATCATCAG 


TCrCACTTAA 


CATTTCTTCA 


ATATCAGCTA 


ATTCTTCTTT 


TTTAGCTTTA 


6900 


TAGTTACGAT 


AAACATCTAC 


AGTTTTTTGT 


AAATCAGCTT 


GCTCTTTAGA 


ATATTTACGT 


6960 


AATTTATCTG 


AATCATTTAC 


AACATCTGGG 


TCACTTAACA 


GTTCATTTAA 


CTGTTCOTAT 


7020 


CTTTCTTCTA 


CAATATCTAA 


TTGATCAAAC 


ACTTATAATT 


CCTCCTTATT 


ATTATCACTA 


7080 


GGTGCTACX3A 


TATGGTGCGC 


GCGACAAC6T 


GGCTCATAAC 


TTTCATTGGC ACCTACTAAG 


7140 


ATAATCGGAT 


CATCGATTTT 


AGCTGGTTTA 


CCATTTATTA 


ATCGTTGCGT 


TCTACTAGAT 


7200 


GAAGAACCAC 


AAACAGCACA 


AACTGCTTGA 


AGTTTCGTTA 


CTTGTTCACT 


GACAGCCATC 


7260 


AATTTAGGCA 


TTGGTTCGAA 


CGGTTCGCCC 


CTAAAATCCA 


TATCTAATCC 


AGCAACAATA 


7320 


ACACGGTGTC 


CATCTGCTGA 


TAGTTTTTCT 


ACTATACTTA 


CAATTTCATC 


GTCAAAAAAT 


7380 


TGCAcTTCGT 


CTATTCCTAT 


AACATCAACA 


TTAGTTAAGT 


CGTGCGTCAT 


AATTTCACTT 


7440 


GCTTTAGAAA 


TATTAATCGC 


TTCAATGGCA 


TTACCATTAT 


GAGAGACCAC 


TTTTTCTTTA 


7500 


TGATATCGAT 


CATCAATCGC 


CGGTTTAAAT 


ACAACGACTT 


TTTGTTTAGC 


GTATATACCC 


7560 


CTTCTTAGAC 


GTCTTATTAG 


TTCTTCGGAT 


TTACCGCTAA 


ACATACTACC 


TGTAATACAT 


7620 


TCTATCCAAC 


CGGAATGGTA 


AGTTTCATAC 


ATTGAGAGTn 


CCACCTTTTT 


CAAAACATAA 


7680 


TCGCTTTATT 


ATATCATATT 


TCAAATATTC 


ATAAATGTCT 


TTnTCATAAT 


TATATCGATA 


7740 


TTGTACATGA 


ACAATTATTT 


TA 








7762 



(2) INFORMATION FOR SEQ ID NO: 125: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
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TAAAAAAATT ATTATCAATG ATGAACTAGA ATTGACTGAA TTCCACCAAG AACTTACTTA 120 

TATTTTAGAC AACATAnAAG GGAATAATAA TTATGGTAAG GAATTTGTTG CAACCGTTGA 180 

S AGAAACATTC GACATTGAAT AaAGCX;GGGT GgaAGCACTA TGAATCAATG GGATCAGTTC 24 0 

TTAACACCTT ATAAGCAAGC GGTTGATGAG TTGAAAGkGA AcTTaAAGGC ATGCGCAAAC 300 

AATATGAAGT TGGTGAACAA GCGTCGCCAA TAGAATTTGT TACTGGTCGT GTTAAACCAA 360 

TCGCTAGTAT TATAGATAAG GCAAACAAAC GACAAATACC ATTTGATAGG TTAAGAGAAG 420 

AAATGTACGA TATCGCTGGT TTAAGAATGA TGTGCCAATT TGTTGAAGAT ATTGATGTTG 480 

TCGTCAATAT TTTAAGACAA AGAmAAGATT TTAAAGTAAT TGAAGAACGA GATTATATTC 540 

IS 

GTAACACTAA AGAAAGTGGT TACCGCTCGT ATCATGTCAT TATTGAATAT CCAATTGAAA 600 

CATTACAAGG CCAAAAATTT ATATTGGCTG AGATTCAGAT TCGTACATTA GCAATGAATT 660 

TCTGGGCAAC GATTGAACAT ACTTTACGAT ATAAATATGA TGGTGCTTAT CCGGATGAAA 720 

20 

TTCAACATCG TTTGGAAAGA GCGGCAGAAG CAGCGTATTT ACTTCATGAA GAGATGTCTG 780 

AAATTAAAGA TGAAATTCAG GAAGCTCAAA AATATTACAC GCAAAAACGT TCTAAAAAAC 840 

25 ATGAAAATGA TTAACGAGGT GTTATAAATC ATGCGTTATA CAATTTTAAC TAAAGGTGAC 900 

TCCAAGTCTA ATGCCTTAAA GCATAAAATG ATGAACTATA TGAAAGrTTT TcGCATGaTT 960 

GaGGATrGTG AAAaTCCTGA AATTGTTATT yCAGTTGGTG GTGACGGTAC ATTACTACAA 1020 

30 GCATTCCATC AGTATAGCCA CATGTTATCA AAAGTGGCAT TTGTTGGAGT TCATACAGGT 1080 

CATTTAGGAT TTTATGCGGA TTGGTTACCT CATGAAGTTG AAAAATTAAT CATCGAAATT 1140 

AATAATTCAG AGTTTCAGGT CATTGAATAT CCATTGCTTG AAATTATTAT GAGATACAAC 1200 

GACAACGGCT ATGAAACAAG GTATTTAGCA TTAAATGAAG CAACGATGAA AACTGAAAAT 1260 

GGCfCAACAC TTGTTGTGGA TGTTAACTTA AGAGGGAAAC ACTTTGAGCG ATTTAGAGGC 1320 

6ATGGATTAT GTGTATCAAC ACCTTCGGGT TCAACGGCTT ATAACAAAGC GCTAGGTGGC 1380 

40 

GCACTGATAC ATCCTTCACT TGAAGCAATG CAAATTACAG AAATTGCCTC GATAAATAAT 1440 

CGTGTGTTTA GAACGGTAGG ATCACCACTT GTATTACCAA AGCATCATAC ATGTTTAATA 150 0 

TCACCAGTTA ATCATGATAC CATTAGAATG ACGATAGATC ATGTTAGTAT CAAACATAAA 1560 

4S 

AATGTTAATT CAATACAATA CCGTGTAGCA AATGAAAAAG TGAGGTTTGC ACGTTTTAGA 1620 

CCATTCCCAT TCTGGAAACG TGTGCACGAT TCTTTCATAT CAAGTGATGA AGAACGATGA 1680 

SO AATTTAAGTA TCATATATCA CAACAAGAAA CTGTTAAAAC TTTTTTAGCA CGACATGATT 174 0 

TTTCTAAGAA GACAGTGAGC GCCATTAAAA ATAATGGCGC TTTAATTGTT AATGATGAAC 1800 
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AAATACCGAG TGTTAATTTA ATACCTTATG CTCGTAAGCT AGAAGTATTG TATGAAC3ATG 1920 

CTTTTATCAT CATAGTTACT AAACCAAACA ATCAAAATTG TACGCCTTCXJ AGAGAACATC 1980 

CTCATGAAAG TTTAATCGAA CAAGTACTAT ATCATTGTCA GGAACATGGT GAAAATATTA 2040 

ACCCACATAT TGTTACGCGT CTAGATCGTA ATACAACTGG TATTGTGATA TTCGCTAAAT 2100 

ATGGACATAT CCATCATTTA TTTTCTAAAG TAAACTTGAA AAAAATATAT ACTTGCCTTG 2160 

TATATGGTAA AACCCATACA TCTGGTATTA TTGAAGCTAA TATTAGACGG TCAAAGGATA 2220 

GGATTATAAC TAGAGAAGTT GCCTCGGATG GTAAATACX3C TAAAACATCT TATGAAGTAA 2280 

TAAATCAGAA TGATAAATAC AGTTTATGCA AAGTTCATTT GCATACGGQA CGTACACATC 2340 

AAATTCGTGT ACATTTTCAA CATATTGGGC ATCCAATTGT GGGAGATTCT TTGTATGATG 2400 

GTTTTCATGA CAAAATTCAT GGTCAAGTAC TGCAATGTAC GCAAATATAT TTTGTTCATC 2460 

CAATCAATAA GAACAATATT TATATTACAA TTGATTATAA GCAATTACTT AAATTATnCA 2520 

ATCAACTCTA ATnCACACAG GGGGTGTAAG TATGTCAATG AnCACAGATG AAAAAGAGCG 2580 

TGT 2583 
25 (2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\ible 
<D) TOPOLOGY: linear 



10 



IS 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATCAAGTGAT ACATTTAACT GGTAAAGGAT TAAnAGATGC TCAAGTTAAA AAATCnGGAT 60 

ATATACAATA TGAATTTGTT AAAGAGGATT TnACAGATTT ATTnGCAATT ACGGATACAG 120 

TAATAAGTAG AGCTGGATCA AATGCGATTT ATGAGTTCTT AACATTACGT ATACCAATGT 180 

TATTAGTACC ATTAGGTTTA GATCAATCCC GAGGCGACCA AATTGACAAT GCAAATCATT 24 0 

TTGCTGATAA AGGATATGCT AAAGCGATTG ATGAAGAACA ATTAACAGCA CAAATTTTAT 300 

TACAAGAACT AAATGAAATG GAACAGGAAA GAACTCGAAT TATCAATAAT ATGAAATCGT 360 

ATGAACAAAG TTATACGAAA GAAGCTTTAT TTGATAAGAT GATTAAAGAC GCATTGAATT -420 

AATGGGGGGT AATGCTTTAT GAGTCAATGG AAACGTATCT CTTTGCTCAT CGTTTTTACA 480 

50 TTGGTTTTTG GAATTATCGC GTTTTTCCAC GAATCAAGAC TTGGGAAATG GATTGATAAT 540 

GAAGTTTATG AGTTTGTATA TTCATCAGAG AGCTTTATTA CGACATCTAT CATGCTTGGO 600 
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10 



IS 



20 



CTCATGTTAA AGCGCCACAA AATTGAAGCA TTATTTTTTG CATTAACAAT GGCATTATCT 720 

GGAATTTTGA ATCCAGCATT AAAAAATATA TTCGATAGAG AAAGACCTAC ATTGCTGCGT 780 

TTAATTGATA TAACAGGATT TAGTTTTCCT AGCGGTCATG CTATGGGATC AACTGCATAT 840 

TTTGGAAGTG GTATCTATCT ATTAAATCGA TTAAATCAAG GTAATTCAAA AGGTATTCTT 9.00 

ATAGGGTTAT GTGCAGCTAT GATTTTATTG ATTTCCATAT CACGTGTATA TCTAGGTGTA 960 

CATTATCCAA CAGATATTAT TGCCGGCATT ATTGGTGGAT TATTTTGcAT TATTTTATCA 1020 

ACGTTATTAC TTAGAAATAA ATTAATAAAT TAAATAGTAA AAAAACAAAA GCAGTAAACC 1080 

TAAAGTGTCG TAAGGGTTTA CTGCTTTTAT AAAACX3TTGT TATAACGTAT ATTGTCTTTT 114 0 

ACGGGCATAT AAnAGGGGAA TATTTGAnAA TGACCAATCC AACAAGAACG AAACGTTGTG 1200 

GGGGGGATGT TCTATGTGGT ATTGATAATC ATTTTCAACT ACTATTATAC ATTAGTGAGA 1260 

ATCATTGTCA ATTAGAAACT AAAACTTTTT TTGAATATTT TTTAAGAATA GTAAATAAAA 1320 

CXJCATGATTA CGCTATTTTA GAAAATAAAA AAATTTGTAT TTCTCATTAG AATTAGAATA 1380 

TTTAAAAGTG ATGAGGTTTA AACATTATAT TGTTTACATA CTCCTTTTGA ATTCATACAT 1440 

25 TATGAAATGT tACTTCCAAG TTCAAAATCG CACATTGAAA TGATGTGTGA AATGTTTAAA 1500 

CTACGGTCAT tTTGTGmAAA TAAAGrTAAT AACTATTCAT TTTACAATAG TGAAAAGTCA 1560 

GTATATGACA ACAATTAATA TTGOGGTAAG GCCTTGTGTT ACAGTATTCT ATATTTAAGT 1620 

ACTGCAATCA GAATTAACAG AATGCCATTA ACTGATTATT AAATATTTGA GTTAATAAAT 1680 

AATTAATGAT TGTAGCTTGA AAAATTTAAA ACATGGTTAT TGATTTGTGA TAAAATTTAA 1740 

ACGTAAACAA ACTAATTTAA AAAGCAACTA TTGTATAGAA AAATACAAAA TTTAAAATAT 1800 

35 

ATTACCTTAT TAGAAAAA 1818 

(2) iNFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH; 12658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY; linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

TGTTTAAACA ATAGGGGGAA TCTTATGATT GAAAAATTAG TAACCTTTTT AAATGAGGTT 60 

^ GTTTGGAGTA AGCCATTAGT TTATGGTTTG CTAATTACTG GTGTGCTATT TACATTGCGT 120 

ATgCGATTTT TTCAAGTTAG ACATTTTAAA GAAATGATTC GATTAATGTT TCAAGGAGAG 180 
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GGTACAGGTA ATATTGTCGG TGTATCTACT 
TTTTGGATGT GGATTACTGC GTTTTTAGGT 
^ GGTCAAATAT TCAAGAGAGT TGAAAATAAT 

GAATATGGTA TTGGTGGTAA ATTTGGTAAA 
ATTATCTCAG TAGGTCTATT GCTTCCTGGT 

10 

CATAATGCGA TTCATGTTCC ACAATGGTTA 
TTAATTATTT TTGGTGGTGT ACGTATTATT 
ATGGCAATTA TTTACATACT GATGGCTGTC 
CCAGCGTTAT TTGCATTAAT TTTCAAATCA 
ATOGTTGGCG CAATGATAGA GATTGGTGTT 

20 CAAGGTACAG GTCCACACX3C AGGAGCGGCa 

CTAGTACAAG CATTTTCAGT TTATATTGAT 
ATTATACTTA TTTCTGGTAC ATATAATGTG 

25 CCGCATTTAA TTAAAGATGG CGGTATTTAT 

GGTACTGCGA TGTATGCACA AGCCGGCATt 

TTTGATCCTA CTTTCTCTGG CGTAGgTTCG 

30 

GCATTTACTA CAATTTTGTC GTACTACTAC 

CGTAATCAAA ATAATCAAGT TTCATCGATA 

TTCGCTACAT TTTACGGTGC AGTTAAAACA 

35 

QGTGTAGGTC TAATGGCTTG GTTAAATATC 
GTAAATGCTT TAAAAGATTA TGAAATTCAA 
GTTTATCAAC CTGATCCGAA TAAATTACCT 

40 

GAACGTTTAA AACAAGCACG TGCCAAAAAG 
GATCATTTGA TAAAAAAGAA AAGTATTGAG 

45 AAAATATAGT GTCTCTTGGT ACAATAACAA 

GAATTTAAAA CTGGTAAGAT TAATAAACAT 
GTCACGTTAA GTATTTATTT ACCAGAATCT 

SO CTTTGCTTTG ACX3GATTAGA TTTTTTACGT 

TTAATCAAAG AAGCGCGTAT TGATGATGCG 

55 



GCAATATTTA TAGGAGGACC TGGTGCAGTA 300 

GCAAGTAGTG CTTTTATTGA ATCTACACTT 360 

GAATACCGTG GTGGACCAGC GTATTATATT 420 

ATTTACGGAA TTATCTTTGC TATTGTTACG 480 

GTGCAATCTA ACGCTATAGC AAGTTCTATG 540 

ATGGGTGGTA TTGTTGTAGT TATTTTGGGA 600 

GCCAATGTTG CAACAGCCGT TGTACCATTT 660 

ATTATCATTT GTATCAATAT ACAAGAAGTG 720 

GCATTTGGAT TACAATCT6C TTTTGGTGGT 7B0 

AAACGTGGAT TATATTCAAA TGAGGCTGGT 840 

gcAGaAGTAT CACATCCAAG TAAACAAGGT 900 

ACATTATTTG TATGTACTGC AACTGCTCTG 960 

ACTGATGGTA CGGTTAATGC GAATGGCACA 1020 

GTTgAAAATG CAACAGQTAA AGATTATTCA 1080 

GATAAAGCGT TCCATGGCAG TGGTTATCAA 1140 

TACTTTATTG cATTTGCTTT ATTCTTCTTT 1200 

ATTACAGAAA CAAATGTTGC TTATTTAACG 1260 

TTTATTAATA TTGCTCGTGT GATTATTTTG 1320 

GCTGATGTAG CATGGGCATT CX3GTGATTTA 1380 

ATTGCGATTT GGATTTTACA TAAGCCTGCC 1440 

AAGAAACGTT TAGGCAACGG TTATAATGCA 1500 

AATGCTGTCT TTTGGTTGAA GACGTATCCA 1560 

TAATCTACTT TTGTTTATAG TATATGTAGT 1620 

AATTTTAGGt GCTCAGAAAT TTGAATTTTA 1680 

TACAACTACT AGGGGCACTT TTTTATGTCA 1740 

GTTTTATATA GTAATATTTT AAATAGAGAT 1800 

TATAATCAAC TTGTTAAATA TAATGTCATT 1860 

TTCGGGAGAA TACAACGTAC ATATGAATCG 1920 

ATCATrGTTG GATTCCATTA TGAAGACGTT 1980 
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GTCGGTAAAG AAATATTGCC ATTTATTGAC TCGACGTTTT CTACACTGAA AGTAGGTAAT 2100 

GCAAGGTTAT TAGTAGGGGA TAGTTTAGCO GGTAGTATTG CCTTATTAAC GGCGTTGACC 2160 

^ TATCCAACGA TTTTTAGTCG TGTAGCAATG TTAAGTCCAC ATTCAGATGA AAAAGTATTA 2220 

GATAAGCTAA ATCAATGTGC AAATAAAGAA CAATTGACAA TTTGGCATGT CATTGGTCTA 2280 

GATGAAAAAG ATTTTACTTT ACCAACAAAT GGTAAGCGTG CCGATTTCTT AACACCGAAT 2340 

10 

AGAGAATTAG CTGAACAAAT TAAGAAATAT AATATAACTT ATTATTACGA TGAATTTGAT 2400 

GGTGGTCACC AATGGAAAGA TTGGAAACCA TTGCTGTCAG ATATATTATT GTATTTTTTA 2460 

AGTAAAAACA CAGATGATCA ACTTTATGAA TAATTTACAT TAGTAGATTT AGTATGAATT 2520 

IS 

GTCTTCATAT AGTCTGGTCT ATAATATAAT TTATAAAAGA TTTTACTGTT TAATTTAATT 2580 

TAAATTTGAC GAAATTGCAA AAGATGTATA ATGAATTATT TTTAATGTAA CGGTTTTCAA 2640 

AGAAATTTGA TATAATAGCA ATAGGTTAAA CAAAGGAGGA ATTCAGATGA TTTTAGGATT 2700 

20 

AGCATTAATT CCATCAAAGT CATTTCAAGA AGCGGTGGAT TCTTACCGTA AAAGATATGA 2760 

TAAACAGTAT TCACGAATTA AACCACATGT GACAATTAAA GCGCCATTTG AAATTAAAGA 2820 

25 TGGTGATTTA GATTCTCTCA TTGAACAGGT TAGAGCTCGT ATTAATGGTA TACCAGCAGT 2880 

AGAAGTTCAT GCTACAAAAG CTTCTAGCTT CAAACCAACG AACAAT6TGA TTTACTTTAA 2940 

AGTTGCGAAG ACGGACGACT TAGAAGAATT 6TTTAATCGC TTTAATGGAG AAGATTTCTA 3000 

30 TGGAGAAGCT GAACATGTTT TTGTGCCACA CTTTACAATA GCACAAGGAC TATCTAGCCA 3060 

AGAATTCGAA GATATTTTTG GTCaAGTAGC ATTAGCTGGG GTAGACCaTA AAGAAATTAT 3120 

CGATGAATTA ACTTTG1TAC GTTTTGACGA TGACGAAGAT AAATGGAAAG TTATTGAAAC 3180 

^ GTTTAAATTA GCTTAAGTAA CATAATAQTA TTGTTAATCO TAGTATGTTT GAATTAATAA 3240 

GAAAATGGTC ATTTTTATTG AATGTAATAA AAATGACCAT TTTCTTTATT TTAAAATACG 3300 

TTTTAACCTT ACTTAGCTTT TTCTCTATTT ACTATAAAGT rGCTTCCATA AAATACAGCT 3360 

40 

AAGACTAAAA AGATTAATGC CX3AGAAATAA AATGTATTGT TTAAATTGTT GGTAAATTGT 3420 

GTAATTAATC CGCCAAATAA TGGCCCTATC ATTGAGCCGA ATCCTTGGAT ACTATTAAAA 3480 

ACACCCCAAG TTTCTTCTTG TTCATCTGAT TTGATAAATC GTGCCATAAA GGTATTCCAT 3540 

45 

GCTGGTAATA AGATGCCATA CATTAGACCX5 ATAGCTAAAG CGATAATCCA CAAGATGTGA 3600 

ATATTAACAA TCATAGATAG AGTAAAAATT AATATCATGT ATAAAATAAA TCCGCTTAGA 3660 

ATAACACCAT ACATAAAGTT TCTGCTGCGG TTATCTATTA GTTTCGATAA AAATAGCATC 372 0 

SO 

GAAACTGCAC AGCCGATACC ACCAATAATG ATTGCAACAG TATAITCAAT TGTGCTTACG 3780 
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TGTAAAAGAA TACCAGGGAA CaACAATAAA 
AATTGAGCTT TAACTGGACG AGTATTATAA 
^ AATATCCATG CAATTAAAAC GACTAAAGAC 

TTGATAAGTA GATTCATAAA AACCATACCT 
ACATAGCCCA TTTGTTTGCC ACGTTTATCT 

10 

ATAGGACTAA CTGCAATACC GAGCATCATA 
GGAAACCAAA TAACTAAAAA TAAACTTGTA 
ATTTTTGTGC CGAATTTTTT CAGTAAAAAT 

IS 

ATAAAATGTA TTGAAAATGC TAGAGACGTT 
AAGAAATTAA TATAGCTTAG GATATACATG 

20 ATAAGCaTTA AAATGAAATT TTTATGATTA 

TAAAGGAACC TTTCCATAAA TCTCTTGTGG 
GTCTCGACAT ATTGTCTGTG TAGCATACTT 

2S GTTAGTTAAT TGCTCATTAC CX5TTAGTTAA 

AGTATCAGCG ATTTTACCAA AACCTTTTTC 
ACCAGGTGCA GGATTTAGGA AAATCATTGG 
GATACCACCA GGTTTCGTAA TCATAAGTTG 
GGTATAACCT AGAATCAATA CATTCTCGTT 
TAGCTCTTTG CTCTTACCAC AAATCATAAC 

^ ATCAGTAATC ATCGTGTCAA AACCTTTAGA 

AGTTTGCTTA TCTGGATCTA AGTTGTTGTC 
TTCAAATTTG TTATCAATAG GAATACCTGT 

40 

GTCTATGAAG TCTTGTTTCG TTTCTTTTGT 
AATCCAGTTT TTATGTAAGC GATAGTCTGT 
AAATTGCTCA GTTAGTACCG ACATAACTGG 

45 

CTTTTCTTTT ATCAATAAAT TAATTAACTT 
GTCTAGTTTA TCTGGGCGGC TGTAATAAAA 
so GCTATTGATA TACCATTTTT TACAAATAGA 

ATCXfTGCTCA ATGACGCTTA AATGGTCTAG 



TGGcGCTTTG TCACATCAAC AATTTGTCTC 3900 

TTTGTTAACT TTACATCGAC AAAATAATAT 3960 

ATCATGAAGG CAAAGCGTGT TGGGTGCACT 4020 

ACCAATAGGC CTAACAACCA TGAAAAATAA 4080 

TCTTCAACAC TGGATAACAT AATGACCCAA 4140 

GCACTAAATA TGATTACAAA AGGTGATGCT 4200 

AATGCTAAAA TAAATCCAGT CGTTAAAACG 4260 

CCTATAACAA AGTTTGTAGA TGCATCAGCA 4320 

ATTGCTACAG CAATGGATGT AACTGTTGGC 4380 

CCTCTCGCAA ATTCCATTAA AAATAAGATA 4440 

GCGTAATTAT TTAACGAAGA ATCTTGCATA 4500 

TTGTGATGAA TGACCGATTA AATCAAGTAA 4560 

AATTTTATCT TGTTCCATTG TACTAATCAT 4620 

ACTTGCTACA ATTTTTATTG CTTCTTCTGG 4680 

TTCAAAGTAA AGGGCATTTT CAAGCTCTTG 4740 

AATACAACGG GCGAAACCTT CAGTTATTGT 4800 

ACTTGATGCC ATCCATTCAT TCATGTGTTT 4860 

AGATTTAAAC TTAGCTGTTA AAGAACGCTT 4920 

TACTTGTGCA TTTGCaCTTT tCXJCTAATAT 4980 

TACACCAAAT 6CACCAGCTG aCATTAAAAT 5040 

TATTAACCAC TGCTTTTGAT TAATAGGCGT 5100 

CaCTTTAACT GTTGAAGGAT CAATACCTAC 5160 

TGCCACATAA TATCTTGTTG AATACGGCGT 5220 

CATCACTGTA GCAACTGGAA TATTAATGTT 5280 

TGTAGGAAAC GTTAATAATA TTAAATCTGG 5340 

ATTAAGTCCA TAGTATTTGT AAAAACATTT 5400 

CCCTTTGTAC ATATTTCTAA AATATTTAAA 5460 

AGTCAAAATT GGATGAGCTT CCATAAATAA 5520 

ATTCATATCA TTAAGTTGAT TAACGATACT 5580 
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TTGAGTAACC ATTAATAGCC ACCCTCCGTT 
TTTTACGGCA TTATAAAAGA AATAAAGACG 
^ CTATAGATGA ATTGATACAA AATAAAACGT 

TTTATTTGTA TATTTAAAAT TATCCAGTAT 

TGATATTATA CCATGTTACA AGATGGTTTT 

10 

CTAATTCATA ATACCGTATG TTTTATTTTT 

AGGATATAAC AGTGAAATTA TAAATTTATT 

TTGTAGAAAA AGGAGCGGTT CAGTTTGGAT 

IS 

AAGCGTGTAT TGGGTTCTTT AGAACAACAA 
GCGAGAGAAG GTAGCATTTT TGTCGCTTCA 
TGTCAAAATG TAGCTGATCA AGGGTGTAAG 

20 

CCAGCTAACG TAACACAAGT GGTTGTGCCG 
CACACATTAT ATGATTATCC GAGTCATCAG 

25 GGTAAAACTT CTATTGCGAC GATGATTCAT 
GCATATTTAG GAACTAATGG TTTCCAAATT 
ACACCAGAAA CAGTTTCTTT AACTAAGAAA 

30 TCTATGACAT TAGAAGTATC AAGCCATGGC 

TTTGACGTTG CAATATTTTC AAATTTAACA 

GAAGCATACG GACACGCGAA GTCTTTATTG 

GAAAAGTATG TCGTGTTAAA CAATGACGAT 

CCTTATGAAG TATTTAGTTA TGGAATTGAT 

CAAGAATCTT TACAAGGTGT CAGCTTTGAT 

40 

AAATCGCCTT ATGTTGGTAA GTTTAATATT 

TGGAGTAAAG GTACATCTTT AGAAACGATT 

GAAGGGCGAT TAGAAGTTTT AGATCCTTCG 

45 

CATACAGCTG ATGGTATGAA CAAATTAATC 
TTGATATTTT TAGTTGGTAT GGCAGGCGAA 
CGAGTTGCCT GTCGTGCAGA TTATGTCATT 
CCGAAAATGT TAACGGCAGA ATTAGCCAAA 



AGTTTGAAAA TTTTATTTAA GTGTAACTTA 5700 

CAAAGTCGTT ACATTTATAG CAATTTTAAT 5760 

TATTTTATAA AGCAATTTAT TGTTCTATGT 5820 

ACAATTATAG CATATTTTTG GAAACAATTA 5880 

AATAATTTAA GATGAGCCAT AATTGTAAAA 5940 

AATAGTAGAA ATTAGAAAAT GCTGATTAGT 6000 

AACATCAACA AAACGTGTAT AATAAACATA 6060 

GCAAGTACGT TGTTTAAGAA AGTAAAAGTA 6120 

ATAGATGATA TCACTACTGA TTCACGTACA 6180 

GTTGGATATA CTGTAGACAG TCATAAGTTC 6240 

TTGGTAGTGG TCAATAAAGA ACAATCATTA 6300 

GACACATTAA GAGTA6CTAG TATTCTAGCA 6360 

TTAGTGACAT TTGGTGTAaC GGGTACAAAT 6420 

TTAATTCAAA GAAAGTTACA AAAAAATAGT 6480 

AATGAAACAA AGACAAAAGG TGCAAATACG 6540 

ATTAAAGAAG CAGTTGATGC AGGCGCTGAA 6600 

TTAGTATTAG GACGACTGCG AGGCGTTGAA 6660 

CAAGACCATT TAGATTTTCA TGGCACAATG 6720 

TTTAGTCAAT TAGGTGAAGA TTTGTCGAAA 6780 

TCATTTTCTG AGTATTTAAG AACAGTGACG 6840 

GAGGAAGCCC AATTTATGGC TAAAAATATT 6900 

TTTGTAACGC CTTTTGGAAC TTACCCAGTA 6960 

TCTAATATTA TGGCGGCAAT GATTGCGGTG 7020 

ATTAAAGCTG TTGAAAATTT AGAACCTGTT 7080 

TTACCTATTG ATTTAATTAT CGATTATGCA 7140 

GATGCAGTAC AGCCTTTTGT AAAGCAAAAG 7200 

CX3TGATTTAA CTAAAACGCC TGAAATGGGG 7260 

TTCACACCGG ATAATCCGGC AAATGATGAC 7320 

GGTGCAACAC ATCAAAACTA TATTGAATTT 7380 
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GTTTTAGCAT CAAAAGGAAG AGAACCATAT CAAATCATGC CAGGGCATAT TAAGGTGCCA 7500 

CATCGAGATG ATTTAATTGG CCTTGAAGCA GCTTACAAAA AGTTCGGTGG TGGCCCTGTT 7560 

^ GATTAATAAA AGATTTATTG ATGAAGGTAA AACTATTGAT GTTTATTTAT TCGAAGCATT 7620 

AAATAACCAG ATAATCATTG CTATACCAGA TTGGTT TTG G TCATATCAGA TGGCAATGAC 7680 

ATTAGATGAA GAAACTTGTT TTGAAGCAAT ACTCATGCAA TTGTTTGTTT TTAAAGAAGA 7740 

10 

GGAAGAGGCA GAATCGATTG CATCACAACT AACAGATTGG ATAGAAACAT ATAAAAAGGA 7800 

GAAAQACTAA TGAACTTAAA GCAAGAAGTT GAGTCTAGAA AGACTTTTGC GATTATTTCA 7860 

CATCCCGATG CAGGGAAAAC AACGTTAACT GAAAAACTAT TGTACTTCAG TGGTGCTATT 7920 

15 

CGTGAAGCGG GTACAGTTAA AGGGAAGAAG ACTGGTAAAT TTGCGACAAG TGACTGGATG 7980 

AAAGTTGAAC AAGAGCGTGG TATTTCTGTA ACTAGTTCAG TAATGCAATT TGATTACGAT 8040 

GATTATAAAA TCAATATCTT AGATACACCA GGACATGAAG ACTTTTCAGA AGATACGTAT 8100 

20 

AGAACATTAA TGGCAGTTGA CAGTGCTGTC ATGGTCATAG ACTGTGCAAA AGGTATTGAA 8160 

CCACAAACAT TGAAGTTATT TAAAGTTTGT AAAATGCGTG GTATTCCAAT CTTTACATTC 8220 

25 ATTAATAAAT TAGACCGAGT AGGTAAAGAA CCATTTGAAT TATTAGATGA AATCX3AAGAG 8280 

ACATTAAATA TTGAAACATA CCCTATGAAT TGGCCAATTG GTATGGGACA AAGTTTCTTT 8340 

GGCATCATTG ATAGAAAGTC TAAAACAATT GAACCATTTA GAGATGAAGA AAATATATTA 8400 

30 CATTT6AATG ATGATTTTGA GTTGGAAGAA GATCATGCAA TTACAAATGA TAGTGATTTT 8460 

6AACAAGCGA TTGAAGAATT AATGTTGGTT GAAGAAGOGG GTGAAGCCTT TGATAAT6AC 8520 

GCGCTGTTGA GTGGAGACTT AACACCTGTA TTTTTCGGTT CAGCTTTAGC TAACTTTGGT 8580 

^ 6TACAAAATT TCTTAAATGC ATATGTTGAT TTTGCGCCAA TGCCAAATGC GAGACAAACA 8640 

AAAG&AGACX3 TTGAAGTAAG CCCGTTTGAT GATTCATTTT CAGGATTTAT CTTTAAAATT 8700 

CAAGCCAACA TGGACCCTAA ACACCGT6AT AGAATTGCCT TTATGCGTGT CGTTAGTGGT 8760 

40 

GCA7TTGAAC GTGGTATGGA TGTTACTTTG CAACGTACTA ATAAAAAGGA AAAGATCACA 8820 

CGTTCAACGT CATTTATGGC AGACGATAAA GAAACTGTGA ATGATGCTGT AGCAGGCGAT 8880 

ATCATTGGAC TATATGATAC TGGTAATTAT CAAATTGGAG ATACTTTAGT TGGTGGAAAA 8940 

45 

CAAACCTACA GTTTCCAAGA TTTACCACAA TTTACGCCAG AAATTTTTAT GAAAGTTTCT 9000 

GCTAAAAACG TCATGAAACA GAAGCATTTC CATAAAGGTA TTGAAGAATT AGTACAAGAA 9060 

GGTGCGATTC AATACTATAA AACATTACAC ACAAACCAAA TTATTTTAGG TGCTGTTGGT 9120 

SO 

CAGTTACAAT TTGAAGTTTT CGAACATAGA ATGAAAAACX5 AATATAATGT TGATGTTGTT 9180 

55 



697 



EP 0 786 519 A2 



AAGATGAACA CATCAAGATC GATTTTAGTG 
TTTGAAAATG AATTTGCAAC AAGATGGTTT 
5 AGTTTACTTT AACAGCTCAA TTGTATAATC 

TTGAAGAAAA ATAAATTGTA TATTTTAAAA 
ATAACCTATG GCATTTTGTC AGAGGGGAGT 

10 

CGACACTTTA TCGTCATTAC GArGATATCT 
GTGAGACCTT TGCTATTTAT TTAGCATAGG 
AAAGGAGTTG TACATGTTAA TGGATCCAAG 
CX3TTTTAGTA TTTTTAGAAG GCTTATTAGC 
GGTTAAGCAC TTACCACCCX} AACAACGTAA 
20 ATTTGTATTT AGATTTTTAG CATTATTCTT 
TCAAGCTGCA GGAGCGGTTT ACTTAATTTA 
TAAACACCCA GAAATTGAAA GTCCTGAAGC 
TGAAQAGATT AAAGCAAGTA ACAAATCATT 
AGATATCGCA TTTGCCATTG ATTCTATGCT 
TAAAGTTGGT ATTCACTTTG GTGGTATGGA 

30 

TGGAATGATT GGTGTTATTC TAATGCGTTA 
CAAATATCCA GGACTTGAAG GTGCAGCCTt 

^ AGTTGTCATG GTATTAGCGC ACCCAGACAT 
CGTATTATGG CAATCTATTT TCTGGACAGT 
AGGT7CAGTT GTTAAAAATA AAAAATCGCA 

40 ATTTAGTTTA AGGTTGTCCT TTTTCATTTA 
AGAATGTGAA TCAAAAGATG CGATATAGTA 
TTTTTCAATA GAAATTATAT AGATTTTAAA 

45 

AAGTGATTTT ATTGAATGGA GTGGACATTA 
AAAAGTCAGT nACCsaCGTA AGCGTCGTGA 
TTTAAATCAA CATCAAGATA AACAAAATAT 

SO 

GCAAATACAT AAAGATTCAA TTGATAAGCA 
TTTAGAACAG AGAAACCGTG ATGTTAATGA 

55 



AAAGATAGAT ATGACGATTT AGTATTCTTA 9300 

GAAGAGAAAT TCCCTGAAAT TAAATTGTAT 9360 

GAATTTGTTA CATTAAAAAT AATTGTTTCG 9420 

GAAAAAGGTA TACTATGATG TATCAAATGA 94 80 

AACTTAAGAA TCATGACCGT ATAAATGaTT 954 0 

TCCGGTAAAG TGGGCAATTT AAATTGCTTA 9600 

TCTTTTTGTT TGTACTTAAC TTATTTATTT 9660 

TTTGATCTTA CCTTATTTAT GGGTACTTGT 9720 

AGCAGATAAC GCGATTGTTA TGGCTGTAAT 9780 

AAAAGCTTTG TTTTACGGTT TGTTAGGTGC 9840 

AATTAGTATT ATCGCGAACT TTTGQTTTAT 9900 

TATGTCAATC AAAAATCTGT GGCAGTTCTT 9960 

TGGAGATGAT CATCATTATG ATGAATCTGG 10020 

CTGGGGAACT GTGTTGAAAA TAGAATTTGC 10060 

TGCTGCTTTA gCTATTGCTG TAACACTTCC 10140 

CTTAGGTCAG TTCGTAGTCA TGTTCCTAGG 10200 

TGCAGCAACA TGGTTTGTAG AGCTATTAAA 10260 

CGCGATCGTT GGTTGGGTAG GTGTTAAATT 10320 

CGCTGTATTG CCTGAGCACT TCCCACATGG 10380 

ACTAATTGGA TTAGTAATTA TCGGTTGGTT 10440 

TAAATAATTG ATGTGAAGCX3 GACAATCTTA 10500 

ATTGAGTGAT TTATGAAAAA TGGATTTTGA 10560 

TTAAGAAAAT GTGCCTTTTA TATTTAGCAT 10620 

GCAAATTAGG TGTTAATGTG TCATAATGAT 10680 

GTGGATATTG GTAAAAAACA TGTAATTCCT 10740 

ATTCTTCCAC AACGAAGACA GAGAAGAAAA 10800 

AGATAATACA ACATCAAAAA AAGCAGATAA 10860 

CGAACGTTTT AAAAATAGTT TATCATCGCA 10920 

GAATAAAGCT GAAGAAAGTA AAAGTAATCA 10980 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



55 



AAATTCATTA GATTCAGTGG ACCAAGATAC AGAGAAATCA AAATATTATG AGCAAAATTC 11100 

TGAAGCGACT TTATCAACTA AATCAACCGA TAAAGTAGAA TCAACTGAAA TGAGAAAGCT 111 SO 

AAGTTCAGAT AAAAACAAAG TTGGTCATGA AGAGCAACAT GTACTTTCTA AACCTTCAGA 11220 

ACATGATAAA GAGACTAGAA TTGATTCTGA GTCTTCAAGA ACTGATTCAG ACAGCTCGAT 11280 

GCAGACAGAG AAAATAAAAA AAGACAGTTC AGATGGAAAT AAAAGTAGTA ATCTGAAATC 11340 

TGAAGTAATA TCAGACAAAT CAAATACAGT ACCAAAATTG TCGGAATCTG ATGATGAAGT 11400 

AAATAATCAG AAGCCATTAA CTTTACCGGA AGAACAGAAA TTGAAAAGAC AGCAAAGTCA 11460 

AAATGAGCAA ACAAAAACCT ATACATATGG TGATAGCGAA CAAAATGACA AGTCTAATCA 11520 

TGAAAATGAT TTAAGTCATC ATATACCATC GATAAGTGAT GATAAAGATA ACGTCATGAG 11580 

A6AAAATCAT ATTGTTGACX5 ATAATCCTGA TAATGATATC AATACACCAT CATTATCAAA 11640 

AACAGATGAC GATCGAAAAC TTGATGAAAA AATTCATGTT GAAGATAAAC ATAAACAAAA 11700 

TGCAGACTCG TCTGAAACGG TGGGATATCA AAGTCAGTCA ACTGCATCTC ATCGTAGCAC 11760 

TGAAAAAAGA AATATTTCTA TTAATGACCA TGATAAATTA AACGGTCAAA AAACAAATAC 11820 

AAAGACATCG GCAAATAATA ATCAAAAAAA GGCTACATCA AAATTGAACA AAGGGCGCGC 11680 

TACGAATAAT AATTATAGTG ACATTTTGAA AAAGTTTTGG ATGATGTATT GGCCTAAATT 11940 

AGTTATTCTA ATGGGTATTA TTATTCTAAT TGTTATTTTG AATGCCATTT TTAATAATGT 12000 

GAACAAAAAT GATCGCATGA ATGATAATAA TGATGCAGAT GCTCaAAAAT ATACGACAAC 12060 

GATGAAAAAT GCCAATAACA CAGTTAAATC GGTCGTTACA GTTGAAAATG AAACATCAAA 12120 

AGATTCmTCA TTACCTAAAG ATAAAGCATC TCaAGACGAA GTGGGATCAG GTGTTGTATA 12180 

TAAAAAATCT GGAGATACGT TATATATTGT TACXWU^TGCA CACGTTGTCO GTGATAAAGA 12240 

AAATCaAAAA ATAACTTTCT CGAATAATAA AAGTGTTGTT GGGAAAGT6C TTGGTAAAGA 12300 

TAAATGGTCA GATTTAGCTG TTGTTAAAGC AACTTCTTCA GACAGTTCAQ TGAAAGAGAT 12360 

AGCTATTGGA GATTCAAATA ATTTAGTGTT AGGAGAGCCA ATATTAGTCG TAGGTAATCC 12420 

ACTTGGTGTA GACTTTAAAG GCACTGTGAC AGAAGGTATT ATTTCAGGTC TGAACAGAAA 12480 

TGTTCCTATT GATTTCGATA AAGATAATAA ATATGATATG TTGATGAAAG CTTTCCAAAT 12540 

TGATGCATCA GTAAATCCAG GTAACTCGGG TGGTGCTGTC GTCAATAGAG AAGGAAAATT 12600 

AATAGGTGTA GTTGCAGCTA AAATTAGTAT GCCAAACGTT GAAAnTATGT CATTTGCA 12658 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6048 base pairs 
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(C) STRANDBDNESS : double 

(D) TOPOLCX3Y: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

TGAAATnGAA TAGTACTATT GCAAGTGTAA AGAGGTTAAT TTTTGCCnCA CGCGGGACTT 60 

AAAAAGGCAA CCACTGGTTG TGACATATCC TTATTTACAT TTATAAATAT AAGGAGGAGG 120 

TAGTAGTGAA AGACTTATTG CAAGCACAGC AAAAGCTTAT ACCGGATCTC ATAGATAAAA 180 

TGTATAAACG TTTTTCTATT CTTACTACTA TCTCAAAAAA TCAGCCTGTC GGACGTCGAA 240 

IS GTTTAAGCGA ACATATGGAT ATGACTGAAC GTGTACTGCG TTCTGAAACA GATATGCTTA 300 

AGAAACAAGA TTTGATAAAA GTTAAGCCTA CCGGAATGGA AATTACAGCT GAAGGTGAGC 360 

AACTGATTTC GCAATTGAAA GGTTACTTTG ATATCTATGC AGATGATAAT CGTCTGTCAG 420 

AAGGTATTAA GAATAAATTT CAAATTAAGG AAGTTCATGT TGTTCCTGGT GATGCT6ATA 480 

ATAGTCAATC TGTTAAAACA GAATTAGGTA GACAAGCAGG TCAATTACTT GAAGGCATAT 540 

TACAAGAAGA CGCGATAGTT GCTGTAACTG GCGGATCCAC GATGGCATGT GTTAGTGAAG 600 

2S 

CAATTCATTT ATTACCATAT AATGTATTCT TCGTACCAGC CAGAGGTGGA CTAGGCGAAA 660 

ATGTTGTCTT TCAGGCAAAC ACAATTGCAG CCAGTATGGc aCAACAAGCT GGCGGTTATT 720 

ATACGACGAT GTATGTACCT GATAATGTCA GTGAAaCAAC ATATAATACA TTGTTGTTAG 780 

AGCCATCAGT CATAAACACT TTAGACAAAA TTAAACAAGC AAACGTTATA TTACACGGCA 840 

TTGGTGATGC GCTGAAGATG GCGCATOGAC GTCAATCACC TGAAAAGGTC ATTGAACAAC 900 

35 TTCAACATCA TCAAGCTGTC GGAGAGGCAT TTGGTTATTA TTTTGATACA CAAGGTCAAA 960 

TTGTCCATAA GGTTAAAACA ATTGGACTTC AATTAGAAGA CCTTGAATCA AAAGACTTTA 1020 

TTTTTGCAGT TGCAGGAGGC AAATCGAAAG GTGAAGCAAT TAAAGCATAC TTGACGATTG 1080 

CACCCAAGAA TACAGTGTTA ATCACTGATG AAGCCGCAGC AAAGATAATA CTTGAATAAG 1140 

AGATAAAAAG TTTAATACTT TTTAAATATC ATTTTAAAGG AGGCCATTAT AATGGCAGTA 1200 

AAAGTAGCAA TTAATGGTTT TGGTAGAATT GGTC6TTTAG CATTCAGAAG AATTCAAGAA 1260 

45 

GTAGAAGGTC TTGAAGTTGT AGCAGTAAAC GACTTAACAG ATGACGACAT GTTAGCGCAT 1320 

TTATTAAAAT ATGACACTAT GCAAGGTCGT TTCACAGGTG AAGTAGAGGT AGTTGATGGT 1380 

50 GGTTTCCGCG TAAATGGTAA AGAAGTTAAA TCATTCAGTG AACCAGATGC AAGCAAATTA 1440 

CCTTGGAAAG ACTTAAATAT CGATGTAGTA TTAGAATGTA CTGGTTTCTA CACTGATAAA 1500 

GATAAAGCAC AAGCTCATAT TGAAGCAGGC GCTAAAAAAG TATTAATCTC AGCACCAGCT 1560 

55 
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ACAGTTGTTT CAGGTGCTTC ATGTACTACA AACTCATTAG CACCAGTTGC TAAAGTTTTA 1680 

AACGATGACT TTGGTTTAGT TGAAGGTTTA ATGACTACAA TTCACGCTTA CACAGGTGAT 1740 

5 

CAAAATACAC AAGACGCACC TCACAGAAAA GGTGACAAAC GTCGTGCTCG TGCAGCGGCA 1800 

GAAAACATCA TCCCTAACTC AACAGGTGCT GCTAAAGCTA TCGGTAAAGT TATTCCTGAA 1860 

ATCGATGGTA AATTAGATGG TGGTGCACAA CXyTGTTCCTG TAGCTACAGG TTCATTAACT 1920 

TO 

GAATTAACAG TAGTATTAGA AAAACAAGAC GTAACAGTTG AACAAGTTAA CGAAGCTATG 1980 

AAAAATGCTT CAAACGAATC ATTCGGTtAC ACTGAAGACG AAATCGTTTC TTCAGACGTT 2040 

,5 GTAGGTATGA CTTACGGTTC ATTATTCGAC GCTACACAAA CTCGTGTAAT GTCAGTTGGC 2100 

GACCGTCAAT TAGTTAAAGT TGCAGCTTGG TATGATAACG AAATGTCATA TACTGCACAA 2160 

TTAGTTCGTA CATTAGCATA CTTAGCTGAA CTTTCTAAAT AATTTTAGTA TAGTTTTTAT 2220 

20 TCAAATACGC TAGTGCTCAG AACTATTTAG CATTAATTAA AGCTTATGAG TAAGCGGGGA 2280 

GCACAAACGC TTCTCCGCTT ATTTTTATAT AAAATTTCCT AATTACAAGG AGGAAACACC 2340 

ATGGCTAAAA AAATTGTTTC TGATTTAGAT CTTAAAGGTA AAACAGTCCT AGTACGTGCT 2400 

GATTTTAACG TACCTTTAAA AGACGGTGAA ATTACTAATG ACAACCGTAT CGTTCAAGCT 2460 

TTACCTACAA TTCAATACAT CATCGAACAA GGTGGTAAAA TCGTACTATT TTCACATTTA 2520 

GGTAAAGTGA AAGAAGAAAG TGATAAAGCA AAATTAACTT TACGTCCAGT TGCTGAAGAC 2580 

30 

TTATCTAAGA AATTAGATAA AGAAGTTGTT TTCGTACCAG AAACACGCGG CGAAAAACTT 2640 

GAAGCTGCTA TTAAAGACCT TAAAGAAGGC GACGTATTAT TAGTTGAAAA TACACGTTAT 2700 

35 GAAGATTTAG ACGGTAAAAA AGAATCTAAA AATGATCCAG AATTAGGTAA ATACTGGGCA 2760 

TCTTTAGGTG ATGTGTTTGT AAATGATGCT TTTGGTACTG CGCATCGTGA GCATGCATCT 2820 

AATQTTGGTA TTTCTACACA TTTAGAAACT GCAGCTGGAT TCTTAATGGA TAAAGAAATT 2880 

40 AAGTTTATTG GCGGCGTAGT TAACGATCCA CATAAACCAG TTGTTGCTAT TTTAGGTGGA 2940 

GCAAAAGTAT CTGACAAAAT TAATGTCATC AAAAACTTAG TTAACATAGC TGATAAAATT 3000 

ATCATCGGCG GAGGTATGGC TTATACTTTC TTAAAAGCGC AAGGTAAAGA AATTGGTATT 3060 

45 

TCATTATTAG AAGAAGATAA AATCGACTTC GCAAAAGATT TATTAQAAAA ACATGGTGAT 3120 

AAAATTGTAT TACCAGTAGA CACTAAAGTT GCTAAAGAAT TTTCTAATGA TGCCAAAATC 3180 

ACTGTAGTAC CATCTGATTC AATTCCAGCA GACCAAGAAG GTATGGATAT TGGACCAAAC 3240 

50 

ACTGTAAAAT TATTTGCAGA TGAATTAGAA GGTGCGCACA CTGTTGTATG GAATGGACCT 3300 

ATGGGTGTAT TCGAGTTCAG TAACTTTGCA CAAGGTACAA TTGGTGTATG TAAAGCAATT 3360 
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TCTTTAGGTT TTOAAAATGA CTTCACTCAT 
TACCTAGAAG GTAAAGAATT GCCTGGTATC 
^ AGTTTAAAGT GATGTGGCAT GTTTGTTTAA 

CATCGTGTTT CATCACTTTT CAAAAATATT 
ACCAATTATA GCTGGTAACT GGAAAATGAA 

10 

AATACATTAC CAACACTACC AGATTCAAAA 
ATTCAATTAG ATGCATTAAC TACTGCACTT 
GGTGCTCAAA ATACGTATTT CGAAGATAAT 
GCATTAGCAG ATTTAGGCGT TAAATACGTT 
TTCCACGAAA CAGATGAAGA AATTAACAAA 
20 ACTCCAATTA TATGTGTTGG TGAAACAGAC 

GTTGTAGGTG AGCAAGTTAA GAAAGCTGTT 
GTTGTAATTG CTTATGAACC AATCTGGGCA 
GATGCAAATG AAATGTGTGC ATTTGTACGT 
GTATCAGAAG CAACTCGTAT TCAATATGGT 
TACATGGCAC AAACTGATAT TGATGGGGCA 

30 

GATTTCGTAC AATTGTTAGA AGGTGCAAAA 
TTATTTTAGA TGGTTTTGCG AACCGCGAAA 
ACAAGCCTAA TTTTGATCGT TATTACAACA 

35 

GCTTAGATGT TGGACTACCT GAAGgACAAA 
TCGGTGCAGG ACGTATCGTT TATCAAAGTT 
40 GTGATTTCTT TGAAAATGAT GTTTTAAATA 

CAGCGTTACA CATCTTTGGT TTATTGTCTG 
TATTTGCTTT GTTAGAACTT GCTAAAAAAC 
OTTTAGATGG CCGTGACGTA GATCAAAAAT 
CTAAATTCAA TGAATTAGGC ATTGGTCAAT 
TGGATCGTGA CAAACGTTGG GAACGTGAAG 

SO 

ATGCCCCAAC TTATGCAACT GCCAAAGAAG 
CTGAC6AATT CGTAGTACCA TTCATCGTTG 

55 



ATTTCAACTG GTGGCGGCGC GTCATTAGAG 3480 

AAAGCAATCA ATAATAAATA ATAAAGTGAT 3540 

CATTGTTACG GGAAAACAGT CACAAGATGA 3600 

TACAAAACAA GGAGTGTCTT TAATGAGAAC 3660 

CAAAACAGTA CAAGAAGCAA AAGatTCGTC 3720 

GAAGTAGAAT CAGTAATTTG TGCACCAGCA 3780 

AAAGAAGGAA AAGCACAAGG TTTAGAAATC 3840 

GGTGCGTTCA CAGGTGAAAC GTCTCCAGTT 3900 

GTTATCGGTC ATTCTGAACG TCGTGAATTA 3960 

AAAGCGCACG CTATTTTCAA ACATGGAATG 4020 

GAAGAGCGTG AAAGTGGTAA AGCTAACGAT 4080 

GCAGGTTTAT CTGAAGATCA ACTTAAATCA 4140 

ATCGGAACTG GTAAATCATC AACATCTGAA 4200 

CAAACTATTG CTGACTTATC AAGCAAAGAA 4260 

GGTAGTGTTA AACCTAACAA CATTAAAGAA 4320 

TTAGTAGGTG GCGCATCACT TAAAGTTGAA 4380 

TAATCATGGC TAAGAAACCa ACTGCGTTAA 444 0 

GCGAACATGG TAATGCGGTA AAATTAGCAA 4500 

AATATCCAAC GACTCAAATC GAAGCGAGTG 4560 

TGGGTAACTC AGAAGTTGGT CATATGAATA 4620 

TAACTCGAAT CAATAAATCA ATTGAAGACG 4680 

ATGCAATTGC ACACGTGAAT TCACATGATT 4740 

ACGGTGGTGT ACACAGTCAT TACAAACATT 4800 

AAGGTGTTGA AAAAGTTTAC GTACACGCAT 4860 

CCGCTTTGAA ATACA'TCGAA GAGACTGAAG 4920 

TTGCATCTGT GTCTGGTCGT TATTATGCAA 4980 

AAAAAGCTTA CAATGCTATT CGTAATTTTG 5040 

GTGTAGAAGC AAGCTATAAT GAGGGCTTAA 5100 

AGAATCAAAA TGACGGTGTT AATGATGGAG 5160 
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10 



15 



CGAACAGAGC ATTCGAAGGC TTTAAAGTTG AACAAGTTAA AGACTTATTC TATGCAACAT 5280 

TCACTAAGTA TAATGACAAT ATCGATGCGG CTATCGTCTT CGAAAAAGTT GATTTAAATA 534 0 

ATACAATTGG TGAAATTGCA CAAAATAACA ATTTAACTCA ATTACGTATT GCAGAAACTG 5400 

AAAAATACCC TCACGTTACT TACTTTATGA GTGGTGGACG TAACGAGGAA TTTAAAGGTG 5460 

AACGCCGTCG TTTAATTGAT TCACCTAAAG TTGCAACXSTA TGACTTGAAA CCAGAAATGA 5520 

GTGCTTATGA AGTTAAAGAT GCATTATTAG AAGAGTTAAA TAAAGGTGAC TTGGACTTAA 5580 

TTATTTTAAA CTTTGCTAAC CCTGATATGG TTGGACATAG TGGTATGCTT GAGCCGACAA 5640 

TCAAAGCAAT CGAAGCGGTT GATGAATGTT TAGGAGAAGT GGTTGATAAG ATTTTAGACA 5700 

TGGACGGTTA TGCAATTATT ACTGCTGACC ATGGTAACTC TGATCAAGTA TTGACGGaTG 5760 

ATGATCAACC AATGACTACG CAwACAACGA ACCCAGTACC AGTGATTGTA ACAAAAGAAG 5820 

20 GCGTTACACT TAGAGAAACT GGTCGCTTAG GTGACTTAGC ACCTACATTA TTAGATTTAT 5880 

TAAATGTA6A ACAACCTGAA GATATGACAG GTGAaTCTTT AATTAAACAC TAATATTGTA 5940 

AAAGATGTTA AGTAAACGCT TAATGACACT TATTTTTTGA AAATAATAGT AATATCllTTT 6000 

^5 TGTTAAATGA AAGAATAAAG CTATAATAAT TATAGAATAA CTATTTAn 6048 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

AAASvAGTGC AAGATATCAT CGCATTAATT AAGTCGTTAC AAAgTGTAAT TGTAGACaTC 60 

40 GCTTCCAATA ATGTTGATAC AATTATGCCT GGTTATACTC ATTTACAGCG TGCACAGCCA 120 

ATTTCATTTG CACATCATAT TATGACTTAT TTTTGGATGT TACAACGAGA CCAACAACGA 180 

TTTGAAGATA GTTTAAAACG AATCGATATT AATCCTTTAG GTGCAGCAGC CTTAAGTGGT 240 

ACCACATACC CTATCGATAG ACACGAGACA ACAGCATTGT TGAACTTTGG CAGTCTCTAT 300 

6AGAATAGCC TAGATGCTGT TAGTGACAGA GACTATATTA TTGAAACATT 6CATAATATT 360 

TCTTTAACGA TGGTTCACTT ATCACGCTTT GCAGAGGAAA TTATTTTCTG GTCCACAGAC 420 

GAAGCTAAAT TCATTACATT ATCAGATGCA TTTTCAACTG GCTCATCTAT TATGCCACAA 480 

AAGAAAAATC CTGATATGGC AGAATTAATT AGAGGTAAAG TIGGTCGAAC GACTGGTCAT 540 

55 



45 



SO 
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GAAGATAAAG AAGGTTTATr CGATGCTGTC CATACAATTA AAGGTTCTTT ACGTATTTTC 660 

GAAGGTATGA TTCAAACGAT GACAATTAAT AAAGAACGAC TCAATCAAAC TGTTAAAGAA 720 

5 

GATTTTTCAA ATGCAACGGA ACTAGCAGAT TATTTAGTAA CTAAAAATAT TCCATTTAGA 780 

ACTGCACATG AAATTGTAGG AAAAATCGTC TTAGAATGTA TACAACAAGG TCATTATTTA 840 

TTAGATGTTC CTTTAGCAAC ATATCAACAA CATCATTCTA GTATTGATGC CGATATTTAC 900 

10 

GATTATTTGC AGCCTGAAAA TTGTTTAAAA CGACGTCAAA GTTACGGTTC AACAGGTCAA 960 

TCATCX3GTCA AACAACAACT TGATGTTGCT AAACAATTAC TATCACAATA AATACGTTAA 1020 

^5 TCTACCTACC CACAATGTCT ATTAAAATTA CATTGTGGGT ATTTTAATGC TCTCTTCGTC 1080 

TTGTTGAACA TCACATTTTT AAGATTCCTA AAATGTTTGA TAATTCTTTT AAATTTATAT 1140 

TACAAAAATG TTATAAATTG TAAAAGAAAT GTGTAAAGCG TTTTCACAAG CAGGTTTTTG 1200 

^ TAGTATTTTA AAATTGTTAG ACTACAAATA AAGAGATGAA AGGATAAAGA CTATGACTAA 1260 

CTCTTCGAAA AGCTTCACTA AATTTATGGC TGCTTCTGCT GTTTTTACTA TGGGATTTTT 1320 

ATCAGTACCT ACTGCTGGCG CTGAACAAAC AAATCAAATT GCAAATAAAC CTCAGGCTAT 1380 

25 

TCAATGGCAT ACAAATTTAA CX3AATGAGCG ATTCACTACT ATCGCACATC GTGGCGCAAG 1440 

TGGCTATGCA CCCGAGCATA CGTTTCAAGC ATATGATAAG AGTCATAATG AGTTAAAAGC 1500 

ATCTTATATC GAAATTGATT TACAACGTAC CAAAGATGGC CATTTAGTTG CTATGCATGA 1560 

30 

TGAAACTGTT AACCGTACAA CAAATGGACA CGGTAAAGTT GAGGATTATA CCCTTGATGA 1620 

ATTAAAACAG TTAGATGCAG GAAGTTGGTT TAATAAAAAA TATCCAAAAT ACGCAAGAGC 1680 

^ AAGTTATAAA AATGCTAAAG TACCCACTTT AGATGAAATT TTAGAACGTT ATGGCCCGAA 1740 

TGCAAACTAT TATATTGAAA CAAAGTCACC TGATGTATAC CCAGGAATGG AAGAACAATT 1800 

ATTA6CTTCA TTGAAAAAGC ATCACCTTTT AAATAACAAT AAATTAAAAA ATGGACATGT I860 

40 AATGATTCAA TCATTTTCTG ACGAAAGTTT AAAGAAAATT CATCGTCAAA ATAAGCATGT 1920 

GCCATTAGTA AAATTAGTTG ATAAAGGTGA ACTACAACAA TTTAACGACC AACGCTTAAA 1980 

AGAGATACGC TCTTATGCGA TTGGATTAGG TCCTGATTAT ACAGATTTAA CTGAACAAAA 2040 

45 

TACCCATCAT TTAAAAGACT TAGGATTTAT A6TACATCCT TATACAGTGA ATGAAAAAGC 2100 

TGATATGTTA CGATTAAATA AATATGGCGT TGATGGTGTC TTTACAAATT TCGCTGATAA 2160 

ATATAAAGAA GTCATTAAGT AGTAATGTTA AACTAGAAAA CATAAATACA AAAATATAGC 2220 

50 

TATTACTATA AAAAACAGCA GTAAGATATT TCCAAATT6A AATTATCCTA CTGCTGTCTT 2280 

TTTGGGAGTG GGACAGAAAT GATATTTTCX5 CAAAATTTAT TTCX3TCGTCC CACCCCAACT 2340 

55 
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TTGTCTGTAG AAATTGAGGA GCTAATTTCT CTGTGTCGGG GCTCCACCCC AACTTGCACA 2460 

CTATTGTAAG CTGACTTTCC GCCAGCCTCT GTGTTGGGGC CCCGCCAACT TGCACACTAT 2520 

TGTAAGCTGA CTTTCCACCA GCCTCTGTGT TGGGGCCCCG ACTATTTTTG AAAAGAGCGT 2580 

GTTACACGGG CATTGTTTTA CAGTCAACTA CTGCTAAAAT AAAATTAACG AGCTTAGGGC 264 0 

TTTGTTTTCT GTCCCAAGCT CGTTAAATCA CATATGATAA TTAATTATGC CCAACCACGA 2700 

TATCTAGCTG CTTCTGCTGT ACGTTTAATA CCTATGATAT ATGCTGCAAG TCTCATATCT 2760 

ATTTTTCGGT TTTGAGACAA TTGGTAAATC GTATCAAAT6 CCGCTTCTAA TTTTTCACGT 2820 

,5 AGCTTTTCAT TAACTTCTTC TTCAGACCAA TAATAACCTT GATTATTTTG TACCCATTCG 2880 

AAGTAAGAAA CCGTtACACC ACCAGCACTT GCTAATACGT CTGGAACTAA TAATATACCA 2940 

CXfTTCAGTTA AAATACGTGT TGCTTCTGGT GTTGTAGGTC CATTAGCAGC TTCAACAACG 3000 

ATACTAGCTT TAATATCATG TGCATTGTCT TCTGTAATTT GG1TTGAAAT AGCCGCTG6T 3060 

ACTAAAATGT CACAATCTAA TTCAAACAAT TCTTTATTTG AGATTGTTTC TTCAAATAAA 3120 

TTTGTTACCG TACCAAAACT ATCACX3ACGG TCTAATAAAT AATCTATATC TAAGCCATTT 3180 

GGATCGTGTA ATGCACCGTA AGCATCAGAG ATACCTACAA TTTTTGCACC TAAATCATAT 3240 

AAGAATTTAG CTAAGAAACT TCCGGCATTA CCGAAACCTT GAATAACAAC CTTGGCACCT 3300 

TCAATTTGCA TATTACGACG TTTTGCAGCT TGTTCAATTG CAATAACTAC ACCTAGTGCA 3360 

GTTGATCTGT CGCGTCCATG AGAACCACCC AATACAATTG GTTTACCTGT GATGAAACCT 3420 

GGTGAATTAA ATTTATCTAA TGCACTATAT TCATCCATCA TCCAAGCCAT AATTTGTGAG 3480 

TTTGTAAATA CATCTGGTGC TGGAATATCT TTGTTCGGAC CTACGAATTG TGAAATTGCT 3540 

CTTACATATC CGCGTGATAA ACGTTCAACT TCATGAATGC TCATTTGACG TGGATCACAA 3600 

ACGMACCAC CCTTACCACC ACCGTATGGT AAGTTTACAA TGCCACATTT CAAAGTCATC 3660 

40 CACATTGATA ATG C TTTT A C TTCTTCTTCA TCAACATCTG GGTGGAAACG CACX5CCCCCT 3720 

TTTGTTGGTC CAACAGCATC ATTATGTTGC GCACGQTAAC CTOTGAATGT TTTTACTGTG 3780 

CCATCATCCA TTCGTACAGG GATACGCACT TGTAACATTC TTAAAGGTTC TTTAATTAAA 3840 

TCGTACATTC CTtCGTCAAA TCCCAATTTA TGCAATGCTT CTTTAATAAT TCCTTGAGTA 3900 

GAAGTTACTA AATTATTGTT CTCAGTCATG ATCCTTTTCG CCTCTTCTTT ACCTAATGAT 3960 

TTCGCTTTCA AACATATTGT AACATAACGT ATTCCTTTTT AAAGCCCTTA CAAACTGATT 4020 

GTTACAACTT TTTGACATTA TTGAAATACA TGTCTTATTT TTTCAAGTGC AAGGTCCAAT 4080 

TCTTCTTTAG TAATAATTAA TGGTGGTGCA AAACGAATGA CAGTATCATG CGTTTCTTTA 4140 
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35 



40 



ACACCTATAA 


ACAAACCACG 


TCCACGGACT 


TCTTTAATTG ATGGATGATC 


AATTTGCTTT 


4260 


AATTGTTCTr 


TAAAATAATC 


TCCTAATTCT 


AAAGAGCGGC 


CTGGTAAATC 


CTCATCAACG 


4320 


ATAACATCTA 


ATGCAGCAAT 


TGATGCAGCA 


CAAGCAAGTG 


GATTACCACC 


AAATGTTGAA 


4380 


CCATGTGAGC 


CAGGTGTAAA 


GACATCTAAT 


ACTTCTTTAT 


CTGCTAATAC 


AACAGAAATT 


4440 


GGGAAGACTC 


CACCACCTAG 


TGCTTTACCT 


AAAATATAGA 


CATCAGGTTT 


TACATTATCC 


4500 


CAATCCGTA6 


CAAATAATTT 


ACCCGAACGA 


CCTAATCCTG 


CTTGGATTTC 


GTCAGCAATA 


4560 


AATAAGACAT 


TATGTTCATC 


ACATAATTCT 


CTAATTGCTT 


TCAAATATCC 


TTCTGGCGGT 


4620 


ATATTTATAC 


CCGCTTCACC 


TTGAATTGGT 


TCTACTAAAA 


CTGCTGCAGT 


ATTTTCATTA 


4680 


ATTGCAGCTT 


TCAATGCATC 


TACATCTCCA 


AAATCAACTT 


TTCTAAATCC 


ATCTAATAAC 


4740 


GGACCATAAC 


CACGTTGGTA 


TTCTGCTTCT 


GAAGATAATG AAACTGGCGC 


CATTGTTCGA 


4800 


CCATGGAAGT 


TACCATTAAA 


TGCAATGATT 


TCTGCTTTAT 


TTGGCTCAAT 


TCCTTTAACA 


4860 


TCGTATGCCC 


AGCGTCGTGC 


TGCTTTCAAA 


GCTGTTTCTA 


CTGCTTCAGC 


ACCTGTATTC 


4920 


ATTGGTAAAG 


CTTTATCTTT 


ACCTGCCAGT 


TTACAAATTT 


TTTCGTACCA 


TTCACCTAAG 


4980 


TTATCACTAT 


GAAAAGCACG 


TGAAACTAAA 


GTCACTTTAT 


CAGCTTGATC 


TTTTAATGCT 


5040 


TGAATAATTT 


TCGGATGTCT 


ATGACCTTGG 


TTAACAGCGG 


AATATGCAGA 


TAACATATCC 


5100 


ATATATTTAT 


TGCCTTCAGG 


ATCTTTAACC 


CATACCCCTT 


CAGCTTcTGa 


AATGaCAATT 


5160 


GGcAATGGTA 


AATAATTATG 


TGCTCCGTAA 


TGATTTGTTA ACTCAATAAT 


TTTTTCAGAT 


5220 


TTAGTCATCA 


TATCTCCCCT 


TTTCATCATT 


TATAACTATT 


ATACATGAAA 


CATTATCCAA 


5280 


ATAATTACAT 


TAGTTTTCAA 


AGCAGATACT 


TTTCCACCAA AAAAGATGAA 


ATAATCACTA 


5340 


AGTTTCATTA 


AATTTGTCTA 


TTTTGAAAAC 


CCTTACATTT 


ATAATGACAT 


AATTACTTAA 


5400 


ATGaTTACAA 


GCAAAAGAAT 


TGATAATTTT 


ACACTTAATC AAAAGTATAT 


TTTACTAAGA 


5460 


ATA'TTTTTAT 


TTATAAATAT 


TGAAAACCAC 


TAACAAATTG 


CATACACAAT 


ATCATTAGTG 


5520 


GTAACAGTTA 


AACACTTATT 


TATCTTTACG 


GGGTAATGGG 


TTAAAACCCT 


TnCATTAAAA 


5580 


TTGGATGnCC 


ATAAAATTAG 


GG 








5602 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TAACCCCATT TTACCTGGAA AAATCgTTTG 
TTTACGTATa GAATTATAAA AgGTTTGATT 
^ TTTTATOTCA AATTTAAAAC AGTAACACTT 

AAAGGAAGTG TCGCGTATTT TAACTTTTTC 
TTTAATGAAT ACTTATTGCA ATCAATGTCC 

10 

GAAAACGAAG GCGCATCATT TTTGTATCAA 
ATTAGGAAAT GAACTTCAAT AGGAGGAAGT 
CCGAACCACA ACACAATGAA 6ATTACACTT 
ACGTATACTA AAGTTGATGA TTCACAGCCA 
GGCGTTAAAT CAATTTTCCA TGTTATGGAC 
20 AATTGGGAAA CAGTATTGCC AAAAGTAGAG 

GTATTCGGGG GGAATAAAGT ATATGGAAAT 
TACAATGAAA GTTGTTTTGT CATATACAAG 
AGTAGAAGAA ACACAACCAA GATTTATAAA 
CATTTTTCAT GTCATGAACT TCTTAGCTGT 
CATATTACCT GATATTAAAG CTGCTTTTTC 

30 

TGAACCTCAA ATTGACAATC ATTTTGGTGA 
TATACCGTAT CAAATTAAGC TAACTTCTGC 
AACATATGTT GACCATATGA CTCAAGCGCA 

35 

TAAATGGCTA GATTTAGGAA ATCGCTATGG 
AGAAGAAGTG CTAGCTACCT ATCCAGAATC 

40 AGAAGAAAAT CACGCAACTA ATAATTATCA 
TCATGCAACT GATAATTGGA AGACTCGATT 
TTTTGAAGAT ATACC6CTGC TTGATTTAGC 

^ TCAAGCGATT GTATTATTAG GTATGATTGA 

GGGGCTTC6T GATAAAAGTC CTGCTGTAAG 

AGGGTATCCA GAGGCACTAC CAGAAATGGT 

SO 

TAGGTGGCGT GCTGCTATGT TTATCTTTGA 

AAAAGCCCAT ATTAATGACA ATGCGTTTGA 

55 



CGATGCaATm GCaTTtGaAT ATAaATACAT 60 

CaAATCTTAG GGTCAAAAAT GTTATAATAT 120 

ATTTACAAGG TTGCAATATT TTGAAGTAAT 190 

AGAGCAAAAT GCACTCGCGA AAATAGATGA 240 

AATCAAAACT CGTCTGCGTA AATTAGAGGG 300 

TGAGTGTTCA ATAGGGAAAG AAATAAAACA 360 

CAAATGAAAA TTATATCTAT ATCAGAAACA 420 

AGTGAAA6CA GAGAAGGTAT GACATCAGAT 480 

GCATTTATTA ATGACATCTT AAAGGTTGAA 540 

TTTATTTCA6 TAGATAAAGA AAATGACGCA 600 

GCTGTATTCG AATAAATTTT TCATCAACTA 660 

TTTACGTATA GAGCCAACAC CAAGTCCAAA 720 

AGAAGACAAG TTATCTAATA CTTATAAAAA 780 

TCAGTTGTTA TCTATAGATG GTATCACTTC 840 

TGATAAGGCA CCAAAAGCTG ATTGGGAAGT 900 

TGATGCGAAT AAGGTTTTAG AATCTGTAAA 960 

AATTAAAGCT GAATTATTAA CTTTTAAGGG 1020 

TGACCAAGAA TTAAGAGAAC AATTACCACA 1080 

AACAGCACAT GACAATATTG TTTTTATGCG 1140 

AAATATTCAA GAAGTAATGG ATGGTGTCCT 1200 

ACAGTTACCC GTATTGGTAA AACATGCTTT 1260 

TTTCTATCGA CATGTCTCTT TGGATGAATA 1320 

ACGAATGTTA AACCATTTTC CAAAGCCGAC 1380 

TTTATCTGAT GAAAAAGTAC CX3GTTAGACG 1440 

AAGTAAAGAA ATTTTACCGT ATTTATATAA 1500 

AAGAACAGCA GGGGATTGCA TAAGCGATTT 1560 

GCTACTATTA GATGATCCAC AGAAAATCGT 1620 

TGAAGGTAAT GCAGAGCAGC TTCCCGCACT 1680 

AGTTAAATTA CAAATTGAAA TGGCCATATC 1740 



707 



EP 0 786 519 A2 



10 



IS 



AATTTAATTG GAGGAATTAA ATATGAATGC ATATGATGCT TATATGAAAG AAATTGCGCA I860 

ACAAATGCGT GGCXSAATTAA CTCAAAATGG TTTTACAAGT TTAGAAACGA GCGAACAGct 1920 

ATCGGAGTAT ATGAACCAAG TAAATGCTGA TGACACTACT TTTGTAGTTA TTAACTCTAC 1980 

ATGCGGCTGT GCAGCTGGAT TAGCAAGACC AGCTGCAGTA GCAGTTGCAA CACAAAATGA 2040 

ACATAGACCT ACAAATACAG TTACAGTTTT TGCTGGGCAA GATAAAGAAG CAACTGCTAC 2100 

AATGCGAGAA TTCATTCAGC AAGCACCATC TAGTCCTTCXj TATGCTTTAT TCAAAGGTCA 2160 

AGATTTAGTT TATTTTATX3C CTAGAGAATT TATCGAAGGT AGAGATATTA ATGACATTGC 2220 

AATGGACTTA AAGGAT6CCT TTGACGAAAA TTGTAAATAG TACACATAAA TAAATATAAA 2280 

GGTTAACACA TTTTATAATA TTAAAAATGG TGTCTGTCAT TGAAAATAGA GAATATAGTT 2340 

GTATTCTATT TGTTAAATAA AGTCCGTTTT TACCaACTAT ATTTTCTAGA AATTTAACTG 2400 

20 TTTTAATAGG ACATCAAACA TAATATTCaA ATCaTGTGTT AACCTCTTTT TTAAAATTTT 2460 

TTAGCATTAA AGTTATAGAT TTGGGTAAAC AATTACCAAT TGGAAACATA TATCACGTTA 2520 

CGATGGGGTA GGTACTTAAT CAGCATTTTA TAAATAAAGT AACGGAATTC ATGATATTAA 2580 

TATCATATTC CTAAAATGAG TGATAACAAA ATGCTACATA AAGTTAAGTT ATATCAAACT 2640 

AAATATACAT ACTATAAATA ATGAAAATGA GGTGTTATCG CATATGTTGA ATTCATTTGA 2700 

TGCAGCATAT CACAGTCTTT GTGAAGAAGT TTTAGAAATA GGAAATACAC GAAATGATCG 2760 

CACAAATACA GGTACGATTT CGAAATTTGG TCATCAACTT CGCTTTGACT TATCTAAAGG 282 0 

ATTTCCACTA TTAACGACAA AGAAAGTTTC TTTTAAATTA GTAGCAACCG AATTATTATG 2880 

GTTCATTAAA GGAGATACAA ACATCCAATA CTTATTAAAA TATAATAATA ATATATGGAA 2940 

CGAATGGGCT TTTGAAAATT ATATCAAATC AGACGAGTAT AAAGGTCCAG ATATGACAGA 3000 

TTTCDGGCAT CGTGCATTGA GTGATCCTGA ATTTAACGAA CAATATAAAG AACAAATGAA 3060 

40 ACAATTTAAG CAACGTATTC TTGAAGATGA TACATTTGCG AAGCAATTCG GGGATTTAGG 3120 

AAATGTTTAT GGTAAACAAT GGCGAGATTG GGTTGATAAA GATGGTAATC ATTTTGATCA 3180 

ACTTAAAACA GTAATTGAAC AAATTAAGCA TAATCCAGAT TCAAGGCGAC ACATCGTATC 3240 

^ TGCATGGAAT CCAACAGAAA TTGATACAAT GGCACTTCCG CCTTGTCATA CCATGTTCCA 3300 

GTTTTATGTC CAAGATGGTA AGTTAAGTTG CCAGTTATAC CAACGTAGCG CAGATATCTT 3360 

TTTAGGTGTG CCATTTAATA TCcGCagctA OGCTTTATTG ACACACCTTA TTGCCAAAGA 3420 

ATGTGGACTT GAAGTGGGTG AATTTGTGCA TACATTTGGA GATGCACATA TTTATTCAAA 3480 

TCATATTGAT GCGATTCAAA CACAATTAGC ACGTGAAAGC TTCAATCCTC CAACATTAAA 3540 
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TGAATCACAT CCAGCAATAA AAGCTCCAAT AGCAGTGTAG TCATTGCATA GTTAGCTAAC 3660 

CATATAGACA TCAAAATGAC ATCATAGTAT TTTCAAGTGC AAAAAAGTAC TTTTTTGTGT 3720 

TAAACGTTTT CATAAATTAT GCAAAATCAT TATTTCTATC ACACTTTATG ATAAAAATTG 3780 

TGTTAAATTA AAGATAACTT AGTAATAAAA AATGAAATGA TAGAAGAAGG AGGATAATTA 304 0 

TGACTTTATC CATTCTAGTt GCACATGACT TGCAACGAGT AATTGGTTTt GAAAATCAAT 3900 

TACCTTGGcA CCTACCAAAT GATTTGAAGC ATGTTAAAAA ATTATCAACA GGTCATACTT 3960 

TAGTAATGGG TCGTAAGACA TTTGAATCX3A TTGGTAAACC ACTACCGAAT CGTCGAAATG 4020 

TTGTACTTAC TTCAGATACA AGTTTCAACG TAGAnGGCGT TGATGTAATT CACTCTATTG 4080 

AAGATATTTA CCAACTACCG GGCCATGTTT TCATATTTGG AGGGCAAACA TTATTTGAAG 4140 

AAATGATTGA TAAAGTGGAC GACATGTATA TTACTGTTAT TGAAGGTAAA TTCCGTGGTG 4200 

20 ATACGTTCTT TCCACCTTAT mCATDcGAgr CTGGQAAGTT GGCTCTTCAG TTGAAGGTAA 4260 

ACTAGATGAG AAAAATACAA TTCCACATAC CTTTCTACAT TTAATTCGTA AAAAATAAGG 4320 

GGGAAAACGA CCATGACAAA ACAGATTATA GTAACAGACT CAACATCOGA TTTATCTAAA 4380 

QAATACTTAG AAGCAAACAA CATTCATGTA ATTCCTTTAA GTTTAACTAT TGAAGGAGCT 4440 

TCATACGTTG ACCAAGTAGA TATTACATCA GAAGAATTTA TTAATCATAT TGAAAATGAT 4500 

GAAGATGTAA AGACAAGTCA GCCAGCCATA GGTGAATTTA TATCTGCTTA TGAAGAACTA 4S60 

GGAAAAGATG GCTCTGAAAT CATAAGTATT CATCTTTCTT CAGGATTAAG TGGTACATAT 4620 

AACACTGCTT ACCAAGCAAG TCAAATGGTA GATGCTAATG TAACTGTTAT TGATTCAAAA 4680 

TCTATTTCTT TTGGTTTAGG GTATCAAATA CAACACCTAG TAGAGCTTGT AAAAgAaGGT 474 0 

GtCTCAACTT CTGAAATAGT TAAAAAGTTA AATCATTTAA GAGAAAACAT TAAATTATTT 4800 

GTAGTTATAG GGCAATTGAA TCAATTAATT AAAGGTGGCA GAATTAGTAA AACAAAAGGT 4860 

TTGATTGGTA ATCTTATGAA AATTAAACCA ATTGGTACAC TAGATGATGG TCGCTTAGAG 4920 

CTTGTGCmCA ATGOGAGAAC TCaAAATTCk AGTATCCAAT ACTTGAAAAA GGAAATTGCT 4980 

GAATTTATAG GAGATCATGA AATCAAATCC ATTGGTGTCG CACATGCTAA CGTCATTGAA 5040 

45 TATGTTGATA AATTGAAGAA AGTTTTTAAT GAAGCTTTTC ATGTGAATAA TTACGATATA 5100 

AATGTAACTA CACCAGTTAT TTCTGCACAT ACTGGTCAAG GTGCGATTGG CCTCGTAGTC 5160 

CTTAAGAAGT AAATTTAATC TTTTCAGTGT TAATTACTTC CATTTCAATC CTTTATAGAC 5220 

50 

TAAATTTATA ATTAGATAGA TAGAGGAGGT AATTCATATG ACAAAAGAAT ATGCAACATT 5280 

AGCAGGAGGA TGTTTCTGGT GCATGGTTAA ACCATTTACA TCATATCCAG GCATCAAGTC 5340 
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GAATCAAACC GGCCATGTCG AAGCAGTACA AATTACGTTT GATCCAGAGG TTACTTCCTT 5460 

TGAAAATATA TTAGACATAT ATTTCAAAAC ATTTGACCCA ACTGATGATC AAGGGCAATT 5520 

■ 5 

TTTCGATAGA GGCGAAAGCT ATCAACCAGT CATTTTCTAT CATGATQAAC ATCAGAAAAA 5580 

GGCTGCTGAG TTTAAAAAGC AACAATTAAA TGAACAAGGT ATTTTCAAGA AACCAGTGAT 564 0 

TACACCTATT AAACCATATA AAAATTTCTA TCCAGCTGAA GACTACCATC AAGATTATTA 5700 

10 

CAAAAAGAAC CCGGTACATT ATTACCAATA TCAACGTGGT TCAGOTAGAA AAGCGTTTAT 5760 

AGAATCACAT TGGGGGAATC AAAATGCTTA AAAAAGATAA AAGTGAACTA ACAGATATAG 5820 

AATATATTGT TACACAAGAn AACGGCACTG AACCACCATT TATGAATGAA TATTGGAATC 5880 

ATTTTGCTAA AGGATTTATG TAGATAAAnT TCnGGTAAAC CTTG 5924 
(2) INFORMATION FOR SEQ ID NO: 131: 
20 (i) SEQUENCE CHAilACTERISTICS : 

(A) LENGTH: 9280 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



35 



45 



SO 



GGCCGTTrsAA 


AATCTCCAAA 


ATAnAAAAAC 


CCATCTTGTT 


CCAATGTTTT 


AAAATCGCCa 


60 


TCCaACACTT 


GaTCaATAGC 


TTGCAACAAC 


GTTGAACGTG 


TTTTaCCAAA 


AGCATCaAAC 


120 


GCTCCCACTA 


AAATCAGTGC 


TTCAAGTAAC 


TTTCTCGTTT 


TGACTCTCTT 


CGGTATACGT 


180 


CTAGCAAAAT 


CAAAGAAATC 


TTTAAATTTG 


CCGTTCTGAT 


AACGTTCATC 


AACAATCACT 


240 


TTCACACTTT 


GATAACCAAC 


ACCTTTAATT 


GTACCAATTG 


ATAAATAAAT 


GCCTTCTTGG 


300 


GAAGGTTTAT 


AAAACCAATG 


ACTTTCGTTA 


ATGTTOGGTG 


GCAATATAGT 


GATACCTTGT 


360 


TTTTTTGCTT 


CTTCTATCAT 


TTGAGCAGTT 


TTCTTCTCAC 


TTCCAATAAC 


ATTACTTAAA 


420 




AAAAATAATT 


TGGATAAT6G 


ACTTTTAAAA 


AGCTCATAAT 


GTATGCAATT 


480 


TTAGAATAGC 


TGACAGCATG 


TGCTCTAGGA 


AAACCATAAT 


CAGCAAATTT 


CAGAATCAAA 


540 


TCAAATATTT 


GCTTACTAAT 


GTCTTCGTGA 


TAACCATTTT 


GCTTTGCACC 


TTCTATAAAA 


600 


TGTTGACGCT 


CACTTTCAAG 


AACA6CTCTA 


rrrrmTAC 


TCATTGCTCT 


TCTTAAAATA 


660 


TCCGCTTCAC 


CATAACTGAA 


GTTTGCAAAT 


GTGCTCGCTA 


TTTGCATAAT 


TTGCTCTTGA 


720 


TAAATAATAA 


CACCGTAAGT 


ATmTTAAT 


ATAGGTTCTA 


AATGCGGATG 


TAAATATTGA 


780 


ACTTTGCTTG 


GATCATGTCT 


TCTTGTAATG 


TAAGTTGGAA 


TTTCTTCCAT 


TGGACCTGGT 


840 
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ACACTTCTTA CACCGTCAGA CTCTAATTGG AATATGCCAG TCGTATCTCC TTGCGACAAC 960 

AATTCAAACA CTTTTTGATC ATCAAACGGA ATCTTTTCGA TATCAATATT AATACCTAAA 1020 

TCTTTTTTGA CTTGTGTTAA GATTTGATGA ATAATCGATA AGTTTCTCAA CCCTAGAAAA 1080 

TCTATTTTTA ATAACCCAAT ACGTTCGGCT TCAGTCATTG TCCATTGCGT TAATAATCCT 1140 

GTATCCCCTT TCGTTAAAGG GGCATATTCA TATAATGGAT GGTCATTAAT AATAATTCCT 1200 

GCCGCATGTG TAGATGTATQ TCTTGGTAAA CCTTCTAACT TTTTACAAAT ACTGAACCAG 1260 

CGTTCATGTC GATGGTTTCG ATGTACAAAC TCTTTAAAAT CGTCAATTTG ATATGCTTCA 1320 

TCAAGTGTAA TTCCTAATTT ATGTGGGATT AAACTTGAAA TTTCATTTAA TGTAACTTCA 1380 

TCAAACCCCA TAATTCTTCC AACATCTCTA GCAACTGCTC TTGCAAGCAG ATGACCGAAA 1440 

GTCACAATTC CAGATACATG TAGCTCGCCA TATTTTTCTT GGACGTACTG AATGACCCTT 1500 

TCTCGGCGTG TATCTTCAAA GTCAATATCA ATATCAGGCA TTGTTACACG TTCTGGGTTT 1560 

AAAAAACGTT CAAATAATAG ATTGAATTTA ATAGGATCAA TCGTTGTAAT TCCCAATAAA 1620 

TAACTGACCA GTGAGCCAGC TQAAGAACCA CGACXAGGAC CTACCATCAC ATCATTCGTT 1680 

25 TTCGCATAAT GGATTAAATC ACTTACTATT AAGAAATAAT CTTCAAAACC CATATTAGTA 1740 

ATAACTTTAT ACTCATATTT CAATCGCTCT AAATAGACGT CATAATTAAG TTCTAATTTT 1800 

rrCAATTGTG TAACTAAGAC ACGCCACAAA TATTTTTTAG CTGATTCATC ATTAGGTGTC 1860 

30 TCATATTGAG GAAGTAGAGA TTGATGATAT TTTAATTCTG CATCACACTT TTGAGCTATA 1920 

ACATCAACCT GCGTTAAATA TTCTTGGTTA ATATCTAATT GATTAATTTC CTTTTCAGTT 1980 

AAAAAATGTG CACCAAAATC TTCTTGATCA TGAATTAAGT CTAATTTTGT ATTGTCTCTA 2040 

ATAGCTGCTA ATGCAGAAAT CGTATCGGCA TCTTGACGTG TTTGGTAACA AACATtTTGA 2100 

ATC»AACAT GTTTTCTACC TTGAATCGAA ATACTAAGGT GGTCCATATA TGTGTCATTA 2160 

TGGGTTTCAA ACACTTGTAC AATATCACGA TGTTGATCAC CGACTTTTTT AAAAATGATA 2220 

ATCATATTGT TAGAAAATCG TTTTAATAAT TCAAACGACA CATGTTCTAA TGCATTCATT 2280 

TTTATTTCCG ATGATAGTTG ATACAAATCT TTTAATCCAT CATTATTTTT AGCTAGAACA 2340 

ACTGTTTCGA CTGTATTTAA TCCATTTGTC ACATATATTG TCATACCAAA AATCGGTTTA 2400 

ATGTTATTTG CTATACATGC ATCATAAAAT TTAGGAAAAC CATACAATAC ATTGGTGTCA 2460 

GTTATGGCAA GTGCATCAAC ATTTTCAGAC ACAGCAAGTC TTACgGCATC TTCTATTTTT 2520 

SO AAGCTTGAAT TTAACAAATC ATAAGCCGTA TGAATATTTA AATATGCCAG CATGATTGAA 2580 

TGGCCCCTTT CTATTAGTTA AGTTTTGTGC GTAAAGCTGT AGCAAGTTGC TCAAATTCAT 2640 
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CAATATCATT AATAATCAAT TGCCCTTTAG AACGTAATCG ACATCTGATT TCATTACCTT 2760 

CATCGACTGC AAATACCCAT ATTTTCAAGC CTTTGATGTC AGCAATTGTA TTAACAAACT 2820 

GAGATGCTTC ATTTGGCTGA ATACCGAATT GCTCCAATAC ATCTTCAGTT ATTTTAACTT 2880 

GGCAGAATCC ATCATCCATA AGTTCGAAAT GTTGTAAAAC ATAACCTTGA AACGGCAACA 2940 

TTTTTGGGTC CTTCTCCATC ATTTTATTTA AAAGCGCATT ATGATCAATA TCATGCCCAA 3000 

TTAACTTTCC AGCAATTTCC ATAGTATGTT CTGAGGTATT GTTAAAAAGG AATCGCCCAG 3060 

TATCACCGAC GATACCAAGA TATAAAACGC TCGCGATATC TTTATTAACA ATTGCTTCAT 312 0 

CATTAAAATG TGAGATTAAA TCGTAAATGA TTTCACTTGT AGATGACX3CG TTCGTATTAA 3180 

CTAAATTAAT ATCACCATAC TGATCAACTG CAGGATGAT6 ATCTATTTTA ATAAGTTTAC 3240 

GACCTGTACT ATAACGTTCA TCGTCAATTC GTGGAGCATT GGCAGTATCA CATACAATTA 3300 

CAAGCX5CATC TTGATATGTT TTATCATCAA TGTTATCTAA CTCTCCAATA AAACTTAATG 3360 

ATGATTCCGC TTCACCCACT GCAAATACTT GCTTTTGCGG AAATTTCTGC TGAATATAGT 3420 

ATTTTAAACC AAGTTGTGAA CCATATGCAT CAGGATCTGG TCTAACATGT CTGTGTATAA 3480 

25 TAATTGTATC GTTGTCTTCG ATACATTTCA TAATTTCATT CAAAGTACTA ATCATTTTCA 3540 

TACTCCCTTT TTTAGAAAAG TTGCTTAATT TAAGCATTAG TCTATATCAA AATATCTAAA 3600 

TTATAAAAAT TGTTACTACC ATATTAAACT ATTTGCCCGT TTTAATTATT TAGATATATA 3660 

TATTTTCATA CTATTTAGTT CAGGGGCCCC AACACAGAGA AATTGGACCC CTAATTTCTA 3720 

CAAACAATGC aAGTTGGGGT GGGGCCCCAA CGTTTGTGCG AAATCTATCT TATGCCTATT 3780 

TTCTCTGCTA AGTTCCTATA CTTCGTCAAA CATTTGGCAT ATCACGAGAG CGCTCGCTAC 384 0 

TTTGTCGTTT TGACTATGCA TGTTCACTTC TATTTTGGCG AAGTTTCTTC CGACGTCTAG 3900 

TATGCCAAAG CGCACTGTTA TATGTGATTC AATAGGTACT GTTTTAATAT ACACGATATT 3960 

TAAGTTCTCT ATCATGACAT TACCTTTTTT AAATTTACGC ATTTCATATT GTATTGTTTC 4020 

TTCTATAATA CTTACAAATG CCGCTTTACT TACTGTTCCG TAATGATTGA TTAAAAGTGG 408 0 

TGAAACTTCT ACTGTAATTC CATCTTGATT CATTGTTATA TATTTGGCGA TTTGATCGTT 414 0 

45 AATTGTTTCA CCCATCTGAG GCTGTCTTCC TAAAAGTTGC ATAGACTTTA AAACATCTTG 4200 

TCTATTAATC ACACCCACTG TCTTTTTATT ACTCGAAACG ACAGGAATCA ATTCAATACC 4260 

TTCCCAAATC ATCATATGCG CACAACTTGC TACTGTACTC ATAGCATTTA CATAAATAGG 4320 

50 ATTTCGCGTC ATCACTTTAT CTATTTCGTC GTCGTCCTTT GTATTAATCA TCTCTCGACT 4380 

TGTTACAATA CCTACTAATT TATACGACTC ATTGACTACC GGAAATCTTG TATGGCCAGT 4440 
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TTCAACTTCA AACACGATAC CCATTGGCAT 
GTGTAAAACT TGAACTATGA AATGCTTGAA 
5 CTTTACCACC AaTATTCATA CCATX3AACAT 

CAGCCATTTC TGCACTTCCA ATTACTGTTG 
CACCAAAATG ATCAAAATGA CCGTGCGTTA 

10 

TATTCAAATC ACACTTAGGG TTATTTGAAA 
TCCCTTCTAA ATAAATCGTT GATTGACCAT 
ATCAATTACT ATATAAATTT AGTACCCTTT 

15 

TTAAAAATTG AAAATCTAGT TAATGTATTA 
ATAAAATGGA AGTAATATTA TGTTGAGGAG 
AATAGACGAA TTGAACAATC AACAAGCTGA 

20 

ATATTATTTT ACT6GATACC GTAGCGAACC 
GAAAGATGGT AAACAAGTAC TATTTTGTCC 

25 ACCTTTCACA GGTGAAATCG TTGGATATTT 

TCAAACAATC AATAAATTAC TAATTGAAAG 
ATTAATCTCT GGTTTCAATG TCAATTCATT 

^ GAGAAATATT AAATCCGAAG ATGAAATTAG 

TAAGTGTATC GAAATAGGTG TTTCTTATTT 
CAACCATATT GAGCAAACTA TCAAACAATA 

^ GGTTTTATTT GGAGATCATG CCGCATCACC 

AAGCAATGAA TATGTACTAT TTGATTTAGG 
GACACGTACT ATTAAATTTG GTGAACCTAG 

40 

ATTAGAAGCA GAAACATCTG CAATCCAAGC 
CGATCATATC GCTAGAAATA TTATTTCAGA 
CTTAGGTCAT GGCCTAGGAT TACAAGAACA 
TAATTTGTTA GAAGCTGGCA TGGTTATTAC 
TGCAGGTGTA AGAATTGAAG ATGACATACT 
50 ACATTACGAA AAATAAGGAG TGGGATAAAA 

TCAGTCAAAC ACTGCCAATA TAACATTGTA 

55 



ACCTAAATAA ACTGGgAATA CCATTTTCAT 6360 

CAAATTTAAC GCTTCC6AAA TCAAaGTTTG 6420 

TTTCAACACC GTGATATGAA GAAAGATAGT 6480 

CTCCTGTTTT CTTTGCTAGT TCCACAACAT 6540 

AAACGATATA GTCTACCTGC ACTGTTTCAA 6600 

TAAACGGATC TACGATAACC TTTTTGTTGT 6660 

GAAATGATAA CTTCATTTGA GCATCCTCCT 6720 

TGCCACTTAA TTATAACAAA TTCTCAAATT 678 0 

GCTCGATTTT GAAATCTAAT AATAATTGGC 684 0 

TGTTTATAAA ATGACAAAAA TATCAAAAAT 6900 

TGCAGCATGG ATTACAACAC CGTTGAATG7 6960 

CCATGAAAGA TTATTTGCAT TATTGATTAA 7020 

AAAAATGGAA GTCGAAGAAG TCAAAGCATC 7080 

AGACACTGAA AACCCTTTTT CACTTTATCC 7140 

CGAGCACTTA ACAGTA6CAC GCCAAAAACA 7200 

CGGAGATGTT GATTTAACAA TCAAACAATT 7260 

CAAAATACGT AAAGCTGCTG AGTTAGCAGA 7320 

AAAAGAAGGT GTGACTGAAT GTGAAGTAGT 7380 

TGGCGTCAAT GAAATGAGTT TTGATACGAT 7440 

TCATGGCACA CCAGGAGATC GCAGATTAAA 7500 

TGTAATTTAT GAGCATTATT GTAGCGATAT 7560 

CAAAGAAGCA CAAGAAATTT ATAATATTGT 7620 

AATTAAACCT GGAATACCAT TAAAAGATAT 7680 

AAAAGGTTAT GGTGAATATT TCCCTCATCG 7740 

TGAATATCAA GATGTTTCAA GTACTAATTC 7800 

AATCGAACCA GGTATTTATG TACCTGGTGT 7860 

TGTCACTAAT GAAGGATATG AAGTATTAAC 7920 

ATGAAAAGCT TGTTACAAGC GCATTCTCAT 7980 

GCGCCTAAGA CATAAATTTT TATCCAAGTC 8040 
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T6TAATGAAT 


CAAATCAATA 


TCATTCATGT 


TCGATGATTT 


CTTCGCATTG TTTCTAGCTT 


8160 




TAATTTATCA 


TTATTTAATT 


TTAATAACCA 


AGGAGATGAT 


AACGTCATTC 


TTTAOTACGC 


8220 


5 


TGTAATCCAT 


TCCCTTTTCA 


TCAAATTCAA 


ATTATAATTG 


TAATGCTTCT 


TCTACAGATT 


8280 




TATATTCCAT 


TTCAAATGCC 


TCTGCAACGC 


CTTTATTGGT 


TACGTGACCT 


TTGTAAGTAT 


8340 




TTAAACCTAA 


TGATAATGGT 


TGATTTGATT 


TAAATGCTTC 


TCTATACCCT 


TTATTAGCTA 


8400 


10 


6CATGAGCGC 


ATAAGGTAGC 


GTAgCATTAT 


TTAAAGCTAA 


CGTCGAAGTA 


CGCGGTACTG 


8460 




CACCTGGCAT 


ATTTGCAACT 


GCATAATGAA 


CCACACCATG 


CTTAATATAT 


GTAGGATCAT 


8520 


75 


CATGTGTCGT 


AATTTTATCA 


GTTGtTTCAA 


AAATACCGCC 


TTGATCAATA 


GCAATGTCAA 


8580 


TAATAACTGA 


CCCATTTTTC 


ATTTGTTTAA 


TCATGTCTTC 


TGTTACAAGT 


CTTGGCGCTT 


8640 




TAGCACCTGG 


AATTAAAACT 


GCACCTATTA 


CTAAATCACT 


TTGTTTAACA 


TACAACTCAA 


8700 


20 


TATTCAACGG 


ATTTGACATA 


ATTGTATGTA 


CACGTCCACC 


GAATAAATCA 


TCTAATTGTT 


8760 




GTAAACGCTT 


TGGATTAACA 


TCTAAAATCG 


TAACATCTGC 


ACCTAGTCCT 


AGTGCAATTT 


8820 




TAGCTGCATT 


TGTTCCTGCT 


TGACCACCAC 


CGATAATAGT 


TACTTTACCC 


TTAGGTACTC 


8880 


25 


CTGGC5ACACC 


ACCTAGTAGA 


ATTCCCATAC 


CACCATTAAG 


TTTTTGTAGG 


AACTCTGCGC 


8940 




CAACTTGAGC 


TGACATTCTT 


CCTGCTACCT 


CACTCATTGG 


TGATAACAAT 


GGTAAAGATC 


9000 




GGTCTGGTAA 


CTGCACAGTC 


TCATATGCAA 


TACTAATTAC 


TTTTCTATCT 


ATCAAAGCTT 


9060 


30 


GTGTTAATTT 


TTCTTCATTT 


GCTAAATGAa 


gatAaGTGAA 


TAATACAAGC 


CCTTCTTTAA 


9120 




AATATGGATA 


TTCAGATTCA 


AGTGGTTCTT 


TAACTTTAAT 


AACCATATCC 


ACATCCCAAA 


9180 




CTTTTGCTTG 


TTCAGCAACA 


ATCTCAGCAC 


CTGCTTCTTT 


GTAATCTACA 


TCTTCAAAGA 


9240 


35 


ATGATCCTGA 


ACCCGcATTT 


GTTTCCACTA 


AAACAGTATG 






9280 



{2) INFORMATION FOR SEQ ID NO: 132: 

* (i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
CTGATTAATC TCTTGTTGTC GTGTATTTAC TAATTGAATC GTTGGTGTCT GAACACGTCC 60 
SQ CAGGGATAGC TGTGCATCAT ACTTTGTTGT TAGTGCACGC GTTGCATTAA TCCCAACAAT 120 
CCAATCTGCC TCACTTCTCG CTAACGCTGC ATAATACAAA TCGTTATATT GACGACCGTC 180 
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TTCAATAATC CTACTGTAAT TATTGTCGGT CTTATTTCTG GTGCATTATG GGCGTTTGGA 2100 

CAAGCGAATC AGCTTAAATC TATTAGTTTA ATCGGTGTAT CAAATACTAT 6CCAGTTTCT 2160 

^ ACAGGTATGC AATTAGTTGG TACAACATTA TrCAGCGTTA TCTTTTTAGG TGAATGGTCT 2220 

TCAATGACTC AAATTATCTT TGGTTTAATC GCCATGATAT TATTAGTTAC TGGTGTAGCA 2280 

CTTACTTCAC TTAAAGCTAA AAATGAACGT CAATCAGATA ATCCTGAATT TAAAAAAGCA 2340 

10 

ATGGGTATTT TAATTGTATC TACAGTTGGA TATGTAGGTT TCGTTGTACT TGGTGACATC 2400 

TTTGGTGTTG GTGGAACTGA TGCATTGTTC TTCCAATCTG TCGGTATGGC AATTGGTGGC 2460 

TTTATCCTAT CCATGAATCA TAAAACATCA CTTAAATCAA CAGCACTTAA TCTATTGcCA 2520 

IS 

GGT6TGATTT GGGGAATTGG TAACTTGTTC ATGTTCTATT CTCAACCAAA AGTTGGTGTA 2580 

GCTACAAGTT TCTCATTATC ACAGTTACTT GTTATCGTTT CAACCTTAGG CGGTATTTTC 2640 

20 ATTTTAGGAG AAAGAAAAGA TCGTCGTCAG ATGACGGGTA TTTGGGCAGG TATTATTATT 2700 

ATCGTGATAG CTGCTATAAT TCTAGOTAAT TTGAAATAGA AAGTTAAATA CTCATGTAAC 2760 

GTAAAAATGT AATCACTTCT GAAAATAACC ATTCACTTAT AGAATGATTA AAATTAATTT 2820 

25 TCGGGAATTT TACGTTGAAT GTTCCTCTAT ATGTCCTAGG AAATACGTGG CTCTAAAAAC 2880 

AAAACX5CAAT AACACATCAT GACATTAATC ATGCGTTTTA AGACTTTAAA ATTAGCGATA 2940 

CTTTTAAAAT CTTGATGATA TTCATATATC AAGTATOCGC CATACATATG AAGTGGATAG 3000 

^ CTGCATAACG CACTGCATTA TCAACTTGAA TGTATGAGTT GAACAACTAT GTCATAAATA 3060 

AAAGCCCCCT TTTCACAATA TACATTTACA TATTGTGGTA AAGGGGGCTC TCATTTTCTA 3120 

CGAATACTAA AATGGATTTT ATTTTCAAAT GTGTAAACTA GACAAACACT GCCTGATACA 3180 

35 

CX3TACAAAAT AATGATACTA ATAATGATTG TCAAATTGGT CGTCATACCT ATAAATGGCA 3240 

GTGTTCGATA TTTAAACTGA ATACCATAAG AAATAATTGC AACACcTACC GGGAACATCC 3300 

AAGTGACCAA CAATGTCGTC TTAATCATAT CATCTGATAC TGGTAACAAO ACATATACTA 3360 

40 

ACAATCCCGC AACTAATGCT AATCCATAAT GCAAACATAA ATATTTAATA GTAGCAGGTA 3420 

TATACTTTCT TTCCAGAGTA AAATTCAACA TGACACCTAG CAAAATCATT GATAACGGCA 3480 

45 TATTTGCATG GGAAAGTATG CTAAAGAAAT CXSATTGCCAC ATGTGGTAAA TGGATGTGAC 3540 

TTATATTCAA TATAAACATT ACAATGTATG TAACGAGTGG CACTGATTGT AATAATTTCT 3600 

TACCTAAATA TTTAAAATCG AATTGATCAC TACCTTCACT AAAGTAGCTA CCTACAAAGT 3660 

^ AAGTAATTCC AAACATCACA AAGGCACCAC CTATATCAGC CATAACAAAA TAAATAAGTC 3720 

CCGTTTTAGG CCATATCACT TCAATTAGTG GATATGCAAA CAATCCAATA TTCATAGCAC 3780 

55 



717 



EP 0 786 519 A2 



5 



10 



IS 



20 



25 



CAATCATTTT 


CGCCACAATA 


CCATATATAA 


TCATTAAAAT 


TGGTAAAATG GAGAATGACA 


3900 


ATTTTAATTC 


TGCACTGTTT 


AAATTCACAA 


TAACTAAAGA 


TGGGAGTGTG ACATTAAGAA 


3960 


CTAATGTAGC 


AATGACTTGA 


CTATCTGTTG 


CTTTTATAAA 


ATTAATGCGC 


TTCAAAAAGT 


4020 


AACCAAGCGC 


AATTAATAAA 


ATAATCATAG 


TAAATTGTTC 


TGTCACTGTT ATCCCTTCTT 


4080 


TCAATAATCT 


TCATAATTTA 


TAACTTTAAC 


ATACTCCACA 


GATATTTTAG AAGTCTACTG 


4140 


TTTCATGCTA 


TAATCTACAT 


TAAATGCACT 


TAATTATATT 


TCAAAGGAGT 


GTTATAGTAT 


4200 


6TCTTTAGAA 


AACCAACTAG 


CCGAACTTAA 


ATATGATTAT 


GTTCGTCTTC 


AAGGTGACAT 


4260 


AGAAAAACGG 


GAATCTTTGA 


ATTTAGATAC 


TTCCGCACTT 


GTTCGTCAAC 


TTAAAGATAT 


4320 


TGAAAATGAA 


ATTAGAAACG 


TTCGTGCTCA 


AATGCAAGAT 


TAATAATCTA 


TCATTCAAGC 


4380 


AATAAATGCT 


TTTTGTTACA 


TAAATTTGAC 


TAGCATTGCT 


CTGAATACGT 


TATATTGATG 


4440 


AATTGCTTCA 


TTTTTCGCTC 


AATTACATCT 


AGAATCACAA 


GATGTTGTCG 


TGTTATGATT 


4500 


TAGTGTTTCA 


TTAACAACAT 


ACAOGCATAT 


CTATCCCAAC 


ACTGCTATTT 


ATGTTTTCTA 


4560 


CGCTGnTGTA 


CTACATGAAC 


CCTTTGAAAC 


GGAGAGGAAG 


TTATCATATG 


CAATTTTAnC 


4620 


TGATTTTACT 


AGCAATACTT 


TAACnAATTG 


nTAGTTTAAT 


AGAATTTTA 




4669 



<2) INPORKATIOM FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2785 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION! SEQ ID NO; 133: 

TTTQCACCCA TCTGaTACAA TGCACCATGC GGTTTAACAT GATTAATTTT AACTTGATGA 60 

ATGCGACAAA ACCCTTGTAA TGCACCTAAT TGATAAATCA TCAAATTATA AATCTCGTCG 120 

40 

TTAGAGATAT CTATATTTCG TCTGCCAAAG CCTTTCAAAT CAGGTAAACC AGGATGTGCA 180 

CCTACTGCAA CATTATGTGC TTTGGCAAGT TTTACCGTTT CATTCATTAC ATTTTCATCA 240 

CCAGCGTGAA AACCACAAGC AACATTCGCA CTTGTAATTA ACGGAATAAT TTGATGATCA 300 

45 

CCACCAAAGG AATAATTTCC AAATGCTTCG CCTAAATCAC AATTCAAATC AACTCGCATT 360 

ATAATTCCAC CCCTTTAACA ATTTGATGTT TTTCTAAAAA TTTAATATCA ACATCTTTTG 420 

50 CATCTCCATC ACGATATAGT GGATAATTTA AAACTGCATA TAAAAAATCG GCAGTTGTAG 480 

AAAATCCATC TATCACCATT TCATCTAAGG TGACTTTCAA CTTATCAATT GCTGAAGCTC 540 

SS 
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AACCGTGATA TAGTAAAGAA TCGACTCGCA CATTAAAGCC TTGAGGTAAA TGTAACGCTG 660 

TCACTTTACC TGGTGTTGGT TGAAATTTCT TTTCaGGATT TTCGGCATTT ATTCTCX3CTT 720 

CTATCACATG ACCATTAAAT TGAATATCGC TTTGTGAAAA AGGTAAATGA TTATGTTCCA 780 

ATAAATACAG TTGTGCTGCA ACCAAATCAC GTTCTGCTCG CATCTCTGTA ACAGTATGTT 840 

CAACTTGTAT TCGAGCATTC ATTTCAATAA AGTAATGTGC GGTATCAGTT ACTAAAAATT 900 

CAATCGTACC TGCACTTCTA TAATTTGCTG CACGTGCAAC TTTAACAGCA TCGTTACATA 960 

TTTGTTGTCG TCTTTCTTCA GTTAATGCTG CACAAGGAGA TTCTTCGATT AATTTTTGAT 1020 

TTTTACGTTG TACAGAACAA TCACGTTCCC CTAAATGTAC ATAATTATCC TGCCCATCTC 1080 

CCaTAACTTG AACTTCAACA TGTTTTGcAA CAGGTATAAA AGCCTCAACA TAAACACGAT 1140 

CATCATCAAA GTATTTTTTT CCTTCACTTT TAGCTTCTTT AAATGCCTTT TCTAAATCTT 1200 

CAGCTTTCTT TACAATACGT ATACCTTTAC CACCACCGCC ACTCGCAGCT TTGATAACAA 1260 

CTGGATAACC GATGTCTTTG GCAAGATTCT CAATTTCAGA CACATGATTC ACAGCACCAT 1320 

TTGATCCTGG AATCACAGGA ACACCTGCAT GATGAACTGT TTGTCTTGCT GTTATTTTAT 1380 

25 CCCCCATCAT TTCCATCGTT TTTTTAGTAG GCCCTATAAA CGCTATGCCT TGTTCCTCAA 1440 

CGGTTPGAGC AAATTTTGTT GATTCTGATA AAAAGCCATA TCCTGGGTGA ATTGCATTAG 1500 

CACCAGTGAT TTCTGCAGCA GATATGATGC GGTCAATATT TAAATAACTA TCTAAAgCAT 1560 

30 TArcwTCCCC AATACATATA GCTTGATCTG CTAAATGTAC ATGCAAGCTT TGCTCGTCCC 1620 

CTTTT6CATA AACTGCTACA GTTTCAATCC CATATTCTCT GCAAGCTCTT ATAATCCTTA 1680 

CAGCAATTTC ACCTCTGTTC GCAATTAAAC AACGAAGCAT TTACTTACCC CCTTTACTTA 1740 

ATACGTACCA AAACTTGGTC GTATTCAACA TTTGTGCCAT GATCAGCTAC TATTTCAGTA 1800 

ATTtCTCCAG CAACATCTGT TGTTACCTCG TTTAATACTT TCATCGCTTC AACATATCCT 1860 

ATAATATCTC CCTTGTTAAC TTTGTCACCG ACATTCACAA TTGGTTCAGT TAATTCTTTA 1920 

CTATCTTGTA AAAAGAATGT ACCTATCATT GGTGATTTAA TGTCATGATA ATCATTTGTC 1980 

GAAACATCGG AGTTATCATT CGCTTTTGAA GCTGTCAAAT CATTATTGTT CATACT TT GA 2040 

TTTGATTGAT TACTGTGTGC AGCCAAATGA TTCGAGTCAG TGAAGTCAAT TTCTATTTCA 2100 

TCTTCAAAAT TTTTATATTT AAATTTCTTA ACATCATTTT CCTTCACTAA TTTGATTATT 2160 

TGTTCGATTT nlTCAATATT CATTTTACAA ATCCCCTTTT AAAATTGTTG CTAATTTTTT 2220 

SO CGAAGTATGT CGCAAGCTAG ATGTATCAAA AATTGGAGTC TTTTGATGAC TCTTAAGAAT 2280 

TTCATTAAAC AGAGACATTT GTTCCCGATT CTTATCTACA GCTTCTTGGA ATGATATCCA 2340 
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TACAGTTGCA ATTTTGGTAT AACCACCTAT CGTTTGTTTA TCATTAAGCA QAATAATAGG 2460 

TTGACCATCA TTTGGTACCT GAACACTACC AAGAGCAACC GGTTCAGAAA TGATATCTGC 2520 

TTGATTAAtT GGTGCAACGC TGTCACCTTC CAAACGATAG CCCATACGGT CTGATTGTTC 25 BO 

AGTAATTAAA TATGGATGAT TTACAATTTT CGCTCTAGCC TCTTCAGAAA ATGCCTCGAA 2640 

TTGAGGTCCT TGAAGAATGT GTATAATATT ATTTTCTGGC AATAAATCGT CCTGTAAATG 2700 

AATCGTCTTT CCAATGTTTT CTTTAAAGTC ATTATTTATT TTCACTGTTA TTACATCATC 2760 

AGCTAATAAC TTTCTACCTT TGAAT 2785 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

25 AATGGAAACG GTTGAAACAG CAATTATTAC TATTTCTATG GGTGAAGGTA TTTCAGAGAT 60 

ATTTAAATCA ATGGGTGCCA CACATATCAT TAGTGGTGGA CAAACGATGA ATCCTTCTAC 120 

AGAAGATATC GTTAAAGTCA TTGAACAATC AAAATGTAAA CGTGCAATTA TTTTACCGAA IBO 

30 TAATAAAAAT ATCTTAAT6G CAAGTGAACA AGCAGCGAGT ATrOTTGATG CAGAAGCTGT 240 

TGTTATTCCA ACGAAATCTA TTCCTCAAGG TATAAGCGCA CTATTCCAAT ATGATGTGGA 300 

CGCAACACTT GAAGaAAATA AAGCGCAAAT GGCTGATTCA GTAAATAACG TTAAATCTGG 360 

TTCATTAACG TACGCTGTTC GTGATACGAA AATTGATGGC GTTGAGATTA AAAAAGACGC 420 

GTTTJLTGGGC TTGATTGAAG ATAAGATTGT TUVGCAGCCAA AGTGATCAAT TAACAACGGT 480 

TACTGAGTTG TTAAATGAGA TGTTAGCAGA AGATAGTGAA ATATTGACTG TGATTATTGG 540 

TCAAGATGCA GAGCAAGCAG TTACAGATAA CATGATAAAC TGGATCGAAG AGCAATATCC 600 

AGATGTAGAA GTGGAAGTTC ATGAAGGTGG ACAACCAATT TATCAATATT TCTTTTCAGT 660 

AGAATAAAAA TTTAAAATAA AAAACTACCA ATGATAAATC ATCAGTTGGT AGTTTTTTAT 720 

TTTGCTATTT TAGTGATATT GCGGGTTAAA AGTATCGTTC TCGAGTTGCT AACAATGTCA 78 0 

TGTTCAACTT AGTCATGATA AAATAAATAA CATACTAAAT GATACGTAAA ATCAAATAAA 840 

50 ACATAGGTGA TTTATTTTGG CTAAAGTAAA CTTAATAGAA AGTCCATATT CTCTTTTACA 900 

ATTAAAAGGT ATAGGTCCTA AGAAAATAGA AGTATTGCAA CAACTAAATA TTCATACAGT 960 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TGTAGTTGAA CATGAACAAC AAAAGAAAGA AAAGACAAAA AAGCAATACA AGCCATTTTG 60 

GATTGTCATG AGTTTTATAA TACTTATAGT TGTACTATTA CTCCCGGCAC CTTCAAGTCT 120 

GCCGATAATG GCTAAGGCAG TACTAGCTAT TTwAGCTTTT GCAGTTATTA TGTGGGTAAC 180 

GGAAGCTGTA TCATATCCGG TGTCAGCAAC TTTAATTATT GGCTTAATGA TATTACTTIT 240 

AGGATTTAGC CCTGTTCAAA ATTTAGGGGA GAAGCTAGGT AATCCGAAAA GTGGCAGTGC 300 

TATTTTAGCT GGAAGTGACC TTCTAGGAAC TAATCATGCA TTATCATTAG OGTTTAGTGG 360 

ATTTGCAACT TCAGCTGTAG CTCTCGTTGC AGCTGCATTA TTTTTGGCTG CTGCTATGCA 420 

2S AGAAACGAAT TTGCATAAAA GACTAGCTCT TTTAGTGTTA TCAATTGTTG GTAATAAAAC 480 

TAGAAATATA GTTATTGGAG CAATTATCGT TTCAATTGTA CTTGCATTTT TCGTTCCTTC 540 

TGCAACAGCT AGAGCAGGGG CAGTTGTACC AATCTTGCTG GGTATGATTG CGGCATTTAA 600 

30 AGTTTCCAAA GATAGCAAGT TAGCGTCTTT ATTAATAATT ACTTCAGTAC AAGCTGTGTC 660 

AATTTGGAAT ATTGGTATCA AAACGGCGGC AGCACAAAAT ATCGTAGCGA TTAATTTTAT 720 

AAACCATCAA TTAGGATTTG ATGTTTCATG GGGCQAGTGG TTCTTATATG CAGCGCCTTG 780 

GTCCATAGTT ATGTCCGTAG CTTTATATTT CATCATGATT AAAGTGATGC CTCCAGAAAT 840 

TAAT&CAATA GAAGGTGGTA AAGATTTAAT AAAAGAAGAA TTGCATAAAC TTGGCCCCGT 900 

TAGCCCACGT GAATGGCGTT TAATTGTTAT ATCGATGTTA TTATTACTGT TTTGGTCAAC 960 

TGAAAAAGTA TTACATCCGA TTGACTCTGC ATCCATTACT ATTATTGCTT TAGGTGTTAT 1020 

GTTAATGCCG AAAATTGGTG TCATGACATG GAAACATGTT GAAAATAAAA TACCATGGGG 1080 

AACAATTATC GTGTTTGGTG TAGGTATTTC ACTAGGTAAC GTTCTTTTGA AAACAGGTGC 1140 

AGCTCAATGG TTAAGTGATC AAACTTTTGG TGTTTTAGGT TTAAAACATT TACCTATTAT 1200 

CGCGACAATT GCACTTATCA CGCTTTTTAA TATATTGATT CATTTGGGCT TTGCGAGTGC 1260 

SO AACAAGTTTA TCATCAGCGT TAATACCTGT TTTTATTTCG CTAACCTCTA CGTTACACTT 1320 

AGGAGACCAG TCTATAGGAT TTGTTTTAAT TCAACAATTT GTTATTAGTT TTGGTTTCTT 1380 
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AGATTTCTTG AAGGCAGGTA TACCATTGAC AATTGTAGGG aATAtCtAgT GaTAGTTTTT 1500 
AGCATGACTT ATTGGAAATG GGTAAGGTTG CnTTAATTAA 1540 
(2) INFORMATION FOR SBQ ID. NO: 136: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11823 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
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25 



40 



45 



SO 



ACTTCTCACA 


ATAAGAAATA 


TGAAATTGTT 


ATGTGTTAGT 


TGAGATTCAG 


TGATGAATTA 


60 


CTTTTATCAT 


TTAAAATGTT 


GTTATCATTG 


TCATGCGTTA 


CCAAATCGCT 


TACGTATACA 


120 


CGATTCCCAA 


TCTTAACATA 


GAOGATTTGT 


ATATCAGAAT 


TTTCTGATTA 


CTAACAGTTT 


180 


ACCTAAGTTT 


AAATATCTGT 


TCAATGATTT 


TCAGTTATTT 


TTAAAAGAAA 


AATCGTAATG 


240 


CTGCCATGAT 


AACAATCCCA 


CTAATAATT6 


TAATAGTTAA 


AtACGCGTGA 


TTATAGATAA . 


300 


AATAACCGTC 


GGAATGAGCG 


CGATAATGTA 


AGGGATGTTT 


AATGTATACC 


CCTCACCATG 


360 


AGGCGTCTGT 


TGAATAATGC 


TGTCAATGAC 


AAGTGCCGTA 


AATAGTGTGA 


TTGGGATAAA 


420 


TGATAGCCAT 


CGAACCACGA 


CATCAGGCAA 


TTGCACTTTT 


GAAATCATGA 


TAAAAGGTAT 


480 


AATTCGAATT 


AATAGCGTTA 


CGATACCACA 


CAATAAAATA 


AGTATTAACA 


TGTTCATATG 


540 


AGTTATCATT 


GTTCCATCAT 


CACTCCTAAC 


GCTGCTGAAA 


TTGTGGCTGC 


AATTAATATT 


600 


GCTAGATATG 


AAGGCATAAA 


CATACTTAGC 


GATAACATCA 


TTACTATGAC 


GGCAATAATG 


660 


AGTACTATGT 


AAATTCTTAA 


TCGCGATTTA 


GTAATTGATT 


CAAATTGCGC 


AATGGCCAAA 


720 


AAGMAAACA 


TAGCCGTGAT 


AGCAAAATCT 


AACCCTAGCG 


TTTGCGGATT 


TGAGATATAT 


780 


TCGCCAAATA 


AAGCCCCAGC 


TACACATGAA 


ATTGCCCAAA 


ATAAATATGC 


TOTGATGTTA 


840 


AGACCATGCA 


TCCAACGATC 


ATTGATAGCT 


TCTCCTTTTA 


AATAAGGTGT 


AATGGCGACG 


900 


CCAAACGTTT 


CGTCAGTTAC 


TAATGAACCT 


AATCCAACAC 


GGTTCCAAAA 


CCCATATGTC 


960 


TTGAAGTTTG 


GTGCAAGCGA 


CATACTTAAA 


AGGAACATTC 


TTGAATTTAC 


GATAAATACA 


1020 


GTTAGTACAA 


TCGCTGATAT 


AGGTGTACCT 


GCTATAAACA 


ACGCGCACAT 


AATAAATTGC 


1080 


GCAgcaCCGG 


CATATATAAC 


AAGACATAAC 


AAGACAATTT 


CTAAAATACT 


AAAGTTTTGA 


1140 


GACGAAGCCA 


CAATACCAAA 


TGAAATACCA 


ACACCGGCAT 


AACCCAATAA 


TGTTGGGATA 


1200 


CACTCTTGCA 


CGCCTTGTCT 


AAAACTTAAA 


TGTGTTGTCA 


TCTCAATTAC 


CTCCTTTGCC 


1260 
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75 
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2S 



30 



35 



40 



4S 



50 



TCAATTTTGC TCTTAACTGT GTCAGGCTCA TTTCTGAATA ATCTAATGTC TAACATTAAC 3180 

CTTCATCCTT TCCCAAATAA TTATCATTTA TTATGGAATG ACGTACGTCT TTATTTTTTA 3240 

GAAAATAAAA AAAGACCACA TCCCTACAAG GGACGTGGTC TACGCGTTGC CACCCTATTT 3300 

AACAATTTAA GTTATAAAGA TACACTAAAC CTAAATTGCA CTTCACTAAA ATAACGGTTA 3360 

TCACCGATTG TTCTTTTAAA TTAAGTAGGT AGATTCATAT ATATGTTGAT TCTTGTrCAC 3420 

ACTAACCACA AGCTCTCTGA TATCGAACAC TATATATTAC TTGTCCTACG AACAATGTCT 3480 

TATTAAGTTA TTTTTAATAT AGCAAACTAT ATTTGCTTTT TCAAGTAACG ATTTCAAACA 3540 

TCACTCATCT CGATTTAGTG ACATGCAGTC GTTTGATAAA TTGATTGCTT TAAATACTGT 3600 

GCAACCGCTT CAATATCTTT ATGAAATTGA CGATCATGTG TAATGGATGG CACGATACTT 3660 

CGAAACTCAT CATACTTGCG ACGTGTTTTT GGTGATAATC CTTCAACACC TTTTAACTCT 3720 

GCTGCTTGTA ATGCAATAAC ACATTCGATT GCCAGCACAC GTCTTGCATT TTCAATAATT 3780 

TGATAACCAT GTCTAGCAGC TGTAGTTCCC ATAGATACGT GATCTTCTTG GTTCGCAGAT 384 0 

GAAOTGATAG AATCAACACT CGCTGGATGC GCTAAAGTTT TATTTTCAGA AACGAGACTT 3900 

GCAGCAGCAT ATTGCATAAT CATCGCGCCA CTTTGCAATC CTGGCTCTGG ACTAAGAAAT 3960 

GCTGGTAAAT CACCATTTAA TTGAGGATIT ACTAGTCGCT CTAGACGACG TTCCGATACG 4020 

TTTGCTAATT CACTTACACC TAATTTAAGA TGATCTAATG CAAAAGCAAT AGGTTGTCCA 4080 

TGGAAGTTAC CACCTGAAAT AACAAACGTT TCATTTGCTT CCTCAAATAT AAGTGGATTA 4140 

TCATTAGCCG CATTCATTTC AAATTCTAAT TGCTGTTTAA CATAATTGAA TACTTGAAAA 4200 

CTCGCGCCAT GGATTTGTGG TATACAACGC AACGTATATG CATCTTGTAC ACGTATTTCT 4260 

GATTGTCGCG TCGTTAATGT TGATCCTTCT AACCAATCAC GCATACGCGC TGCCACATTA 4320 

ATCrerTCTT GAAAATTACG AACTGCGTGC ACATCATGTC GATATGCATC TATAATGCCA 4380 

TTAAGAGACT GATGCX3TTAA TGCAGCAATC CATTCAGATT GGTAACCTAA ATCTTCTGCT 4440 

TCTATATAAC TAATGACACC TTGAGCTGTC ATAGCTTGCG TACCATTAAT CAATGCTAAA 4500 

CCTTCTTTAG CCTGAAGGTT CAAAGGTTGT CTATTTAATT CTCTTAATAC ATCGTCACTA 4560 

TCCTTTTCTT CCCCTCTGTA CAATACTTTC CCTTCACCAA TTAATGCTAA TGCTAAATGT 4620 

GATAATGGCG CTAAATCTCC TGATGCACCG AGAGAGCCTT GCTGTGGGAT TATCGGTATA 4680 

ATACGTTCAT TTATAAAAAA TTGTAATTGT CTCACTAATT CTAAAGTGGC ACCTGAATGA 4740 

CCTTTTAATA ATGTATTCAA TCGTAAAATC ATCATGACTA ATGCTACTTC TTTTGAAAAT 4 800 

GGCTCACCTA GTCCACAGGC ATGTGAGCGT ATCAGATTCA CTTGTAATTC ATTATATTGC 4 860 
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TCCTCATTTT 


CAATAATACG 


TTCAACTACC 


GCTCTACTTT 


TTTTGACACG 


TTCTAACGCA 


4980 




TCATCAATAA 


TTTCAATCTT 


TGATTGTTGT 


TGTAAAAATG 


ATTTAATATC 


CTCAATTGTT 


5040 


5 


AGTGTTTCAC 


CATCTAAATA 


TAAAGTCATA 


TATGTTACCC 


CCTTGTTTAT 


ATTAAGTAAC 


5100 




CCATCCTTCT 


TGAAGTATAC 


GTTTTCATTT 


TTATTGAAAC 


AATGGTTTTA 


CGTACATTTA 


5160 


10 


TAACCTATTA 


TCAGAGCACT 


ATTGTAGTGC 


GTTAAAGGAT 


ATTAAGATTG 


TTGTAAGCAT 


5220 


ATTTAATAAT 


TTATCTATTG 


ACGAATTGCA 


TATACAGGTA 


TAGTATTTTC 


TATTGTATTT 


5280 




AACGACAAAT 


AATAATGAAT 


TCAGAAATTT 


ATAATACATT 


TTGTTAAAAG 


TTACTATATA 


5340 


15 


TTTTTAAAAT 


TGAATAAATT 


CX3GAAAAGGC 


TTTTACATGG 


GAGGTTATAT 


CACTATGGAA 


5400 


ACGTTAAATT 


CTATTAACAT 


TCCTAAGCGT 


AAAGAAGATT 


CACATAAAGG 


TGATTATGGC 


5460 




AAAATTTTAT 


TAATTGGTGG 


ATCTGCTAAC 


TTAGGTGGTG 


CCATTATGTT 


AGCGGCTCGT 


5520 


20 


GCATGTGTAT 


TTAGCGGTAG 


TGGTTTAATC 


ACTGTAGCTA 


CACATCCAAC 


AAATCATTCA 


5580 




GCATTACATT 


CTCGTTGCCC 


AGAAGCGATG 


GTTATTGATA 


TTAATGATAC 


GAAAATGTTG 


5640 




ACGAAAATGA 


TTGAAATGAC 


TGACAGTATA 


CTAATTGGTC 


CAGGTCTTGG 


CGTTGATTTC 


5700 


25 


AAAGGAAATA 


ATGCCATTAC 


ATTCCTACTA 


CAAAATATAC 


AACCGCATCA 


AAATTTAATC 


5760 




GTAGACGGCG 


ATGCGATTAC 


AATCTTTAGT 


AAACTGAAAC 


CGCAATTACC 


TACATGTCGT 


5820 




GTGATCTTTA 


CACCACACCT 


CAAAGAATGG 


GAACGATTAA 


GTGGTATTCC 


TATTGAGGAA 


5880 


30 


CAGACATATG 


AGCGTAATCG 


TGAAGCAGTT 


GATCGTTTAG 


GTGCAACTGT 


TGTACTTAAA 


5940 




AAACATGGTA 


CTGAAATTTT 


CTTTAAAGAT 


GAAGACTTTA 


AATTGACAAT 


CGGTAGCCCA 


6000 




GCAATGGCGA 


CTGGTGGTAT 


GGGCGATACA 


CTTGCTGGTA 


TGATTACAAG 


CTTTGTOGGT 


6060 


35 


CAATTTGATA 


ACTTAAAAGA 


AGCGGTTATG 


AGTGCCACAT 


ATACACATAG 


TTTTATTGGC 


6120 




GAAA^CCTTG 


CAAAAGATA7 


GTATGTGGTG 


CCACCATCAA 


GACTTATCAA 


TGAAATACCT 


6180 




TACGCAATGA 


AACAATTAGA 


AAGTTAGTCA 


TTACTAATCA 


TTGAATATAG 


TAAAGCATTA 


6240 


40 


CTTTCTAGCA 


TAAAAATAAG 


ACTCCCCTAC 


ATATAGGGAA 


GTCTTATTTT 


TTATTATTCT 


6300 




TCATCTGATG 


ATTGTTGTAT 


ATCTTCTTCA 


ACACGATCCA 


TGAAATCTTG 


TCTTACTTCA 


6360 


45 


ATACGTCCAT 










TGCAi'i'iCCT 


6420 


GGTGTTTCAT 


CATTTaCAAC 


CGCTTCACGT 


TGTTGTTCAG 


TACCATCTTC 


AGATACAGTT 


6480 




GAAGTAGATT 


GCTCATCTTC 


ATTCGTTTCA 


TCTTCTGCAT 


CTTCTTTTAC 


TTTAGCAACC 


6540 


SO 


GTTGAAACAA 


ATPGATCATC 


ACCTAAGCGA 


ATTAAGCGAA 


CACCTTGTGC 


TGCACGACCA 


6600 




TTTTGAGAAA 


TATCTGCAAC 


ATCTAGTCGA 


ATAATGACAC 


CTGCATTAGT 


AACAATCATT 


6660 
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GTAGCTGTTT 


TAATAC cm 


ACCACCACGA 


TTTGATAAGC 


GATAGT CATT 


AACTC3GCGTA 


o /o u 




CGTTTACCAT 


AACCATTTTC 


AGTAACTACT 


AATACTTCAT 


CAACACTGTT 


TGCATGAGCT 




5 


ACATCAAGCC 


CTACAACTTC 


GTCACCTTCA 


CGAAGTGTAA 


TACCTTTCAC 


^^wwwwx X w w X 






GTACGGCCTA 


AAGGACGTAA 


TGTTGATTCA 


GGGAATCGAA 


TTAATGATG C 


ATGTQATGTA 


w 7 D U 


TO 


CCAATCAAGA 


TATCTTCTTG 


ACCACTTGTT 


AAGCGAACTG 


CAATTAACTC 


ATCATCTTCT 


7020 


CTGAACGAAA 


TCGCAATCTT 


ACCATTTCTA 


TTTATTCTTG 


AGAAGTTACT 


TAATGCTGAA 
xnnx WW X umn 






CGTTTAACGA 


CACCACGTTT 


AGTTGCAAAC 


ACTAAGAAGT 




X X V»Art W X X 


7140 


15 


TTAACAGCAA TCATTGTACT 


AATGACTTCA TCATTTTCAA 


GTTCAATACir 


AX X wAW X A^A 






GGAATACCTT 


TAGACTGTCT 


TGATAACTCA GGCACTTCGT 




'1*1* WJTI'ATAOft 
X X XwXAXAV>A 


7260 




CGACCTTTGT 


TAGTAAAGAA 


CAATACATGG 


TCATGTGTAC 




^■MaX XUnwXU 


/ J^U 


20 


ACAAAATCTT 


CTTCCAATGT 


ATTCATACCT 


TGAACACCAC 




\»x X 1 Xianul..A 






CQATATGTAG 


ATACCGGCAA 


ACGTTTAATG 


TAGTTATTAT 


\JU\> 1 X AV7 XU X 


aaX XaCxaI X 






TGTTCTTCTG 


GAATTAAGTC 


TTCGTCCTCT AAGTCTTCAA 


AX v.i«i#iww X An 


i xXxaaX X X ^ X 




Ho 


GTACCACGAT 


CATCACCGAA 


ACGATCTCTA ATTTCAGTCA 


AX l^>nX^XwX 


AA^ X AA^ X\a X 


/!>OU 




AATAACACTT 


CTTCATCAGC 


TAAGATTGCT 


TCTAATTCAC 


TJUk.TATAATT 
x^nnxnxnnx x 


X AAXAa^XVmA 






TTATATTCAG 


CTTCAATTTT 


GTCTCTCTCT 


AAACCTGTTA 


V3n^u) X V* X X nn 


A\«.V9k«A X U> X X. X 


/Do U 


30 


AAAATAGCTT 


GAGCTTGTTT 


TTCAGAAA6T 


TTGAAGCGTT 


wX XVVwWlww X 


1 XV.V.AX XVtViA 






ACTTTATCTG 


TATCTGACTC 


ACGAATCGTT 


GAAATAATTT 


CATCGATATG 
X k>unx AX w 


w X VAnV XOWU 


7Ann 

/ DU U 




ATACGTAATC 


CTTCTAAAAT 


GTGGGCACGA 


TCTTTAGCTT 


TACGTAAoTT 


GTATTGCGTA 

W X r% X X >7WV7 X lA 


7B60 


35 


CGTCTTCTAA 


CAACTGTCTT 


TTGATGCTCT 


AAATAATGTA 




XXX XAAAX XA 


7920 




ATAAGCTTCG 


GTCTACCATT 


TACAAGTGCA 


ATCATATTCA 


CACCAAATGA 


TGTTTGAAGA 

X W A X X w^lAUl^m 


7980 


40 


GGTGTTTGTT TGTATAAGTT 


ATTTAAAATG 


ACACTAGCAT 


TTGCATCCTT 


ACGCACATCA 

^%wwwi^Um^%x Wi^i 


B040 


ATAACGACAC 


GCACACCAGT 


ACGTAAACTT GTTTCi^TCAC 


GTAAATCAGT 


GATACCGT(!!!A 

V3AX A\.<WV9 X wA 


8100 




ATTTTCTTGT 


CACGAACGAG 


CTCTGCAATT 


TTTTCAATCA 


TACGAGCCTT 


A X X wA w X X %9w 


8160 


45 


AAAGGAATTT 


CAGTGACAAC 


AATACGTTGA 


C6TCCGCCTC 


CACGTTCTTC 


AATAACTGCA 


8220 


CGAGAACGCA 


TTTGAATTGA 


ACCACGACCT 


GTTTCATATG 


CACGTCTAAT 


ACCACTCTTA 


8280 




CCTAAAATAA 


GTCCAGCAGT 


TGGGAAATCA 


GGACCTTCAA 


TATCCTCCAT 


TAACTCAGCA 


8340 


SO 


ATTGAAATAT 


CAGGGTTCTT 


ACTTAAGCTA 


AGTACACCAT 


TGATTAATTC 


TGTTAAGTTA 


8400 




TGTGGTGGAA 


TATTCGTTGC 


CATACCTACC 


GCGATACCTG 


ATGCACCATT 


GGCTAATAAG 


8460 
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AAATCTATTG TATCTTTATT AATATCACGT AACAGTTCAA 
GCTTCAGTAT AACGCATTGC TGCTGCGCCA tCTCCATCCA 
CCATCAACAA GCGGATAACG ATAACTGAAA TCTTGAGCCA 
ATAGATGAGT CACCATGAGG GTGATATTTA CCCATTACGT 
TTTTTATATG ATTTATCCGG TGTCATACCT TGTTCATTTA 
TGTACTGGTT TTAAACCGTC ACGAACATCT GGCAATGCAC 
GCATAATCTA AAAATGATTC ACGCATTTCA CTGGTAATAT 
TGAGGTAATT CAGCCATCAA GAGTTCCTCC TTCAAAAGTT 
TCTAAOTTTQ CATAAACTGC ATTATCTTCT ATAAATTGTC 
CCCATTAACA TTTCAAATGT TTGGTCCGCT TCAATCGCAT 
AGAGCGCGGT GCTCAGGGTT CATTGTTGTT TCCCAtAATT 
AGACCTTTGT ATCGTGCAAT AGACCATTTT GGTGTTOGAT 
TCAAGTTCCC TATCATTGTA TACATAATAC TTTTGTTTAC 
GGTGGCTGTG CAATATACAC ATAGCCTGCT TCAATTAACG 
AATGTTAATA ACAATGTTCT AATATGCGCT CCATCCACAT 
ATTTTGTGAT ATCTTGCTTT CGCTAGATCA AAGTCGCCAC 
GTGATCATTT GACX3AATTTC ATTGTTATTC AAAATTCTAT 
TTTAATATCT TACCTCGTAA TGOTAAAATC GCCTGCGTTC 
GTAGACCCCC CGGCAGAGTC CCCTTCGACT AAGAAAATCT 
CTAGAGCAAT CGGCTAATTT ACCTGGAAGG CTTGCTACAT 
GTTACTTCAC G06CTTTTTT CGCAGCAACA CGTGCACGTG 
ACCACTGTAC GTGCGACTTC TGGATTTTCA TATAAAAATC 
AATTTATCTA CAACTTGACG CACTTCAGAA TTACCTAATT 
AATTQAGGAT CACCATGTTT GATAGATATA ATTGCTGTCA 
CCAGAAAGTC TATCTTTTTC TTCTTTCATA ATCTTGCTAC 
ACACGCGTTA ATGCACGTTT GAATCCGTCT TCATGCX5TAC 
TTATTTGCGT AAGTTAAAAG ATTTGTGGCA TATCCTGAGT 
ACTTCAATAT CATCTTTAGA TTGATGAATA TAAATTGGCT 
TnTCGTTCA ATAACTCAAC GTACQATTTA ATACCGCCCT 



GTGTGATTTT AGTCATACGC 85B0 

TTGAACCAAA GTTACCTTGG 8640 

TACGTACCAT TGCTTCATAA 8700 

CACCAACGAT ACGTGCTGAT 8760 

ATCCATATAG TATACGACGA 882 0 

GAGCAACGAT AACACTCATC 8880 

TTCGTTCATT TATTCTTGAT 8940 

CAGTTCACAG CGCTTAGAAG 9000 

TACGGTTTTC TACAACGTCA 9060 

CTTCAAGTTT TACTTGTAAA 9120 

GATCTGCATT CATTTCTCCA 9180 

TCAATTCAGA TTTAAGTTTA 9240 

CTTGTGTCAG TTTATACAAC 9300 

GTCTCATAAA TCGATAGAAG 9360 

CGGCATCAGT CATAATGA06 9420 

CGATTCCTGT ACCAAATGCT 9480 

CTAATCX3TGC TTTTTCAACA 9540 

TAGAGTCACG ACCAGATTTT 9600 

CACATTCTTC AGGACTTTTA 9660 

CTAACGCTGA TTTACGACX3T 9720 

CCGCCATAAT ACCTTTTTCA 9780 

GTTCAAAGTG CTCTGAGAAT 984 0 

TTGTCTTCGT TTGACCTTOG 9900 

TACCTTCACG TGTATCTTCA 9960 

TTAAACCATA ACTATTTAAG 10020 

CACCTTCATA CGTATGAATG 10080 

TATATTGAAT CGCAATTTCT 10140 

CATCATGAAT AGGTTCTTTA 10200 

CATAGTGATA GGAGTCTTCT 10260 
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GCAAGCTCTC 


TAATACGCTG 


CTGTAATGTT 


TCATAGTTGT ATACAGTTGT CTCTGTGAAG 


10380 




ATTTCTCCAT 


CTGCTTTAAA 


ACGAAtGaCA 


GTACCTGTCT 


TAtCAGTnGT 


GCCAACTTCT 


10440 


5 


TTTAAGTCAA 


ATTGAGGTAC 


ACCTTTTTTA 


TATGCTTGAT 


GATATATAGT 


CTCATTTCTG 


10500 




TGTACATATA 


CTTCTAAGTC 


TTGTGACAAT 


GCGTTTACAA 


CTGATGAACC 


AACACCATGT 


10560 


10 


AAACCACCAG 


ATACTTTGTA 


TCCGCCACCG 


CCAAATTTAC 


CACCAGCATG 


TAAAACAGTT 


10620 


AAAATAACTT 


CGACAGCTGG 


ACGTCCCATT 


TTTTCTTGAA 


TATCAACTGG 


GATACCACGT 


10680 




CCGTTATCCG 


TTACTTTAAT 


CCAGTTATCT 


TTTTCAATAA 


CAACTTCAAT 


TTGATTTGCA 


10740 


IS 


TAACCaGCTA 


ATGCTTCATC 


GATACTATTA 


TCGACAATTT 


CCCACACTAA 


ATGGTGCAAA 


10800 




CCTCTCTCTG 


AAGTCGATCC 


TATATACATA 


CCTGGTCTTT 


TACGTACTGC 


TTCTAAACCT 


10860 




TCTAATACTT 


GTATTTGCCC 


AGCACCATAA 


TTATCOGTGT TGTTTACATC TGACAATX3CA 


10920 


20 


GTCACCATC6 


CTTTCTGTTA 


CTTTATAATT 


TCACCTTGAT 


TAATACGATA 


CAATTTAGCG 


10980 




TTATTCATGA 


TTTCATGATC 


AATACCATCT 


ACAGATGTCG 


TAGTGACAAA 


TGTTTGTACT 


11040 




TTATGCTGAA 


TCGTACTTAA 


TAAATGGGTT 


TGACGC6AAT 


CATCTAATTC 


ACTGAGTACA 


11100 


25 


TCGTCTAATA 


ATAAQATGGG 


ATATTCCCCA 


ACTTCX3ATAT TCATTAACTC AATTTCAGCT 


11160 




AATTTAATGG 


ACAAAGCCGT 


TGTACGTTGC 


TGTCCTTGAG 


AACCATATGT 


TTGAGCATCC 


11220 




ATGCCATTCA 


CATCAAAACT 


TATATCATCT 


CX5ATGTGGTC 


CGAATAAGCr AATGCCTCGT 


11280 


30 


TCTTTTTCTC 


TTTGCATATT 


ATCGCTAAGA 


ATAGACATAA 


TTTCTTCAAG 


TCGTGCCX^CT 


11340 




TCATTTTGAG 


CATAATCAAA 


TTTAAGACTA 


GGTAAATAAT 


TCAGCGACAA 


CGCTTCTTTA 


11400 




TCATTTGTGA 


TACCAGCATG 


AATCGGTTTA 


GCTAACGACT 


CTAGCTCTTG 


AATAAAATGT 


11460 


35 


GCACGTTTAT 


QfVGTrACTTT 


CATTGCATAT 


TCAGCAAACT GCTGATTTAA 


TACTTCCAAC 


11520 




ATTGTTAAGT 


CCTTTTTTTG 


GCCTAATTGT 


AACTGCTTTA AGTAATTATT 


CTTTTGCTTT 


11580 




AAAATACGTT 


GGTATTGAGC 


TAAATCATTT 


AAGTAAACAG 


CAGAAATTTG 


GCCCAACTCC 


11640 


40 


ATATCTATAA 


AGCGTCGTCT 


TATTtGrGGr 


GAGCCTTTTA 


CAATATTCAA ATCTTCTGGC 


11700 




GCAAATAGAA 


CCACATTGAG 


GTGTCCAATA 


TATTGAGTTA GACGACTTTG 


CTCTAAGTGn 


11760 




ATTCACTTTG 


GACTTGTTTA 


CCTTTnTTAG 


TTATAAACAT 


TGTTAATGGG 


CATCGTGCCG 


11820 



TGT 11823 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

ATAATTATTA ACATGGTGTG TTTAGAAGTT ATCCACGGCT GTTATTTTTG TGTATAACTT 60 

AAAAATTTAA GAAAGATGGA GTAAATTTAT GTCGGAAAAA GAAATTTGGG AAAAAGTGCT 120 

TGAAATTGCT CAAGAAAAAT TATCAGCTGT AAGTTACTCA ACTTTCCTAA AAGATACTGA 180 

GCTTTACACG ATTAAAGATG GTGAAGCTAT CGTATTATCG AGTATTCCTT TTAATGCAAA 240 

TTGGTTAAAT CAACAATATG CTGAAATTAT CCAAGCAATC TTATTTGATG TTGTAGGCTA 3 00 

TGAAGTTAAA CCTCACITTA TTACTACTGA AGAATTAGCA AATTATAGTA ATAATGAAAC 360 

TGCTACTCCA AAAGAAACAA CAAAACCTTC TACTGAAACA ACTGAGGATA ATCATGTGCT 420 

TX3GTAGAGAG CAATTCAATG CCCATAACAC ATTTGACACT TTTGTAATCG GACCCGGTAA 4 BO 

CCGCTTTCCA CATGCAGCGA GTTTAGCTGT GGCCGAAGCA CCAGCCAAAG CGTACAATCC 540 

20 mrrATTTATC TATGGAGGTG TTGGtTTAGG aAAAACCCAT TTAATGCATG CCATTGGTCA 600 

TCATGTTTTA GATAATAATC CAGATGCCAA AGTGATTTAC ACATCAAGTG AAAAATTCAC 660 

AAATGAATTT ATTAAATCAA TTCGTGATAA nA 692 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

ATACTGTAGC GCAAATTTCA CAATGGCATG TTATAGAAGA TTTAGTTACG AATGAATTAG 60 

GTATTAGTAT TTTACCAACA TCAATTTCAG AgCAACTAAA TGGAGATGTG AAGCTGtACG 120 

CATTGAAGAT GCTCATGTAC ATTGGGAATT AGGTGTTGTT TGGAAGAAGG ATAAACAATT 180 

AAGTCATGCC ACAACGAAAT GGATAGAATT TTTGAAAGAC COTTTAGGCT AACATATTAA 240 

TAAAGCACTC ATTATTTAAG GCGCATCATT ACGTGGGTCA TTGAAATAAT GAGTGTTTTT 300 

TTGTGAAAAT GAAGTGAAAT TTAGAGAGCG TTTCCATAGA AAATAGTAAT ACAAACTATA 360 

AAAAAAGAGT ATTTTTATAT TGTGTACGCC ATCTTTATAA TAGTTATTGT AACAATTTAG 420 

ACATATTTAG AAAGGGATGG CGCCATGCAC AAAGTCCAAT TAATAATCAA ACTACTACTA 480 

SO CAACTAGGAA TCATCATTGT GATTACTTAT ATTGGCACAG AAATTCAAAA GATTTTTCAT 540 



3S 
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4S 



SS 



729 



EP 0 786 519 A2 



ATTGTACCGC TAACTTGGGT AGAAGACGGT 
TTCTTCATAC CGTCAGTTGT AGGtATTATG 
S TACTCTTTTT CGCAGTCATT ATCATAGGAA 

TTGCTGAAAA AATGTCyGtT AAACwTAAAC 
TACGTGCAAG CCTTATTAAT GATTTTGTTG 
TTACAACAAA AATATCCGAA CCCATTTTTG 
ATTTTTGTCT TACTTATCTT TGGAATTAGT 
ATCAACCATA TTTTAAACGC AACGGTCGTA 

IS 

GAGAAAATTA AAGACAATGT CTCTATCATT 
AATTTCATGT TAGTGTTCnT AACACTTAAA 
ACGTTATTGC CCCGATCTAT AACAGCCGCA 

20 

GGTACAGATA CGATGACCGT ACTTTTTATT 
GGTTCGATGT TATTAAGATT TGGAAGATTT 
GGGAATGCGT CACATGCATT TGGCACAGCT 

25 

GCATTTAGTT CAATTGGGAT GATTTTAACT 
CTAATTTTAT TATTCTATTA ATTTAGATAT 
AGTAATAATG GCAAAAATAA AAGCAAATGA 

30 

TATAGATCAC TTGTATGGTA TTCCAGGAGA 
ACAGTGAGAG ATCAATTTAA ATTTTATCAT 

35 GCTGCTGGTT ACACAAAATT AACTGGTAAA 

GGTTTAATTC ATTTATTAAA TGGTATGTAT 
ATATTATCTG GACAAACGAA TAGTACAGCA 

40 TTACAAAAAT TATGTGAAGA TGTAGCCGTT 
GTGTTTGAAA TCGTTAACGA AGCAATTCGT 
GTTATTTGTC CTAACGACTT ATTAACTGAA 

^ GATACATCAA GACCAACAGT AGTATCACCA 
CTAATTAATA AAAGTAAAAA GCCTGTCATG 
GATGAGCTAC GTGAATTTAT TGAAATGGCT 
AAAACAATCT TGCCGGATGA TCATCCATAT 



GCAAACTTTT TATTAAAGAC GATGGTCTTT 6$0 

GaTGtgCTTC CGAAATTACG CTAAATTATA 720 

CATGTATCGT TGCATTATCT TCAGGTTATA 780 

ATCGTAAAGG TGTAGACGCt TATGAATGAT 840 

ACTGTCGTTT TATATTATTT CGCTAAAAGG 900 

AATCCAGCAT TAATTGCATC TTTAGGAATT 960 

TATAACGGGT ATATGAAAGG TGGCAGTTGG 1020 

TGTTTAGCGT ACCCACTTTA TAAAAATA6A 1080 

TTTGCAAGTG TATTAAcTGG CGTCATGCTG 1140 

GCATTTGGCT ATTCTAAAGA CGTCATTGTA 1200 

GTAGGTATCX; AAGTGTCACA TGAACTAGGT 1260 

ATCACAACGG GTTTAATCGG TAGTATTTTA 1320 

GAATCTTCTA TCGCCAAAGG ATTAACGTAT 1380 

AAAGCACTAG AAATGGATAT TGAATCCGGT 1440 

GCAGTTATTA GTTCAGTGTT AATACCTGTT 1500 

TTAAAATGAT AGACAGAAAG GGAGGCTATT 1560 

AGCATTAGTT AAAGCATTAC AAGCaTGGGA 1620 

CTCAATCGAC GCATAGTCGA TAgTTTACGT 1680 

GTACGTCATG AAGAAGTAGC AAGCTTAGCG 1740 

ATCGGTGTGG CATTAAGTAT CGGTGGCCCT 1800 

GATGCCAAAA TGGATAATGT ACCGCAATTA 1860 

CTTGGAACGA AAGCATTCCA AGAAACAAAT 1920 

TATAATCACC AAATTGAAAA AGGTGACAAT 1980 

ACOGCATATG AACAAAAAGG TGTAGCTGTT 2040 

AAAATTAAAG ATACAACGAA TAAACCAGTA 2100 

AAATATAAAG ACATCAAAAA AGCGGTTAAA 2160 

TTAATTGGTG TAGGTGCX^AA ACATGCGAAA 2220 

AAAATTCCTG TCATTCATTC ATTACCAGCT 2280 

AGTATCGGtA ACTTAGGTAA AATCGGTACC 2340 
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CCATATGTGG ATTACTTACC TAAGAAAAAT ATTAAAGCCA TTCAAATTGA CACAAATCCT 2460 

AAAAATATCG GACATCGTTT CAATATTAAT GTAC3GAATTG TTGGAGATAG TAAAATTGCG 2520 

5 TTGCATCAGT TAACTGAAAA TATTAAACAT GTTGCTGAAA GACCATTCTT AAACAAAACG 2580 

TTAGAACGTA AAGCGGTTTG GGATAAATGO ATGGAACAAG ATAAAAATAA TAATAGTAAA 2640 

CCATTAOGTC CAGAACGATT AATGGCATCA ATCAATAAAT TTATTAAAGA TGATGCAGTG 2700 

70 ATTTCAGCAG ATOTAGQTAC AGCAACAGTT TGGTCAACTC GATACTTAAA CCTTGGTGTA 2760 

AATAACAAGT TCATCATTTC AAGTTGGTTA GGTACAATGG GTTGCGGTCT TCCAGGTGCA 2820 

ATTGCATCAA AAATTGCATA TCCAAATAGA CAAGCCATCG CAATTGCTG6 TGACGGTGCA 2880 

TTCCAAATGG TAATCCAAGA CTTCX5CTACA GCAGTACAAT ATGATTTACC TTTAACPGTA 2940 

TTTGTACTTA ATAACAAACA GTTAGCATTT ATTAAATATG AACAACAAGC AGCTGGTGAA 3000 

TTAGAATATO CAGTTGATTT TTCTGATATG GATCATGCAA AATTTGCTGA GGCAGCAGGT 3060 

GGTAAAGGTT ATACAATTAA GAGTGCTAGC GAAGTAGATG CTATAGTCGA AGAGGCATTA 3120 

GCACAAGATG TACCAACGAT TGTAGATGTA TATGTTGATC CTAATGCTGC GCCATTACCA 3180 

GGTAAAATTG TAAATGAAGA AGCGCTTGGT TATGGTAAGT GGGCATTTAG ATCAATTACT 3240 

GAAGATAAAC ATTTAGATTT AGATCAAATT CCACCAATTT CAGTGGCAGC AAAACGTTTC 3300 

TTATAACTGA TTTAAAGGTT ATCACAATTG AATTGAACTA TAAAAACGGT AATTTCTATT 3360 

TCAACAAAAT GGGAATTGCC GTTTTGTTTA TTTATCACAA ATGATCGTAC TGAATTGATG 3420 

ATAAAATTGT GAAAAAGTTG TTGAAAACGC TTTTACAAAT ATGTATAATA GCTATGAATT 3480 

AGATATCACT TGCGTGTTAC TGGTAATGCA GGCATGAGCA AACAACCGCA CTATGAGAAT 3540 

AGTCTTGTTT GTTCATGCCT GCTTTTTTTG TACATGGAAG CGGAAATTGA GATAGGGGAT 3600 

GTTTfiTATGT TTAAGAAATT GTTTGGACAA TTGCAACGTA TCGGTAAAGC ATTAATGTTA 3660 

CCTGtTGCGA TTTTACCAGC AGCTGGTATT TTATTAGCGT TTGGTAACGC AATGCACAAC 3720 

GAACAATTAG TAGAAATTGC ACCATGGTTA AAAAACGATA TCATTGTAAT GATTTCGTCG 3780 

GTCATGGAAG CAGCAGGACA AGTTGTATTT GATAACTTGC CATTATTATT TGCAGTTGGT 3840 

ACAGCACTTG GATTAGCAGG AGGAGACGGT GTTGCAGCAT TAGCAGCGCT AGTAGGTTAC 3900 

45 TTAATTATGA ATGCAACAAT GGGGAAAGTG TTGCACATTA CAATTGATGA CATTTTCTCA 3960 

TATGCCAAAG GGGCAAAAGA ATTAAGTCAA GCAGCGAAAG AACCAGCACA TGCTTTAGTA 4020 

TTAGGTATTC CAACGTTACA AACGGGTGTG TTTGGTGGTA TTATCATGGG TGCTTTAGCC 4080 

^ GCATGGTGTT ACAACAAATT TTATAATATT ACACTACCAC CATTTTTAGG ATTCTTTGCA 4140 



55 



IS 



20 



2S 



30 



35 



731 



AGCTTTGCGT GGCCACCAAT TCAAGATGGA 
AAAAATTTAA CATTAACAAC GTTTATATTC 
5 GGTTTACATC ATATTTTCTA TTCACCGTTC 

GCAGGTGAAT TGGTTCGTGG TGACCAACGT 
CCATTTACTG CTGGTGCATT TACTACTGGT 
GCGGCGGCAT TTGCTATTTA TAAAAATGCA 
TTAATGTTAT CAGCAQGATT AACTGCATTT 

TCATTCTTAT TTGTAGCACC AGTACTTTAT 

IS 

TTCTTAGTAA TGCATTTATT AGGCGTTAAA 

GATTATATTT TATATGGTTT ATTAAACT6G 

GTCGGTATTG TATATGCTAT CGTGTATTAC 

20 

AAATTGAAAA CACCAGGTCG TGAAGATGAA 
AAATTACCAT TTGATGTCTT AGATGCAATG 
GCATGTATTA CACGTCTACG CGTAGAAGTG 

25 

ATTAAAGCTT TAGGCGCATC AGGTGTATTA 
GGTCCAAAAT CAGATCAAAT TAAACATGAT 
AAACCAAGTG AAACGACAGT GACTGAAGAA 

30 

CTTGGAACAA CAGACATCTA TGCACCAGGT 
CCTGATCAAG TATTCGCTGG TAAAATGATG 

^ GGTGAAATTG TAGCACCGTT TGATGGTACA 

ATACSSATTAG AATCTGAAAG TGGCGTCGAA 
AAACTGAATG GTGAAGGATT CGAAAGTCTG 

40 CAACCATTAA TGAAAGTGAA TTTAGCATAC 

CCAATGATTA TTACAAATCT TGAAAATAAA 
GATCCAGGTA AGCTAATTAT GACAGTCAAA 

45 TGAATGAACT TTTAGTCATT C6TAGTGCGT 

TTACAAAAGG CAGTAGCTTA AAATGAAGCT 

CAGGTGTGTT GATGrTAATA AGTTAAATAT 

SO 

TTTTTGTCAC ATTTTAGAAA AATGCATCTT 
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TTAAATAGTT TATCGAATTT CTTATTAAAT 4260 

GGTATTATTG AACGCTCATT AATTCCATTT 4320 

TGGTTTGAAT TCGGAAGTTA TACAAATCAC 4380 

ATTTGGATGG CACAATTGAA AGATGGCGTA 4440 

AAATATCCAT TTATGATGTT TGGTTTACCA 4500 

CX^ICCAGAAC GT/UUU^GT CGTGGGTGGT 4560 

TTAACTGOTA TCACTGAGCC ATTAGAATTT 4620 

GGAATTCACG TATTATTAGC TGGTACATCA 4680 

ATTGGTATGA CATTCTCAGG TGGTTTCATA 4740 

GATCGTTCAC ACGCATTATT AGTTATTCCA 4800 

TTCTTATTCG ACTTTGCAAT TCGTAAGTTT 4860 

GAAACTGAAA TTCGTAACTC TAGTGTCGCA 4920 

GGTGGAAAAG AAAACATTAA ACATTTAGAT 4 980 

GTTGATAAAT CAAAAGTAGA TGTAGCAGGT 5040 

GAAGTTGGAA ACAATATGCA AGCTATCTTT 5100 

ATGGCCAAGA TTATGAGTGG TGAAATTACG 5160 

ATGTCAGATG AACCAGTTCA CGTAGAAGCA 5220 

ATCGGTCAAA TCATTCCATT ATCAGAAGTA 5280 

GGTGATGGTG TTGGCTTTAT CCCTGAAAAA 5340 

GTGAAAACAA TCTTCCCTAC QAAACATGCG 5400 

GTACTTATTC ATATTGGTAT CGATACAGTG 5460 

ATTAACGTTG ATGAAAAAGT AACACAAGGT 5520 

TTGAAAGCAC ACGCACCAAG CATCGTTACA 5580 

GAACTTGTCA TTGAAGATGT ACAAGATGCT 5640 

TAATGATTAA AAATGAAACA GCATATCAAA 5700 

ATGCGAAGTA GCGAGTTGAA AGAGAATACG 5760 

ACTGCCTTTT TA6TGCGCAA TGATGTATAG 5820 

TAGTGTTAGA TATAGAAAAC ATTGCTTATG 5880 

CGCGACTAGC CAAATTAATA GTCTCATTGA 5940 
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CCAGATGCTT TGAACTTATC GCAATTGTTG AGTGCGAAAG GTATCGAACA TGACTTTATA 7860 
CCTGGATATT ACCAATTCCA TATTTATCCA GTATTTCCGA 7900 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



75 





GTCTAAATAA 


ACAAAATTAT 


CATTGATTaC 


TGAACTGGCA 


TTTCGAAGTA 


ATGCTTCAAT 


60 




ATCATTCGAA 


TATTTCTTCA 


ATTTATGATT 


GTGAAATAAT 


TCTTGCATCA 


AAAATGGTCT 


120 


20 


TTGGTCACAT 


GAATGTGCAT 


CTGAAGCTAC 


AAAATGAGCC 


AAATTACATT 


CTATAAATTG 


180 


TAATGATAAC 


TTTTGAATGT 


TTTTACCAAA 


TCCACCAACT 


AAAGAACTCG 


ATGTTAATTG 


240 




ACTCAGTGCC 


CCATTTGCAA 


CCAATTCATA 


TAATATTTCC 


GGATTTTTGG 


CGATACTTCT 


300 


25 


ATTTCTTTCA 


GGATGTGCAA 


TGATTGGTAT 


GTAACCTCTC 


GATTGTATTT 


CAAAAAACAA 


360 


TTGTTTTGTA 


TAATGTGGTA 


CTTCGCCCGT 


TGGAAATTCA 


ATTAATAAAT 


ATTTCGAACG 


420 




ATTAATACCT 


TGAATACTAC 


CATTATCTAA 


GCCTTTCAGA 


ATCGAATCTG 


TAATTCTAAT 


480 


30 






X AA i A i I- t-Aft. 


TGCTTGAACT 


TCTGGATGCG 


TTCTTAACTC 


DH U 




CGCCAATTTC 


ACAAGCACTT 


GTTGAAATGT 


ATTATCATAT 


CTCGGATGCA 


AATGATGAGG 


600 




TGTCGCTACA 


ATACTTGTTA 


CACCTTCATC 


CTTAGCTTGC 


TTTAATAGTG 


CAATACTCTT 


660 


35 


TTCAATTGTT 


TTAGGACCAT 


CATCTATATC 


AACTAATATA 


TGGTTATGAA 


TATCAATCAT 


720 




GATTCATCAG 


TCCCATAATA 


TGCATA6TAA 


CTAGCACTTT 


TATCTTTAGG 


CATTCTATTT 


7B0 




AAGACTACAC 


CTAATAATTT 


AGCACCTGTT 


GCTTCAATAA 


GTTCTTTTCC 


TTTTTTAACT 


840 


40 


TCATCTCTAT 


TATTATTTTC 


CGAATTAACT 


ACGTAGACAA 


CATTGCCGGT 


AAACTTTGAA 


900 




AATAATTGCG 


CATCTGTAAC 


TGTGTTCACT 


GGTGGCGTAT 


CGATAATTAC 


AAAGTTATAA 


960 




TTCATCAATA 


ATGTGTCATA 


CAAATTTGCA 


AATGCCCTTG 


ATGTAATTAA 


CTCTGACGGA 


1020 


45 


TTCGGTGGGA 


TTGGCCCAGA 


CGTCAAGACG 


TCTAAATCTT 


GAATTTCAGT 


TGAGATAATA 


1080 




CTGTCTTGAT 


AAGTTGACCA 


ATTTAGCAAT 


AAACTTGATA 


GGCCTTCATT GTTTGGCAAA 


1140 




TTAAAAATAT 


AATGCTGCGT 


AGGTTTACGC 


ATATCCCCGT 


CTACGATTAG 


TGTTTTATAA 


1200 


SO 


CCTGCTTGCG 


CATATGCAAC 


TGCTAAATTT 


GCTGCAATTG 


TAGACTTACC 


TGCOCCTOOT 


1260 
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GATCTTATGC CTCGAAATTT CTCGCTAATA GGTGACTTTG GTTGTTCATG GACAATTAAA 1380 

CTTGATGTAC TTCyTCGTGT ATTCGTCATO GTAATTCCTC GTAAATTAAA ATTTTTGTAT 1440 

^ TGAACCTAAA ATAGGTAATC CTAGTTGCGA TTCAACATCT TCTTCTGTCT TAATACXXHT 1500 

ATCTAATAAT TCTTTTAAGA AAATAATCAA TATTGCTAAA ACAATACCAA CAATAATGCT 1560 

GATAACTAAG TTQACAGATA CTATTGGAGA TACTTTTACA GCATTATCAT GTGCTGAGGA 1620 

TO 

AAGTATCGTA ACATTATCAA CACTCATAAT TTTAGGCATG TCATGAGCAA AAACTTTAGA 1680 

TATTTTATTA ACAATTTTGT CAGATTCAGA TTTATTCCCA GTGGTAACTG ATACAGTAAT 1740 

AATTTGAGAG TTTGTTTGAT TGGTTACTTT TAAAAATGAA TTCAACTCAG CTGTTGAATA 1800 

IS 

CTGACCATCA AnTTCTCTAG ATACTTTATC TAGAATTCTA GGACTTTTGA TAATTTCCGT 186 0 

ATATGTATTA ACAGACTGCA AACTACTTTG AACATTTTGG AAAGCTAAAT CACTTGAGGA 1920 

CTTTTTCATG TTCACTAATA TTTGAGTAGA AGCAGTATAT TTGTCAGGCA TAACAAAAAA 1980 

20 

GGTT 1984 
(2) INFORMATION FOR SEQ ID NO: 140: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 





CAAATCCCTT 


GGTGATGAtA 


AAtGtATTGC 


TGTGTAGCCA 


AATAATCTTC GTATATATGA 


60 


35 


CTGACGTTCA 


ACAACAGCTT 


GCAATCGTTT 


OGTTGGTACA 


GTTACTTTCT TCTTGTTAAA 


120 




GAi^CCATAT 


TCAATTTTAA 


GTTGCTCATT 


TTCAAGCATC 


ACC6AAAAGC CATAAAATCT 


180 




TATCATTGTT 


ATAATCGTTC 


CAATAATATA 


TGCCACTATT 


AATACTAGTA AAATGATGAT 


240 


40 


TAATACTGAA 


ATACTTACAA 


TTTGAACCCA 


TTGACTAATT 


TCATGATTTA GCTTCGACCA 


300 




TGGGATCAAC 


TCTCTTACAO 


CCCCGTAAAT 


CGGTACTAAA 


GCTGCTAACG TTACACCAAT 


360 




GGCGCCACTG 


GTCATTGCCA 


TAAATAGTGA 


TTCTTTAAAA 


TTCATCTGAT ATATAGGAAT 


420 


45 


GCGTTTATTT 


TTCTGATTAA 


GCATACTATC 


AGTGTTCTGC 


ACTTCATCTA AGCGACCTTC 


480 




TGCGATGTCT 


TCCACATTAC 


CTTCAATGTC 


ATGATTACAG 


TTGTCATTCT TCTCAGCACT 


540 


SO 


AGACTTTTGC 


GCCACTTCTG 


TCTTCAACTC 


TGTTTGCAAT 


TGATCAATAT ATCGTTCAAG 


600 


ATATTCACCT 


TGTTTTTTOG 


AAATAACACT 


TAAGACAATA 


CCATCACTTG GTGTTTTGAT 


660 
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AATACGTTTT ATATTTAATT CTTTACGCTT TTTATTAAAA ATACCTGTTG TTAAAATGAA 780 

ATAATTATCC tCAATCCAAT ATCGCGTGTT CATAATTCCG ACAATTTGAG AAATGTATGA 84 0 

TATTAAAAAG AATACAAATA CAATACCTAT. CCATAAATAT GATTCGGGAT TCGTATAATC 900 

AAAATCTTTC AATTGAAAGA TAATGAAAAT AAAAAAGACG ACTATGTTTT GTTTGATAGC 96 0 

ATTGATTATG CCATTAAAAT ATGAAATCGG ATGTAATTTT TGAGGTTCAG ACATCACTTT 1020 

CAACCCCTCT CAAATTCQAC ATAGTTCTCT CTTCGATTAT TTTAACATCG TCATGAGACA 1080 

TCATCGGTAA ATAAATAGTA TGACCTGCAG TCATAAATCC AACTTTATAC AAATTAAGCA 1140 

CTTTACTAAT TGGATTAGAT TTAATCX3ACA AGTATTGTAA ACGTTCAATT CGACTCGTTT 1200 

CTTCTTTATA TATAAAAAAT GATGTACGAT ATTGTACACT TAOTTGATCA ACTTTATAAA 1260 

AGCGACAATG ATATTGCCAT AAAGGCTTAA TAAATAATTT TAATGTACTC AGAGCACCTA 1320 

20 AAACCAACAA AATATAAAGT AAGTAATGTG GCCATTCAAA TCTTAACCAT ATAAAATAAA 1380 

AAATGACATA CACAGCTACA CTCAATATAA ATTCTAAGCC ATTCGTAATG TAGTAATACA 1440 

ACAATGCTGA CTTAGGACTC TTAGTCAACT TAGTATAATC TGACATATAC CCCTCTCCCC 1500 

AAATAAAAAA TTATACXSGAT TTATAATCTA TTTCATTTTA TTTTTATATG ATGATAATTA 1560 

TAGCATATGQ AATATTTCAT GCTAATTTAT TCTTCCTAAA GGTACATCTA AAAATTTAAT 1620 

TAAGCAGAAA GTGCTTGAAT TGCTAAAAAG ACACCATGTT ATAATTTTAT CAACATGATG 1680 

CCTTTCATCT ATAATCAATC TTTCATCTTA TCAAGAGCGA TATTTAGTTC AAGCACATTC 1740 

ACATAATCAT TTGTTAACAC ACCACGCTGC TTACGATGTT GAATCAAGTC GGCCACTCTT 1800 

35 GAAGTAGATA CATGACGAGC ATCAGCAATA CGAGGTGCTT GCTTCAATGC ATTTTCGACC 1860 

GTAATATGCG GATCTAAGCC CGACCCAGAA CTTGTTGCAG CATCTATTGT TACATTTGAA 1920 

rrCCCAAATT TAACATGATG TTTCATGCGT GCTATTAATT CGGTGTTTCC ATTCGATTCA 1980 

40 TTACTTCCAC CTGAAGATAC GCCGTTTTTA TATAATTTTT CAGGATTCAT ATTATAATCA 2040 

ACTGCACTCG GTCTCCCGTG AAAATATCGT GTCTCTGTCC AGTGCTGTCC AATCAATTTT 2100 

GATCCAACTA TACGATTGTC ATACGTAATT AAACTGCCAT TTGCTTGTTG ATAAAAAAAT 2160 

ATTTGACCAA TTAACGTGAT AGCTAACGGG AATAAAAATC CACATAATAC CATAGTTATT 2220 

ATCGTTAAAC AAATACTATT TCTTATCGTA TTCATGGTAC AGGCTCCTTC CTCTTTACAC 2280 

AAAAAATTGT ACAATCATAT CTATTAATTT AATGCCTAAA AACGGGACGA TTAATCCACC 2340 

TAATCCATAA ATCAACATAT TATTTATAAA GATTCTATCA ATGCTGTAAC CCTTTACTTT 24 00 

TACACCTTTC ATGGCAATTG GAATTAAGGC AACAATGATT AATGCATTGA ATATCAAAGC 2460 
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AATTGITGAC ATCATTAGTG CAGGTAAAAT TGCAAAGTAT TTTGCTACGT CATTAGCCAA 2580 

ACTAAATGTC GTTAATGCAC CTCTCGTCAT TAATAATTGT TTGCCTATTT TTACAACCTC 2640 

5 

TATTAACTTT GTAGGATTCG AATCTAAATC AATTAGATTA GCTGCCTCTT TAGCACTAAT 2700 

TGTCCCTGAG TTCATAGCTA ATCCTATATT CGCTTtGTGc tAGCGCAGGT GCATCATTTG 2760 

TACCATCTCC TGTCATCGCA ACAATATGGC CTTTCGCTTG TTCATCTTTG ATGACTTTAA 2820 

10 

TTTTATCTTC GGGTTTACAC TCTGCAACAA ATCTATCAAC CCCGGCTTCT TTTGCAATTG 2880 

TAGCTGCTGT TAAAGCATTA TCACCTGTAC ACATAACTGT TTCAATCCCC ATTTTTCTCA 2940 

IS ATTCA6TAAA TCGTTCTACA AGACCATCTT TAATCACATC TTTTAAATAA ATCACGCCAA 3000 

GCATGACATT GTTTTCAATG ACTATTAAtG GnGTGCCACC TTTACTCGAT ACATCCATAC 3060 

AGAGAGACTC AATATTAAGA GGAATATTGC CTTGTTGTTG TTTGACAAGA TTTATCATAC 3120 

20 TATTAGGTGC ACCTTTGAAT ACCXjATATTT CATTTGTAAT GATTCCGCTC ATTCTAGTTT 3180 

CAGCTGTAAA AGGCTTATAT GTGCCATCAA TGTCTTTAGG CAGCTCATTT ATATACATcT 3240 

GcttCGCTAA TCGTACAATA CTTTTTCCTT CTGGCGTATC ATCGTAGATT GATGACATAT 3300 

25 

AAGCAGCGAC TATCAATTTT TCAAGCATTT GTTGATTCAC TGGTAAAAAT TCACTAGCGA 3360 

TTCGATTGCC ATAAGTGATT GTGCCTGTCT TGTCTAAAAT CATTACATCG ACATCTCCAC 3420 

ATACTTCTAC AGCACGCCCA CTTTTCGCTA ATACATTGAA TTGAGTAACA CGATCCATGC 3480 

30 

CTGCAATACC AATCGCCGAT AACAAACCAC CGATTGTCGT TGGTATTAAA CATACTGTTA 3540 

ACGCAATGAG CATCGCAATA GGTAAAATTA AATGCAGGTA AGATGCTATT GGATATAACG 3600 

35 TTACAATAAC GACTAAAAAT ATAATTGTTA ACGTTGTTAA TAATGTAAAA AGTGCAATTT 3660 

CATTTGGTGT TTTATTTCTT TCCGCCCCTT CAACTAAGGC AATCATTTTA TCTAAAAAAG 3720 

ATGTACnCGC TTCACTCTCA ACACGTATTT CTAACCAATC AGATGTTACA AGTGTACCGC 3780 

CAATGACTCC ATCAAAATCG CCACCTGATT CTTTTATCAC AGGTGCAGAC TCACCAGTAA 3840 

TTGCAGATTC ATCAACGGTr GCTAATCCAT TTATTACAAC GCCATCAGCA GGGATTGTTT 3900 
CTCCATTTTC TACCCGAATA TTTTGTCCGG CTTTTAACTC TGTGGCGTTC ACTATCCGAT . 3960 

45 

ACGCACCATT TTCTTCTATC AATCGAGCAG TTAAATTTGA TTGTGCTTGT CTTAAACTAT 4020 

CAGCTTGCGC TTTTCCACGA CCTTCAGCAA AGGCTTCTGA AAAATTAGCA AACAATATAG 4080 

TTATTAATAA TATGATAAAA ATTGTAATCA AATAACCTCG CGATAGATAG CTAGTTCCAA 4140 

SO 

ATATGTCAGG AAAACATATT AATATCAACG TTAAAATCAT TCCAACCTCA ACGACAAACA 4200 

TTATCGGATT TTTTATTAAT TGTTTAAGAT TCAGCTTATA AAAACTCATT TTCAAAGCTT 4260 
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TTTATTTTAA AGTTAAAAAT TCACCAATAG GACCAAGTAA TAGTACTGGA ATAAATGTCA 4380 

AACCACTTAG TAAAACGATA AATACGATTA GTGAT ACGCC AAAATAAGGT TTATCAATCG 4440 

CTATTGTATA TTTATCTTGA TGGTATGATT TTTTATTCAC TAAACTTGAT GCAATCATTA 4 500 

ATTGCAAAAT AATTGGTATA TAACGAGAAA GCAACATAAT GATTCCTGTA GAGATATTCC 4560 

AGAATGTTGT ATCATCTTTC AGTCCTTCAA ACCCTGATCC ATTGTTCGCA GCAGCTGATG 4620 

TCATTTCATA CATAACTTGT GAAATACCAT GAAAAGACGG ATTCGTtATa CTTtCACTTG 4680 

CTCCAGGAAT CATAAAAGCA AGTGCTGAAA ATACTAAAAT TAAAATTGGG TGTATGAGAA 4740 

AGACTAAGAC AATACATTTC ATTTCACGGG CGCCAATTGG CATATTTAAA TATTCTGGTG 4800 

TTTTACCAAC CATCAAACTG CATATAAACA CCGTCAGTAA GACAAATATC AATAAATTCA 4860 

TGAGTCCTAC GCCTTCGCCA CCAAATACAA CATTTAGCAT CATTAATACC ATTGGTCCTA 4920 

^ ATCCACCTAT AGGCGTTAAG CTATCATGCA TGTTATTAAC AGAACCCGTT GTAAATGCCX3 4980 

TCGTAATAAC TGTAAATAGT GCTGACAAAC CTGCTCCAAA CCGTACCTCT TTACCTTCCA 5040 

TATTCGGTCC ATAAATGCCT AAATTCGCTA GTATTGGATT ACCACGATAC TCACTCCACA 5100 

TAGTTAATGT AAGAATTGCT ATAAAAATGA AAAACATTGC GACAAATAAT ATCAACGCAT 5160 

GACGATGTAC TCGTTTACCA TGTCTACTTA ACATGCGACC AAATAAGAAC AACATTGACA 5220 

TAGGAAGTAA CATCATACTG CCCATTTCTA TAAAATTGCT CCAAATATTT GGATTTTCAA 52 BO 

AAGGTGTTGC AGAATTTCCT GCTAAAAATC CTCCACCATT CGTACCAAGA TGTTTTATTG 5340 

ATTCAAGTGA TGCAATAGGT CCAAATGCAA TATGTTGAAT ATGTCCGCTT AAAGTCCGAA 5400 

35 TCATTAAATT AGCATGCAAC GTTTGTGGTA CaCCTTGAGT CATCAATAAA ATACTAATTA 5460 

AACATGATAA TGGTAAAAGT ACTCGGACAA TAAACCGAAC AATATCTTGA TAAAAATTAC 5520 

CAATCATATT AGTTAATCCA GTTAAACGTC TCAACATCGC TATACAAACG GCGTAACCTG 5580 

^ ATGCACTAGA TGTAAACATT AAATATGTCA TTACAATCAT TTGCGTTAAA TATGTCACAT 5640 

CTGaTTCACC GTTATAGTGT TGtAAATTAC TATTTGTTAA AAAAGATATT GCTGTATTAA 5700 

ACGCTAAATC TATCGATTGG TTTAAATTAT GATTTGGATT TAAAAAAAGC CATTGCTGAA 5760 

CTATTAGCAA TACAAATGTT ATAAACCCCA TAAATCCATT AAATGCCAGA AAATGTTTGA 5820 

CATATGTTTT AGCTGACATG TGTTCTAAAT CTGTGCCGAT AATTTTAAAA CACATATTTT 5880 

CAAATCTAGT AAATATTAAA TCTACTCTTG ACGATTGCAC CAATGCTACG CGATATAGAT 5940 

ATCCACTAAA AACATACGTA ATCATAACCA TCATTGTTAG AAACAAAATT ATTTCCATGA 6000 

TAACCCTCAC TTAATATATT TCTAAAATTT TTCACTACGA ATTAAGGCAT AAAATAAATA 6060 
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ACACAACAAC ATCGTAACAA CTTGTTTATG AGAGAAATnT TAATTTTCAA ACTTAGTTAT 
TAAGAAAnCA TTAAGATGTG TATGCAGAAA TAAATTTTAT AGCATTTAAT TGTGAAGAAT 
ATTATGATAT TGCTATCGAG GTGAAGGTTA TG 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



6180 
6240 
6272 



IS 



20 



25 



30 



40 



45 



SO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAATGATGTT TTACAATAAA TATAnAAACG TATCAACATA TATCATCATA TTTTTAGTTT 60 

CAAGTGCAGC CTTTGCAATA TTCTTGTTAA GTGCGnACAT TAGTGCTCAC TCGGAACAAG 120 

TGTACGAAAT GACTGACCAT CAAATTAAGA ACAATACGAT AAATAAAGCA TACGAACATA 180 

AAGACCCTAC AAACAATAGC GAACAAAGAG ATGGGAAAGT GTTCGCTTTA ATAAATTGAT 240 

ACATTGTCAC AACGTTATTT TGCCTATTTT TGCGmAATAG CGTTTTTTAT TACwTTTTTG 300 

CTGATSTTAA ATTTGTTATA TTTTGTTAAA GTATTATAAT GATTGAATAA ACAAATTGAA 360 

GGTAGGTTTT TTAATTGAGT AATTCTGATT TGAATATCGA AAGAATTAAC GAGTTAGCTA 420 

AAAAGAAAAA AGAAGTAGGA TTAACTCAAG AAGAAGCAAA GGAGCAAACA GCCTTAAGaA 480 

AAGCTTATCT TGAGAGTTTT AGAAAAGGGT TTAAACAACA AATTGaAAAT ACTAAAGTAA 540 

TTGATCCAGr AGGTAATGAT GTAACACCTG AAAAAATTAA AGAGATACAA CAAAAAAGAG 600 

ATAATAAAAA TTAAATCACA AATCTGTAAA GAATTTTCTG ACATTATAAC TTGAAATAAG 660 

TATtTTACTT ATCTTTTTAT TTTAAAATAA GTTATAATGT ATTTGATAAA ATTQAAGAAG 720 

GGAAGATACA CAAGATGTTT AATGAAAAAG ATCAATTAGC TGTTGATACG CTACGTGCAC 7 80 

TAAGTATCGA CACAATCGAA AAAGCGAATT CTCGTCATCC AGGATTACCT ATGGGAGCTG 840 

CCCCAATGGC TTACACTTTG TGGACACGTC ATCTGAATTT TAATCCACAA TCTAAAGATT 900 

ACTTCAATAG AGACCGTTTC GTATTATCTG CAGGGCATGG TTCAGCATTA TTGTATAGCT 960 

TGTTACATGT TTCTGGTAGT TTAGAATTAG AAGAATTAAA GCAATTTAGA CAATGGGGTT 1020 

CTAAAACACC AGGTCATCCT GAATACAGAC ATACAGATGG TGTAGAAGTT ACTACCGGAC 1080 

CACTTGGACA AGGTTTT6CT ATGTCAGTAG GATTAGCTTT ACAGAAGATC ACCTAGCAGG 1140 

gAAATTTAAT AAAGAAGGAT ATAATGTTGT AGATCATTAC ACATATOTAT TAGCTtCTQA 1200 
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AAGTAAATTA GTTGTTTTAT ACGATTCAAA TGATATTTCA TTAGATGGCG AATTAAACAA 1320 

AGCTTTTTCT GAAAACACAA AAGCTCGTTT TGAAGCATAT GGTTGGAATT ACTTACTAGT 1380 

5 TAAAGATGGT AATGATTTAG AAGAAATTGA TAAAGCGATT ACTACAGCTA AATCTCAAGA 1440 

AGGACCAACG ATTATTGAAG TTAAAACAAC AATCGGATTT GGTTCACCGA ATAAAOCAGG 1500 

AACTAATGGT GTTCATGGGG CACCTTTAGG TGAAGTTGAA AGAAAATTAA CATTCGAAAA 1560 

TTACGGTTTA GATCCTGAAA AACGTTTTAA TGTTTCAGAA GAGGTATACG AAATTTTCCA 1620 

AAATACTATG TTAAAACGTG CTAATGAAGA TGAATCTCAA TGGAATTCAT TATTAGAAAA 1680 

ATATGCAGAA ACATATCCTG AATTAGCAQA AGAATTTAAA TTAGCGATTA GTGGTAAATT 1740 

IS 

GCCTAAAAAT TATAAGGATG AATTACCACG TTTTGAACTG GGTCATAATG GTGCATCTCG 1800 

TGCTGATTCT GGTACTGTTA TTCAAGCAAT CAGTAAAACT GTCCCTTCAT TCTTTGGTGG 1860 

2Q ATCAGCAGAC CTTGCTGGTT CAAACAAATC CAATGTAAAT GATGCAACTG ATTATAGTTC 1920 

TGAAACACCT GAAGGtAAAA ATGTGTGGTT TGGTGTACGT GAATTTGCTA TGGGTGCT 1978 
(2) INFORMATION FOR SEQ ID NO: 142: 

2S . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



3S 



40 



TAGTAGTATT 


TATTAAATTA 


TACGAAGGGA 


CCcAACACAG AAAATTCATT 


TTATTGAATT 


60 


TTACATTTAT 


GTGCCAAGTT 


GGGAAAAATG 


TCTTATTTTT TCaAAGTATT 


TAAAAGTAAA 


120 


att/k:atgtt 


AATACGTAGT 


ATTAATGGCG 


AGACTCCTGA GGGAGCAGTG 


CCAGTCGAAG 


180 


ACCGAGGCTG 


AGACGGCACC 


CTAGGAAAGC 


GAAGCCATTC AATACGAAGT 


ATTGTATAAA 


240 


TAGAGAACAG 


CAGTAAGATA 


TTTTCTAATT 


gaaaattatc ttactgctgt 


TTTTTAGGGA 


300 


TTTATGTCCC 


AACCTTTTTA 


GAATATTAAA 


TTTCTACAAT TTCGTCATCT 


TCAACAATAA 


360 


AGCCCATTGT 


ATTGACGCTG 


TTATTTAAGA 


aagtcagaat ataacgcatt 


acttcatcac 


420 


GTTCTGGCTC 


ATTGTGAACC 


TCGTGGTAAA 


AACCTTGCCA AGCTTTAAAA 


tataattcag 


480 


GTGTTTGATA TTTTTCTTTA AACTCATCAA 


TTGCCCTAGT ATCAACAATT AAATCCTTCG 


540 


TTCCATACAT 


TAATAGCGTT 


GGCATTGGTT 


GAATGTCATG AATATGAGCC ATCGTATCTT 


600 


TCATCGTCTC 


ATTAATTGTA 


TTATACCAAT 


GATACGTTGC TTTTTTTAAC 


attaaaccat 


660 
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742 



CTCGTTTCTQ CSAATAAGATG AATGTCAAAA 
CTAACACCAT CCACAGTAAT AGACCCTTGC 
^ CATTGAATCG TAATAATTTT TGCATTGGCT 

TCATCTACAT GACCGAGGAC AAAATGTCCA 
TCTAAATTTA CTTCTGATTG TCGCTTAACA 

10 

TTAATTACTT GAACAGTAAA AGATGTCTGA 
TTAACACTGA TGGAATCACC AATATGCATA 
ATCGTCCTGA CTGATTGACG AATTTGAACA 

IS 

CCAGTAAACA TGCATCATCA CTTCTTTCGT 
TCGGAATGAA CAATTTCAAA TTGGTTCGCA 

20 TAAAATTGAT AATTTCCAGA TCCGCCAATT 
TCTATATAAT TAGATTGGAG AAATTCTGAA 
GTTCCAACTC CTCTTTTATA TAAATTGTGA 

25 TAAATAATTT CAATATGTGT TTGATTGGTT 
ATTGGTGTTG ATTCATCTTG ATAAATTTGC 
AATATTACTT TTATAOGGTT TTTTCCATCT 

30 

AATTCAACTG TACGTCTTCC AGTTAACACT 
TCTTGTTTAA CCTCTTTGTT AGTAATCCAT 
CCATCTAAAC TTGCAGATAC TTTCACTGTA 

35 

AAAAAGTCTT GGTATAATTG TGATGCCCGT 
TGAGCCCGTA ACGTCTCATC ACCATGTGTG 
ACTTTTGCTA TCTTACAATC AATTATTTTG 

40 

CTACATGGCT CTAACGTAAT ATAAATCGTC 
AGTGCTTGAA CCTCCGCATG CTTGTCACCT 
45 CTACCTTCTT TAACTACAAC AGCGCCAACG 

ATATTTGCAA GTTGAATCGC ATAATCCATA 
AAAATCCTCA CATCATGAAT TAAGATGCAA 

SO 

TTGTACACAT TTTTACAAAT ACGCTACATT 
CATCCAGACT TTAACTGTCG GCTCTAGAAT 

55 
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CTGTTATCAT GCTTATCAAA TACCGTTAGA 4380 

TTAACTAACT GATTATTAAT ATGTTGGCTA 4440 

GTTTCATTTA TTTTTGAAAC TGTTCCTAGT 4500 

CCAAACCTAC CGTTACCACT CATGGCACGC 4560 

TCTGCTAAAT AGGTTTTATT TTCAGTGCCT 4620 

TTAAAATCAA TCACTGTTAA ACATGCACCA 4680 

TCTGCCGTAA TCTTATGTGC TTCAATTTCA 4740 

CTTTTAACGA CACCTATTTC TTCAACGATG 4800 

AAAGTTAATT TAACATTTTG ATTTAATAAC 4860 

TCTGGTATCT CAATCACATC ATTTGTTTGA 4920 

AATTTCGGOG CATAATAGAG AATAAATTCA 4980 

GTAGTGGTTG GACCTGCCTC GACTAGCAAA 5040 

AGAATTGTTG TTAAATCGCA AGACTTCAA6 5100 

GTTAAATTTG GATTTTCAGT ATATATCCAA 5160 

TGATTAAAAT GAATATTCCC AGACTTAGAC 5220 

TGAATACGTG TAGTATATTG TGGATCATCT 5280 

GCGTCGTGTC GATOTCTTAA CTTATAGACA 5340 

TGACTTTGTC CATTATCATT CGCTTGTTTA 5400 

ATTTGTGGCA GTTGCTTTGC TTTTGCTTTA 5460 

TCATCATCAA CGCATTCAAC CTCAATACCX5 5520 

TCTAACGAAT TGTCTTTTGT TGCGTATACT 5580 

TTAACACAGG GTGGTGTTGA ACCAAAATGA 5640 

GCACCTTCAG CATTTTGTTG TGCCATATCA 5700 

TTTCTCAAGT GTGCACCAAT ACCAACAATC 5760 

GGTGGATTAA CACCTGTTTG ACCTTGTACC 5820 

AATTGACTCA AATGATCACC TCTATAAACA 5880 

GGAGaAAAAT TTATCGTTAA ATAAGCCTAT 5940 

ATCTTTGTCG ATAATTAACA TTCTTTCTCC 6000 

CTCACTAGAT CAGCCACTAA TATGAAACAT 6060 
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10 



IS 



20 



35 



40 



45 



TTaTATATGA 


AATTGTTATA 


GATTATTTGA 


GTACGTAOTA TGTCAACTAC 


ATTTAAAATG 


6180 


ATACTATATG 


TTTTCTGAAA 


AAACAATTAA 


TGACGGTTTT 


AATTTAATAT 


AATCTGAGTA 


6240 


CTATAGGCAT 


CTCATTGATA 


TGATTCTTAC 


TAACAGACAT 


TAAAATCAAA 


CCTTCAATTC 


6300 


GTCTCTATAG 


AGCGTTCTCT 


TTATTATCTT 


CTAGTTACAA ATTATTGATT 


GtCACtGCGC 


6360 


TGTTGTTGCT 


CATTCGATTC 


TAAAGCATCA 


TATAATTGAG ATACTGTATG 


CGCAACTTGT 


6420 


TCTACAATCA 


TTTTCACACC 


GTTTCGTAGT 


TTATTAACAC 


CGTTTGTCAT 


TTGACCTATC 


6480 


GCAATCATAT 


TTGTTAATGT 


TCCAAACCTT 


GGACTAATAA 


CTTGATTGGT 


TTCCGGAATG 


6540 


ATTTGTATGC 


CTCCCATTGG 


GTGTGCTTGT 


ACAATTTGTC 


TATTTTCAAG 


ATTTCTAATT 


6600 


AATTGATCAT 


CTTGATCCAA 


TTCATTTAAA 


TGACTTTTTG 


CACCTGTCGC 


GTTAATGACA 


6660 


ACATTATATA 


TGTCTACTGA 


TTCTTGGTTT 


TTGTATGAAA 


AATAATACAA 


CTTGCCATaC 


6720 


ATGTTCACAT 


CTTCTAAATC 


TTTTTTCAAA 


ATTAAAGACT 


TATTTTCTAT 


TAATTCAATA 


6780 


ATTAGTTCAG 


CAGTTCTTGG 


AGGCATTGGA 


TTTGAATTTA ATTGAATCAT 


CTTTGAGTAT 


6840 


TTTTGATTAA 


ATTGATGTTG 


GTCTTCAATA 


CTTAAGCTAT 


TCCATATCCA ATTTAAATTC 


6900 


TCTTTCAAAT 


GTTCAATCAT 


ACTTTGGAAA 


ATGCCCaTTT 


CTGTTGGACG 


CGCTAAATCA 


6960 


TACTTCAAAT 


CTGCAATATG 


ATTTCCTGTA 


CGTCTATGTA 


CTAATTTTTT 


AAAATCAATG 


7020 


TCATATTCAG 


CACATTCTTT 


TAAAAATAAA 


GAAACTAAAG 


TATCAAGCGG 


TGCATTGCCG 


7080 


AAATGATGTT 


TTTTAATGTC 


ATTTAATTTG 


TCTTTAGTTA 


AGTACTTGAA 


TGTCACGTCT 


7140 


ATCATTGTAC 


CTCTTACACT 


TGGTAAATGA 


GCAGAACGAC 


TCGTCATAGT 


AATTGGTAAT 


7200 


TTTGGATGAT 


GAGCAGCAAC 


ATAACGGACA 


ACATCTAAAC 


TGGCAAGGCC 


TGTACCAATA 


7260 


ATCGCAATAT 


GGTCCAGTTC 


ATTTACTTCX5 


TCTAACGTAT 


TATATGTTGG 


ATAAGGCGTA 


7320 


gcG/ttATATC 


CTTTTTTACC 


CTTTAAGTTA 


TATGGATCAT 


GGTAGGCAAA 


TGTACCACAT 


7380 


GTTAAAAATA 


CATAATCGTA 


CGCTTGCCAT 


GATTGTCCTG 


AATTTGTAGT 


ACATATGTAA 


7440 


TAAGTTAAAT 


TCGTTTCATC 


GATATTAGAA 


TTTGTATAAA 


TCTCTTGAAC 


TTTATTATAA 


7500 


TTAGTTGATA 


TATTTGGATA 


TTTTTTCGTG 


AACATAGATA 


AATAAGATTT 


CATATAATGT 


7560 


CCGAATACAA 


ATCTCX3GTAA 


ATATGCAG 
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(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : doiible 

(D) TOPOLOGY; linear 
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AGTTGCCCTT CAGAAACTTT ACCTGTTCCA 
GCAACACCAA TCGCATATGC CAATTGTACT 
^ ATATTTTTAG CCACATAACG TGCAGCGTAT 

TTACCACTGA AGCATCCGCC ACCATGACGT 

TTACGTCCTG TTAATCCTGC ATCACCTTGA 

10 

TTGATGTAGA ATTTAGTTTG TTCATTAATC 

TGTGCTTTAA TGTCTTCTTG AATTTGTTCA 

ACGACAATCG TATCAATACG TACTGGGTTA 

IS 

ACTTTACCGT CTGGTCGTAA ATAATTTAAC 
TTT6CCAATT GATGTGATAA ATAAATTGCT 

20 GCX3TAACCAA ACATTAAACC TTGGTCACCT 

TCTTTATCAC GATACTCTAA TGCTTTATCC 
TCAATCGCAG TTAAAATTGC CATTGTTTCA 

2S CCAATTTCTT TAATTGTTTC TCTAACAACT 

ATTTCGCCGG CGATCAATGC CATACCTGTT 

TTTGGATCGT CTTTTAAAAT AGCATCTAAT 

30 

TCTGGGTGTC CTTCTGTAAC AGACTCTGAA 

TGCTCCTTTA AATTTATATT ACGAAAATTC 

CTAACTATTA ATATAGAGAG AAGGCCTAAT 

35 

TTTGTCTGCA AAcGGTTTGG CACCTTTCTT 
GGTCCATGTC CCTCCACCAC TCAGGATAAG 
AATGAATTAA TGTCAATTTT TCACAAATAA 

40 

GTTAATGTGT TATACTAATT AAATGTAAAG 
TTTAGGATGT CAGTAGACAC ATACACTGAA 
45 CCAACGTCAC ATTTTCAACT TTCGACGACA 

GAAGGGGTAT TAACAGAACT TGGTGCTGTT 
TCGCCTAAAG ACAAATTTTT TGTCTCTGAA 

so 

GAAATTAATC AACCTATCGA TGAAGAAACT 
TATTTAGATA AAAAAGATGA ACTATACGTA 

55 
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AATGTATCAA TTGCAATTGA CACTGGTTCT 1800 

TCACATTGAT CTGCTAAACC TGCTGCAACA 1860 

GCAGCTGAAC GGTCTACTTT TGTAGGATCC 1920 

GCATAGCCAC CGTACGTATC AACAATGATT 1980 

GGTCCACCGA TTACAAAGCG TCCTGTAGGA 2040 

AAGTTTTCTG GAACAGTTGG ATAAATGACA 2100 

AGTGTCACAT CCTCAGCATG TTGTGTTGAT 2160 

TCATTTTCAT CATATTCAAC AGTGACCTGA 2220 

GTACCATCTT TACGCACATC TGATAAACGT 2280 

AGAGGCATAT ACGTCTCTGT TTCATTCGTT 2340 

GCACCTGTTG CTTCAATTTC TTCTTCX3CTA 2400 

ACGCCTTGTG CAATGTCAGG TGATTGTTCA 2460 

TAATCATAAC CATATTTTGC TCTTGTGTAT 2520 

TTCGGAATAT CAACATATGT TGTTGTAGAA 2580 

GTAACAGTTG TTtCACAAGC TACAC6TGCA 2640 

ATTGCATCTG ACACTTGGTC AGCGATTTTA 2700 

GTAAATAATC GTTTGTTATT TAACATAGTT 2760 

TCTCTCTGTG AGCTAAATAA AAAAGACCTT 2820 

ACGTCCATTC GCTCTTATCG TTCAGACCTA 2880 

TTATAAAAAA GAGGTTGCTG GGTTTCATTG 2940 

AGAATCCGTT AAAAATAATA GTACCTAATT 3000 

ATTTACAGTA AAATATTGTA GATTAATTAT 3060 

GCTTACATTT AAATTATCGC TTTGGAGGGA 3120 

ACAACTAAAA TTGACAAATT ACTGAAAAAA 3180 

CAACTTTATA ATAAAATCTT AGACAATAAC 3240 

AATGCAAGTA CTGGAAAATA TACTGGTCGT 3300 

CCTTCATATA GAGATAACAT TGATTGGGGA 3360 

TTCTTGAAGT TATACCATAA AGTACTAGAC 3420 

TTTAAAgGcT ACGCTGGTAG CGATAAAGAT 3480 
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ATGTTTATTA GACCTGAATC AAAAGAAGAA GCTACAAAGA TTAAACCTAA CTTCACTATC 3600 

GTTTCTGCAC CACATTTTAA AGCAGATCCA GAAGTTGATG GTACTAAATC TGAAACCTTT 3660 

GTCATTATTT CATTTAAACA CAAAGTCATT TTAATCGGCG GTACTGAATA CGCTGGTGAA 3720 

ATGAAAAAAG GTATCTTCTC TGTAATGAAT TATCTCTTAC CGATGCAAGA TATTATGAGC 3780 

ATGCATTGCT CAGCAAACGT TGGTGAAAAA GGCGATGTTG CATTATTCTT TGGTCTATCT 3840 

GGCACTGGTA AAACAACCTT ATCGGCTGAC CCACACCGTA AACTAATCGG TGATGATGAA 3900 

CACGGCTGGA ATAAAAACGG GGTCTTTAAT ATCGAAGGTG GCTGCTATGC AAAAGCAATT 3960 

AATCTTTCCA AAGAAAAAGA ACCACAGATT TTTGACGCAA TCAAATATGG TGCAATTTTA 4020 

GAGAACACTG TAGTTGCAGA AGATGGTTCA GTGGACTTTG AAGACAATCG TTATACAGAA 4080 

AACACGCGTG COGCTTATCC AATTAATCAC ATTGACAATA TTGTAGTACC ATCTAAAGCA 4140 

GCACATCCAA ATACAATTAT TTTCTTAACT GCGGATGCAT TTGGTGTTAT TCCACCGATT 4200 

TCAAAGTTAA ATAAAGACCA AGCAATGTAT CATTTCTTGA GTGGTTTCAC TTCTAAATTA 4260 

GCTGGTACAa GCGTGGTGTG ACAGAACCTG AACCATCATT CTCAACATGT TTCGGAGCAC 4320 

2S CGTTCTTCCC GTTACACCCT ACTGTTTACG CTGATCTATT AGGTGAACTT ATCGATTTAC 4380 

ATGATGTTGA TGTTTATCTT GTTAATACTG GATGGACTGG CGGAAAATAT GGTGTAGGAC 4440 

GTAGAATCAG CTTACATTAC ACACGTCAAA TGGTAAACCA AGCGATTTCT GGCAAATTGA 4500 

AAAATGCAGA ATATACAAAA GATAGTACGT TTGGTTTAAG CATTCCTGTA GAAATTGAAG 4560 

ATGTACCGAA AACAATTTTA AATCCAATTA ATGCTTGGAG CGACAAAGAG AAATATAAAG 4620 

CACAAGCAGA AGATTTAATT CAACGTTTTG AAAAGAACTT CGAAAAATTT GGTGAAAAAG 4680 

TTGAACATAT TGCTGAAAAA GGTAGCTTCA ACAAATAAAT TTGAATACTA AATCaAAACC 4740 

ACCCSGTGTGA ACGGGTGGTT TGTTCTGCGG CTATAAGCCT TCCTTACTGG CCAGCCCTAA 4800 

AAGGGCACTG ACAAGTCAGC CAACTGCACT ACTATTCCAG CAACCCTAAA GGGTTACTCT 4860 

TTTTTCTTTC TTTTTTTATT TTTCTCTCCA GTGAAAGGAT CTAAATATTC TTCCATTGAG 4 920 

ATTTGGTCTG CAACGATATC CTCTTGTAAT TGATTACGAA TATAATTTTC AATCACTTTT 4 980 

45 TTATTTCTAC CTACTGTATC CACATAAAAT CCTTTACACC AAAACTTTCT ATTTCCATAT 5040 

CTATACTTTA AGTTAGCATG TCTATCAAAT ATCATTAAAC TACTTTTTCC TTTTAAATAG 5100 

CCAACAAATG ATGATACCCC AAGTTTGGGT GGTATACTAA CTAACATATG GATATGATCT 5160 

^ TTACATGCCT CTGCTTCAAT TATCTCTACA CCTTTTCTTT CACATAATTG ACGCAATATA 5220 

ATCCCTATAT CTTTTTTTAT TTTTCCATAT ATCACTTGTC TTCTGTATTT AGGTGCAAAG 5280 
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AAATAGCATC TCCTCGTGTT GATTATTTTG 
AGAGATGCAT TTTTTGTGAC AATGGTAGAA 
5 TTTTCTTTTT ACAATTAAAG AGCCAACCGT 

TATTTTATGT GCTAAAAATT TATAGGCAAT 
CTTCATGCCA AAATCGCATC ACTCATTTAA 

10 

GATCCCTAAT TCAACGCATG TAGTACCACA 
ACCACGTTTC ATACTAGCTA TGCGACTCAA 
ATGTTTCAAC CATCGCTGGT GGTCTTGGCA 

IS 

CATGCGTGGC ACCTTTTAAC TCTAGTTGGT 
CTTGCTGGTC TTTCCCTCCA TGTACAATTA 

20 GAATCGCTTG GCGTGCCTCA TATGCCGCTC 

GTAGCATGCC TCTTAAATCG ACACGTTCTT 
CCCAGATTGT ATAACTTGTT ACTGGTAAGT 

25 CTCGCGAAAA ACCAACCATG TGGATAAATG 

ATAATTGCGT CACATCATTT AAATCGCCAC 

TGTTACCTCG GTAGTATGGC CCAATCACTA 

30 

TACCTGCGCG CACACGTCCT ACTTGACCTT 

TTACTTCATG ATGTGGTGTC ATCATTAAAG 

TAATTTCTTC GAATTGATGC GTAAAATATT 

35 

CCAAGTGATT GCACCCTCTC TACXjCATTTT 
CGTTGTGCGA GTTCAATATC ATTGATACAG 
TAATCGTTCT TGCTTACCAA TGATTCAACT 

40 

GTTTGATCAT GTGTTTCAAT GGTATATTGT 
CCTGTTTCTT CATATAATTC aCGTGTAACT 
45 TTACCACCAG GAAATTCAAT CCCCCGTAAA 

AACGTTGGAA TAGCTAGCAC ATGATTGCCA 
AAATTAACTT GACGATTATC TTTATCCCTA 

50 

ATGTTAATAT AATAGTTAAT TACTATCGTT 
GGCGATGATT CATTTTTATC AACGTTTCAT 

55 



GTTGGCTGAC CAATATTTAT TCTAGCACGT 5400 

CCTTTTCtGa ACCATACGCA TAGCGTATGG 5460 

TGTTATAGTC TAACAATGGT TGGCTCCTCT 5520 

TTTATTACAA CAATGTACAT TTAAGGTGAC 5580 

TGGAAGCAGC ACGTCTTCAT ATAAAGTACC 5640 

TCTTCAAAGC TTGATAGTTC CCATGCGCAC 5700 

CTTGGTTCAT AAACTCTTTA ATATAAGTCA 5760 

CATGTCCTTC TGCCATTTGA TAAAATGTTT 5820 

CCGCTAAATA ATACGCATGA TGAATACCAA 5880 

ATATTGGOOG ACTGTTTTCA TTAATGTTTG 5940 

GATCTTTTTT CGGATGACCA ATCATTCTTC 6000 

CATACATTAA ATCAATATCT GAGACACCAC 6060 

CTT6AAATGT CAACAATCCT TGTAAACCAC 6120 

CATGTGGATA TTTATCATGT AGCAACCTTA 6180 

GGTAAAATTC GTCTTTGCCT TCACTCCCAT 6240 

AAGTTTGACT ATCTQAAAAT TGCATTAATC 6300 

TGCCACCTCG CAAATAAACT ACAATGCGAT 6360 

CTTTTACTTG TAAGTCATCT GACAAATATG 6420 

CAATTGGCAT TCGTTTACGT TTGATAAAAC 6480 

AAAATGGTAC TATCTTGCAG TAAGAAACTC 6540 

TTAAACAACA CTGGCCCTGC TGTTTCTAAA 6600 

TCGATAAAAT ATACATCTTT TACAAAATCA 6660 

GCTATGTAAT AAATATTTTT AACTTTGGCG 6720 

GCTTCAGCAC TACTTTCCCC GCGTTCCCTT 6780 

TTATGTTTGG TAAAAAGCAA TTGATTTTTA 6840 

TCTGCTATCT CATTATCCTT TTTAAATGTC 6900 

AACTTCACGC GCATCACATC CCTACATTGT 6960 

GGAGGCATTA ATTATGAAAA AGATATTCTT 7020 

TTCGCCACTC ACTCCACCAA CTTGTCGTTT 7080 
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AACCTTGTTG CATAAGCCAA TTTAAAGTTT GTGGCACAAG 
CATTTTTAAT CATCGTTAAA ATTTGTTCGG CATTGAATTT 
AACCTTCAAT AACAGCTCTT AAAAGTACAC TGAGACCCGA 
CAGATAGCCA ATTAGTGTCA CGATCAAATC CCAAGCTCTC 
CATAATGATT ACGAAACGTT TGTGGCACCG CTTTTTGAGG 
ACATAATCGA TGCAATGTCA TCTAAATTAA ATGATGTATT 
CTTTCGGCAC CACAGTTTCA TTCGATGTTT CATATTGGAT 
AACTGTTCX5T TGTAATATCC CTTCCAGOGA ATTCAATATC 
ACCCTCGTAA TTCCAGTGGC AAGGTACAAA AAATCAATTG 
GATTCGTCAT CTCATTAGGT GTCAACCTTG TATTAATCAT 
ACCAACATGC ATGTATTAAA ATGATCGATT GAATCGAATT 
GAGATTGTTG ATAAGCCTT6 AGTCTTTTAG CCAATAGACT 
GATAAGTATA AGATTCTTGA CCGTCTGTTA TCGCAATATG 
GTTTATATAA CCAAAAGTCC ATGCX3TTATT CCTCCAAAAT 
CGATTTTATG ACATTCTAGC AGTGGTTATO TTTAAAAATA 
TGCATTGATA TGATTGTTAT AATGCTCAAT ACATATCGTT 
TCAGTTATTT TTATTTAATT TTAGTGTCAT TCTGTCATTT 
TGTTGCCACA TCATCTGCAA TGTCAATTGG TATACGGTTC 
ATGGAATACT TCATCATCTA AATTTTCAAT GAGATATACA 
TTTATATTTT AACGTTTTCC AAAAGTCCGG CTTGCAATTC 
TTCAATAAAT AAGTAACGTT TGCTGCCTAC TTTGTCTATG 
TTCTATACCT CTTATATGTG CATAGTCTGC TGAAAAGTAA 
ATGTTGTTGT ATTTCAAATC GTrGGCCTAC TATTTTATTA 
(2) INFORMATION FOR SBQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



CGAAATGTGC 

ATCAACAATG 

AATATGATAA 

TTTACATCCG 

GCCCGTTGTC 

TAATATGTTG 

ACCCATTGTG 

ATCCAGCGAT 

TACATCGATT 

CGCAATTTCA 

ATCTATGTAT 

CGCTTCACAG 

ATGTCCATTT 

CATTTACATT 

TAAAAAAGTA 

ATATCATTCG 

TGATGTGGTG 

ATGTCTTGTA 

TAATATGTTA 

AATACATTAT 

AAATATTTTG 

ATACTACCTA 

TTTGTGCTAC 



GTGATTCGTT 

CGCACAGTAA 

ATCGGCAAGA 

ATTGCACTGG 

CCTGATGTAA 

GACGGCGACT 

TTGTCCAACA 

ACAATTTGAA 

GACTTCATCT 

ATATTTGCCA 

AGCCCAACAC 

TATAAATTTT 

TGTTGTGCTT 

ATAATTATAA 

GACGAATTGA 

TCTACTATTA 

ATTTACCCAT 

ATGCACTTAA 

CCTTGTCCTT 

CCGGAATATA 

CAGTGCCTTT 

TTGTTTCATT 

nGGGGACTTA 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
94B0 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
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10 



IS 



GTGTGGATTG GATTTTAAAA TCACCCTCAT AAATACTGTC ATCAATATGA TAAGTTACAA 120 

TTTCACCTAT TATTAAATCA GCCCCATCTA ATACATCTCC AAGCAATATC ATTTGCGmTA IBO 

GTTTACATTC GAATCTCATT TTCGCATCTT TAATTCCTGG CGTCTTAATC GTTGTAGATG 24 0 

TTAAAAGTGA TAATTCTGTA CGACTCAACT CACTGTCACC ATATGCTAAC GGCGCTGCAG 300 

TCTCATTAAT ATCTTGAACA TTATCTTCGT CTGTAATATG CACAACAAAG TCTCCAGTCC 360 

GTTCTATATT TAATGCAGTA TCTTTTCTCT TACCTCCTGC ACX5TTGAACT GCAATAGCAA 420 

TCATTGGCGG ATGATTATTA ACAATATTAA AAAAGCTAAA TGGTGCTGCA TTTACTGATG 480 

CATCTTGATT TAATQTTGTA ACAAAAGCTA TAGGTOGTGG AATAATTGAA CCAATTAATA 540 

ATTTATAGTT TTCTCTAGCA GTTAATGATT GTGCATCAAA CGTATACATA ATACCTACCT 600 

CTTTTCTAAG TATATCTAGG TATTTCTCCG ATTTTGGTTA ATTTAAACAT CTATTCTCCT 660 

20 CTGAAAATCA CTTGTATTTA TTTAGCAAAT CTTTTGAAAT ATGACACATA TGCATATCTT 720 

CTGGATATTT TTCTAAATGT TGCTGATGTT CTTCAGCACT TTTAATGTAG TTAGACAGCG 780 

GTAAGACTTC CACTGCAATT TGATCTCTGT CTTTACGTCG TTCAATGAAC TGACGOGCTT 840 

CAATTAAGTG GTCATCTACA CAACTATATA AACCCX3TTCG ATACTTTTGT CCAATATCAT 900 

TTCCTTGTTG ATTCACACTG TAAGGATCAA TGATTTCAAA TAAATAATTC ATAATGTCTG 960 

TAATTGTTAA CATACGATCA TCGAAATGAA GTTTGACACA TTCAGCATAA CCATCATACG 1020 

GACCGTCTAA TTTAGAGCTT CTTCCATTTG CTCTTCCTGC TTCTGTATGT ATAATTCCAG 1080 

GTATTGTTGC AAAAAATGCT TCAACACCCC ATAAACATCC TCCTGCTACA TAAACAACTG 1140 

CCATATTTAC ACCTCATCAT CCTTTTTTAT ATTTTTAACA AGGTTATACC ATTTAATACC 1200 

GCCATGACAT GATTCTGATA CACCTTCATT ACGATACCCA TATTTTTCAT AAAATGAAAT 1260 

TAATGATTCT CX3ACATGTTA ACGTTACACC ATGTCGATGA TGATTCTTAG CAAGAGTTTC 1320 

AAAATAGTTT AGTAAGCGAC CTGCAATACC CTGACCTTGA TAATTTGGTG CTACAACAAG 1380 

ACCTAACACA CTAATATAGC CACCTTCACT ATTATTTGTG GAGACATTTT TAAATAAATC 1440 

ATCGCTAATG TAACGCTCTT TTATGACTGG ACCGTTG 1477 
45 (2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 976 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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AGGTGATTAT CCTAAAAATG CTCATGAGGT CXICTATTAAT GATAAGTTAG CTGCAGACAA 60 

CATTAGAGTC GGGGATAGAT TACATTTTAA AAATAATTCA ACTAGTTATA GAGTTTCTGG 120 

TATTTTAAAC GACACAATGT ATGCGCATAG TTCCATTGTG CTATTGAACG ATAACGGATT 180 

TAATGCATTG AATAAGGTTA ATACGGCATT TTATCCAGTG AAAAATTTAA CACAACAACA 240 

ACGTGATGAG CTTAATAAAA TAAATGACGT TCAAGTTGTG AGTGAAAAAG ATTTAACAGG 300 

TAATATTGCG AGTTATCAAG CAGAGCAAGC ACCGTTAAAT ATGATGATTG TTAGTTTGTT 360 

TGCTATTACA GCAATCGTTC TAAGTGCATT TTTCTATGTT ATGACGATTC AAAAAATATC 420 

ACAAATTGGC ATTTTGAAAG CAATTGGTAT TAAGACAAGA CATTTATTGA GTGCGTTAGT 480 

TTTACAAATT TTAACACTAA CAATAATTGG GGTAGGTATT GCTGTGATCA TCATAGTAGG 540 

ACTATCATTT ATGATGCCGG TAACGATGCC TTTTTACTTA ACAACGCAAA ATATTTTATT 600 

20 AATGGTGGGG ATATTTATAT TAGTAGCGAT TTTAGGTGCC TCACTATCAT TTATCAAATT 660 

ATTTAAAGTG GATCCTATCG AAGCAATTGG AGGTGCAGAA TAATGGCATT AGTCGTTGAA 720 

GATATCGTCA AAAATTTCGG AGAAGGTTTG TCTGAAACAA AAGTTTTAAA AGGTATTAAT 780 

25 TTTGAAGTGG AACAAGGGGA ATTTGTCATT TTAAATGGTG CCTCTGGTTC TGGGAAAACA 840 

ACATTGCTAA CGATATTAGG CGGATTGTTA AGTCAAACGA GTGGTACAGT GCTTTACAAT 900 

GATGCGCCAT TGTTTGATAA ACAGCATCGT CCTAGTGATT TACGATTGGA AGATATTGGT 960 

TTTATTTTTC AATCTTCACA TTTAGTTCCT TATTTAAAAG TGATAGAGCA ATTGACACTC 1020 

GTAGGTCAAG AAGCGGGAAT GACCAAACAA CAAA6TTCAA CAAGAGCAAT AOU^CrTTTG 1080 

AAAAATATTG GTTTAGAAGA TCGCTTGAAT GTATATCCGC ATCAGTTATC TGGCGGTGAA 1140 

AAGCAACGTG TTGCGATTAT GAGAGCATTT ATGAATAATC CGAAAATCAT TTTAGCAGAT 1200 

GAGeCCACAG CAAGTTTAGA TGCCGATAGA GCAACAAAAG TTGTTGAGAT GATACGTCAA 1260 

CAAATTAAAG AACAACAAAT GATTGGTATT ATGATTACAC ACGATCGAAG ATTATTTGAA 1320 

TATGCAGATC GAGTGATTGA ATTAGAAGAT GGCAAAATAA CTGATTAGTG GCTTGTAAAG 1380 

ACGCTAAATG TTAATGATTT AAGACATAGT AGTATAAAAG TTAGATAACA GPJKTACGATT 1440 

45 TGGGTTTACA AAAAACAGGC TGQGACATTA AGTTCTTAGG CAATGTAAAA AAGCTGATTT 1500 

CTATTAATTA TTTGATAGAA ATCAGCTTTT TTGATATGTA TTTTATAATG TACAGCTCGT 1560 

TGCATTCATA TAGCTTGAAG TCACGTTTAA AACCATATCT ATCATTATGG TATGCATATC 1620 

TTTTAAAACC TATTCTTTTG TTATTAGGAC ATATAAATTC ATCATTAAGT TCGTCATATT 1680 

TCCAATTTTG AGTGTTAAAA ATGTCACTTT TAAACTTTCT AGTITTATCT TTAATAAACA 1740 
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TATGGTAAAA CATTTACAAG ACCATATTCA ATTTTTAGAG CAGTTTATAA ATAACGTTAA 3660 
CGCATTAACT GCAAAAATGT TGAAAGATTT ACAAAATGAA TATGAAATTT CATTAGAGCA 3720 
5 GTCTAACGTA TTAGGTATGT TAAATAAAGA ACCTTTGACA ATTAGTGAAA TCACGCAAAG 3780 

ACAAGGTGTA AATAAGGCCG CAGTAAGCCG ACGAATTAAA AAGTTAATCG ATGCTTAATT 3840 
AGTTAAGTTA GATAAACCAA ATTTAAATAT TGATCAACGT TTGAAATTCA TAACCTTAAC 3900 

70 

TGACAAAGGT AgAGCATATT TGAAAGAACG TAATGCGATT ATGACAGATA TTGCGCAAGA 3960 
TATTACTAAT GATTTA 3976 
(2) INFORMATION FOR SEQ ID NO: 146: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\able 
«o (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



25 


GCTACCTAGG 


CATTTAAGAG 


ATCAAAAAAT 


GTATGAATAT 


GAACGTTATT 


TTTATGAGCA 


60 




AGAACTTAAT 


GGCGTTGATG 


aAGGGGAAAT 


TTTAAAGAAG 


TTAAAAGACC 


CACAAGATGT 


120 




TGCAGCTGAA 


ACAAAAGCTA 


GAAGTGTTAT 


TGATTATGCT 


GAATCTAAAC 


CAACATTTGA 


180 


30 


AAATATTTCA 


AGAGCTGTTG 


CTGCTTCATT 


AAGTTTAGGC 


ATTCTATCTA 


TTTTTGTCAT 


240 




CCTTATACCA 


GTATCTATAG 


TTGGATTATT 


TGTATTAGCA 


TTATTTTTAA 


TATCACTTTT 


300 


35 


GCTGCTGTTT 


TGTCCAATTA 


TTTTATTAGC 


ATCAGCAATA 


TCCAGAGGAA 


TTGTGGACTC 


360 


AATTAGTAAT 


GTATTTTTTG 


CCATATCATA 


TTCAGGATTA 


GGATTAGTAT 


TTATCATTGT 


420 




CATATTTAAG 


ATTTTAGAAT 


ACATTTATCG 


TTTAATCTTA 


AAATATTTAC 


TTTGGTATAT 


480 


40 


TAAAACTGTC 


AAAGGAAGCG 


TTAGAAAATG 


AAGAAATTCT 


TTTTTATTGG 


GCTTTTAGTG 


54 0 




TTTGTTGTCT 


TTTTTACAGC 


AGCAACCATT 


ATTTGGTTCA 


GCTATGATAA 


AAACAAATAT 


600 




GGTACTAAAC 


AATATGATAA 


AACATTCAAA 


gACGATGCTT 


TTGACAATGT 


ATCTATAAAT 


660 


45 


TTGGATAGTA 


CAGAACTTCG 


TATAAAACGG 


GGGAATCAAT 


TTAGAGTTAA ATATGATGGT 


720 




GACAATGATA 


TATTAATTAA 


TATAGTAGAT 


AAGACGTTGA 


AGATTAGTGA TAAAAGGTCT 


780 




AAGACAAGAG 


GATATGCAAT 


TGATATGAAT 


CCTTTTCATG 


AGAATAAGAA AACGTTAACG 


840 


SO 


ATTGAAATGC 


CTGATAAAAT 


GATTAAACGT 


TTAAATCTAT 


CATCTGGAGC AGGAAGTGTT 


900 




AGAATCAGTG 


ATGTTGATTT 


AGAGAACACA 


AGTATTCAAA 


GCATTAACGG 


TGAAGTAGTT 


960 
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AGTAAAAGTA ACATTAAAAA TAGCAATATT 
AAGAGTCAAA TTAAACAATC CATATTTTTA 
^ ATGCCATCAA AAGTAGATGC AAAAGCTTCT 

GATAGTAAAC CTGAAGACAC TATACTAAAG 
AAAAATAAAA CATTTACTAA TGGtAAAGTT 

10 

ACGATTGATG GTAATATCAA AGTTGAATAA 
CATAATTTCT CTAATATCGG TGTTATTTAT 
TTGCCAGTAA AGTGAGTGAT ATTAAGGTCT 

IS 

ATTTCTGGTG GCGCATTTTC TCCCAATGTA 
ATTGATGTAT GGATGGTGCC AGACCAGCTT 

20 GGATTATTGA ATAACTTAAA TGCTGTAGTC 

6TACGCGCCA GTCTTTCTTT AGATTCTTTT 
TCAAAATGAT TTGTACATAT ATTATCATAA 

^5 ACAATTGCAT GGTTCAATGA TTTGTCCATC 

ATTTCTTTCA ATAATTGGAT TGCTTCTGTT 
CCAATCATAT AACATACAAA ACCATTT6CT 

30 

ATAGTTAATC CTGACTCATT CATACCATCC 
ATTTGAGCTA AACCGCTATC TGTAGGTTGA 
GGGTGGTAAT CATAATTTCT AACCATGAAG 

SS 

TCTTTTAAAT CGGTAAAACG ATAATGTCCA 
TTGAGTATAC TTTGTAGTCC CATTAATTCT 
ATTTGATAAG TTTCATTTAC ATCTATATCG 
TCTCGATTTT TTAGAAGAGG TGTTTGTTGA 
AAATGTGAAC CTCTAAAAGT CATGATATCT 

45 TTCTTTTTAG TTGTAATATA TTGTAAATAA 

TTAAATAAAG TTATATTTTA CTAAATGAAA 
TATAAGACTT ATCAATCAGT TAAAACATAT 

^ AACTTAAAAT CATCTTATCA TAAATATCAA 

TGTTAACATT AGCATATAAG GTCATGTCAC 

55 



AAAGTTGTTA TTGGTACGCT ACAAATCGAC 1080 

AACGATCATG GTGACATTGA ATTTAAAAAC 1140 

ACTAAACAAG GAGATATTCG TTTTAAGTAT 1200 

CTAAATCCGG GAACGGGTGA TAGCGTAGTT 1260 

GGGAAAAGCG ACAATGTTTT AGAATTTTAT 1320 

ATAAAGGATG TAAGCACCGA TATTAGGAAG 1380 

TTGTTGGCAA AAGTTAAGTC GGTATCTATA 144 0 

TGACCATCTA ACCATGATTT GAAATCTATT 1500 

AAATATGCAG TTAATGTTTC AGGTTGATAC 1560 

TTGAATAGTt TACTGTAAAT TTCATACTGA 1620 

ATATCTAAAT TATCATTAGT TTGTGAAATG 1680 

GTATAATTAC GATTTTCATG TGTTAATATT 1740 

CGAACATCTA TTGATCTCGG TGTCACTTCA 1800 

AGTATGTAGC TAAATGAGCT TCTGTGTGGT 1860 

ACATTTCGGC AATTTTCAAG AATTAGACGA 1920 

GGTTTCTTCC GGTGCATAAA GTTATAGCCC 1980 

ATTCTTCCAG TTACCCTTGA TACAGGACCA 204 0 

TAAAGTAAGT AGCGACCATC ATAAGTTGCA 2100 

TCTTTGCCTT GAAAGACCGT GCAaCCACTT 2160 

AAGTTTAAAA TAATTTGGCG TGTTGGCATT 2220 

TCCCATATTT GAGGTGCGTA TGTTTGGAAT 2280 

AAACGTGGGA CaCnTTTTTT CCATTCTTTT 234 0 

AGCCATTTAC CAGTTTTAAC ACCTAACTCG 2400 

GATGTCACTT GTTGCATATC ATCGGCCCCT 2460 

ATAGTAATCG TATGTATATT GAATGTCATG 2520 

TATAAAATTG TTTGAGGTGA TTTCTCGGTG 2580 

TTTTATAGAT GGT6GGGATA TTGAGTTAAA 2640 

TCTTAAGTTA GCATTCACGA TAATAGTCAT 2700 

GTTGAAACAG AGGTTCCTCG GC A TTTTT G A 2760 
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TTATTTAATG ATTATTCTAT ATATGATAGT ATAATGJAAAT GTAGATAGGT ATTTAATTTA 2B80 

ACAGAGGTGA AATTGAGATG TGGAATTTTA TTAAATGtGT GkTTAAATTC GTATTTAGCT 2940 

5 TAGTTGCTAT TACAACATTA GTTGCTGGTG TTGGTGTAGT AGCATTTGCT TATATCTTTA 3000 

AAAAAGATTT TGAAGATATT GAAAGAAAAA CTAAAGAAAT TATTTCTGAT ATTGAAAGTA 3060 

AAAATAACTA ATAACATTTA GAGGCTGGGA CATAAATCCC TAAAAAACAG CAGTAAGATA 3120 

ATTTTCAATT AGAAAATATC TTACTGCTGT TCTCTATTTn ATcAmTACTt CGTATTGAAT 3180 

GGCTTCGCTT TCCTAGGGTG CCGTCTCAGC CTTGGTCTTC GACTGGCACT GCTCCCTCAG 3240 

GAGTCTCGCC ATTAATACTA CXSTATTAACA TGTAATTTTA CTTTGGAAAT ACTTTTAAAA 3300 

IS 

AATAAGACAC TTTGGCCCAA CTTGGCACAT AAATGTAAAA TTCAAT 3346 
(2) INFORMATION FOR SEQ ID NO: 147: 

2^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 





6TTGAAGAAA 


GAAATATAAC AGTCAATTAT 


AATTATAACC 


TTGTTGAAAT 


CGACGGTGAC 


60 


30 


AAAAAAGTGG 


CTACATTCGA 


ACATATCAAA 


GCATACGATA 


GAAAAACAAT 


AAGTTATGAT 


120 




ATGTTACATG 


TAACACCACC 


TATGGGTCCC 


TTAGATGTAG 


TAAAAGAAAG 


TACACTTTCA 


180 


35 


GATAGTGAGG 


GTTGGGTAGA 


TGTTAACCCA 


ACCACATTAC 


AGCATAAAAG 


CTACTCTAAT 


240 


GTATTTGCAC 


TTGGTGATGC 


TTCAAATGTA 


CCTACTTCAA 


AAACAGGCGC 


ACTATTcGTA 


300 




AGC^AGCACC 


TATCGTCGCT 


AATAATTTAT 


TGCAAGTGAT 


GAATAATCAA 


ATGTTAAOGC 


360 


40 


ATCATTATGA 


TGGTTATACT 


TCATGCCCTA 


TTGTTACTGG 


ATATAATAGG 


TTAATACTTG 


420 


CAGAGTTTGA 


TTATAATAAA 


AATACTAAAG 


AAACAATGCC 


GTTTAATCAG 


GCCAAAGAAC 


480 




GTaGAAGTAT 


GTATATATTT 


AAGAAAGATT 


TATTACCTAA 


AATGTATTGG 


TACGGCATGC 


540 


45 


TAAAAGGATT 


AATATAATAA AGTACAGAAA 


ACAATAAATT 


TTTAATGAAA 


AATCTTTTAC 


600 




TATAAAAGAT 


TAAGTATTTA AATGACGTGT 


CAGTGTTGTG 


TTTATATGTC 


GTGAATTTTT 


660 




AGCTCTAAAT 


AGTATAAGAT 


TGAAAAAGTT 


GTTACTGTTT 


TAAATGATCA 


CGATGAAGTC 


720 


50 


ATTCAATAAG 


AATGATTATG AAAATAGAAA 


CAGCAGTAAG 


ATATTTTCTA ATTGAAAATC 


780 




ATCTCACTGC 


TGTTTTTTAA AGGTTTATAC 


CTCATCCTCT 


AAATTATTTA 


AAAATAATTA 


840 
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AGATATTCAA ACCACGTGTA CTCAAAATGA TAGCTTGGTA TGTACCTCCA ATAGTAATTT 960 

CAATAACTTT GTCTGTTGAA CACTAAGAGC AATTTTAATT TCATAATGTG TTGTAAACAT 1020 

S TTTTTTTGAT TGGAGTTTTT TTCTGAGTTA AACGATATCC TGATGTATTT TTAATTTTGC X080 

ACCATTTCCA AAAGGATAAG TGACATAAGT AAAAAGGCAT CATCGGGAGT TATCCTATCA 1140 

GGAAAACCAA GATAATACCT AAGTAGAAAG TGTTCAATCC GTGTTAAATT GGGAAATATC 1200 

10 

ATCCATAAAC TTTATTACTC ATACTATAAT TCAATTTTAA CGTCTTCGTC CATTTGGGCT 1260 

TCAAATTCAT CGAGTAGTGC TCGTGCTTCT GCAATTGATT GTGTGTTCAT CAATTGATGT 1320 

CGAAGTTCGC TAGCGCCTCT TATGCCACGC ACATAGATTT TAAAGAATCT ACGCAArCTC 1380 

IS 

TTGAATTGTC GTATTTCATC TTTyTCATAT TTGTTAAACA ATGATArATG CAATCTCAAy 1440 

ArATCTAATA GTTCyTTGCT TGTGTGTTC6 CGTGGTTCTT TTTCAAAAGT GAATGGATTG 1500 

20 TGGAAAATGC CTCTACCAAT CATGATGCCA TCAATACCAT ATTTTTCTGC AAGTTCAAGT 1560 

CCTGTTTTTC TATCGGGAAT ATCATCGTTA ATTGTTAACA ATGTGTTTGG TGCAATITCG 1620 

TCACGTAAAT TTTTAATAGC TTCGATTAAT TCCCAATGTG CATCTACTTT ACTCATGCGT 1680 

25 TTGATAAAAA CTTAAATAAT ATTAATTCGG TCATCAGTGG CGTTAAATCT TTTATCATTT 1740 

TTAGTTATAG TTGATAAATT TATATTTATA AGCATATATG GATATTTCAT CAAAAATTTT 1800 

TATTTATATA AATCCGAACT GCATACATAT TTGTTTAAAT AAGAGGTATT ATTTTTCGGG 1860 

30 

AAATTGCTGT CTGAGTTAAA AGGATTAGTT TTATAAAATG AGTTGAACTA TAGCCAAAAA 1920 

CGATTAAAAT ACTGATAATC CATTTTTGtA TTATGTTAGG GACTTTTTTA CTTAATTTTA 1980 

ACCCTATTGG aGCmAATATA ATACTCCCTA TTATAAGGAA TAAGGCGTCA TATAAaGGGA 2040 

35 

TATAACCTTG AATAAGTTTG ATGACAAAAG CACCAATTGA AGATATAAAA GCAATTACTA 2100 

TACtATTAGC GACTACAGTA TTCATTGGTA ATTTGAATAA AACCAATAAT ATAGGAATAA 2160 

TAATGAAGGC ACCACCTGCA CCTACTATAC CTGAAATAAT ACCAATGAAA AGGCCAATGA 2220 

40 

TAACTAATAA ATATTTATTA AATGAAGACT TTTCGGAACT AGGTTtCACT TTAATAAACA 2280 

TTAATGTTAA TGCAAGTAAA GCAATAATGA TATATACCGT ATTTACAAAT GTAGCATCAA 2340 

45 ATAAATTTGC TAGAAATGCA CCTAACATAC TCCCT 2375 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GAGGTTTCTA GACAAGCTTT TAATAACTTA CCAAACTCAT TAAgrTGGTT gTGtTGGACT 60 

^ GCCtATTATC mAAGtATTAT GaGTTGTTTA ATATTAGtGC TAArACATAC GAAGAGTGGT 120 

TTAAACAATT TAGTAGTAAG AAAGCACAAT TCAGTATTAA TCTCACGGAT AAATGGATAA 180 

TTCAAATCGC ATATGGTAAA TTAATAATAA TGGCTAAAAA TAATGGCGAT ACATATTTTA 240 

TO 

GAGTTCAAAC AATTAAAAAG CCAGGTAATT ATATTTTTAA CAAATATCGA TTAGAGATAC 300 

ATTCTAATTT ACCAAAATGT TTATTTCCGC TTACAGTGAG AACACGACAA AGTGGCGATA 360 

CATTTAAACT GAATGGGCGC GATGGTTATA AGAAAGTGAA TCGCCTGTTT ATAGATTGTA 420 

IS 

AAGTGCCACA GTGGGTTCGG GATCAAATGC CAATCGTATT GGATAAACAA CAGCGCATTA 480 

TTGCGGTAGG AGATTTATAT CAACAACAAA CAATAAAAAA ATGGATTATA ATTAGTAAAA 540 

ATGGAGATGA ATAGCGTTAT GCATAATGAT TTGAAAGAAG TATTGTTAAC TGAAGAAGAT 600 

ATTCAAAATA TCTGTAAGGA ATTGGGAGCA CAATTAACAA AGGATTATCA AGGTAAACCA 660 

TTAGTATGCG TGGGTATCTT AAAAGGCTCA GCAAT6TTTA TGTCAGATTT AATTAAACGA 720 

25 ATTGATACCC ATTTATCAAT TGATTTCATG GATGTTTCTA GTTATCACGG AGGCACT6AG 780 

TCAACTGGTG AAGTTCAAAT CATTAAAGAT TTAGGTTCTT CTATTGAAAA TAAAGACGTA 840 

TTAATTATTG AAGATATCTT AGAGACTGGT ACTACACTTA AGTCAATTAC TGAATTATTA BOO 

30 

CAATCTAGAA AAGTTAATTC ATTAGAAATA GTTACTTTAT TAGATAAACC AAACCGTCGT 960 

AAAGCGGACA TTGAAGCTAA GTATGTAGGT AAAAAAATAC CAGATGaATT TGTTGTTGGt 1020 

TACGGTTTAG ATTATCGTGA ATTATACCGA AACTTACCAT ATATCGGTAC GTTAAAACCT 1080 

35 

GAAGTGTATT CAAATTAATT TTTTAATCAA TTTCAGTTAT TATTACTATG CGTTTGAGAA 1140 

ATAATAGTGT AGACTCAAAA ATATGAAAAA TGTATTTCAT ATATATTTAA TTTTAGACAA 1200 

GACATATGTC TTGAAAAGTT GAAAAATATA GAGATTGATA AAACTAATAC GGGTGTGAAT 1260 

40 

GACATTGATG TTAAGCTCAA TTACTAGCTT ATAAAACATG TCATATGTTA CAATTTTTGT 132 0 

TAGTTTTATT ATGGGAAGTA GGAGGAAATG ACGCATGCAG AAAGCTTTTC GCAATGTGCT 13 80 

45 AGTTATCGTA ATAATAGGCG TTATTATTTT TGGTCTATTT TCATATTTAA ACGGTAATGG 1440 

AAATATGCCG AAACAGCTTA CATATAATCA ATTTACTQAG AAGTTGGAAA AAGGTGACCT 1500 

TAAAACTTTA GAAATCCAAC CACAACAAAA TGTCTATATG GTAAGTGGTA AAACGAAAAA 1560 

TGATGAAGAC TATTCATCAA CTATTTTATA TAACAACGAA AAAGAATTAC AAAAAATTAC 1620 

TGATGCTGCT AAAAAGCAAA ACGGTGTAAA ATTAACGATT AAAGAAGAAG AAAAACAAAG 1680 
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TTTCTTCCTA 
ATCTAAAGCA 
^ GGCAGATGA/V 
ATTCAAAGAA 
TGGTAAAACA 

TO 

TAGTGGTTCA 
ATTCGATAAT 
TGGTCGTCAA 

IS 

CCAATTATTA 
TACAAACCGT 

20 AATTCAAGTT 
AAACAAACCA 
CTCAGGTGCT 

25 TAAAAAGAAA 
TGCTAAGAAA 
TGGTCATACA 

30 

TGTTCCACGT 
AATGACTGAA 
AGATATTAAC 

3S 

AATCGCACGC 
CGGTCATAGC 
AAGCCAAATC 

40 

ACGTTGTAAA 
ATTAACAGAA 

45 ACCTGAAATT 
TGGTAAATTC 
TGACGAAAGT 

^ TAAATCTGAT 
CGATCCAAAT 



AGCCAAGCAC AAGGTGGCGG 
AAAATGTACG ATAATAATAA 
GAAAAACAAG AATTAATTGA 
ATGGGATCTA GGATTCCTAA 
TTACTTGCTA GAGCGGTTGC 
GACTTTGTAG AGATGTTTGT 
GCTAAGAAAA ACGCGCCTTG 
CGTGGTGCAG GTGTTGGTGG 
GTTGAAATGG ATGGTTTCGG 
CCTGATATCC TTGACCCAGC 
GGTCGTCCAG ATGTGAAAGG 
CTTGATGAAA CGGTTGATTT 
GATTTAGAGA ACTTATTAAA 
ATTGACATGA GAGATATCGA 
TCTCGAGTTA TTTCTAAGAA 
ATTATCGGTA TGGTACTTGA 
GGACAAGCAG GTGGTTATGC 
CAAGAGTTAT TAGATAAAAT 
TTTAACGAAG TATCAACAGG 
TCAATGGTTA CGCAATATGG 
AATGGTCAAG TATTCTTAGG 
GCATATGAAA TTGATAAAGA 
CAAATTTTAT TAGAGCACAA 
GAAACATTAG TTGCTGAACA 
GATTATGATG CAGCTAAAGT 
GGTAAATCTT ATGAAGAGAT 
GAAGATCGTA AAGAAGAAAA 
GAAAAAGATG AACCAGCACA 
CACCCAGACA ATAAATAATC 
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TAGTGGCGGT CGTATGATGA 
ACGTCGTGTT CGTTTCTCTG 
AATTGTTGAT TTCTTGAAAG 
AGGTGTCTTA CTTGTTGGAC 
AGGTGAAGCT GGCGCACCAT 
TGGTGTTGGT GCGAGCCGTG 
TATCATCTTT ATCGATGAGA 
CGGTCATGAT GAACGTGAAC 
TGAAAATGAA GGTATCATTA 
CTTATTACGT CCAGGTCGTT 
CCGTGAAGCA ATTCTTCATG 
AAAAGCAATT TCACAACGTA 
TGAAGCATCT TTAATTGCTG 
AGAGGCAACG GATAGAGTTA 
AGAACGTAAT ATTGTTGCTC 
TGAGGCAGAA GTAGTGCATA 
AATGATGCTA CCTAAACAAG 
CTGTGGTTTA CTTGGTGGAC 
TGCTTCAAAT GACTTCGAAC 
TATGAGTAAA AAATTAGGAC 
TAAAGATATG CAAGGTGAGC 
AGTTCAACGA ATCGTTAAAG 
AGAACAATTA ATTTTAATTG 
AATTCAATCA TTATTCTACG 
TGTTAAAGAT GAAGATTCTG 
TCGTAAAGAG CAATTAGAAG 
AGATATTGCT GAGGATAAAA 
TCGACAAGCC CCAAATATCG 
GATTATATTC AGTACCTCTT 



ACTTTGGTAA 1800 

ATGTAGCAGG 1860 

ATAATAAAAA 1920 

CTCCAGGTAC 1980 

TCTTCTCTAT 2040 

TTCGTGACTT 2100 

TTGATGCTGT 2160 

AAACCCTAAA 2220 

TGATAGCTGC 2280 

TTGATAGACA 2340 

TTCATGCTAA 2400 

CACCTGGTTT 2460 

TAGGTGAAGG 2520 

TAGCCGGACC 2580 

ATCACGAAGC 2640 

AAGTTACTAT 2700 

ATCGTTTCTT 2760 

GTGTATCAGA 2820 

GTGCAACACA 2880 

CATTACAGTT 2940 

CTAATTATTC 3000 

AACAATACGA 3060 

CTGAAACATT 3120 

AAGGTAAATT 3180 

AATTTAATGA 3240 

ATGGACAACG 3300 

AAGAAGCTGA 3360 

AAAAACCTTA 3420 

TCTATGATAA 3480 
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AATTGTTATA GCAGAAAATA ATTGTAAAAC AAGTTACTTC ATTATTTAGA ATGATGGGTG 3600 

TAGAATAAGT ACAATTGTTG CAITTTATGA AGTAAAGTAA TTTTTTAAAT ATAGAGTAAT 3660 

5 AGAGGAGATT GAAATAATGA CACACGATTA TATTGTTAAA GCATTAGCAT TTGATGGAGA 3720 

GATTAGGGCT TATGCT6CTT TGACAACTGA AACTGTTCAA GAAGCACAAA CGAGACATTA 3780 

TACATGGCCG ACAGCATCTG CTGCAATGGG AAGAACAATG caCAGCAACA GCTATGATGG 3840 

GCGCAATGTT GAAAGGTGAT CAAAAATTAA CTGTCACTGT AGATGGCCAA GGACCTATTG 3900 

GACGAATTAT TGCCGATGCA AATGCTAAAG GCGAGGTGCG TGCTTATGTA GACCATCCAC 3 960 

AAACTCATTX TCCATTAAAT GAGCAAGGTA AACTTGATGT AAGACGAGCG GTAGGGACAA 4020 

IS 

ATGGATCTAT TATGGTTGTT AAAGACGTTG GAATGAAAGA CTATTTCtCT GGAGCAAGTC 4080 

CaATTGTTTC AGGAGAACTT GGTGAAGATT TTACTTATTA TTATGCTACA AGTGAACAAA 4140 

CACCTTCATC GGTAGGTCTT GGTGTATTGG TAAATCCTGA TAATACGATT AAAGCAGCAG 4200 

20 

GAGGATTTAT CATTCAAGTT ATGCCAGGTG CCAAAGATGA AACAATTTCA AAATTAGAAA 4260 

AAGCAATTAG TGAAATGACA CCAGTTTCTA AATTAATTGA ACAAGGATTA ACXX:CAGAAG 4320 

2S GATTACTAAA CGAAATCTTA GGTGAAGACC ATGTGCAAAT TTTAGAGAAA ATGCCTGTTC 4380 

AATTTGAATG TAATTGTAGT CATGAGAAAT TTTTAAATGC TATTAAAGGA TTGGGCGAGG 4440 

CTGAGATTCA AAATATGATT AAAGAAGATC ATGGTGCTGA AGCAGTATGT CATTTCTGTG 4500 

GAAATAAATA TAAATATACT GAAGAAGAAT TAAACGTGTT GCTAGAAAGT TTAGCGTAAT 4560 

TTAATTTAAA TCAATACGCT AAAATGTTTA TTTTTAGCGG TTTAGTGAAA TGTAGAACTA 4620 

AATAGTTGTA TAATCCTTAG TGATTTTGTT TGCTTTCTAG AATTTATTTG ATAAAATAAT 4680 

3S 

TCTATATCCG ATAAATAAAC TAAGATTTCA ACAACTAACT AAAAAGGAGT GTTCTTAATG 4740 

GCAQ^AAAAC CAGTAGATAA TATTACTCAA ATTATTGGCG GTACACCGGT AGTCAAATTG 4800 

AGAAATGTAG TAGATGACAA TGCAGCAGAT GTTTATGTAA AATTGGAATA TCAAAATCCA 4860 

40 

GGTGGTTCTG TAAAGGATAG AATTGCTTTA GCAATGATTG AAAAAGCAGA GCGAGAAGGC 4920 

AAAATTAAAC CTGGCGATAC AATTGTAGAA CCAACAAGTG GTAATACAGG TATCGGTTTA 4980 

45 GCATTTGTAT GTGCTGCTAA AGGATATAAA GCAGTATTTA CTATGCCCGA AACAATGAGC 5040 

CAAGAGCGTC GTAATTTATT AAAAGCATAC GGTGCGGAAT TAGTTTTAAC GCCTGGATCA 5100 

GAAGCGATGA AAGGTGCAAT TAAAAAAGCT AAAGAATTGA AAGAAGAACA TGGTTACTTC 5160 

^0 GAGCCACAAC AATTTGAAAA CCCTGCGAAC CCTGAAGTTC ATGAGTTAAC TACAGGTCCT 5220 

GAGTTATTAC AACAATTTGA AGGGAAAACT ATCGATGCGT TCCTAGCTGG TGTTGGTACT 5280 

55 
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5 



20 



GTTGCTATAG 


AGCCTGAGGC 


TTCTCCAGTA 


TTGAGCGGTG 


GTGAGCCAGG 


TCCACATAAA 


5400 


TTACAAGGTT 


TAGGTGCTGG 


ATTTATTCCA 


GGCACTTTGA 


ATACAGAAAT 


CTATGACAGT 


5460 


ATTATTAAAG 


TAGGAAATGA 


TACAGCGATG 


GAAATGTCTC 


GTCGAGTTGC 


TAAAGAGGAA 


. 5520 


GGTATTTTAG 


CAGGTATTTC 


ATCAGGTGCT 


GCGATTTATG 


CTGCCATTCA 


AAAAGCAAAA 


5580 


GAATTAGGAA 


AAGGTAAAAC 


AGTAGTAACA 


GTATTGCCGA 


GTAATGGTGA 


ACGCTACTTA 


S640 


TCAACACCTT 


TATATTCATT 


CGATGACTAA 


TTAATGTCAT 


TTAAAAGAGT 


GAGTTATCTT 


5700 


TTTGAGATAA 


CTTGCTCTTT 


TTTTCTACCA 


TGTATATTTT 


TAAAAATATG 


AGCGTTAAAT 


5760 


TAAACATTTT 


TCTGATAAAA 


ATATCCAGTG 


AATGATAAGA 


TAATAAACGT 


ACATACTAAT 


5820 


AACTAGTAAA 


TAGCAGGAGT 


AAATTTTATT 


AGAGTTAAAC 


AATACATAAT 


TAAAGGGTGG 


5880 


TTAACATGAC 


TAAAACAAAA 


ATTATGGGcA 


TATTAAACX5T 


CACACCTGAT 


TcATTCTcAG 


5940 


ATGGTGGAAA 


ATTTAATAAT 


GTTGAATCAG 


CTATAAATAG 


aGTGAAAGCC 


ATGATAGATG 


6000 


AAGGTGCTGA 


CATTATAGAT 


GTTGGAGGTG 


TTTCAACGAG 


ACCCGGTCAT 


GAAATGGTTT 


6060 


CATTAGAAGA 


TGAGATGAAC 


AGAGTATTAC 


CTGTTGTTGA 


AGCTATTGTC 


GGTTT 


6115 



(2) INFORMATION FOR SEQ 10 NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: i 

35 

TAGATACTGG GnTAAAcaTc AAAAATAtyT 
GCAACGTTAA TTGCGTGTAA TCATTTAGTG 
CTGATATGCC TTTTATCGAC TTTATTTTCT 

40 

CTGCGCCGTT TGGCAATTCA ACTTTGTTAT 
TACCAAAGTT TATCAAACAA TCTTTAACAT 
45 GTACTTTTCT AAGCCACAGT ATACGCCTAG 

CAACTTTTTG TATAACGCCG TGAGACTTAC 

TGAAGTtATC GTAATTGGTT AGTATTTCGA 

SO 

TGTTAAAGTT TAATTTAACT TTCGTCATCG 

TOAGTAATTT CTTTTTTATC GATATACGCA 

55 



lEQ ID NO: 149: 

GCtTaTTCaC GTGTTTAcGc TCCCtCAAAC 60 

TGAATTcAGA CGCTTCTTCC ATGACTATGT 120 

CTGGGTTATC TAATCCTTTA AACAAAAAAA 180 

CAGTCTTATT CCAAAGGCAC ATGTCCCAAA 240 

CTTCGAACAA ACTATCTTTA AITGTTGATT 300 

GATATTTCCA ATCTTGCAAT GCTTTGAGTA 360 

CGCTCGAACC TCCACCGTAA TGkACTTCAG 420 

ATATGTTTCT ATTGAAAACA TTAGACGGTT 480 

TACTCACCAA TATTAATCTC AATATTCTTC 540 

CCATGTACTT TTA6TATGTG GTCAATAGAT 600 



761 



TTTAAATGGT CATATTTCTT ACTGTAAGCC 
GATAACGCTA AAGCTTCTGT AATACTCATT 
5 TCTTTAATAT ATTCCXSAAAC ATTAACATTT 

GTTTTCTTAC TATAACCTGC TGTAATTGCT 
TATTCATCTG CGAATCTCTT TTGTTTTTCX5 
ATACGCTTTT TAAAATTAAA AAAGGATTGG 
ATTCCTAATT GCGATAAGGG AAACGCAGTA 
TCTCATTTAA GGTATCAAAA ACTGCCACTT 

IS 

TCCGCCAATC TAGATATGAT TTTTCTTTTG 
ATGTCAACAC AAACTTTCAC TAATTCCTTT 
CGTTCTTTCC TGCTTGATGT GTTGATTATA 

20 

TTACCTCTTC GTATACTACA AAGATAATTA 
GACGGTACAA ACTCCCCGCC TATATTTGTA 
25 CTTCTTAAAT CTTCAAGTrG TTTATGATAA 

TTTOGAACTG TTGATAATTT TAATCCGTAT 
AATATGTTTA ATCTTCAAAG TGTCTCAATC 
TAACTTTTAC ATCACCTTTT AACTGTTCCG 
CTAGTAATAT AATTGCTATG ATTAACCACA 
AATCAGACTG ATCACTCAAC TTTGCGAAGT 

35 

TCATCATAAT ATATACTTGC TCAGTTACAT 
ATAAiAGTCT CAAAATTTCT TTAACCACCA 
TTGTATAGCT TGTTTTAATG CGTCTCTTTT 

40 

ATCAGCTGAC ATTAACTCAC TGTCATATTC 
TACTACTAAT AAGTTTTCAT CTACATCAAT 

^ TCCTTATCTT AACTTGTGCC TCGTATTTGC 
CAACTGTAAA CGTCTGATTA TCTCTAGCAG 
AATCTTTGAA TGTTGTGACT AAGTATTGCG 

SO AAGTTTTTCT ACGAATAAAA GTATTAGTAC 
CTTTGCAAAA TCTACTATAA CGATTAAGAC 

55 
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TCTTGAGGTT CTCCTCTAGC AATAGAAGCA 720 

AAACGCTCTT CTTGTATCTG TTCTAATCGT 780 

CTTAACAATC GACTTGCTAA AGACTCTGCT 840 

GCTTTTTTAC CATTACATCC ATTCATTATA 900 

TTCATTTCAT TTACCACCAA CTCTCXJCGCT 960 

CTATAATCAG CCAACCCACA TAGATCCTTT 1020 

CGATAGTCAA TATCCTACAC TATCATAATA 1080 

TACTGCCAAT TTCAGTCTTC CCCTAACTCT 1140 

ATTCTATGAG CAGTTCTATC AGAAATGTGT 1200 

TTATTAAAAT AATACTCTTG AATGAATTCG 1260 

CGTTCAATAG CGCTCTTAAA CTCAAGGATT 1320 

GTTACTGCCA TTTCTGTTTT CGATGTATTA 1380 

TCTGTTGGAA TCCACX5GTGT CATTATTTCA 1440 

TTAGGATAAT CACACAACTC ATCTTCTAAC 1500 

TTCTTTTTAG TCATGAATAC CCTCCGTACA 1560 

TACTTCTTAA TATCTCTATC TCTCGCTCTT 1620 

CTTGTAACAT CACACCAAAC AATAAGATGA 1680 

TCATCTACTC CGACACCTCC GCCCTCATCA 1740 

CACTTGGCXSC CTCTACATCA TCATTAGCCG 1800 

ACTTACX:TAA CTCATACATC GCTAGTAAGA 1860 

CTAAACACCC CATGTTAATT TATCGATAAT 1920 

TTCTTTGATA TCTCTATTAT CGCCATCTTC 1980 

ATATAATAGT TCTGATATTT CATTACTAGC 2040 

CGTTACCGTT TTCTTTGGCA TCTCCATCTC 2100 

GCTCAGCTTC TTCTTTACTC TCTGCCTCAA 2160 

TAGTAAAATG TTCATGTGGT TGTCCTGTTG 2220 

TCACTTCTTA TCACTCCTTT GAATGATTCT 2280 

AACACTCAAT GTAGCCAACA TATTTTTTTG 2340 

TAATAACATT CCAATTCTGC ATGTAAATAA 2400 
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ACATTTAAGT TAACCATCTC AGCTTTTCCG TTTTTATATC CACTAATAGT TGATCTTGAT 4320 

ACGCCAGTTT CATTGTGCAA ATCTTGGACA CTTACGTTAT CTCTAGCCAT GATTACCCTT 4380 

^ AAATTAGTTG CGAATACTtC GTTCAACTTC ATTTATTCCA CCTCTATATA TGCATGTCTT 4440 

ATTGTTATGT TGTCATACTT TAGTAATTCXS TCCGGATTGT CATCTAAGCG CTTTGCCAGC 4500 

GTATCTTTTT CTTTATCCAC ATCATCGTAA TGCTGATATT CAACTTCTGT AGGTATTCTT 4560 

10 

ATATCAATCG TTGCGTTTAT ATATGCTTGT TGTTGCATTA GATCACTTCA TTTCTCTTTT 4620 

TCTTTTACGT CTGACTTTCA CTAAGTCCTC ATATACCATC CATTCTTGAC CTGTGTATTT 4680 

AGGCGCTTTA CATATCCACG TTAAATTCAC ATCTCTATAC TGATATCTGA ATATCTTCGC 4740 

IS 

TTTGATGTTG GCAACTTCAG TCGCCTTACC TTTAACGTCT ATAACTTCAA CCAGTTTCCC 4 800 

rrCCTTCCAC AAAGAGAAAT CGGCTATATA C6TAATCGGT CTTTGTTTCC CGAATTTAGG 4860 

^ TTGTAATTCA AATTTCGGTT GTATTTCGAT ACGATCATAG TTAGTGCCAT TCATATTACT 4920 

TTCTAAATAT TGGTAATATT CGCACTCTAC TTTGCTATCA AATACAATTC CTTTGTACTC 4980 

AACTTTCTTA GCATTGTATT TACTCATTGT GCCACCTCTA AATATCAAAT ATCGTTGCTT 5040 

25 GCAATCCTAG CTCTTGCTCA TATAGAAGCC CGTGAGCGCC TTTGAATCGT TTTAGGTCAC 5100 

TATCAGTCAT AATTTTCTTT TCGTCGCTQA AATGGGCTCC TGTGAGCGAA TAAACTTCAT 5160 

TTACGTTGTC TTTATACTTG ATGACCTTAA TATCTTCCGT GCCATCTTCT CGGTATAAGT 5220 

AATATTTTTC TTTCGGCATT TTTTAACACT CCTTAATGTG TGTTTTCTTC CAGTTGATTT 5280 

CATTCATGAT TTTCTTTTCA ACTCTGTCGT AATCATCGAA AGGCGATAAC TCGTTATTGT 5340 

CCAACAATCT ATTGACCGCC CAACCAGTCT CGATATATAC ATTTGCTACA ATCGGGTCGC 5400 

35 

TTTOCTTTGT CTCTTCATAC ATCGATTTCA ATAAGCTTTT GAATTGCATT ATATTCATGT 5460 

GAAA3LACCTC TGAGTCTTCT TGTAATACTC AAATTCAATT ATTCCGGTTT CGCCGTCTTT 5520 

GTTTrTGGCT ATGTTACATT CAACAATAGA TTTGCCAGTG ATACTGTCAT CTTCGTCACG 5580 

40 

GTTATAATAA TCATCACGGT AAAGTAGCAT CGCTAAACTC GCATCTGCTT CTATTCCGCC 5640 

TGATTCTTTC ATGTCCGATA GCATTGGTCT TTTATCCTGT CTAGACTCGA CACCACGATT 5700 

CAGTTGTGAA AGTAGTACGA TGATTGCGCC TGTCTCGTTA GCGATTATCT TTAAGTCACG 5760 

45 

TGATATCTTT TCTACTGCTA CACGTCTATC AACTTTCGCA TCAGTATCCA TCAGTTGAAG 5820 

ATAATCTATA AAAATAACTT GTTGCCTGTC TGAATGCCTC ATTGtTGCGC TCGCACATCT 5880 

SO TGCGGTGTGA TATTACTTTT ATCAGAAATA TCGATGCCTA ATTTCATGAT TTTATCCATC 5940 

GCATTCGTTA ACTTTGTTAA GTCATCCGGC GTTAAGTTCC TGATTTCTTT TATCTTTGTT 6000 

55 
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AGACTAAAGA AAGATGTTTT GTATCCATTT TGTGCTATGT TCAGCATCAT GTTTAATGCA 6120 

AAACCTGTCT TACCCACTGA GGGACGCGCT GCGATGACGA TTAATTGTGA TGGTTCTAAT 6180 

5 CCCCCTATTT TGTAATCCAT TAGCTTGTAA CCCGTCTTAA TTTGCTTCTT AGGGCTATCG 6240 

CTGTATAACT CTTCGACAAA CTCCTCAACA AACTTCTTGG TTCCATCTTC TTnTTGTTA 6300 

GTAATTGTTT TTAAATCCTT GAGTTCATCA ATCAAGTTGT TAAAGTTTTG GTTCGTAGGT 6360 

10 

TGTTGTTTGA ACTCAGTTAC CAATTCGTTA GCTTTGTTGA GCTGATAACT TTCCAATAAT 6420 

TCTTGTTGAT AACGTTCAAA GAAGCCATAT CCAATGAAAT CXX3AGTTGTA AAGTTTAGTT 6480 

ATAGTATCTG CATCTAAAAA TTCTTTATCT TTAGTTGCTT TTAAATAGAT TTCTTGATGA 6540 

IS 

TCTATCTTTC CQACGTCCAT TACATAATTG AAAAAGGTTT TAAACTTTTC GTTCGTAAAC 6600 

ATGTAATCTT TAACTCTTAT CTTTTCTAAT ACGTCCGGTT GTTTAAGTAG CGTAGCGATT 6660 

2^ ATTGTACTTT CAATTTCGAA TTGTCCGTAA TTCATTCXSTT TTCGCCCCCA AATTCTGCCA 6720 

ACTTATTCAT GAACTTATCT AGCGCTATTT TTCTTTGTCT GACATATTOG GGGTCATTCT 6780 

GCATTTTCCA TTGGTGTGTA GCGGTTTCGT TATCTACTGG CTCGATAGAT ACTTTTTTAO 6840 

2S GTTCqTTACG CATGATTGCT GGTAAGTTAG GCGGGTACGG GTTGTTACTG TTGATATAAA 6900 

CATCTACCGC TTTTACAGTT GGTTGATAAT CTCCATTTTQ ACTTAATACA TCAATCCACA 6960 

TTTCTAACTT OGGTTTATCA AAATCAATGT TGTATACGTA CCTAACTTTT TTAATAATTT 7020 

CTAATGCTTG TGTTTTGCTC ATCGGCAITA GTCATCACTC AATTCTTTTT CCATTTGTGC 7080 

AATGACATCA TCAGTAGTAT TTTTTCTAGG TGCTATTTTA TTTTCTGCAT CTTCnTTGT 7140 

TTTGACATTC TCTTTAGCCC AGTTGTTTAA AACTTTAATT AAATAGCCAC CATGCGCACT 7200 

35 

TTTGCTTTTA GTGTACTCAA CACCTACTTT TACAACTTCA AAAGOGTTTG TACCTATATC 7260 

ATCAATAGCA AACCCTAATT GTTCCATTTG ATTAGGTGTT AACTTATCAT CCAAATTTGC 7320 

AATTATATAT TTTATTGAAG ATGAGAAGAC GGCTTCTCTT TCTTCTTCTT TATTCTTATA 7380 

40 

TTCTTCTTCT TTTTCTTCTT CTCTTTCTTC TTCTTCTTCT GTATCGTTAC GTAACGTTAC 744 0 

GGTAACGTTA CGTTTTGCTT CTAGTAACTT TTTCTGTTTC TCACGATAGC GTTGTTGTCX3 7500 

^ CAATTTATTT TTTTCTTTAT GCTTAGCTTT GCTATCTAAG CTTTGATGCT TCTCCCAGTT 7560 

TGTCACTTTT ATGACACCAT TAACTTTTTC AATCATGCCC AATGTCTCAA AAGTTTGAAT 7620 

TGCTAACCTT ATTGAGTTAA TAGGTCTATT AAATTCATTT GCTAACATTT CTTCGTTGTA 7680 

SO CGGCAAGTTT TCGGATAGCA TAATATAACC TTGTTCATTG TACTTTCCTG ATAAAGTTAG 7740 

TAACTTAACC CAAATAGTTA TGATCGTATC TCTTTCGGGT AAAGCTTCGA TATATTTGAT 7800 

55 



765 



10 



IS 



EP 0 786 519 A2 

CTCCTTTCAG CATTTTGTTG AGCCTCTCAT CAACTTTTAT CCACGAGTCA TGCAAGTGAT 7920 

ATTTATCATC AAACGACTTA ACGCCAATTG CGTGCTGTTC ATTATGATGT TGTCTACACA 7980 

GTGCTAACAC ATGTTTGTCG TAGTGATTCA TTTTGTTTCT GTTCATGCCT CTGCCGACTG 8040 

CTTCATAATG TGCCAGGTCT GCGTGAGGCT TTCCGCATAT TACACAGTTG CGGTTGATTG 8100 

TAGCCCAATA TAATAACGCT TTATCTTCGC TTAACAACTT ACTCGTTTCT ACACTCATAG 8160 

GTATTTGATG ATGAAACATA AACX3CTATAA TCAGTTCTAT TAACTCCCTT GCAACTTTCA 8220 

TAGAACAGTC GCGCAGACTG ATTTCTTCAT AACCTTTCAT AATTTCCAAT TCTGTTTGTA 8280 

ATAATTTTCT AGTTGATTCT ACTGGTTCGC CCCAGTGAAG TTCTATATCT CTACACATTG 8340 

CGAATATTTT TTTGCX5TTGT TCTATAGATA GTTTTTTATT GTCCGGAACC TCTACTTCTG 8400 

CTTTTAGTGG ATATCCGTTT TCTAGTAAGT CAATGTGACT TTGTTCAAGT TCAACACCAG 8460 

20 TAGCAACGAC GGAATAAGTA CCGTCATTGT CTTTCTGGTA TCTTGTAATG TATTGCATTT 8520 

AAACCACGTC CTAGAACGGT AAATCATCAT CATT6ATTTC TATTGGACCA TTAGCATTAG 8580 

CGAATGGGTT TGATTGTTGA CTCATTGGCG TCTGTTTCCC ATTTGCTTGC TGTTCTTTTT 8640 

2S GTTTCATCTC ATCAGTTTTA GGTTCTGGTT TATTAACTAC TTCATCGTCT TTATTCCAAA 8700 

CTTTTACATA TGAGAGTCTT ACAAAATACT TGCCTTGTTC CTCGTTAAAT TTATTTTTAA 8760 

GTACAATAGT TCCGATTTTG TTAATTAATT GATCTGTGTC AAAAGTTAAA TCTGGTAAGT 8820 

TCAATTTAAT TCCTAATCTA CTAAGTAACT CGATATATTG TTTTTCTTGA TAATCTTGTT 8880 

GGAATGGTGG GACGAATTGG TTGTGTTTGT ATTGTTTACC TTCGTTGTTT TCAAAAACAA 8940 

TCGTGAAGTA TCTGTTTTCT CTGTCGTTAA ACTCGACATT TGCAACTTTT ACTGTAAATT 9000 

CTCCAGCTCC TAAAAAGTCC CCACCTTTCA TGAATGCCTC TTGATTAGTT TCTTGAATGT 9060 

ATTGTGTTCT ACCAGTGATT TTCATAATTT TTATACCGTC CTTTTAATTA ATTTTTAATT 9120 

ACCATTTCTA ATTGCTTGTA CAACATCGTT AATACTTGGA TTAATGAAAC GTTTGTTGTT 9180 

AATTTTGATG TTGCTTGAGT GTCTTATCTT TGTCTCGAAT AAATTTGATG GTTCAGCGTT 9240 

AAGTACATAT TGATAAGTTT TTTCGCCGTC TTGCTCATGT TCTTCTATTG TCATTCTTGC 9300 

45 TAACACGTCA GATTGACTGA TGACTGCTTT TTTTATTTGG TCTTGTGCCT CTATCGTGAT 9360 

TGTTGGATTG ATAGTACTTC CCTCATCATC TTTGTCTTTG TTAATGCCCT CGTGTCCGCT 9420 

TATAGCAAGA TGAAATTGAT AATGTTCTTG TAATTTAGAA ATATAACGAT AAATACTTAC 9480 

SO AATGCGTGTA GCACACTCGC CCCAATCATT AAATGTCGGT TTCTTTGATT TACCGTCCAT 9540 

GATGTCGTCC ATAGTGATAT CACCTAACTT TTGGATTGTT TCAATCACTA CAACATCAAT 9600 



55 
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AAAATGCTTA TAATTCTTAA TCTGCACAAC TGCCCCATCT TCTGTTACCG TTGTTCCGTC 9720 

CTCATTTATA TCTAGTACTA AGGCATTGTT ATCTTTTGTT AAAAACGTAG TTTTACCAGT 97 BO 

^ ACCGAACTTG CCGTATATCG CAAATTTATA AAACTTGTTT GCATTTTGTT TGCTGATGTC 9840 

TTTTACACCT AGTTGCGTTA AAATATCGAC ATCTTGATTA GTTTTTTCAG TCATCTATTC 9900 

TCCCACCTTT ACCGTGTATG ACGTTGGTTT CTCCACAATG CTAGCACCCT CTAAAACTTC 9960 

10 

GCCX3TTTGCG TCAATCAATG TGCCGTTTTC AGTTACATTG AAATCTTTCT TAATGTCTGA 10 020 

TTGGCTAAGT TTTTTAGTTA CTTTTACATA GTTGTCAAAA CCTCGTTGCT CAAGTTGTnT 10080 

AATGACTTCT TGCTCATTGC TAACTTGAAT GACTTTTGAA CCTTTTCTGG CTGTCACTTT 10140 

IS 

TCCGTAAGtG TATTCAACTT GAATTTGCTA TCTTGTTCTT TTTGTATTCT GTAATATTCA 10200 

ATTACAAGGC TTTGTAAATA TTCTTTGCCA CTCTGTAATT TTTCTACTTC TTTATCTTTC 10260 

CATTCGTTTA TGCGTTCAAT TTCTTTATTT GCTAAATCGT TGATTTCATT CTCTTTAGTT 10320 

GTGATTGCAT CCAGTTTCTn AAAAACCCAG TTAGCACTGT CTAGATCAGT nACTTTGAAT 10380 

CGGTCGTCTT GTTCGAATGT n 10401 
2S (2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

35 

TTTCTCTCTA TTATTCTCGA TGCGTAGATA ATTGTTTAAA TTTAAGTTTA TAGTAATGTT 60 

GAGTHTATAA TTTCATATAT CTAAAAACAG GTGTTGTATA TATAATCATT CATCTAGTTA 120 

TACTTACTTT AAAAATAATA TAATTTCATG CGATGCAATT CATTGATGGA TGTTTTTAAT 180 

40 

CTTAATCAAA TCCAaATAAA GCATATATTT TTAAATTCAC TTTCTTTCGA ATCGATTTTT 240 

ATCTCTTGnA TTAAACTTTT CCATTGTTTC ATTAAAGCTC TCTGTCATAT CTATTCCCAT 3 00 

^ TGAATTCGCT AAACATAACA ACACAAATAA ATTATCACCT AATTCTGCTT TAATCGTATT 360 

TGCTTCCTCT GAATCTTTCT TCTTTTTTTC ACCATAGGTA TGATTTATTT CACGTGCAAG 420 

TTCGCCCACT TCTTCAGTCA ATCTAGCTAA GTTAGCTAAT GGTGAAAAAT ATCCTGTTTT 4 80 

SO AAATTGTCCA ATATATTCAT CAACTTCACG TTGCATTTCT ACCATTGATT TCATTTCTAC 540 

GTTCTCCTTA TATTGCATTT CTAATATAGT ATATATCAAT TT6AAGTCTC ATGCATGTTT 600 
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AATTCAGTTT ATATAAATGT AATGCATTCC 


TAACTAAATT 


AAATCAATTG 


AAATTGGGAT 


720 




TATAACTTTA TGATACGTAC CACTACAATA AAATAATATA GTGAATAATC 


TACCATTAGA 


780 


5 


AAAATAAGCA CAAAAAAACT AGCAACCACA 


CAAAAATGTG ATTAGCTAGT 


TAATAAGTGT 


840 




CTAATTTAAG TTAATTGTTA ATCTATAAGA 


TTAATCACTT 


GAACGCGCAA 


TCAAAATAAT 


900 


10 


ACGTACAAGC TCTGCTACAG CGACTGCAGT 


TGCTGCAACA 


TAAGTCATTG 


CTGCTGCAGA 


950 


TAATACTTTA CGCGCATGCT TGTATTCTTT 


TTCATTTACA 


ATGTTCAATG 


CCGTAATTTG 


1020 




TTTCATCGCT CTTGAACTCG CATCAAACTC 


AACTGGTAAC 


GTAACAATTG 


AGAATAATAC 


1080 


15 


CGCTAATGAC ATTAAACCAG CACCAATCCA 


TAAAGCAGTT 


GAACCaAATG 


CACTACCTAT 


1140 


CGCTGTTAAG ATAATACCTA ACATGATGAT 


CATATAACTT AATGAACTCC 


CTAGGTTTGC 


1200 




AACAGGTACT AATGCTGCTC TGAATCTTAA 


GAACCAATAT 


CCTTGGTGAT 


CTTGAATGGC 


1260 


20 


ATGACCAACT TCGTGGGCTG CAATTGCAGT 


TCCAGCAACT 


OATGGTCTGT 


CATAGTT7GC 


1320 




AGGAGATAGT GAAACAACTT TCTTTTTAGG 


ATCGTAATGA 


TCTGTTAAGA ATCCTTCACC 


1380 




TTTAACAACT TCGACATCAT AAATACCGTT 


TGCATGTAAA ATTTCTAATG 


CAACTTCACG 


1440 


25 


ACCCGTTTTA CCACTAGTTG ATCTAACTTG 


TGAATATTTC 


TCATAGTTAG 


ATTTAACTTT 


1500 




GTGTTGTGCC CATAAAGGAA GCACCATTAA 


TATTACGAAA 


TAAATTATCA 


TAGTAAAAAT 


1560 




TGAAGACAAT AAACTCACTC TCCTTTATAA 


ATATTTTACT 


GTCATTTGCC 


GTTTTTATCA 


1620 


30 


AATCATTTAC ACTTTAATAA TTTGTTTAAT 


TCAATATAAA 


GCAAAAGTCC 


AAAAACACTT 


1680 




AGACAACATG ATAATACACC AATTTGCCAC 


ACATGTGTAG 


TTATAAAATC 


ATAATATGGA 


1740 


35 


AATTGAAGGT GAAAATAGTC AATATAATCA TTCAAAAACA CCCAAATCAT yGCTACACTG 


1800 


ATTCCAATCA TAGAACGTTT AAACCTAGGA 


TAGAAGTAAA 


TTGCCTGAAC 


AGCCATTATA 


1860 




CTGTCGGAAA ACATTAATAC CAAACCATTT 


ACTGTAATAT 


CACCTTGTTC 


AATAATAAAT 


1920 


40 


AATATATTCA TTATAACTGC CCAAATCCCA 


TATTTGAATA ATGTTACAAA 


TGCCAGTGCA 


1980 


TCGATAATAC TATTTTGTTT TTGAATTAAT 


ATCAATGAGA 


TAGAAATAAC 


TAAGTATAAT 


2040 




ATTGCAGTTG GGCTATCTGG AACAAAAATC 


TTAAAATGCC 


AGGGCGTATG 


ACTTAATTGT 


2100 


45 


TCACCATACC ATATATAACC ATAAATCATC 


CCTAATATAT 


TACAAATGAG 


TAGCATCATT 


2160 




AACCAAGAAC GTTGATAAAG TGTATATTGC 


CAAAATGCTT 


TAATTGTCAT 


CTGCTAAGTC 


2220 




CTCAAATTGA TTATGTTTAT TTACTAGCTT 


GAGTGTATTT 


AAAATTTGCG 


TTAGTTGATA 


2280 


50 


AAAACGTTGC TTTTCATTCA TCTGTAAACT 


TAAATCAATA 


TTGTGTAACA 


AGTAATCTAT 


2340 




TAATAACGCA TGTTTATGCC GATCTATAGC 


CATACTATTT 


AAGTCATGAA GATAAGTTTG 


2400 
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TGACACGTTT GCGAAGTGAA TTTGAATATC AAAAGCACAG TTATGATTAG CGATATAATC 2520 

AAATATTTCA TTTGTATTCA TTAACTTTAT ATTACGCTTA GTAAATTGAA TTGCAGAAGC 2580 

^ GTGACTTCCC ACTTCTGCAA TTTCTAATGT TTCATGATGA TTAATTTTTG TATCTACAAA 2640 

ATGAATGTTT GCCAATTTCG CCTCATTCAC TTTTATATAG TTAAGCACCC AAACTGCAAT 2700 

ACGCGACTTA AATCGATATT GAAAAAGTAA ATATTCAATA AAACTTTCTT TAATTTGATT 2760 

10 

GAGTGTCTCT GACATCAAAT ACCCCATTTT AAGATTGCAA TCTTGaTAAT TCGTCATGCC 2820 

AATTTTCGTT ACTTGGcTCT AGTTCCAACA ATTGATTTAA AATAGTAATT GCTTGTTCCT 2880 

TTTGACCAAT TTCAATTAAA TAGAAATAAT AATCACTCAT AAAATCAATA TTTGTTTTCA 2940 

IS 

TCGTTGGATA TGCTAATTCA AAGAAATGTT QAGCTTCTTT ATCTCGCTC 2989 
(2) INFORMATION FOR SEQ ID NO: 151: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\lble 

(D) TOPOLOGY: linear 

2S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



30 



35 



40 



45 



CATCAACTCC 


TTAATTACAC 


TGTAAATGAT 


ATGOGTCTTT 


TTGACAACTA TATTTGTCAA 


60 


ATCTACACCA 


AAAAATATGA 


TTATCCACCT 


ATGTATGACA 


TTTTGAAACA 


AACACCTCAA 


120 


CGCCTACAAG 


TCATAATTGT 


TTACTTTCGT 


TACACCTTCC 


TGCATAATTA 


ACAGCATTCT 


180 


AATTTTAGTA 


TGATGCACGC 


ATTTTCACTA 


AATCAAACCA 


TTCAAAGGAG 


ACTATTATGG 


240 


CATTTACATT 


ATCTGCAATT 


CAACAAGCAC 


ATCAACAATT 


TACTGGTGTT 


GACTTTCCAA 


300 


AACTATTCAA 


AGCTTTTAAA 


GATATGGGGA 


TGACTTACAA 


TATCGTCAAC 


ATTCAAGATG 


360 


GCACTGCAAC 


ATACGTACAT 


CAATCAGAAG 


ATGATATCGT 


TACGTCATCT 


GTAAAAAGTA 


420 


ATCATCCTGT 


TGCTCAAAAA 


TCAAACAAAA 


CAATAGTTCA 


AGACGTCTTA 


ACTAGACATC 


480 


AACAAGGGCA 


AACAGATTTT 


GAAACATTTT 


GTGATGAAAT 


GGCTGAAGCT 


GGCATTTATA 


54 0 


AATGGCATAT 


CGATATTCmA 


GCGGGCACTT 


GTACTTATAT 


CGACTTGCAA 


GACCAAGCTG 


600 


TTATTTCAGA 


ATTAATCCCT 


CAATAAACTA 


TATTTATAGC 


AACATTTTAA 


TTATTTCATA 


660 


AAATTTTATT 


GATAATCATT 


ATCGTTCGGT 


ATAAAGTAAA 


TACTATATAC 


TACTTATGAG 


720 


TGAGGTTGAT 


TATCATGATA 


ACTAACACTT 


TTATTTTAGG 


CATCACAGGC 


CCAACAAGTC 


780 


TTGTCGTCAT 


TAGCATTATC 


GCTTTAATTA 


TTTTTGGTCC 


GAAAAAATTA 


eCACAATTTG 


840 
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AGTCTCACX^ TACACCCAGT AA6GAATCGA AACAACAG06 AGAGCAATAG CACTGACCAC 


960 




ACCTTACTGG TTCACTTTAG CGAACTACGC CATCGGTTAG TAAAAATTTT ATT6TCGTTC 


1020 


5 


GTCATTACGG TCATCGTCGT ATATGTyTCA TCATTTTGGT GGATGACACC ATTCATAACG 


lOBO 




TATATyACCC GgCACATGTG TcCTTACATG CATTTcATTC ACAGAAATGA TACAAATAAC 


1140 


10 


GTG 




1143 


(2) INFORMATION FOR SEQ ID NO: 152: 




IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7953 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: dovible 

(D) TOPOLOGY: linear 




20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 






CAACGCCTGA 


ACGTAAACCA TATCGTTTCG CGATTTCCTC ATCTTGACTA TTTACTAAAA 


60 




ACTCTCTCAT 


GGCGATTAAT GTTTCTTTTT CTTCTTTAGT TAATGGTAAT TCTAACTCAG 


120 


25 


CTGCTTTTTG 


ACGCAAAGTT GGATGACCAT CTCTAATGAT GTCTTTCATT GTTAACATAT 


180 




ATTGCACCTT 


CCTTATTTTA ATTTGTTTTA GTTGAATGAC AGTAAAAAGG TTGTTAAGAT 


240 




ACTCATACAT 


TTTTATGTGT AAATATCTAC AAAGTTAACC AACTACTGCC AATGTTTATT 


300 


30 


TTAGATAGTA 


TATGTAAATT TTCAaGAtAT GCgTAATTGC gTTAAAAAAT GaTTAAAGTG 


360 




TTGGTTTCAA 


GCAATGaTAC TTTAGAAATT TATTTATCAT CTTGACTTTA AAAATTATAT 


420 




TATAAATGAC 


GTAACTGTCA ACAGATATAC TTAGTArTGA AGATGTGTAA TGTAATTGTT 


480 


35 


TAAAATTGAT 


TTCCAAGCAG ATTTTATTTA TCATTTAATT TAAATAGCAA GTGGAGGTAC 


540 




AAC^ATGAA ATTTGGAAAA ACAATCGCAG TAGTATTAGC ATCTAGTGTC TTQCTTGCAG 


600 


40 


GATGTACTAC 


GGATAAAAAA GAAATTAAGG CATATTTAAA GCAAGTGGAT AAAATTAAAG 


660 




ATGATGAAGA 


ACCAATTAAA ACTGTTGGTA AGAAAATTGC TGAATTAGAT GAGAAAAAGA 


720 




AAAAATTAAC 


TGAAGATGTC AATAGTAAAG ATACAGCAGT TCGCGGTAAA GCAGTAAAGG 


780 


45 


ATTTAATTAA 


AAATGCCGAT GATCGTCTAA AGGAATTTGA AAAAGAAGAA GACGCAATTA 


840 




AGAAGTCTGA ACAAGACTTT AAGAAAGCAA AAAGTCACGT TGATAACATT GATAATGATG 


900 




TTAAACGTAA 


AGAAGTAAAA CAATTAGATG ATGTATTAAA AGAAAAATAT AAGTTACACA 


960 


SO 


GTQATTACGC 


GAAAGCATaT AAAAAGGCTG TAAACTCAGA GAAAACATTA TTTAAATATT 


1020 




TAAATCAAAA TGACGCGACA CAACAAGGTO TTAACGAAAA ATCAwAAGCA ATAGAACAGA 


1080 
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AAGAAAAGCA AGACGTTGAT CAATTTAAAT AATTAATATA ATACAGATGG TAGGAAACAA 1200 

CTAATACAGT TCCTATTATC TGTATCTTTT TTTATTAAAA CAGAACTTTT TCAAATX3GTT 1260 

^ TAACAGTCCC ATTTATTTGT GGTACAATTA GTAAGGATAA AATGAATTTC TATACAATTA 1320 

TGGGAAAGGT ATTGTGAATT GAATGGCTCC TAAGTTACAA GCCCAATTCG ATGCAGTAAA 13 80 

AGTTTTAAAT GATACTCAAT CGAAATTTGA AATGGTTCAA ATTTTGGATG AGAATGGTAA 1440 

10 

CGTCGTAAAT GAAGACTTAG TACCTGATCT TACGGATGAA CAATTAGTGG AATTAATGGA 1500 

AAGAATGGTA TGGACTCGTA TCCTTGATCA ACGTTCTATC TCATTAAACA GACAAGGACG 1560 

TTTAGGTTTC TATGCACCAA CTGCTGGTCA AGAAGCATCA CAA7TAGCGX CACAATACX3C 1620 

IS 

TTTAGAAAAA GAAGATTACA TTTTACCGGG ATACAGAGAT GTTCCTCAAA TTATTTGGCA 1680 

TGGTTTACCA TTAACTGAAG CTTTCTTATT CTCAAGAGGT CACTTCAAAG GAAATCAATT 1740 

20 CCCTGAAGGC GTTAATGCAT TAAGCCCACA AATTATTATC GGTGCACAAT ACATTCAAGC 1800 

TGCTGGTGTT GCATTTGCAC TTAAAAAACG TGGTAAAAAT GCAGTTGCAA TCACTTACAC 1860 

TGGTGACX3GT GGTTCTTCAC AAGGTGATTT CTACGAaGGT ATTAACTTTG CAGCAGCTTA 1920 

^5 TAAAGCACCT GCAATTTTC6 TTATTCAAAA CAATAACTAT GCAATTTCAA CACCAAGAAG 1980 

CAAGCAAACT GCTGCTGAAA CATTAGCTCA AAAAGCAATT GCTGTAGGTA TTCCTGGTAT 2040 

CCAAGTTGAT GGTATGGATG CGTTAgcTGT nATATCAAGC AACTAAAGAA GCACGTGACC 2100 

GCGCAgTTGC AGGTGAAGGT CCAACATTAA TTGAAACTAT GACATATCX5T TATGGTCCTC 2160 

ATACAATGGC TGGTGACGAT CCAACTCGTT ACAGAACTTC AGACGAAGAT GCTGAATGGG 2220 

AGAAAAAAGA CCCATTAGTA CGTTTCCGTA AATTCCTTGA AAACAAAGGT TTATGGAATG 2280 

35 

AAGACAAAGA AAATGAAGTT ATTGAACGTG CAAAAGCTGA TATTAAAGCA GCAATTAAAG 2340 

AGGCTGATAA CACTGAAAAA CAAACTGTTA CTTCTCTAAT GGAAATTATG TATGAAGATA 2400 

TGCCTCAAAA CTTAGCAGAA CAATATGAAA TTTACAAAGA GAAGGAGTCG AAGTAAGCCA 2460 

40 

TGGCACAAAT GACAATGGTT CAAGCGATTA ATGATGCGCT TAAAACTGAA CTTAAAAATG 2520 

ACCAAGATGT TTTAATTTTT GGTGAAGACG TTGGTGTTAA CGGCGGTGTT TTCCGTGTTA 2580 

45 CTGAAGGACT ACAAAAAGAA TTTGGTGAAG ATAGAGTATT CGATACACCT TTAGCTGAAT 2640 

CAGGTATTGG TGGTTTAGCG ATGGGTCTTG CAGTTGAAGG ATTCCGTCCG GTTATGGAAG 2700 

TACAATTCTT AGGTTTCGTA TTCGAAGTAT TTGATGCGAT TGCTGGACAA ATTGCACGTA 2760 

CTCX5TTTCCG TTCAGGCGGT ACTAAAACTG CACCTGTAAC AATTCGTAGC CCATTTGGTG 2820 

GTGGCGTACA CACACCAGAA TTACACGCAG ATAACTTAGA AGGTATTTTA GCTCAATCTC 2880 
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10 



IS 



20 



CTATTAGAAG TAATGACCCA GTCGTATACT TAGAGCATAT GAAATTGTAT CGTTCATTCC 3000 

GTGAAGAAGT ACCTGAAGAA GAATATACAA TTGACATTGG TAAGGCTAAT GTGAAAAAAG 3060 

AAGGTAATGA CATTTCAATC ATCACATACX3 GTGCAATGGT TCAAGAATCA ATGAAAGCTG 3120 

CAGAAGAACT TGAAAAAGAT GGTTATTCTG TTGAAGTAAT TGACTTACX3T ACTGTTCAAC 3180 

CAATCGATGT TGACACAATT GTAGCTTCAG TTGAAAAAAC TGGTCGTGCA GTTGTAGTTC 3240 

AAGAAGCACA ACGTCAAGCT GGTGTTGGTG CAGCAGTTGT AGCTGAATTA AGTGAACGTG 3300 

CAATCCTTTC ATTAGAAGCA CCTATTGGAA GAGTTGCAGC AGCAGATACA ATTTATCCAT 3360 

TCACTCAAGC TGAAAATGTT TGGTTACCAA ACAAAAATGA CATCATCGAA AAAGCAAAAG 3420 

AAACTTTAGA ATTTTAATAC ATTTTAAAAG TTAACGAAGT TAGCGTATTT TAGTCTCATT 34 80 

GATTAAAATG AAATGTTTAA TTTACGAAAT CTTAGGAGGG CAAAAACGTG GCATTTGAAT 3540 

TTAGATTACC CGATATCGGG GAAGGTATCC ACGAAGGTGA AAITGTAAAA TGGTTTGTTA 3600 

AAQCTGGAGA TACTATTGAA GAAGACGATG TTTTAGCTGA GGTACAAAAC GATAAATCAG 3660 

TAGTAGAAAT CCCATCACCA GCATCTGGTA CTGTAGAAGA AGTTATGGTA GAAGAAGGTA 3720 

25 CAGTAGCTGT AGTTGGTGAC GTTATTGTTA AAATCGATGC ACCTGATGCA GAAGATATGC 3780 

AATTTAAAGG TCATGATGAT GATTCATCAT CTAAAGAAGA ACCTGCGAAA GAGGAAGCGC 3840 

CAgcAGaGCA AGCACCTGTA GCTACTCAAA GTGAAGAAGT AGATGAAAAC AGAACTGTTA 3900 

^ AAGCAATGCC TTCAGTACGT AAATACGCAC GTGAAAAAGG TGTTAACATT AAAGCAGTTT 3960 

CTGGATCTGG TAAAAATGGT CGTATTACAA AAGAAGATGT AGATGCATAC TTAAATGGTG 4020 

GTGCACCAAC AGCTTCAAAT GAATCAGCTG CTTCAGCTAC AAGTGAAGAA GTTGCTGAAA 4080 

CTCCTGCAGC ACCTGCAGCA GTAACATTAG AAGGCGACTT CCCAGAAACA ACTGAAAAAA 4140 

TCCCTGCTAT GCGTAGAGCA ATTGCGAAAG CAATGGTTAA CTCTAAGCAT ACTGCACCTC 4200 

ATGTAACATT AATGGATGAA ATTGATGTTC AAGCATTATG GGATCACCX5T AAGAAATTTA 4260 

AAGAAATCGC AGCTGAACAA GGTACTAAGT TAACATTCTT ACCTTATGTT GTTAAAGCAC 4320 

TTGTTTCTGC ATTGAAAAAA TACCCAGCAC TTAACACTTC ATTCAATGAA GAAGCTGGTG 4380 

AAATCGTTCA TAAACATTAC TGGAATATCG GTATTGCAGC AGACACTGAT AGAGGATTAT 444 0 

TAGTACCTGT TGTTAAACAT GCTGATCGTA AGTCTATTTT CCAAATTTCA GATGAAATTA 4500 

ATGAATTAGC TGTTAAAGCA CGTGATGGTA AATTAACAGC CGATGAAATG AAAGGTGCTA 4560 

SO CATGCACAAT CAGTAATATC GGTTCAGCTG GTGGACAATG GTTCACTCCA GTTATCAATC 4620 

ACCCAGAAGT AGCAATCTTA GGAATTGGCC GTATTGCTCA AAAACCTATC GTTAAAGATG 4680 



SS 



35 



40 



45 
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ATGGTGCAAC TGGCCAAAAT GCAATGAATC ACATTAAACG TTTATTAAAT AATCCAGAAT 4800 

TATTATTAAT GGAGGGGTAA AACATGGTAG TTGGAGATTT CCCAATTGAA ACAGATACTA 4860 

^ TAGTAATCC5G AGCAGGTCCT GGTGGATACG TTGCAGCAAT TCGTGCAGCT CAATTAGGAC 4920 

AAAAAGTAAC AATCGTTGAG AAAGGTAATC TTGGTGGTGT TTGCTTAAAC GTAGGATGTA 4980 

TTCCTTCAAA AGCATTACTA CATGCTTCTC ACCGTTTTGT TGAAGCACAA CATTCTGAAA 5040 

TO 

ACTTAGGTGT TATTGCTGAA AGTGTTTCTT TAAACTTCCA AAAAGTTCAA GAATTCAAAT 5100 

CATCAGTTGT TAATAAATTA ACTGGTGGTG TTGAAA6CTT ACTTAAAGGT AACAAAGTTA 5160 

ACATOGTTAA AGGTGAAGCA TATTTCX5TAG ATAACAATAG CTTACGTGTT ATGGACX^AA 5220 

AGAGCGCACA AACATACAAC TTTAAAAATG CAATCATTGC AACAGGTTCA AGACCAATTG 5280 

AAATTCCTAA TTTCAAATTC GGTAAACGTG TTATCGACTC AACAG6TGCT TTAAACTTAC 5340 

20 AAGAAGTACC aGGTAAATTA GTTGTAGTTG GTGGAGGATA CATTGGATCA GAATTAGGTA 5400 

CAGCATTTGC TAACTTTQGT TCAGAAGTAA CCATCCTTGA AGGTGCTAAA GATATCTTAG 5460 

GTGGCTTCGA AAAACAAATG ACACAACCTG TTAAAAAAGG TATGAAAGAA AAAGGTGTTG 5520 

AAATCGTTAC TGAAGCTATG GCTAAATCAG CTGAAGAAAC AGATAACGQA GTTAAAGTTA 55B0 

CTTATGAAGC TAAAGGCGAA GAGAAAACAA TCGAAGCTGA TTATGTATTA GTAACTGTAG 5640 

GTCX3TCGTCC AAACACAGAC GAATTAGGCC TAOAAGAATT AGGTGTTAAA TTCGCTGACC 5700 

GTGGATTATT AGAAGTTGAT AAACAAAGCC GTACGTCTAT CAGCAATATC TATGCAATTG 5760 

GTGATATCGT TCCAGGTTTA CCACTTGCTC ACAAAGCTAG CTATGAAGCT AAAGTTGCTG 5820 

CTGAAGCAAT TGATGGTCAA GCTGCTGAAG TTGATTACAT TGGTA7GCCA GCAGTATGCT 5880 

35 

TTACTGAACC AGAATTAGCT ACAGTTGGTT ATTCAGAAGC GCAAGCTAAA GAAGAAGGTT 5940 

TAGCAATTAA AGCTTCTAAA TTCCCATATG CAGCAAATGG TCGTGCATTA TCATTAGATG 6000 

ATACTAACGG ATTTGTTAAA CTTATTACAC TTAAAGAAGA TGATACTTTA ATCGGTGCTC 6060 

40 

AAGTAGTTGG TACTGGTGCA TCAGATATTA TCTCTGAATT AGGTTTAGCA ATTGAAGCTG 6120 

GTATGAATGC TGAAGATATC GCATTAACAA TCCATGCACA TCCAACATTA GGTGAGATGA 6180 

45 CTATGGAAGC AGCAGAAAAA GCTATCGGAT ACCCAATCCA TACAATGTAA TAACTGATTA 6240 

TCTATAAAGA TTCAGTCATT AAAAGCTGTA GCATATGCTA CGGCTTTTTT GTTTTAGGTA 6300 

AAGTAATGTA AGGAAATTGA TTTGAGATAT CGTTAACATG TGACATGCAT GTTATACTAG 6360 

CGATGCTAAT AAAAGAATTG AAATGGAGGG TTCAACAATG GAATATGAGT ATCCAATTGA 6420 

TTTAGACTGG AGTAATGAAG AGATGATTTC AGTGATAAAT TTCTTTAATC ATGTAGAGAA 6480 
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5 



10 



IS 



20 



35 



40 



AATTGTGCCT 


GCTAAAGCAG 


AGGAAAAACA 


AATTTTTAAT 


ACTTTCGAAA AAAGTAGTGG 


6600 


CTATAATAGT 


TACAAAGCAG 


TTCAAGATGT 


AAAAACTCAC 


TCTGAAGAAC 


AAAGAGTAAC 


6660 


AGCTAAAnAA 


TAATTCGTTC 


GAAATTAACA 


CAATTTAATA GGAATTTTTC 


TTTAAAACTA 


6720 


TTGCTAATAA 


AGCTATATTT 


TGATACCTTT 


ATCAAGTGTT 


AAACAAAATG 


TTTGATAAAA 


6780 


GTAAACTTAA 


TATAGCTTTT 


TTAGGTGGAA 


AAATAAATGA 


ACATAGGTAA 


TAAAATTAAA 


6840 


AATCTTAGAA 


GAATTAAAAA 


TTTAACGCAA 


GAAGAACTTG 


CTGAACGTAC 


AGACTTATCG 


6900 


AAAGGCTACA 


TTTCACAAAT 


AGAAAGTGAA 


CATGCCTCAC 


CAAGTATGGA 


AACTTTCTTA 


6960 


AATATTATAG 


AGGTGTTAGG 


AACGACGCCA 


AGTGAATTTT 


TTAAAGACAG 


TGAAAATGAA 


7020 


AAAGTATTAT 


ACAAGAAGGA 


AGAACAAGTT 


ATTTATGATG AGTATGATGA AGGTTATATA 


7080 


TTAAATTGGT 


TAGTTTCAAA 


GTCAAATGAA 


TATGATATGG 


AGCCATTAAT 


ATTAACTTTA 


7140 


AAGCCTGGAG 


CATCATATAA 


AAATTTTAAT 


CCATCAGAGT 


CTGATACGTT 


TATTTATTGT 


7200 


ATGTCAGGTC 


A6ATAACACT 


TAATTTAGGC 


AAA6AGATAT 


ATCAAGCACA 


AGAAGAAGAC 


7260 


GTTTTGTATT 


TTAAAGCACG 


AGATAATCAT 


CGTTT6TCAA ACGAATCAAA 


CAATGAAACA 


7320 


CGAATACTTA 


TTGTAGCGAC 


AGCTTCATAT 


TTATAGGGGG 


GATCTTATTT 


GGAACCGTTA 


7380 


TTATCATTAA 


AATCAGTTAG 


TAAAAGCTAT 


GATGATCTTA 


ATATCTTAGA 


TGACATAGAT 


7440 


ATTGATATTG 


AATCAGGATA 


CTTTTATACA 


TTATTAGGTC 


CTTCAGGTTG 


TGGTAAAACA 


7500 


ACAATTTTAA 


AATTAATTGC 


AGGGTTTGAA 


TATCCTGACA 


GTGGTGAAGT 


GATTTATCAA 


7560 


AACAAACCAA 


TTGGTAATTT 


ACCACCAAAT 


AAACGTAAAG 


TGAATACAGT 


CTTTCAAGAT 


7620 


TATGCATTAT 


TTCCACACTT 


AAACGTCTAT 


GATAATATCG 


CTTTTGGTTT 


GAAATTAAAA 


7680 


AAATTATCAA 


AAACCGAAAT 


TGATCAAAAA 


GTAACTGAGG 


CATTAAAATT 


AGTAAAACTT 


7740 


TCAGGTTATG 


AAAAAAGAAA 


TATTAATGAA 


ATGAGT6GCG 


GACAAAAGCA ACGTGTTGCA 


7800 


ATTGCACGTG 


CTATCGTAAA 


TGAACCAGAA 


ATATTATTGT 


TAGATGAATC 


TTTATCCGCA 


7860 


TTAGATTTGA 


AATTGCGTAC 


TGAAATGCAA 


TATGAATTAC 


GAQAATTGCa 


ATCTAGATTA 


7920 


GGtATTACAT 


TTATATTTGT 


aACACATGAT 


CCA 






7953 



^ (2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
SO (D) TOPOLOGY: linear 
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GGCGTGATCA 


TACGACCGTC 


ATTCATGCTC ATGAAAAAAT ATCTAAAGAT TTAAAAGAAG 


60 




ATCCTAi rii' 


TAAACAAGAA 


GTAGAGAATC 


TTGAAAAAGA 


AATAAGAAAT GTATAAGTAG 


120 


5 


GAAACTTTGG 


GAAATGTAAT 


CTGTTATATA 


ACAGCACTAA 


TGATnACAAT CATTTTTTAC 


180 




ATTTCTATAT 


GCTAATGTGG 


CAAGATGAGC 


AAAACTCATT 


TTGTGGATaA TGTTTaAAAG 


240 


10 


TCATACACAC 


CATACACAAG TTATCAACAT GTGTATAAyT 


cGcCAAATCT ATGTTTTTAA 


300 


GACTTATCCA 


CCAATCCACA 


GCACCTACTA 


CTATTACTAA 


GAACTTAAAA CCTATATAAT 


360 




TATATATAAA 


CGACTGGAAG 


GAGTTTTAAT 


TAATGATGGA 


ATTcACTATT AAAAGAGATT 


420 


19 


ATTTTATTAC 


ACAATTaAAT 


GACACATTAA AAGCTATTTC 


ACCAAGaACA ACATTACCTA 


480 




TATTAACTGG 


TATCAAAATC 


GATGC6AAA6 


AACATGAAGT 


TATATTaACT GGTTCAGACT 


540 




CTGAAATTTC 


AATAGAAATC 


ACTATTCCTA AAACTGTAGA 


TGGCGAAGAT ATTGTCAATA 


600 


20 


TTTCAGAAAC 


AGGCTCAGTA 


GTACTTCCTG 


GACGATTCTT 


TGTTGATATT ATAAAAAAAT 


660 




TACCTGGTAA 


AGATGTTAAA 


TTATCTACAA ATGAACAATT 


CCAGACATTA ATTACATCAG 


720 




GTCATTCTGA 


ATTTAATTTA 


AGTGGCTTAG 


ATCCAGATCA 


ATATCCTTTA TTACCTCAAG 


780 


25 


TTTCTAGA6A 


TGACGCAATT 


CAATTGTCGG 


TAAAAGTGCT TAAAAACGTG ATTGCACAAA 


840 




CAAATTTTGC 


AGTGTCCAcC 


TCAGAAACAC 


GCCCAGTACT 


AACTGGTGTG AACTGGCTTA 


900 




TACAAGAAAA 


TGAATTAATA 


TGCACAGCGA 


CTGACTCACA 


CCGCTTGGCT GTAAGAAAGT 


960 


30 


TGCAGTTAGA 


AGATGTTTCT 


GAAAACAAAA 


ATGTCATCAT 


TCCAGGTAAG GCTTTAGCTG 


1020 




AATTAAATAA 


AATTATGTCT 


GACAATGAAG AAGACATTGA 


TATCTTCTTT GCTTCAAACC 


1080 




AAGTTTTATT 


TAAAGTTGGA AATGTGAACT TTATTTCTCG ATTATTAGAA GGACATTATC 


1140 


35 


CTGATACAAC 


ACGTTTATTC 


CCTGAAAACT 


ATGAAATTAA 


ATTAAGTATA GACAATGGGG 


1200 




AGTTTTATCA 


TGCGATTGAT 


CGTGCCTCTT 


TATTAGCGCG 


TGAAGQTGGT AATAAGGTTA 


1260 


A/% 
*tv 


TTAAATTAAG 


TACAGGTGAT 


GACGTTGTTG 


AATTGTCTTC 


TACATCACCA GAAATTGGTA 


1320 


CTGTAAAAGA 


AGAAGTTGAT 


GCAAACGATG 


TTGAAGGTGG 


TAGCCTGAAA ATTTCATTCA 


1380 




ACTCTAAATA 


TATGATGGAT 


GCTTTAAAAG 


CAATCGATAA 


TGATGAGGTT GAAGTTGAAT 


1440 




T CTTCGGTAC 


AATGAAACCA 


TTTATTCTAA 


AACCAAAAGG 


TGACGACTCG GTAACGCAAT 


1500 




TAATTTTACC 


AATCAGAACT 


TACTAAAAAT 


AAATATAAAT 


AAAGGATGAC GTGATTAATT 


.1560 




AAAACGTCAT 


CCTTTATTTT 


TTGGCAAAAA 


TAATTCTAGG 


TGCGTATGTA AAATAAATTT 


1620 


50 


GGCAGCATTT 


TAAACAGCAA 


ATAAAAGACXj 


CCAATTAAAT 


TTATGACAAA TGTATCCAAA 


1680 




ATTTAATAAG 


TGTGCTTATA TGCCCTTTAA ATTTAAAATT TTAATAGTCA ATAACAAGTT 


1740 
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AAAAATAAGA ATTAATTATT TATATGTAAA CGGTTTCTAC CTCTATTTTA AATGAAATTT 1860 

GTGACAAAAA AAGGTATAAT ATATTAATGA CATACAAAGA AATGGAGTGA TTATTTTGGT 1920 

TCAAGAAGTT GTAGTAGAAG GAGACATTAA TTTAGGTCAA TTTCTAAAAA CAGAAGGGAT 1980 

TATTGAATCT GGTGGTCAAG CAAAATGGTT CTTGCAAGAC GTTGAAGTAT TAATTAATGG 204 0 

AGTGCGTGAA ACACGTCGCG GTAAAAAGTT AGAACATCAA GATCGTATAG ATATCCCAGA 2100 

ATTACCTGAA GATGCTGGTT CTTTCTTAAT CATTCATCAA GGTGAACAAT GAAGTTAAAT 2160 

ACACTCCAAT TAGAAAATTA TCGTAACTAT GATGAGGTTA CX3TTQAAATG TCATCCTGAC 2220 

GTGAATATCC TCATTGGAGA AAATGCACAA GGGAAAOACA AATTTACTTG GAATCAATTT 2280 

ATACCTTAGC TTTAGCAAAA AGTCATAGAA CGAGTAATGG ATAAGGGACT CCATACC6TT 2340 

TTAATGC 2347 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTTERISTICS: 

(A) LENGTH: 13542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ACAAGACGTn TCTATAACTT ATCTGAAATC GCTCGTCAAG ATAAAGATTA TGCAACTATC 60 

TCATTCTTAA ACTGGTTCTT AGATGAACAA GTCGAACy^G AATCAATGTT TGAAACTCAC 120 

ATCAATTATT TAACTCGTAT CGGCGATGAC AGCAATGCAT TATATCTTTA CGAAAAAGAA 180 

CTTGGCGCTC GTACATTCGA CGAAGAATAA TTAAACATCA CTACAATAGA CAGATAAATA 240 

TCATACGACA TGATAGGCAT TTGGGTCACT TACAATAACC CAATGTCTAT ATTATTTTGC 300 

TTTACGGAGA TCACTAGATT CATTTTCTGA ATCATTGATC TGCGTTTTTT CATTTTCAAG 360 

GCTAATTATT GTATTTTTAG TCATTTATTT TTTAAACTAC TAATGTTAAT AACTCTAAAT 420 

TTGATGTTGA ATTAATTTGA CGATTTTAAA GCATATCATC ATTTACTTTT TAATCAGAGT 480 

TACATCCAAA TGATAGATTT CACGTTATAC CTTCACGTAT AATATTATGT ATCGTTTGTA 540 

AGCAAATGAC TAAAAGTCTA TTAATATATA CATTTAATTA ATTGAAAGGA TTGACTACAT 600 

GATACAAGAT GCGTTTGTTG CACTTGATTT TGAAACAGCA AATGGTAAAC GTACAAGTAT 660 

TTGTTCTGTC GGAATGGTTA AAGTCATTGA TAGTCAAATA ACAGAAACAT TTCATACTCT 720 

TGTGAATCCG CAAGACTATT TTTCACAACA AAATATTAAA ATTCATGGCA TACAACCAGA 780 
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aGATTTACCT GTTGTCGCAC ATAACGCGGC 
TCAAAATATT GGTTTACCAA CTCCAAATTT 
^ AAGAACCGTT GATTCGTATC GATACGGTTT 

TTTTCATGGT CATCATGATG CATTGAATGA 
GCTACTGAAA AATTATGAAA ATTTAACATA 

10 

AGATAAAGGC TAGGACTAAA TAAAATACTC 
ACTTTGCAGG GAGCTTTATT TTATATAAAG 
TGTTCTAAAT TACCTACAGT GACACCAATA 

IS 

TCXSTTATAAA GTAAATATGC AATATTATAA 
AAACTCATCT GTTTAGATAG CX3TTTCAAAT 

20 TTAGCTGACT TCTGTAATTT ATTTAGACGT 

TTTTCTTAAA ATCTCTTCAT CATCATTCAC 
TGATTTTCTT ACTCTTGATG ATTTCACTTC 

25 TAAACCCCGA CCTCTTTTTC CAAACAAACG 

ATCTCTACCG TTAAAAATAC CATTATCATG 
TGGaAAATCT CCAATATCCA ATGTCATCAA 

^ AGTCATACCA TCAGGTTTAT TCATACCACT 

AACACCTGCA GATGCTGTTA AATGTGTCTG 
AATTTTCGAA GCAGGAAGGT CTGGTCTCAC 

35 

CAATGACATC GGTTCTACCT TATCTGTATA 
TGTTTCTCGG TAAGCACCAA AATTACTTGT 
CGCTTGTGAC ATAGGCATTG CTGAATGGAC 

40 

AGAGACAACA CCCCTACTGC TTGCTTTACC 
GGGGTTATCT CTCATTTCGA CTTGTGCAAA 

^ TCTCTCAGTC AAGTGCTCAC CTCCCTACTA 

TCAACATAAT TATACGCTGT GTACGATTTT 
ATATTGaGAT AAC7ACAAGT TGTTATaAAA 

SO CGATTGAGTA AGAAAACATA GGTAATCTGA 

TCATCATATT AAAATTTATT ATTGAGTCGG 

SS 



ATTTGATATG AACGTCTTAC ATCAAAGCAT 900 
AACTTACTTT TGTAGTTATC AACTTGCTAA 960 

AAAACATATG ATGGAGTTTT ATCAATTAGA 1020 

TGCCAAAGCA TGCGCAATGA TTACTTTTAG 1080 

TGTAACTAAT ATTTATGGTA AAAATCTAAA 114 0 

CCTTCAAAAG TAAGCATTGT AAAAATGTAA 1200 

TCATATATCG TCATATTTTT ATAAGTTGAT 1260 

AGTCGAATTG GTACATCAGG GTCTTTTAAA 1320 

ATATCTTCTT CAGAACTAAC CGAATCTCTT 1380 

TGATAA6TTT TAATTTTAAC CX3TTACAGTT 1440 

TCAGCTGTTT TACCTGnACA ATTCCCATAC 1500 

GTCTGTTGCA AATGTGCGTT CAGTCCCTAC 1560 

ACTATGGTCA ATACCGCGTG CCTTCTTATA 1620 

TATTAATTCA AATTCC3GTTT TCTCATATAA 1680 

CATTACTTTT TTGGAAGCTT TACCTACGCC 1740 

AATATCATGG aCATTTTGAT AATCAATCAC 1800 

CGCTAATTTA GCTAAAAATT TGTTATAAGA 1860 

CTCTAGAATA TCTTTTCTAA TATACTGAGC 1920 

TAATTCTGTA ATATCTAAAT ACGCTTCATC 1980 

ACTTCGGAAA ATAGACATAA TCTGCGCAGA 2040 

GACAAAGTAT CCATTTGGAC ATAATTTATG 2100 

GCCGTATTTT CGTGCTTCAT AGGATGCCGT 2160 

ACCAACAATG ACTGGTTTCC CTTTCAATTT 2220 

AAAATAGTCC ATATCTATAT GAATAATTCG 2280 

ATTTTTACrr TTATAACGCA CAAAAATATC 2340 

TTTACATAAA TCTTGCACTT AGCGATAACT 24 00 

TCAATTGCTA TTTAAGCATG ATGATGAAGA 2460 

AATAATTCAA GCAAATTCAT TTTGTTGGTA 2520 

CTTTTGATGA TACAAATAAA TACTATCTTC 2580 
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AAAGCAATAA GCGGTATGCA TACTAAACAT AAAAATAAGT GATGAATAAC CAAATACCTT 2700 

AATTAAAATA AGCAAGCCAG TACTTAATAG GATTAGTGGT GACAGCATAA TAATTGAGAA 2760 

TTGCCATTTG TTGAAGCAAG CATCTGCTGT TTGGAATAAG ATTCTGTCTT TTTTTATATT 2820 

AAACATAGGT TTGCTATCTT TTTTAAATAA AAGAAATAAT GCTCTATGGA TAAGTTCATG 2880 

TAAAATCAAT AAAATAATGA ATCCAGCAAA CCCATATACA AGATTGATGA TGATATTTTG 2 94 0 

ATCGACAACC GCTGTGACAC CTAACGCCCA CTTATACGTA AATAAAATCA CGAATAACGC 3000 

AATAACAAGT TGCAAGATAA TAAACCTTCX5 CATTTGAAAA TTATTTGTC!G TTAAATCAAT 3060 

TTTATGCATT ACCAACCCTC CCGATCATGA CATTCTTATT CTTCTTTAAA TATAGTATAC 3120 

AATGTCACAT TTAATTTAAA AAGTTCATAT CAAGAAAGTA AATTGGCTGT AATAAAATTT 3180 

TAATATACGA Cri ' CiTrcri ' CACTTATTAA OQCGAAATTT TATCtCAAAT CATGTGCGCT 3240 

ATTTCAAATT GAATAATGCC ACTGTCTCAA CAT6TGTTGT TTGTGGAAAC ATATCTACCG 3300 

GTGTTACCTC TTCAAGTTGA TATTTTTCAG CTAATAATAA TGCATCACGT TGCTGTGTTG 3360 

CGGGATTACA TGAAATATAG ACAATACGCT TAGGTTCTAA TGTAAGCAAA GTCTGAATAA 3420 

ACGTTTOOTC ACA6CCCTTT CTTGGCGGAT CAACCATTAC AACATCTGGT TTAATCCCTT 3480 

GTGCTTTCCA TTGTAAAATA ACTTCTTCAa CTTTCCCACA GACAAAAGTT GTATTATTGC 3540 

ATTGGTTTAT AGTCGCATTT TGTTGTGCGT CTTCAATTGC AGAAGGTACT ACTTCAACAC 3600 

CGTATACATG TTTTGCAAGT GGTGCCATAT ATAGCCCTAT TGTTCCAATA CCACAATAGG 3660 

TATCTAATAC AACTTCATTA CCTGTCAATT GCGCATACTC AATTGCTTTA TTATATAATT 3720 

TCTCTGTTTG TTCAGAATTA ATTTGGTAGA ATGACTGATC ACTTATTTTA AATGTACTAT 3780 

CTGTTAATTG ATCAATAATT GTATCTTTAC CATATAGCGT TATAGATTGA CGTCCCATAA 3840 

TAACiATTAGA GTGGCTATCA TTAATGTTTT GTTTAATGCT TGTCACATTA GGAAATGCAT 3900 

CTAATATCTT CTCAACAACA GCATTTTTTT GTGGCCACTT TTTACCATTA GTTACAAAAA 3960 

TAATCATCAT TTCGTCTGTA TGATATCCTG TTCTTACAAC CAAATGTCTC ATTAAACCTT 4020 

TTTTCAATTG TTCTTGATAA ATACTTACAT TTAAATCTTT TAAAATAGAT TTAACTTCAT 4080 

TCATCACTTC TTGATGTTGT GAATCTTGTA TTAAACAACT TTCCATGTCA ATAATGTCAT 4140 
GGCTTCTTTG ACGATAAAAG CCCATAATAA CTTCATTCTG TTCATTCTTA CCAACTGGAA 4200 
TCTC5GGACTT GTTTCGATAT CTCCAAGGAT CTGTCATGCC AACTGTATCG TTAATCTTAG 4260 
AATTATCAAA ATGCGCTTTT CGCTGAAACA AATTAATCAC TTGTTCCTTT TTCATTTCAA 4320 
GTTGTGCTTC GTATGATAAG TGTTGAAGTT GGCACCCACC ACAAOGTTCA TAATATATAC 4380 



55 



778 



10 



IS 



20 



EP 0 786 519 A2 

AGTTCTTTTT TACTTTGATA ATTTTATATT CAATTTGTTC ATTAATTAAA GCTTGTGGTA 4500 

TGAAAATAGG AAAGOSATCT ATTTTTACGA CACCATGGCC TTCATGCGTT AAATCAACAA 4560 

CTGTTCCCGT TTTTATGTCA TTTTTAGCTA TTGCTTGCAA AATTTTACCT CCAAAATGAA 4620 

CAGGTTAGGA ACAAAATTAT GCGCTTCCTA ACCTGCCATT ATATATTTCA CTATTTCTGT 4680 

TTATTCTTCG ATTAAATTGT CATCAACATG ATCATTATTT ATTAACTCTT CATTTACAAT 4740 

ATCATTAGGT GCAAAGACAT CTATATGACG TTCTAGATTT AAGAAATTCG CTGGTAATTT 4800 

ACCACCATAT TCTCCATCTA CATTTAGTTG TAAGTCTGTG AATGATGAAA TATTAATTGC 4860 

CTTTGCTTTT TCATAAATAA CTTTAGGATG CTTAGTATGT TCTCCTCTTG AAGCTAAAGT 4920 

CATAATATGA CCAAGTTCTG CAAGGTTTGA TTTTTCAACT ATAATTAACG TAAAATAGCC 4980 

GTCATCTAAC TTAGCGTCCG GCACTAATTT TTCAAATCCT GCCATTGAAT TTGTTAAACC 5040 

TAAAAAGAAT AATAATGCTT CTCCTTGGAA AACATTACCA TCATATTCAA TTCTTAAATC 5100 

TACAGCTTTC ATTTGAGGTA ACATTTCGAA ACCTTTGaTG TAATAAGCAA ATGGACCAAC 5160 

AATAGATTTC AATTTACTCG 6TGTTTCATA AGAGACTTGC GTCAATTGTC CGCCTGCAGC 5220 

2S TAAATTAATA AAGTATCGAT TATTCATTTT ACCAATATCT ACTTTAGTAG AATGACCTTC 5280 

AATGATGACA TCAAGTGCCC CCATGATGTC ATTAGGTATA TGCAATGCAC GTCCAAAGTC 5340 

ATTAACAGTA CCCATAGGAA TGACACCTAG CTTAGGACGA TTAGGCTTTT CTGCGATACC 5400 

ATTAACTACT TCATTTAATG TTCCATCACC ACCTGCAGCG ATTAATACAT CATAATTTTC 5460 

ATGCATAGCT CTTTCTGCTT CAAGTGTGGC ATCACCTATT TTCTCGGTTG CATATGCACT 5520 

CGTTTCATAT CCCGCTTTTT CTAATTTTAT TAAGGCATCA GGTAATTCTC TTTTAAATAG 5580 

CTCTTTACCT GATGTCX«3GT TATAAATGAT TCTAQCACGT TTCCTCATAT CTTATCCCTC 5640 

TACWAAAAT TCATATATTT TAACTTCATC TTTGTTTCX5T CTAATAGGGA GTGGGACAGA 5700 

AATAATATTT AACAAAATTT ATTTCGTTCT ACCCCAACTT GCATTGTCTG TAGAATTTCC 5760 

TTTCGAAATT CTCTATGTTG GGGCCCCACC CCAACTTGCA CATTATTGtA AGcTGACAGA 5820 

AAGTCAGCTT CTTTGTTTGG GGGCCCCGCC AACTTGCACA TTATTGTAAG CTGACAGAAA 5880 

ATCAGCTTCT ATGTTGGGGC CCCACTAGAA TTGAAAAAAG CTTGTTACAA GCGTATTTTC 5940 

TTTCAGTCAA CTACAGCCAA TATAACATTG TAGTGCCTAG GACATTGAAT TTATGACCCA 6000 

GGCTCAGTCT TATTTCATCA TTCTTAATAT CGTTAAAGAC CAACTTGTAT CTTAAACAAA 60 60 

SO TACTATCTCA ATATGTACAA AGCTTGTTAT TTATTCAGCA TTTTTTGCCG TTCTTCATTA 6120 

TAtAGcTTCG TCAGTTATGC TATnTACCT TTAAAATGAT GTTGTAAATA TAATGTTGTC 6180 
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AACGCATTAA TAAAATTAAT ATTTTTACCA TTAACATCTA CAATGAATAA AGTTAAAAGT 6300 

AATTTGACTT CTATAGATAT AAATAAACCC TCGATTGCAT CTAAGTCAGC AATCAAGGGT 6360 

5 TTATTTTTTA AATCTTCATA GTTTGATGAT TTAAATTATC TTTTATCTAA TTCTTGTTTT 6420 

AATAGTTGAT TTACTAATTG TGGATTAGCT TGACCTTTAG ACGCTTTCAT AATTTGACCA 64 80 

ACTAAGAAGC CCATAGCTTT GCCTTTACCA TTTTTGTAAT CTTCAACTGA TTGTTCXjTTA 6540 

10 

TTGTCTAATG CTTCATTTAC AAATTTTAGA AGTGTTGCTT CATCAGAAAT TTGAACTAAG 6600 

CCATTATCTT CCATAATCTG TTTAGCATTA CCACCTTTAG CTGCTAACTC TGGGAAGACT 6660 

TTCTTCGCAA TTTTACTGCT CATTGTTCCG TCTTCGATAA GTTTAATCAT ACCTOCTAAA 6720 

15 

rr rrCT G GTG TTAATTTAGT ATCTAATAAT TCTACTTGAT TXTTAnTAA ATATTCGTTT 6780 

ACGCCACCCA TTAACCAGTT AGATGTTAAT TTAACATCTG CACCGTGTTC AATTGTTGAT 6840 

20 TCAAAGAAAT CTGACATTTC TTTAGTCAAT GTTAATACGT GTGCATCGTA TGCAGGTAAA 6900 

CCTAATTCAT TTACATACTT AGCTTTACX3T TCATCTGGTA ATTCAGGAAT TGTCTGACGA 6960 

ACACGCTCTT TCCAA6CATC ATCAATATAT AAAGGTACAA TGTCAGGCTC TGGGAAGTAA 7020 

2S CGGTAATCAT CAGAACCTTC TTTAACACGC ATTAAAATTG TTTTACCTGT AGATTCATCA 7080 

AATCGACGTG TTTCTTGTCC GATTTCTCCA CCATTTAACA ATTCTTCTTC TTGGCGTTTT 7140 

TCTTCATATT CTAAACCTTT ACGTACATAG TTAAATGAGT TTAAGTTTTT CAATTCGGCT 7200 

TTAGTACCAA ATTTTTCTTG ACCATATGGA CGTAAAGAGA TGTTAGCATC ACAACGTAAA 7260 

GATCCCTCTT CCATCTTAAC GTCTGATACA CCAGTGTATT GAATAATTGA ACGCAATTTT 7320 

TCTAAATATG CATATGCTTC TTTAGGTGAA CGAATATCTG GTTCAGATAC GATTTCAATT 7380 

35 

AGCTGGTGTAC CTTGACGGTT CAAGTCAACT AATGAATACT CACCTTTATO TGTTGACTTA 7440 

CCAGCATCTT CTTCCATGTG AAGACGAGTA ATACCGATTC GTTTTGTTTC ACC6TCGACT 7500 

TCGATATCGA TATATCCATT TTCACCAATT GGTTGATCAA ATTGAGAAAT TTGATATGCT 7560 

40 

TTTGGATTAT CTGGATAGAA ATAGTTCTTA CGGTCAAACT TAGATTCTGT TGCGATTTCC 7620 

ATATTTAGTG CCATTGCAGC ACGCATTGCC CAGTCTACTG CACGCTTATT AACAACTGGT 7680 

4S AAGACACCTG GATATGCTAA GTCGATAACA TTTGTATTTG AGTTAGGTTC TGCTCCAAAA 7740 

TGCGCTGGTG ATGGAGAAAA CATTTTTGAG TCCGTTTTTA ACTCTACGTG AACTTCAAGT 7800 

CCTATAACTO TTTCAAAATG CATGATTTCC ACTCCTTATA ATTTTTCATA AACGTCATGT 7860 

^ AAATTGTATT GTGTTTCATA TTGATAAGCG ACACGATATA ACGTTTTTTC ATCGAATGGT 7920 

TTACCAATGA ACTGTAAACC GATTGGTCGG CCATTTGATT GTCCACAAGG AACAGAAATA 7980 
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GGATCATCAA TTTCTTCACC TAAATTAAAC 
ACATCATAAT TTTCGAATAC TTTATCAAAG 
5 GATTTTTTAT AGTAAGCATC ATAGTAACCT 

CGACGTTTTA CTTCTTTACC GAAACCTTCA 
GAATGAGCTT CTTTAGAATG ATAACCATAA 

10 

GAAGCTTCTG ATGATGCAAT CACGTAATAT 
GATACTTCCT CAACGACAGC ACCTAAAGAT 
TCTTTTACGT CATCAGCTAC ACCTTCACCT 

75 

CCTTTTAATAT CTTTACCAAT TTCAGATCTA 
GAGTCATTAA CATCTGCACC AGAAATAGCT 

20 CGAGTCAATG GACCAAITTG GTCTAATGAA 

ACACGACCGT ATGTTGGTTT CATACCGACA 
GAACCACCT6 TGTCTGAACC TAAGCTAAAT 

2S GATCCACCTG ATGAACCACC TGGCACTGCT 

AAATAAGATG TTTCTGTTGA ACCACCCATT 

AAAACGGCAT TTTCATTATG TAGTTTTTCC 

«/) _____ 

CCTTCTAACA TTTTACTTGC ACATGTTGTT 

ATACCCATTG GAATACCAAA TAATTTGCCA 

AATTCTTGCG CTTTTTTGAT TGCATTTTCT 

35 

GGATCA6TCT CTTCAATTGC ATCATATATA 
TTGTCTTTTA TTAAAGTTAA TAAATTCTCA 
GTCCTCCTCA TTCATGATTG TAGGCACTTT 

40 

TTTCAAAGCT AATTCTTGTG GAATACCTTT 
TAAATCTAAA ACGTGATATG TAGGTTCAAC 

45 TGCAAAATCT AAAATGCTTT CTAATGTGTT 

TTGAAGTCTT GCAAGATTCG CGATATGCTC 
TAAAAGCCTC CTTTAAGTCA TTCATCACTA 

50 AGTATTTATG AGGTGCTACT TTAATTTCAT 

TCAAATCATT TTTTATGAAA ACAACACTCT 

55 



GCaGTgTnAG GCGCTGTTGG 


ACCAACTACT 


8100 


TCATTTTTAA TCAATGTTCT AACTTTTTGA 


8160 


GAACTTAATG 


CAAATGTACC 


TAAGAAAATA 


8220 


GATCTTGACA TTTTATATAA TTCTTCTAAT 


8280 


CGAATTCCX5T 


CAAAACGAGA AAGGTTTGAC 


8340 


GATGGAATAC 


CAAATTTAGT 


ATTTGGCAAT 


8400 


TTTAAAGTTT 


CTACAGCGTT 


TTGAACTGCT 


8460 


AAGTATTCTT 


TAGGTAATGC 


AACTTTTAAT 


8520 


AAGTCTACAT 


CATCAACTGG 


TGCACTTGTA 


8580 


TCTAATACGA 


TTGCATTATC 


TTTTACATTT 


8640 


GATGCAAAAG 


CAACTAATCC AAATCGAGAT 


8700 


ACGCCACAAT 


ATGCAGCCGG 


TTGTCTAATT 


8760 


GGTACTAAGC 


CAGCTGCAAc 


TGCT6CTGCA 


8820 


TTATGGTCAA 


ATGGGTTAAC 


TGTTTTTTTG 


8880 


GCAAACTCAT 


CCATATTTAA 


TTTACCGATT 


8940 


ATTACAGTAG 


ATTCGTAAAT 


TGGCACAAAA 


9000 


TCTAATCCGT 


TTGTAATAAT 


GTTATCTTTT 


9060 


TCCATTTGAT 


CTTTTGCTTG 


TAATTCATCC 


9120 


TTATCCAGCXJ 


CTAGAAAAGA 


CTTAATTGTT 


9180 


TCTTTAACAA 


CATCAGATGG 


TTTGATTTTT 


9240 


ACCGATTCGT 


AGCGAATGCT 


CATCTTACGC 


9300 


AAATTGTCCA TCTTCTOTTT 


CTTTGGCATT 


9360 


AATTGCTTTA 


TCTTCACGTA AAACGTTTTG 


9420 


GCCTTCTGTA 


TCAGCGCTAT 


CATTTTGTTT 


9480 


GGCCATTTCT 


TCCGTTTCTT 


CAGGAGAAAT 


9540 


AACTTCTTCA 


CGTGTTACTT 


TTGTCATTAA 


9600 


AATTGTATCA 


AATTTCCAAT 


TAAAAATCTA 


9660 


ATAAACTGTA 


TAAACATTAT 


CATTCGTTTA 


9720 


TTTAATATTA GACAACCCAA 


TTCAATATTA 


9780 
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TATATTGGTA TGCAAGTATT TCAAAAAGAA TAAATTTAAT TTTCCTACTT TTCTAAACAT 9900 

TTATCTTTAT GTATAATGTT TTCAAGTAAC TAAATTATAA ATTAAATAAA GGGAOTGTTT 9960 

ATCATGCTTA CAATGGGGAC AGCATTAAGT CAACAAGTAG ATGCCAATTG GCAAACTTAT 10020 

ATTATGATTG CCGTCTACTT CTTGATACTA ATCGTTATTG GCTTTTACGG TTACAAGCAA 10080 

GCAACTGGTA ACCTAAGCGA GTACATGTTA GGTGGACGTA tATTGGACCG TATATTACTG 10140 

CATTATCAGC TGGAGCTTCA GATATGAGTG GATGGATGAT TATGGGGCTA CCTGGTTCTG 10200 

TCTATAGCAC TGGTCTATCA GCTATGTGGA TTACAATCGG TTTAACATTA GGTGCTTATA 10260 

TAAATTACTT TGTTGTTGCT CCTAGACTTC GTGTTTATAC CGAATTAGCT GGAGATGCAA 10320 

TTACATTACC AGATTTCTTT AAAAATCGTT TAAACGATAA AAATAATGTG TTAAAGATTA 10380 

TTTCTGGATT GATTATCX5TA GTATTCTTTA CATTATATAC ACATTCTG6T TTCGTATCTG 10440 

GTGGTAAACT ATTTGAAAGT GCTTTTGGAT TAGATTATCA TTTOGGTTTA ATATTAGTTG 10500 

CTTTCATTGT CATTTTCTAT ACTTTCTTTG GTGGATATTT AGCTGTATCA ATTACAGATT 10560 

TCTTCCAAGG TGTCATTATG TTAATTGCGA TGGTTATGGT CCCTATTGTT GCTATGATGA 10620 

ATTTAAACGG CTGGGGAACG TTTCATGATG TAGCAGCTAT GAAACCTACA AATTTAAATT 10680 

TATTTAAAGG GTTATCATTT ATAGGAATTA TCTCTCTATT TTCATGGGGA TTAGGTTATT 10740 

TCGGTCAACC TCATATCATT GTAAGGTTTA TGTCTATTAA ATCACACAAG ATGCTACCTA 10800 

AAGCTAGACG TTTAGGTATT AGCTGGATGG CTGTTGGTTT ATTAGGCGCT GTGGCTGTTG 10860 

GTTTAACAGG TATTGCATTC GTACCTGCTT ATCATATTAA ACTAGAAGAT CCTGAGACAT 10920 

TATTCATCGT GATGAGTCAA GTACTCTTCC ATCCTCTTGT AGGTGGTTTC TTACTTGCTG 10960 

CGATTCTAGC TGCAATTATG AGCACGATTT CTTCACAATT ACTTGTAACA TCTAGTTCAC 11040 

TAACGGAAGA CTTTTATAAA TTAATTCGTG GTGAAGAAAA AGCTAAAACG CACCAAAAAG 11100 

AATTTGTTAT GATTGGAAGA TTATCTGTAT TAGTTGTAGC AATTGTTGCC ATCGCGATTG 11160 

CATGGAATCC AAACGACACA ATTCTAAACT TAGTAGGTAA CGCTTGGGCC GGATTTGGTG 11220 

CATCGTTCAG TCCACTTGTG CTATTTGCAC TTTACTGGAA AGGTTTGACA CGTGCCGGTG 11280 

CTGTAAGTGG AATGGTTTCA GGTGCCTTAG TCGTTATCGT TTGGATTGCA TGGATTAAAC 11340 

CATTGGCACA TATCAACGAA ATATTCGGCT TATATGAAAT TATTCCTGGA TTTATTGTAA 11400 

GTGTAATCGT TACATATGTT GTAAGTAAAC TTACTAAAAA ACCTGGTGCA TTTGTTGAAA 11460 

CTGACTTAAA CAAAGTTCGT GACATCGTTA GAGAAAAATA ATTCATAAGT CTTAACAAAT 11520 

TAAAAAGGTA CTAATGTTAA TCAAAATTAT GACTAACATT GGTACCTTTT TATTATCTTT 11580 
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AATTAAA6CA 
GAAATATTTA 
ATAGTAATCG 
TTTCCCTACT 
GAAATTATTA 
TTTTTCATTG 
AGTGATAAAT 
AATTGGTATA 
TTGTTTACCT 

TrrrcTTTTT 

TTCTTACXTTT 
GCTGGTGATT 
CCTAAATTTT 
CGTTTTGTAT 
TCTTGATACA 
TCAAAGTCGC 
CTTTCTTTAA 
ATTTCTTTTA 
TCCTTATGGT 
CGCTTCATCG 
TCTGTCCAAA 
CCAGCAATAA 
GCAAGCCATT 
GTTTTACCAC 
ATATTAACAT 
TAAGTAACTA 
TCAGTTACCG 
TTAACACCTA 
GCTTGTTGAA 



CGTGGTTGGT 
TTCGCTTGTT 
ATTGGTAAAT 
GCTTGTGTGA 
TTTAAATTTT 
ATATTATGCC 
TCACCTGGTG 
TCTTTTAAAT 
TGCTTTTTAA 
TATAGTAATA 
TTGTATCTCC 
TTTCTGCAAT 
CATTCGCTTT 
ACTTAGGATT 
AATATTTATC 
CGCCATTATA 
ATGGTAACAA 
CTTGATTTGA 
TACCACAAGC 
ACATACCCCT 
TTTCAATACC 
CGACATCTGT 
TAGATGCTTC 
TAAATTCAGG 
GTTTATCTTT 
CAGATTGTGC 
TTAGTAATCG 
AATGCCTAAT 
TGGCAGCTAA 



TACCATCTTT 
CAGTAACGTA 
CTACTACTAA 
AATTAGAAAA 
CATCATAATC 
ATTCATTAAG 
TGATTGAATC 
CATCATTTTC 
CTcGCTATCA 
AACACTATTC 
ACCACCATAA 
CTTTTCAGGA 
CTTGTCTTTT 
TAAATAGGCA 
TGTTGGAAAT 
ACTATTTGCC 
TGTACGATAG 
ACTATTGTTA 
AGCGAGTATA 
CTAACTATTT 
TAAACTTTGT 
ATTTTTAGTA 
ATTGCGTGTC 
ATGTCCTTCA 
TAATTTTTGA 
TACTTTATCA 
ATCTATCGTT 
ACCTAGACCA 
TAAATTATCA 



AATACGAATT 
CTGTGTAATA 
TCGTTGTGGC 
ATATGATTGC 
TGCTGCTGTT 
CTTTGTTTGA 
TTCACTTGAT 
ACX^TAACCTT 
TCTAGTTTTT 
ATAGCTAAAC 
AAATCTTGCT 
TCTGTTTCAC 
TTATCTTTTT 
TTAATTGTTT 
ACTTCTTTAC 
ATGTTATCTT 
TTATCACCTT 
TGTTTTTGAT 
AAGATAGCTG 
AATTCATTTT 
GCTTTTGTTA 
ACGCTACTTG 
ATTTGATGTA 
ATATCTGATG 
ATTAAAGCAC 
CCTATATCAT 
TCATATTTTT 
AATAATAAAT 
ACTTTTTTCT 



TCATAGTTAT 
CCAATTGTTT 
TTTTTATCAA 
AAATTATCAT 
GAAGAAGGTA 
CTCTTTTCTG 
TGCTTATAAA 
GATAATATCT 
TACTAAAAGT 
CAATCGTCAT 
CTAAAATGTT 
CTtOGTGTGA 
CAGACATTTT 
TCTTGTCCAA 
TTAAGTTCAA 
GTAAAAGTCC 
GTACATTTTT 
TATCTTTTCC 
TAATCAATAA 
GCTTATCTAC 
ATTTTGAACC 
TAACTTTAGC 
GCTTACCAGT 
TTTTGATACC 
GAATATCTTC 
GAATTTCTAC 
CTGCTAACAC 
TTTCTAAAGA 
GCCCCATTCT 



OGATTTTATC 
CAGCTTGTCC 
CAAATTTAAC 
TAAATTGCTT 
ATAAAGCTGA 
CAGTCGCTTG 
TTGCAAAATG 
CACTAGCCAT 
CGATCCATCT 
ACCTTTAATA 
AGATAAATAG 
TGGATTAAGT 
ATCGATTTCA 
AAATTGACCA 
TAAACCATCT 
TCTTGCCTGG 
ATCCGTTGCA 
AGCCTGGTCA 
TACTAATGTA 
AAATTGTTGC 
TGCATCTTCA 
ACCTTGTGAT 
CAGTACTATC 
TTTATAAATC 
ATTTTCTAAA 
TAATTCCGCT 
TTGGCTCGCT 
GTTGTCCTTA 
GTCTAAAGGT 



11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
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TAAAGCTGTT GAATAATTTT AGTGCCTAAA CCATCAATAT TcATGGCTTG TCTTGaTACA 13500 
AAGTGnATCa ATCCtTcAAC AAGTTGTGCT TGGTCATTTT GG 13542 
(2) INFORMATION FOR SEQ ID NO: 155: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

70 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



20 



25 



30 



35 



40 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CAGTAAACAC CTCTGATTAC GAATATTTAT ACATTTATTT TAACACATGC ACTGATTTAC 60 

GACTACTAAA CACCTTTACG TAAAAAGGGT AAACATGGTT TATCTATCTT GGTTATCTAT 120 

TTATAAATAT TTnTCATATT ACGCATAACA ATTGCTTAAA ATATGTATAA AAATGAATAT 180 

ATGTGTAATA AACTTGCTAA TTATTAGATT TAATAAGCGT CAATTGTTTG AACATATTtA 240 

ATTAAAATCA CATTGATATC ACAGATACGA ATATTGTCGT ATAGAAATTG AAAATTCTAT 300 

TTTTTAAATG AAAGTCTTCA ACATAATTTT AAGTTTCAAC ATGAGAAAAA TCGATTAACA 360 

AACAACGTCA GTTGAATATG CCTTTTGAGA CATTTCAAAC TTTACAATTG TTGCTAATCG 420 

ATATATTTGC TTTTAGTGAT CCCTGCTATA AAATAAATCA AOGATTTCTA ATAAGTGTTT 480 

TGTATTGAAT TGTTCATCAA TTTGCGTTAG TTCATCCACT GCTGCGTCTC TATGATAAGT 540 

CAATTTATCT TCTGCGCCAT CTTTCCCTAA TAAACTCACG TACGTACTTT TATTATTTTC 600 

AAGATCGCTG CCCACTTTTT TACCTAACTT TGCTTCATCA CCATAGCAGT CTAATAAATC 660 

ATCTTTAATC TGGAACATCA TACCTAAATG ATAACTATAA CTTTCTAAAT GTTCTTTAGT 720 

TGTSTCATCG ACATTAGCGA TATCTGCTGC ACTCATAACC GCAAAAGTTA ATAATGCTCC 780 

TGTTTTTGTT TTGTGTATCA TTTCCAAAGT TTCAAGATCA ATTGGTTGGC CTTCGCTTTG 840 

CATATCTAAC ATTTGACCGC CGACCATTCC AACATGACCA CTTGCTATTG ACAGCCGTTG 900 

TAGAACTTTT ATTTTTACTT CATCAGTTAA TCTATCATCA CTTGAAATAA GTTCAAATGC 960 

TTTAGTTAAT AAAGCATCAC CTGCTAATAT CGCAGTCCAC TCACCATATA CTTTATGATT 1020 

TGTTAATTTT CCTCGTCGAT AATCATCATT ATCCATCGCT GGTAGGTCAT CATGAATAAG 1080 

TGAATATGTA TGAATCATTT CTAGTGCAAT TGCGCTCTTC ATACCTAACT CATACTCGGT 1140 

ATTTAGTGAA TCTAAAGTGA GTAATAACAG AACTGGTCGG ATGCGTTTAC CTCCAGCATT 1200 

TAATGAATAC AACATACTTT CTTCTAGCTG AGTATCCATT ACTGATTTAT TTATCGCAAC 1260 
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CATCCTCAGC TTCTTCTTTT ATTAAGTCAT TCACCTTTTT TTCGGCATTT TTTAAAGTTG 1380 

TGTCACAAGC TGCTGATAGT TTCATACCAC GTTGATATAA ATCTAATGAT TCCTCTAAAG 1440 

^ ATACTGTTTC ATTATCTAAT TTTTGAACAA TTTGCTCTAA TTCTTGCATC ATTTCTTCAA 1500 

AACTTTGCX5T TTCTTTAGTC ATTATTACAC CTTACTTTCG TAACTTTTGC ATCTACTAAG 1560 

CCATCTTTCA TTGTTAACGT CAATTGATCA TTTTCTGITA AATCTTTAGT ACTCGTAATG 1620 

70 

ACTTCGTCTT TTTTATTAAC AATTGCATAT CCACXSCAACA TTGTATTAGT TGGACTTAAA 1680 

TTGTTTAAGT TTTCTACTTT ATTTTTCAAA TCATTTTTAT AACTTAATAT CTTAGAATTC 174 0 

AATAATTTAA CAAGTTGGTT TGTCAATTGA AGATTATnTT GTTGTTCTTG ATTAACACTA 1800 

IS 

CTTAGTAATG CTTTTAAATn ATAACGTTGG TGCAACAGCA TTAAATCGAG GCCCCGGTGG 1860 

TCCAAAGTTG CCCGAATTnG TGGTTTCAGG CCC 1893 
(2) IKFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
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AAAATATATT 


CCTTCACTTA 


ATATTCAATT 


AGAGAAAAAC 


ATGGTGATTG 


TAATATGTTG 


60 


TGCAATATTT 


CTGGGTGTTT 


TAATACTTTT 


TTTATTTCTG 


AATCGTAAGC 


TAAGGTTGGA 


120 


AATTTATAAT 


AATAACTCTA 


GTAAAGGGAA 


AATAATTTTA 


TTTCCTTCAT 


TAAAAAACTT 


180 


TTGTTTCACA 


ATATTTTATT 


ATTTTTTATT 


TGGCGGTCTT 


TCAATAATGG 


CTCTAAGTAT 


240 


GTTATTAACT 


TTAAATCCTC 


AAAATATAAT 


AGGCTTTATT 


GGTTGGTTGG 


TAATGACTGC 


300 


AGGTTTCTTT 


CTGTTAAACA 


TGTCATCGAT 


TATTGACAAA 


AAAATTTATG 


TATTATCTAA 


360 


AACTAACACG 


GTGGAAAAAT 


GATGGTTTAG 


CTGGATTTAC 


TGCAGGTTCT 


ATTTCGGCAA 


420 


TACTTGTATA 


TTGGACCAAT 


CAAAAAAATG 


AATTTGGAAT 


AAAAGATAAA 


AACGATTGGA 


480 


TAGGACATAA 


ACTAGACGTT 


GGTATAGATG 


CTGTAGAAAA 


ATCTGCAGAA 


AAAACAGTAG 


54 0 


ATGGTGTTGA 


AAATGTCATG 


GTGAAGCTTC 


AAAAAGTATT 


TCTAATCATA 


TAAGCCCTAA 


600 


GAAATGGAGC 


TGGTAAATGT 


TGCTATGCGA 


ATGTAAAATC 


ATCAATAAAA 


ACCCAAAATA 


660 


TAGAATTATT 


AAATATAATG 


ATGAATACTT 


AATGGTCGAT 


ATAATAAGCA 


CTTGGATTAG 


720 


TTTATTTTTT 


CCTTTTATTA 


ATTGGTTCAT 


CCCaAAAGaA 


TACGTCAAAA 


TTAGTAGAGA 


780 
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IS 



20 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 2343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT ACAATACTTC 60 

GTATTGAATG GcTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG ACTGGCACTG 120 

CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC TTTGAAATAC 180 

TTTTAAAAAA TAAGACACTT TCCCCAACTT GCACATAAAT GTAAAATTCA ATAAAATGAA 240 

TTTTCTGTGT TGGGTCCCTT CTTATAATTT AATAAATACC ACTAAACTAA ATTAACGAGG 300 

TGCCTTATGT ATAAAAATTA TAACATGCCC CAACTACACT ACCAATAGAA ACTTCTGTTA 360 

GAATCCCTCA AAATGATATT TCACGATATG TTAATGAAAT TGTTGAAACr ATACCTGATA 420 

^5 GCGAATTCGA TGAATTCAGA CATCATCGTG GCGCAACATC CTATCATCCA AAAATGATGT 480 

TAAAAATCAT CTTATATGCA TATACTCAAT CTGTTTAATT ATGTTCAAAG CATTAAGGTA 540 

ACAAGACAAT ATCTAAGATA TCAAAGATAG AAATTTTTTG ACGTTGTTGC TGATTGTAAA 600 

CATAACCATC AATTTCATAA TTAATAGCAT CAATACGATA AATGGTTAAG CGTACTGAAT 660 

CTACAAAGCC ATTATTATAA AATTTAACTT CTACAGGTTG GGCATATTGT AGCGCCTCGT 720 

GTAGCCGAAT GTTTAGCTCA GCCAATTGAT CATCTGATAA TACAGGACGT GTAATTTTGT 780 

TTTGGTCGAT AATGTATTGT TGAATCGTTT CGAATTGTTC GGGTAATGTT GCAAAAGGAG 840 

CCOStTTAAT CATGCCTCTT CCCATAGGTA TATTGTTATC TAGTAATTCT CTTGGAACGT 900 

TACGATAATC AGTTTCTTCT TCATAACTTG TCATCCTTAA TTCACCCCAA TCTGATAATT 960 

ACATTATACG AACATGTGTT CTATTTTGCA ACAAAAATTT TGTGGaAGCA TAAACGCGTT 1020 

AATAATTAAT GCTCGTGtAA GTAAAAAAGA GGGATTAATT AAAATCGAAT AATGaCATAT 1080 

CACaGCAAAT AGTTCTTTTA AAGTAGTTAA ATAGTTTTAG CTTTAAGGAA aTGATAAaTG 1140 

ATTGTWAATT CTAGCTAAAA TTTAATAAAA TGAAAATAAG ACTAACATGO AGGGGTAAAA 1200 

GTAATGACAA ATGGATATAT TGGTTCTTAC ACTAAAAAGA ATGGTAAAGG GATTTATCGT 1260 

50 TTTGAATTAA ACGAAAATCA GTCACGTATT GATTTATTAG AAACAGGATT TGAATTAGAA 1320 

GCGTCTACAT ATTTGGTGCG TAATAATGAA GTTTTATATG GAATCAACAA AGAAGGAGAA 1380 
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TGTTTGTCTT CAAAAGCTGG TACAGGTTGT TATGTATCGA TTTCAGAAGA TAAAOGATAT 1500 

TTATTTGAAG CGGTATATGG TGCTGGCATC ATACGTATGT ATGAATTAAA TACGCACACA 1560 

GGTGAAATTA TACGTCTAAT TCAAGAACTT GCACATGATT TTCCAACAGG TACACATGAA 1620 

AGACAAGATC ATCCACACGC ACATTATATT AATCAAACTC CAGATGGTAA GTACGTTGCA 1680 

GTAACAGATT TAGGTGCTGA TCGTATCGTT ACTTATAAAT TTGATGACAA CGGGTTTGAA 174 0 

TTTTATAAAG AATCTTTATT TAAAGATAGT GATGGGACAA GACATATTGA ATTTCATGAT 1800 

AATGGAAAAT TTGCTTATGT CGTACACGAA TTATCAAATA CTGTGAQTGT TGCAGAATAT 1860 

AATGACGGTA AATTTGAAGA GCTCGAGCGT CATTTAACAA TTCCTGAAAA CTTTGATGGA 1920 

GATACTAAAC TTGcAGCAGT GCGTTTATCT CATGaTCAAC AATTCTTATA TGTATCTAAT 1980 

AGAGGGCATC ATAGCATTGC AATTTTTAAA GTTCTTGATA ATGGTCAACA CTTAGAACTA 2040 

GTAACAaTTA CTGAAaGTGG TGGTCAATTC CCAAGAGATT TTAATATTGC CTCATCAGAT 2100 

GACCyTTTAG TTTgTGCTCA kGaGCaAGGA GATTCAGTTG TAACTGTTTT CGAAAGAAAT 2160 

AAAGAAACAG GTAAAATTAC GCTATGTGAT AACACTCGTG TAGCATCTGA AGGTGTATGT 2220 

2S GTCATATTTT AATCTTTAAT TAATCATGAT AAAAAGAAAA CCATGTTTCC AAAAAATTTG 2280 

TGTATACCTT GAAATTTATT GnTTTCCAGn ACATCAATTA TGGGAAGCAT GGnTTATTTT 2340 

TGT 2343 
(2) INFORMATION FOR SEQ ID NO: 158; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 337 base pairs 

(B) TYPE: nucleic acid 
3S (C) STRANDEDNES5 : doiible 

(D) TOPOLOGY: linear 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

AAATTGCCAG TTGGTATCGC TTCTGGTGCA GTAGTCGAAG GTTTCTTCCA AGGTATCATT 60 

CCGATTGGCT ATATCGTTAT GATGGCAGTA TTGTTATACA AAATTACTGT TGAATCTGGA 120 

CAATTTTTAA CAATTCAAGA TAGTATTACA AATATTTCAC AAGACCAACG TATTCAAGTT 180 

TTACTTATTG GATTTGCATT CAACGCATTT TTAGAAGGTG CAGCAGGATT TGGTGTACCA 240 

ATTGCAATTT GTGCACTTTT ATTAACACAA TTAGGATTTA ATCCATTAAA AGCTGCGATG 300 

50 TTATGTTTAG TCGCAAATGC AGCGTCTGGT GCTTTTGGTG CGATTGGTAT CCCTGTAGGT 360 

GTTGTAGAAA CGTTGAAATT ACCTGGAGAT GTTTCAGTAT TAGGTGTTTC TCAATCAGCA 420 
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GGlTiTAGAG GTGTTAAAGA AACATTACCA GCAATTTTAG TAGTTTCAAT CACTTATACA 540 

CTTACTCAAG GATTATTAAC TGTATTCAGT GGACCTGAAT TAGCAGATAT TATTCCACCG 600 

TTATTAACAA TGTTAGCATT AGCAGTATTT TCTAAAAAAT TCCAACCAAA ACACATTTAT 660 

CGTGTTAATA AAGATGAAGA AATTGAACCT GCAAAAGCAC ATTCTGCAAA AGCAGTATTA 720 

CATGCATGGA GCCCATTCAT TGTATTAACA GTCATTGTAA TGATTTGGAG TGCGCCATTC 780 

TTTAAAAACT TATTCTTACC AAATGGTGCT TTATCATCAT TAGTATTTAA ATTCAACTTA 840 

CCTGGaACAA TCAGCGAAGT TACGCATAAA CCATTAGTAT TGACTTTAAA TATTATTGGA 900 

CAAACAGGTA CAGCTATTTT ATTAACTATT ATTATTACAA TTTTAATGTC TAAAAAGGTT 960 

AACTTTAAAG ATGCAGGTAG ATTATTCGGC GTTACATTTA AAGAGTTGTG GTTACCAGTT 1020 

CTTACAATTT GTTTCATCTT AGCAATTTCT AAAATCACAA CTTATGGTGG TTTAAGTGCA 1080 

GCAATGGGTC AAGGTATTGC TAAAGCAGGT AATGTCTTCC CAGTTCTATC ACCAATTTTA 1140 

GGTTGGATAG GTGTGTTTAT GACAGGATCA GTTGTAAATA ACAACTCATT ATTTGCACCA 1200 

ATTCAA6CTT CTGTTGCACA ACAAATTGGA ACAAGTGGTT CACTTCTTGT ATCTGCTAAT 1260 

25 ACAGTTGGTG GTGTAGCGGC AAAATTGATT TCACCACAAT CAATTGCAAT TGCAACTGCA 1320 

GCAGTAAAAC AAGTTGGTAA GGAATCAGAA TTATTAAAAA TGACATTGAA ATACAGTGTA 1380 

TGTTTACTAA TATTCATCTG TATTTGGACT TTCATCTTGT CATTATTATA AAAAAACGTA 1440 

30 TTTCAAAATA TAAATATACA GAAGGTGAGA TGTTTTCTAA CATCTCATCT TTTTTTTATG 1500 

GATCATTAAT GAAAGAAGTT TGACATTATA ATAATGGTAG CGCTTTATGT TAAAATGAAT 1560 

A6TGAGTAAT CAGCAATCAA ATTAAATTGG TTGATAGCTG TTAAGGTTTG TGGTTTTGTC 1620 

TTTGTGCTAT CGCnCATAAA GTATATAATT AAAGTAGTTT CGTTATTATA AAATATTAAT 1680 

ATAC^TAGTA GATAGTAATA GAGCATCACC ATGGGAACCT ATTGAGACAC TTATTGATTT 1740 

AAAGTGGTAT TAATATGTCG TATTTCTCGA ACGTTCCATT ATTCATTTTA AAAAGGGGGA 1800 

CTGTATTTGT TATGACAACA CAACATAGCA AAACAGATGT CATCTTAATT GGTGGCGGTA 1860 

TTATGAGTGC aCATTAGGAA CATTACTTAA AGAATTATCA CCTGAGAAAA ATATTAAAGT 1920 

GTTTGAAAAA TTAGCACAAC CTGGCGAAGA GAGTTCAAAT GTATGGAATA ATGCCGGTAC 1980 

AGGGCATTCA GCACTTTGCG AGTTGAACTA TACAAAAGAA GGTAAGGATG GCACAGTTGA 2040 

TTGTAGTAAA GCAATTAAGA TAAATGAGCA GTACCAAATT TCAAAACAGT TTTGGGCATA 2100 

50 TTTAGTTAAA ACAGGACAAT TAGATAACCC AGATCGCTTT ATTCAAGCGG TGCCACACAT 2160 

GAGTTTTGTC ATTGGCGAAG ATAATGTAGC TTTTATAAAA AGTCX5TGTTG CAACGTTAAA 2220 
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GGTACCGTTA ATGATTGAAG GTCXSTAAGTC TGATGAACCA ATTGCTTTAA CTTATGATGA 2340 

AACTGGTACa gATGTTAACT TTGGTGCGTT AACTGCAAAG TTATTTGATA ATTTAGAGCA 2400 

ACGTGGTGTG GGAATTCAAT ATAAGCAGAA TGTATTAGAC ATCAAGAAAC AGAAATCTGG 2460 

GGTATGGCTA GTTAAAGTTA AAGATTTAGA AACTAATGAA ACGACAACAT ATGAATCTGA 2520 

TTTTGTATTT ATTGGTGCTG GCGGTGCGAG TTTACCATTA CTCCAAAAGA CTGGGATTAA 2580 

ACAATCAAAA CATATTGGTG GTTTCCCGGT AAGTGGATTA TTCCTGCGCT GTACAAATCA 2640 

AGAAGTGATT GATCGTCATC ATGCTAAAGT GTACGGAAAA GCAGCAGTGG GTGCGCCACC 2700 

AATGTCAGTG CCGCACTTAG ATACACGTTT TGTAGACGGC AAGCGTTCAT TGTTATTTGG 2760 

TCCATTTGCA GGTTTCTCAC CTAAATTTTT AAAAACAGGT TCACATATGG ATTTAATTAA 2820 

ATCGGTTAAA CCAAATAATA TCGTGACGAT GTTATCTGCA GGTATCAAAG AAATGAGTCT 2880 

TACGAAGTAT TTAGTGTCAC AATTGATGTT ATCTAATGAT GAGCX5TATGG ATGATTTAAG 2940 

AGTCTTTTTC CCAAATGCTA AAAATGAAGA TTGGQAAGTG ATTACAGCAG GGCAACGTGT 3000 

CCAAGTAATC AAGGATACTG AGGATTCTAA AGGTAACTTA CAATTTGGTA CTGAAGTTAT 3060 

25 TACGTCAGAT GATGGCACAT TAGCTGCATT ACTTGOTGCA TCACCTGGTG CGTCAACAGC 3120 

TGTAGATATT ATGTTTGATG TTTTACAGAG ATGCTATCGT GATGAATTCA AAGGATGGQA 3180 

ACCAAAGATT AAAGAAATGG TGCCGTCATT TGGTTATCGC tTAACAGATC ATGAGGATTT 3240 

ATATCATAAA ATTAATGAAG AAGTAACTAA GTATTTACAA GTTAAATAAT AAACGAAACG 3300 

GTAATGTCTT TTTTAATGTG ATAGACATTA CCGTTTTTTA GTGGTTAATA AAAATCATTT 3360 

TAATTGTTTC AGTTGCTTGT TAATAGTGTC TACGTAGTTC TT G TTT TT AA AGAATTGAAT 3420 

TATCCAAATT AATACATAAA CCACAATGAA GATAATTGTG AATATGATTA GATAATGCAC 3480 

TGTTAGTGGA AACCAACCGG CAAGCATTGC TAAAGGCAAG AATCCGACAT ACGTTGTTAT 3540 

GAAATGCATT ATAGTTGCTT TAGTAATGCT CCAATCTGTG TATTTAAAGA TAAAATCTCC 3600 

AAGGAAAAAG ACGACGCCTA TGAGTAACCA TAAAATGATA GAAATCAACA TTACGGTAGT 3660 

TTCTGTGAAA TGCGTATAAT ACAATATGCC AATAGTTGAT TGTGGGTTCA GTGGATAATA 3720 

TTTGCCGTCT GCAAATAACA TACTAAAGAA CAGTGAAAGG GACAAACCAA TGATTAAGCT 3780 

AATAAATAAT GAGTTTTTCA AATTTTTCAT ATTGATAAGC GCTCCTTTAT AGATTTTAAA 3840 

TAACGTCTAG AAGAATAGGT GTAGTGTGCA TCTTTAAGAT ACATACGTAT AAGTCCATTT 3900 

SO GGCTCTAATA ATAATTTTTC AATGTAATAC TTGTTGACGA TTTCTGATTT GGAAATGCGA 3960 

ATGAAATGTT GTGGTAACTG TTTTTCTAGT TCATAAAGTC GTAATTTTAG TTTGAATTTT 4020 
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ACATTAATGA TATGGATTTC TTTGTCTATG TATCCGACTA ATGTATGTGA TTTGTCTAAA 4140 

TCATTGACTG CATTAATAAT ACTTTGAACG TTATCATTCA TTTTAGGTGC ATGTATATCA 4200 

^ ATATAAGATT CCGTCTCATT TGCATTGATA AATAAATTGA GTTTCATCAT AGOTTAATGC 4260 

CTCCTTCAAA ATTATTAAAC CATAAATGAC CATCGATATA TTTAAATTTT GTTGAATGGT 4320 

AGAAATTAAA TGTTAAGTGG CTAGAAAGCG CTAATCAATA TAAAAGATAC CTCCTGAAAT 4380 

10 

AAAAACAGAA ATGTTTTTTC AGGAGGTAGA GATTAAAGTG AATTATTTGG CAGTGTAATA 4440 

GTAAAGGTGG TTACATACTC GTTACTTTGT GTGAATTGGA TTGTACCATG ATGCAATTCA 4500 

ATGATGGATT TTGTAATTGC AAGACCTAAA CCATTGCTAT TATCATGTTT GCTCACTTTA 4560 

IS 

TAAAAACGTT CAAATAAACG T6CTTCAGCT TGTGGACTAA TTGGTGAACC ATCATTACTT 4620 

ATTGTGAAAA TGATATTGTT GTGACTATGT TGCAAAGCGA TGTCAATGGC ACCACCAACA 4680 

20 TCTGTATACT TAATAGCATT TATTAATAAA TTACTCAATG CTTGATGTAA CAAACGTTGA 4740 

TTTCCTAGGA AATTGATGAT TCTAGGTCAG CTAAnATGAT TAACGACTTT TCATCAGCAG 4800 

CAnATTGTTC ATGTCXSAATG ATATCnTTAA TGAGCTG 4837 

2S (2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

35 

ACAATTATTG GATTATTATC AAGCAACGTT AATGGATGAC TTCCACTTAC AACAGAAATG 60 

CCCATAGATT CTAAATCTtT TGCATGAGCA TCTTGTGATA AGTCTTTTCC ATCATTGACA 120 

GTTACATTCG CACCTAATTT ACTTAATAAT TTAGCTGCTT CATAACCACT TTTTGCCAAA 180 

40 

CCGACAACTA ATACATTTTT ATTTTCTAAC CCTGTATAAT TAAGCATCTT AATGCACTCC 240 

AATCCATAAA CCGATTAAAC CTGAAATCAG ACCAACAGCC CAAAATACTG TAACTACTTT 300 

45 CCATTCGCTC CATCCTATCA ATTCAAAATG ATGATGAATC GGACTCATTT TAAATATACG 360 

CTTTCCAGTC AATTTAAAGC TAGCGACTTG TAACATAACA GATAATGTTT CAATTACGAA 420 

TACTAAACCT ATAAAAATTA ATGATAATTC CTGATTAAGC ATGATTGAAA TGGTAGCAAA 480 

SO TATACCACCT AAAGCTAAGC TACCTGTATC TCCCATAAAC ACTTTAGCAG GGTTAATGTT 540 

ATATGGTAAA AATCCTAAAA GTGCAAACAA CATAATGATA CAGAAAATAC CAATTGCCGT 600 
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TGCTAATCCA 


TCTAAACCAT 


CTGTTtiMiTT 


TACTGCATTA 


GAAAAACCTA CTTGCCAAAA 


720 


AACAATGAM 


ATAACATATG 


CAAATGATAG 


TGGGATTGCT 


ACATTCGTAA ATGGAATATG 


780 


TATGCTCGTA 


GAAAAATTCA 


CCAAATGAAA 


CACATTACTT 


AAAACAAAGA ATATAATCGC 


B40 


AATACCAATT 


TGCGCCAAAA 


ACTTCTGTTT 


ACTTGTTAAA 


CCTTGGTTAT TCTTTTTAAC 


900 


AACAATAATA 


TAATCATCTA 


TAAAACCAAT 


TAACCCAAAA 


CCAATCGTCA CAAATAATAA 


960 


CAGTATGATT 


GGATTAGCTT 


GATCTACAAA 


TATAATAGCC 


ACCAAAGACG TTATCACAAT 


1020 


ACTTAATAGA 


AATGTTAGTC 


CACCCATCGT 


TGGTGTACCA 


GTCTTCTTCA TATGGCTTTG 


1080 


TGGACCTTCT 


TCTCGAATAC 


TTTGACCAAA 


TTTCATCCTT 


TTTAATGTAG GTATTAAAAC 


1140 


AGGTACCAAA 


ACAAATGTAA 


TCACTAGCGC 


TAATAACGCA 


TATACAAAAA TCATAACTAT 


1200 


CTCCTCTTCT 


TAATCCAGAC 


TTTTTTAACC 


ACTAATATAT 


TATCAATTTT TCAATTAAAT 


1260 


AAACAAAGTT 


GTAATCAAAA 


TTTATAATTT 


TTCTTTTTTA 


CGGCATAAGA GGCCAGTATA 


1320 


xmn^ivj <L X A \xw 


CTATAACAAA 


PAAGTTAATC 


A vl^VwV* A VmW a V» 


TACCTTAAAA TTCTCTATCA 


1380 


ACACTTATTT 


ATAAAGATTA 


AATGAAGATG 


TTGTTTTCTA 


TCACAGCATT ACTTTAGTAA 


1440 


AAACAAATA6 


TGACAATACA 


TCCTAATTTA 


ATGTAGCCAT 


TCTTGTTAGT CCGACTTATC 


1500 


CTTGTCAGTT 


TTACTGTCAG 


ATTTCnTCTT 


ATCATCTGAA 


TTTGAATCAG AATTATTCGT 


1560 


CGAATTGCTG 


TCTACATTCT 


CTGGATGGAA 


AATTCTACGT 




1600 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

40 

ATTGCCTTTG TTTTAATTTT AAATCAAAAT 

CTATAATATT ATCATTTTTA AAGCATATCA 
4S CTAATAGTTA CTTTATAAAA CATTACATAG 

AGGATCACTC TTCTTCAAAA TACCTTAATT 

AGAAAAATAA AAGTAATCCC AAAATTGATG 
SO TCCAATCTAA GAAACCTAAA AAAGATGATC 

CTGCAAAATA TAAGAAAGAC TCTAATTTCT 



mGCCTATGAA AGATTTAAAT 
TTGTTTAGTT TTTTTATAAT 
AGAAAGGTTA AGGAGTGCAC 
CTGTTAAGGA AGCGCAAGAG 
TTGATCGTAC ATATATTGAA 
AGGTTTTCTT CTTATCAAGA 
TATCATATCT CATCTATCGC 



CAATT AATTT 6 0 

TGGATAAATA 120 

ATGTCGAAAA 180 

GAGTCAAAAA 240 

CCTCAACAAT 300 

TTAAATAAAC 360 

ATAGGAAAAG 420 
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TGTTGCTTTT CCTATTAACA TTATTACCAT TTTTCAATAT TAAGCAGAGT CAAATTACTA 540 

ATATGTTAAG CAATGCACCC GCTGAAACAT CTACTCTAAT TAAGAGTGTA ATTGGTGATA 600 

5 TAACTCAAAA CTCCAGTGGT GGCTTATTAT CTATCGGTTT GATTTTAGCA ATTTGGTCAG 660 

CTTCAAATGG AATGACTGCA ATTATGAATT CTTTCAATGT TGCTTACGAT GTAGAAGATA 720 

GCCGTAATGG AATCGTATTA AAACTACTAA GTGTTGTCTT CACTGTAGTT ATGGGCGTTG 780 

10 

TGTTTGTAGT TGCTCTAGCA TTACCAACGC TTGGTTCTGT AATTAGTCAT TTCCTATTCG 840 

GTCCACTTGG aTTTGACGAA CAAGTGAAAT GGATTTTTAA CCTTATTAGA ATTGTGTTAC 900 

CAATCATTAT TATATTTATC ATATTTATCG TGTTATATTC GGTTGCACCT AACGTTAAAA 960 

IS 

CGAAGCTTAA GTCAGTATTA CCAGGTGCAG TATTTACTTC AATTATTTGG TTAGCTGGTT 1020 

CATTTQGTTT TGGTTG6TAT ATTTCAAATT TTGGTAACTA TTCTAAAACA TATGQCAGTA 1080 

20 TCGCX3GGTAT CATCATTTTG TTACTATGGT TATATATCAC AAGTTTTATT ATAATTGTCG 1140 

GnGCTGAAAT CAATGCAATC ATTCATCAGC GTAGTGTAAT TAAAGG 11S6 
(2) INFORMATION FOR SEQ ID NO: 161: 

25 . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



35 



40 



45 



TCTTGAGCCA 


TCTTTTGAGC 


TAACTGACTA GATTGATACC 


CAAAAATCAT 


AGTTACCAAC 


60 


ATAAACTTTA ATTTTACCGA AGTCTAAATC AGCGATATGA 


GTACATACAT 


TATTTAAGAA 


120 


ATGACGGTCA TGCGATACTA CGATAACAGT ATTATCAAAG 


TTAATTAAGA 


AATCTTCTAA 


180 


CCAACTGATT 


GCTGGAATAT 


CGAGACCGTT AGTAGGCTCA 


TCCAGTAATA 


GTACGTCTGG 


240 


TTCACCGAAT 


AAACTTTGCG 


CTAATAATAC TTTAATTTTT 


TGGTTGTTTT 


CTAATTCAGC 


300 


CAITTTITPA 


TCGTGTAAAG 


TTGGATCGAT ACCTAAACCA 


GATAAAAGGT 


TAGCAGCATC 


•360 


AGCTTCAGCA 


TTCCAACCAT 


TCATTTCTGC AAATTCACCT 


TCAAGTTCAG 


CAGCACGGAT 


420 


ACCATCTTCA 


TCACTGAAAT 


CTGGCTTCAT ATAGATTTCA 


TCTTTTTCTT 


TCATAACCTC 


480 


ATAAAGACGT 


TCGTGACCTT 


TAATTACAAC ATCAAGCACG 


CGTTCATCTT 


CATAAGCATA 


540 


GTGGTCCTGT 


TTTAAAACAG 


CTAGACGTTC ATTTTTCCCT 


AATGAAACAT 


GTCCTGTTTG 


600 


AGAATCTAAT 


TCACCAGATA ATATTTTTAA GAATGTTGAT 


TTACCTGCAC 


CATTCGCACC 


660 



55 



792 



10 



IS 



20 
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ATCTCCAAAA CGTAAACTCA CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT 780 

TCGATATTCT AACGGAAGAA TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT 840 

TGTAATGATA GAATGACTCA CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA 900 

ATAACATGGC ATTAGACAAA GATATAGTAG GTTCTATAGA ATTCCTTGAA GTAGTAGGGT 960 

TACAAGGTTC AACTTACCTT TTAAAAGGAC CAAACGGTGA AAACGTAAAG TTAAACCAAT 1020 

CAGAAATGAA CGATGATGAT GAATTAGAAG TAGGTGAAGA ATATAGTTTC TTCATTTATC 1080 

CAAACCGTTC AGGTGAATTA TTTGCAACTC AAAATATGCC TGATATTACG AAAGATAAAT 1140 

ATGACTTTGC TAAAGTACTT AAAACGGATC GCGATGGGGC ACGTATAGAT GTTGGATTAC 1200 

CCCGTGAAGT GTTAGTACCA TGGGAAGATT TACCAAAAGT GAAATCACTA TGGCCACAAC 1260 

CTGGTGATTA TTTGCTAGTT ACATTACXSAA TTGACCGTGA GAATCATATG TATGGACGTT 1320 

TAGCGAGTGA ATCTGTTGTA GAAAATATGT TTACACCTGT ACACGACGAT AATTTAAAAA 1380 

ACGAaGTCAT TGAAGCCAAA CCTTACCGCG TATTACGAAT TGGTAGCTTT TTATTAAGCX3 1440 

AATCAGGTTA CAAAATTTTC GTACATGAAT CAGAACGTAA AGCTGAACCA AGATTAGGTG 1500 

25 AATCTGTTCA AGTTAGAATT ATCGGQCATA ATGATAAAGG TGAGTTAAAT GGTTCATTTT 1560 

TACCACTTGC ACATGAACGT TTAGACGATG ACGGCCAAGT CATCTTTGAT TTACTAGTTG 1620 

AATATGATGG TGAATTACCA TTCTGGGACA AATCAAGCCC TGAAGCXSATT AAAGAAGTAT 1680 

^ TCAATATGAG TAAAGGTTCA TTCAAACGTG CAATCGGTCA CTTATATAAA CAGAAGATTA 1740 

TTAATATAGA AACAGGTAAA ATCGCTTTAA CTAAAAAAGG TTGGAGTCGA ATGGACTCAA 1800 

AAGAATAATC ATTTTTACAC GTGTCGTAGG ATGCGTGTTT TTTTTATTCA ATATTAAATC 1860 

GGACAGATGA AGTAGTTTTT TAAACATTCC TTTCAAAGTA AAAAATTAAA TAATTCAAAC 1920 

GAATAGGCTG GGaCATTAAG TTCTTAGGCA ATGTAAAAAA GCTGATTTCT ATTAATTATT 1980 

TGATGGAAAT CAGCTTTTTT GATATGTATT TTATAATGTA CAGCTCX5TTG AGCTGCTATT 2040 

TTCCTTATAT TAAGTGCCAT TAATACAAAA CCTAGCTCTC GTTTAACTTT ATTTATTCCT 2100 

CGAACTGACA TTCGAGTGAA aCCCAAAATA GCCTTCATAA ATCCAAAAAC AGGCTCTACA 2160 

TAAATTTTTC TATGACTATA 6ATTTTTTTC GTTTCTGGTT CAGAAAGCTT TTGaTTAATT 2220 

TGGGCTTTAA TGTATTTCAA AGTAAAATTA CATGTTAATA CGTAGTATTA ATGGCGAGAC 2280 

TCCTGAGGGA GCAGTGCCAG TCGAAGACAG GGGCCCCAAC ACAGAAGcTG ACATATAGTC 2340 

SO AGCTTACAAC AATGTGCCGG TTGGGGTGGC TGAGACGGCA CXTCTAGGAAG GGACCCGTCA 2400 

TCAAAAATTC TATTTATAGA ATTTTACAGT AATGTGACAG ACGGGCAAAG CGAAgCCAIT 2460 



ss 
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40 



4S 



793 



EP 0 786 519 A2 



CTTACTGCTG TTTTTTTAGG GATTTATCTC CCAGCCATTT TTGTATTCAT ATTTAAATTT 2580 

CGATAATTTT TCAGGAAGCA TTTTAATTTT ACTAATC5AAG CAATATTTTT TAGATTAACA 2640 

^ AAAATTAATA TTTACATTTT CTTAACAATT TTTTATGTAA CATTTACAGT TTCTAAAAAT 2700 

GAGGTTAATA ATTCAAGGTT AAGATAAAGA TGTAATCAAT ACAAATACTA TTTGTTGTTC 2760 

ATACAGGGAG GATATTTCAA TGAAAAAATG GCAATTTGTT GGTACTACAG CTTTAGGTGC 2 820 

10 

AACACTATTA TTAGGTGCTT GTGGTGGCGG TAATGGTGGC AGTGGTAATA GTGATTTAAA 2880 

AGGGGAAGCT AAAGGTGATG GCTCATCAAC AGTAGCACCA ATTGTGGAGA AATTAAATGA 2940 

AAAATGGGCT CAAGATCACT CGGATGCTAA AATCTCAGCA GGACAAGCTG GTACAGGTGC 3000 

IS 

TGGTTTCCAA AAATTCATTG CAGGAGATAT CGACTTCGCT GATGCTTCTA QACCAATTAA 3060 

AGATGAAGAG AAGCAAAAAT TACAAGATAA GAATATCAAA TACAAAGAAT TCAAAATTGC 3120 

GCAAGATGGT GTAACGGTTG CTGTAAATAA AGAAAATGAT TTTGTAGATG AATTAGACAA 3180 

ACAGCAATTA AAAGCAATTT ATTCTGGAAA AGCTAAAACA TGGAAAGATG TTAATAGTAA 3240 

ATGGCCAGAT AAAAAAATAA ATGCTGTATC ACCAAACTCA AGTCATGGTA CTTATGACTT 3300 

25 CT7TGAAAAT GAASTAATGA ATAAAGAAGA TATTAAAGCA GAAAAAAATG CTGATACAAA 3360 

TGCTATCGTT TCTTCTGTAA CGAAAAACAA AGAGGGAATC GGATACTITG GATATAACTT 3420 

CTACGTACAA AATAAAGATA AATTAAAAGA AGTTAAAATC AAAGATGAAA ATGGTAAAGC 3480 

AACAGAGCCT ACGAAAAAAA CAATTcAAGA TAACTCTTAT GCATTAAGTA GACCATTATT 3540 

CATTTATGTA AATGAAAAAG CATTGAAAGA TAATAAAGTA ATGTCAGAAT TTATCAAATT 3600 

CGTCTTAGAA GATAAAGG7A AAGCAGCTGA AGAAGCTGGA TATGTAGCAG CACCAGAGAA 3660 

35 

AACATACAAA TCACAATTAG ATGATTTAAA AGCATTTATT GATAAAAATC AAAAATCAGA 3720 

CGACAAGAAA TCTGATGATA AAAAGTCTGA AGACAAAAAA TAATAAGACG CAATTTCAAA 3780 

TGTGTCTTGA AACATGATTT TGATGGTGAA TCATTATTTA GAGTACAAAG CTTGATTTAT 384 0 

40 

CGAGACGCTG ATTTTGACAT TCAGTTAGTC TAcAAGCTTA TCAACTTAAA ATAGTGGTTC 3 900 

ATCATTATTT TACAAATCTA ATTATTTTGG GAGTAATAGA AAGAGGTTTG ATTATGACTT 3960 

45 CATCTACTAA TGTTAAAGCT TTAATCGAAA AAAATAATAA TAAAAAAGGA AAGCATAATG 4020 

ACAAAATTAT ACCAGTTATT TTAGCCGCAA TTTCAGCGAT TTCCATTTTA ACAACACTAG 4080 

GTATATTAAT CACATTGCTT TTAGAAACCA TCACTTTTTT CACCAGAATT CCAATAACTG 414 0 

SO AATTTCTATT TTCTACTACT TGGAATCCTA CCGGTTCAGA CCCTAAGTTT GGTATCTGGG 4200 

CATTGATAAT AGGGACTTTA AAAATCACAG TTATTGCGAC TATATTTGCA GTTCCAGTCG 4260 

55 
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AACCGATATT AGAAATTTTA GCAGGAATAC CAACAATTGT GTTTGGTTTC TTTGCATTAA 4380 

CCTTTGTTAC ACCAGTATTA AGATCTTTCA TACCAGGTCT TGGAGAGTTT AATGCTATAA 4440 

5 GTCCCGGCrr AGTTGTCGGT ATTATGATTG TCCCTCTCAT CACAAGTTTG AGTGAGGaTG 4500 

CAATGGCATC TGTACCAAAT AAAATTCGAG AAGGTGCCTA TGGACTTGGA GCAACTAAAT 4560 

TAGAAGTAGC AACTAAAGTC GTACTTCCCG CAGCAACATC AGGTATTGTA GCTTCAATCG 4620 

10 

TTCTCGCGAT TTCAAGAGCA ATTGGAGAAA CGATGATTGT ATCATTAGCG GCAGGTAGTT 4 680 

CGCCAACAGC TTCATTAAGT TTAACAAGTT CGATTCAAAC AATGACTGGA TATATTGTTG 4 74 0 

AGATAGCGAC AGGTGATGCA ACATTTGGAT CAAATATTTA TTACAGTATT TATGCTGTAG 4800 

IS 

GGTTCACACT ATTTATCTTT ACCTTAATCA TQAATTTACT TTCTCAGTGG ATTTCTAAGC 4860 

GTTTTAGGGA GGAGTATTAA TATGGAAACG ACAGATAATA ATAGACAATC ACTCGTCGAT 4920 

^ CAACAACTTG TCCAAAAACA TTTATCATCC AGAACGGTTA AAAATAAAGT GTTCAAACTC 4980 

ATATTTTTAG CATGTACATT ATTAGQACTT GTCGTACTTA TTGCGTTGTT AACTCAAACA 5040 

TTGATTAAAG GGGTAAGTCA TTTAAATTTA CAGTTTTTCA CTAATTTTTC TTCTTCAACA 5100 

25 CCATCTATGG CTGGCGTTAA AGGCGCGTTA ATCGGTTCAC TTTGGTTAAT GTTAAOTATC 5160 

ATTCCATTAT CAATCATCCT AGGAATAGGT ACAGCTATAT ACTTAGAAGA ATATGCXSAAA 5220 

AACAACAAAT TTACTCAGTT TGTTAAAATC AGTATTTCCA ATTTAGCTGG TGTACCATCA 5280 

30 GTTGTATTTG GGTTATTAGG TTATACTTTG TTCGTTGGTG GTGCAGGGAT TGAAGCCTTG 5340 

AAAATGGGTA ACAGTATATT GGCAGCAGCG CTAACAATGA CCTTACTGAT ATTACCAATT 5400 

ATTATTGTTT CAAGTCAGGA AGCAATTAGA GCTGTACCTA ACTCAGTACX5 CGAACTTcTT 5460 

^ ACGGCTTAGG TGCTAATAAA TGGCAAACGA TAAGACGTGT TGTCTTACCA GCAGCGTTAC 5520 

CTGGTATTTT AACTGGATTC ATTTTGTCTC TTTCAAGAGC ACTGGGAGAA ACAGCGCCAC 5580 

TTGTGCTAAT CGGTATACCG ACTATATTAT TGGCAACACC TAGAAGTATA TTGGATCAAT 5640 

40 

TTTCAGCATT ACCTATCCAA ATATTTACTT GGGCGAAAAT GCCTCAAGAA GAATTCCAGA 5700 

ATGTTGCATC GGCAGGCATT ATCGTTTTAC TAGTTATCTT AATCTTAATG AATGGCGTTG 5760 

CGATTATTTT ACGTAACAAA TTTAGTAAAA AATTCTAATT TAAACAATCA ATCTCATTTA 5820 

TCTATTAAAA AGGGAGTTTT AAATATGGCG CAAACACTTG CACAAACTAA ACAAATATCT 5880 

CAAAGTCATA CGTTTGATGT CTCACAAAGT CATCATAAAA CACCAGATGA TACAAACTCA 5940 

SO CATTCTGTTA TATATTCAAC ACAAAATTTA GACTTATGGT ATGGCGAAAA TCATGCATTA 6000 

CAAAATATTA ATTTAGATAT TTATGAAAAC CAAATTACTG CCATTATAGG TCCATCTGGT 6060 

55 
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AAAACAGCTG GTAAAATATT ATATCGAGAT CAAGACATTT TTGATCAAAA ATATTCTAAA 6180 

GAACAATTAC GTACAAATGT GGGCATGGTC TTTCAACAAC CTAATCCATT TCCAAAATCA 6240 

ATATACXy^TA ATATTACTTA CGGTCCAAAG ATTCACGGTA TTAAAAATAA AAAAGTTCTT 6300 

GATGAAATCG TTGAGAAATC ATTACGTGGC GCTGCAATTT GGGATGAATT AAAGGATAGG 6360 

TtGCACACAA ATGCATATAG TTTATCCGGT GGGCAACAAC AACGTGTTTG TATCGCGCGT 6420 

TGTTTAGCAA TTGAACCTGA AGTCATTTTA ATGGATGAAC CGACATCAGC ATTAGATCCA 6480 

ATCTCAACAT TAAGAGTAGA AGAGTTGGTT CAAGAACTAA AAGAAAAGTA TACAATTATT 6540 

ATGGTtACAC ATAATATGCA ACAAGCAGCT CGTGTATCAG ATAAAACTGC ATTTTTCTTA 6600 

AATGGTTATG TCAATGAATA TGATGATACT GATAAAATTT TCTCTAACCC ATCAAACAAG 6660 

AAAACAGAAG ATTATATTTC AGGAAGGTTT GGTTGATATA TAATGGCAAT AATTAGACAA 6720 

CGATATCAGG AGCAACTTGA TGATTTAATA AAAGAATTAC GTCGGTTAGG TGCaAATGTC 6780 

TATGTGAGTA TTGaAAATGG TATAAAAtCA TTAAGTATTG aCGATAGAGG cTTTGCACGA 6840 

CAAACAGTTA AAAACGATAA ACATATCAAT CAATTAAATT ATGATATTAA TGAGCGAGTT 6900 

25 ATCATGTTAA TTACAAAGCA ACAGCCCATT GCGAGTGATT T6CGTATGAT GATTTCTTCA 6960 

TTAAAAATCG CCTCCGATTT AGAAAGAATA GGAGATAATG CCTCGAGTAT TGCCAATATT 7020 

CXyVTtGCXSTA CAAAGATTAC AGATGATTAT GTGTTAACCC GTTTAAAGAC AATGGGTAAA 7080 

^ TTAGCTATGT TAATGTTAAA GGACTTAGAT CAAGCATTTA AAAAGAAAGA TACCGTATTA 7140 

ATAAGAGAAA TAATTGAGCG TGATGAAGAT ATCGATGACT TATATAGTCA TATTATTAAC 7200 

GCAACGTATC TTATTGATAA CGtCCATTTG TCGCTGCACA AGCTCATTTA GCAGCAAGAC 7260 

ATTTAGAACG TATTGGTGAT CATATTATTA ACATCGCTGA AAGTGTTTAT TTTTATTTAA 7320 

CAGOTACACA TTACGAACAA TAACTTAAAG TTATTACTAT AAAATCCCTT ACGATAAATA 7380 

TATATTTCTA TTATTGATAA ACCCTCAAAA AAACCAAGAT TCTCACAATT AGTAATOTGA 7440 

AAATCTTGGT TTATATTGTT CTACTATAAA TTGTCTCGCA TCTTAGTTAT TTGCTTGCTC 7500 

AATTTCATCT GTTAATTTTT CAACTTCATC GACTAAATCA GAAATATATT GAATTGTAGA 7560 

TTTAAGTGGC TGTTCTGTAG TAATGTCTAC ACCTGCAATG TTTGCAAGTT CGACAGGTGA 7620 

TACACTACCA CCTTTTTTCA ATGTTTCTAA CCAAGCATCA ACAGCTGGTT GGCCTTCATT 7680 

TTTAATCTTT TGAGAAACGA CAGTTCCGAT TGTTAAGCCA GCAGAATACG TATACGAATA 7740 

50 TAATCCCATA TAGTAATGAG GTTGACGCAT CCATGTTAAT TCAGCACCCT CAGTCATGTC 7800 

TACTGCATCT CCAAAAAATT GTTTATAAAC ATTTAGCATT ATTTCATTTA ATGTnCGGCG 7860 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 





TTTTTTCTTT 


TCTTCATTTG 


AAAATTGATC 


ATTCAGCAAT 


ATAAGCGTAT TTGTTAATGA 


60 


IS 


TTTAGGTGTT 


CCAATTTCAT 


AATCCCACCA 


ATTTAAGTTG 


GTATTCTTGC CAGTTGTTTT 


120 




AGTAAAATTC 


TCACTTAATT 


CTTTTACTTT 


TTTATCTGGT 


TCTTTTCCAT ATGCATTTTT 


180 










CTTCTTATTT 


TCGTCAGTAT TTAAAGTGGT 


240 


20 


TTTAGGATTC 


CTCATCGCTT 


CTGC6ATTTT 


CTCAATATTA 


CGATAGGTAC GAGTCATATG 


300 




AGAAGAATTA 


GTTTCAAGG6 


TTTCCGCTCC 


TGACCACAAG 


TATTTCCTAC CACTTTCA6T 


360 




TTTCATTTCC 


TTGAGTAAAT 


TCGTCGCCTC 


TTTCTCTGTA GCATCAAACT TCTTCTTCAT 


420 


25 


ATCTGGATTA 


TTCTCATCAT 


ACTTATCATA 


ACCATAGTTA 


ACGTCCAGCC ATGTGTTCCT 


480 




CAATTTTTCA 


TAATCTGGCG 


TTTGAACATT 


CGTATCAGCC ACAGCGATTT GATGTTTATC 


540 




AACACTTCTG 


AATTCACCAC 


CATTCAAAGT 


AATCACACCA 


GCCATTAATA ACGTAATGGT 


600 


30 


GGATAATTTT 


TGCCATTTCT 


TTATTCTATA 


TGTCATTGaC 


ATGTCTCCTT TTT6TGTTGC 


660 




6CGTGCGCAA 


TGAATATTAT 


GATTAAATAA 


TGATTCAATT 


TTTCAAAATT CGTTAACGTA 


720 


35 


TACAAATGAC 


TGTCTACTGT 


CAAACAATCC 


ACAAAGAATG 


TTGATGtCAT ATaAACAATC 


780 


GATCACCCAA 


ATTTTCCG 








798 



(2) .INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 
'^'^ (A) LENGTH: 5132 base pairs 

IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACAGGTTTT ACTATAATGG ATGGTATTTT GGCTAAACGA CATTGGTTTA GTCTTCTTTT 60 
^ TTTnACTTCC TAnATTTACA ATGGTATAAA TAATAATGCT ATATTTAGAA TGATGAGTAT 120 
ACTTACTGAA ACTAAATTAA AAGTGTCTGG TTCTTTACTA AAGATAGCTG CTATCCTTGC 180 
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AATACAAGTT CCAATGAGCG CAATTAAAAG TACTAACCCA ACXJATGAAAC TCTGTTTGTC 300 

ACTTAACTCA AAGAAACTAT AGATAGGATA TTTTTTAATA ATCAAGCCAC CTAAAATCAT 360 

5 CCATAAAAAT ACGATAATTC CATAAGTCAC ATTTATAACA TACGTTATTT TTTGGTCACC 420 

AAATCGGACT AATGTATTTC GTAGAATCAG CATACCAATG ACAACACCTA AAATAACGAT 480 

ACTAGCTATA TAAAGTAAAA ATGCAATTGT CACATCAAAT GTACCCAAAT CTAAAAACCT 540 

AGGAATTAyA AyGACTGCTA AAATAAAAGC GAAGyACAAA GTAATATAkT TATACAAACC €00 

GGTAGTAAGA CTTATCTCAG GTGATAATTG ATCAGCCATT GACTTAATCG GTGTATTAAT 660 

AATTGAACTT GTATCTTCGT TATTTTTTTC AGCCATAGTT AAATGATCTT CGAGCTCTTC 720 

15 

CAATAACTCT TCTACTTCTG CTTCAGTCTT ACCTCTAAAT AACAATTCAA CACGTAATTT 7 BO 

TTCTAAAAAA TCTTGAGATT GTTTACTTAA CATCOnTTC CCCTCCAAAC AAGTTAATCA 840 

TCCCTTTATT CAAAACTTGC CATTTCGATT TAAATACTTT TAGTTCCTTT AAACCTGAAT 900 

20 

CQGTAATCGT ATAGTATTTC CGCCTCGGGC CGCCATTACT AGATTTTTTT ATTGTCGTAT 960 

C2VACGTATCC TTTTTTGTTT AAACGCATTA AAACTGGATA AATACTACCC TCACTTATCT 1020 

25 CTCGAAACTC TTGATTCTTA AGTTTCGTCA TAATTTCATA TCCATACGTT TCGCCTTGGG 1080 

CAATGAGACC TAATATCGCC CCATCTAAGA GACCTTTCAT AATCTGATCT GACACTGACA 1140 

TTTTAATCAC CTACTATCTT ACATAATAAG ATAGTACATT GAGAACTTTT CGTCAACTAT 1200 

30 CTTTTATTGT AAGGTAGTTG TTGTACACAT TCCTTAAATG ACTAACAACT TTGTTAATAG 1260 

GGTAATACTT ACGGAAGTAT ATTTTATTTA TGGGGGAGGA ATTAATAATG ACTACAAAAA 1320 

CAGTATTTGA TGTCATTGAT ATGGGGTTAG GATATTTAGT AAATGTGTAT GATGCTTGGA 1380 

^ AAGTTGAAAA GGTACTTGAT GATTATCATA AGCCTTTTTC TAATACCATT CATTGGCAAT 1440 

TTGGitCATGT ATTAACAATT TTTGAATCGG CCTTAOCTGT TGCTGGTAAA GAGAATATTG 1500 

ATTTAAATAT CTATAGACCT TTATTCGGAA ATGGTTCGTC TCCAGATGAA TGGAAGGATG 1560 

40 

AAGTACCGAG TATTGAAAGG ATTTTAGAAG GTCTCCAAAC TTTACCTGAA CGTGCAOGAA 1620 

ATCTAACTGA AGATGATTTA GCAATTGAAT TGAAACAGCC AATTGTCGGT TGTAATAACT 1680 

TAGAAGAGTT ATTAGTATTA AATGCCATTC ACATCCCACT TCATGCTGGT AAAATTGAAG 1740 

45 

AGATGTCTCG TATATTAAAA AATTTAAAAT AAATATGTGC TTATTAACCG TTAACAACAC 1800 

GTTAACGGgT TTTTTATTTG TTTAAAAGGT CACTTTTTTG AATTTAATAA ACACCATCTA 1860 

SO TACCAGTTCT TCACCGATTC TCGAAAAATA ATTATATTAA TGATTTCGTT AATTTAATTT 1920 

TATATTTAAT TATTACTGTA CATCTTTTGT AGTTAGCTTT ATTCTTAAAT TGAAATATGT 1980 
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TACTCCCTAT CGTTGTAGGT CTCCTTATTT GGGCACTTAC ACCTTTTAAA CCGGATGCTG 


2100 




TGGATCCAAC AGCATGGTAT 


AT6TTCGCAA 


TATTCGTCGC GACAATCATT 


GCTTGTATTA 


2160 


5 


CACAACCGAT GCCAATTGGG 


GCCGTCTCTA 


TAATTGGATT TACAATCATG 


GTACTCGTTG 


2220 




GCATTGTTGA CATGAAAACG 


GCTGTCGCTG 


GTTTTGGTAA TAATAGCATT 


TGGTTAATTG 


2280 


10 


CTATGGCATT TTTCATTTCG 


AGAGGATTTG 


TGAAAACAGG TCTTGGTAGA 


CGTATCGCAC 


2340 


TTCATTTCGT CAAATTATTT 


GGTAAAAAAA 


CATTAGGATT AGCATATTCT 


ATCGTCGGTG 


2400 




TAGATTTAAT TCTAGCGCCT 


GCTACACCAA 


GTAATACCGC GCGTGCTGGT 


GGAATCATGT 


2460 


IB 


TCCCAATTAT CAAATCACTT 


TCTGAATCAT 


TTGGTTCGAA ACCGAAAGAC 


GGATCAGCAC 


2520 


GCAAAATGGG TGCATTTCTT 


GTTTTCACAG 


AATTCCAAGG TAATTTAATT 


ACTGCGGCTA 


2580 




TGTTTTTAAC TGCAATGGCC 


GGTAACCCCC 


TTGCACAAAA TTTAGCATCT 


AGCACATCTA 


2640 


20 


ATGTTCACAT TACATGGATG 


AATTGGTTTC 


TAGCTGCTTT AGTTCCTGGA 


CTTGTTTCCT 


2700 




TAATTGTTGT ACCTTTTATT 


ATTTATAAAA TTTATCCACC AACTGTTAAA 6AAACACCAA 


2760 




ATGCTAAGAG TTGGGCTGAA 


AATGAATTAG 


CGACTATGGG TAAAATCXKH* 


TTAGCTGAAA 


2820 


25 


AATTTATGAT TGGTATTTTT 


GTCGTTGCGT 


TAACACTATG GATTGTCGGA 


AGTTTCATTC 


2880 




ATATTGATGC AACTTTAACG GCCTTTATTG CGCTAgcATT gTTATTATTG ACAGGCGTCT 


2940 




TAACATGGCA AGACATTTTA AACGAAACAG 


GTGCTTGGAA CACATTAGTA 


TGGTTCTCAG 


3000 


30 


TATTAGTGTT AATGGCCGAC 


CAATTAAACA AGCTTGGATT TATTCCTTGG 


TTAAGTAAAT 


3060 




CCATTGCTAC AAGTCTTGGT 


GGCTTAAGCT GGCCTATAGT CCTGGTCATT 


TTAATATTGT 


3120 


35 


TCTACTTCTA TTCACATTAC 


TTATTTGCAA GTTCTACAGC ACATATCAGT GCGATGTATG 


3180 


CAGCATTACT AGgCGTTGCC ATCGCAGCCG GTQCACCACC ATTATTCAGT 


GCATTAATGT 


3240 




TAGGTTTCTT CGGTAACCTA TTAGCTTCAA CAACACACTA TAGTAGTGGT 


CCAGCGCCGA 


3300 


A/i 


TTCTATTCTC TTCAGGTTAC 


GTGACTCAAA AACGTTGGTG GACAATGAAC 


TTAATATTAG 


3360 


GTTTCGTCTA CTTTATTATC 


TGGATTGGTT 


TAGGATCACT TTGGATGAAA 


GTAATTGGTA 


3420 




TATTTTAAAA TATTTAAATT 


AGCGCTCGAA 


TCTCATTGAT TTGGGCGCTT 


TTTAATTTGT 


3480 




ATTTAAAATC AACCTTTGCT 


AAATCAAGAC 


TCCCTTTTTA AAATACGTTT 


ATCCTTTAAA 


3540 




TCATTGCGTG CTTCACTGAA 


AATTTGTATA 


AAGATTTAAG TCATTACGTA 


ACATCACATA 


3600 




AAATACATTT CTATACTATT 


CCGCTTCATT 


GATTAACATT ACGTATGCCC 


TCATAAATCA 


3660 


50 


TCATACAAAA AACACCTTCG 


TTTAAATTCA TTTTAATTGC GAATTCAACG 


AAAGTGCCTT 


3720 




ATTTCATATT TAATGTTTCA AATTTATACG TCTOTCACTG TTACTGCACA CATACCTCAG 


3780 
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TTKr^GGGTT 


TTTGCGACCG 


GATGTTTCTT 


CAATTTAATG TATTGAGAAA GACTATATAA 


3900 


CACAATACCT 


GTCCAAATAA 


ATATAAACGT 


AATTAATTGA TCTATACTAA AAGGCTCTTT 


3960 


GAAAACAAAT 


ATGCCGAGTA 


CAAACATTAT 


TGTTGGTCCA ACGTATTGAA TAAATCCTAT 


4020 


TAGCGAAAGT 


GGAATACGTT 


TTGCCCCGGC 


TGAGAATAGG 


ATTAGTGGTA TTGCCGTAAT 


4080 


AGCACCAGAA 


AATAACAACC 


AAAATGATGA 


CATGTTCAAT 


CCAAATGACA TCTGATGTTG 


4140 


CTGCCATAAA 


TAAATAACGT 


ATATTAGTCC 


AGCAGGTGCG 


GTAACAATAC ATTCAATCGT 


4200 


AATACTGCTG 


ATGGCATCAA 


TATGTACTAC 


TTTTTTCAAT 


AATCCGTATG TACCAAAGGA 


4260 


TAACGCTAAT 


ATAATAGAGA 


CGATTGGGAA 


TTCTCCAATC 


TTGAGCGTCA TATATAATAC 


4320 


ACCGATGAAT 


GCGAATAAAA 


TGGCTAGCCA 


TTCAAATTTA 


TTGAATCTTT CTTTTAAAAA 


4380 


GATAAGTGCG 


AGCAAAATGC 


TAACAAGTGG 


ATTTATATAA 


TAACCTAAAC TTGTTTGTAG 


4440 


GACGTGACC6 


TTCGTTACAG 


CCCAAATAAA 


TGTACCCCAA 


TTTAATGTAA TGACATAGCC 


4500 


TGCTACGACA 


ATCGCTAATA 


GCTGAATGGG 


CTTGCCTAAC 


AATTGATTCA TATCTCGTTG 


4560 


AAATGCATTG 


CGTTGTTTTT 


GTCCAACCX3C 


GAGTATGAAA 


ATCATGAATA TTGCTGAAAA 


4620 


TATAATACGA 


AAGGCTAAAA 


TTTCAAATGC 


GCCTATTGCA 


TCAAC6AACT GCCAATATAT 


4680 


AGGTAGTATT 


CCCCACAGAA 


TGTATGCACT 


GAGTGCTAAA 


AATATGCCTT TTTTATACTC 


4740 


TGAATTCACC 


TTCAAACCTC 


CTTACTTTCC 


TAATTTTTAA 


TTTACTGCAT ACGCTCACTT 


4B00 


GGTTATGCTA 


ATATAACGAT 


TTTACTAATA 


ATATTTCGAT 


AAAGATATCA TTTTGTTTAT 


4660 


ATTTCCCACA 


TTTATTCACC 


AACCACTAAA 


CAATATTAAT 


TTTATAAATA ATTCTGTACA 


4920 


AATCAGGGTA 


TATTGCCAGA 


AAGACTACCA 


TACAACATAA AGGATGGATA CAAATGACTT 


4980 


TACCTAAAAT 


TGGAAAGCCT 


GCAACACGCG 


CGCTAAATTC 


ACAAGGTATA TACACATTAG 


5040 


AAGGAGTATC 


ACAATATACG 


AAGTCATCTC 


TAATGGAGAT 


6CATGGCGTT GGTCCTAAAG 


5100 


CTATATCAAT 


ATTGGAACAA 


GCTTTATTTC 


AG 




5132 



40 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



so (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

AAGTAAATTA TATTATGAAT TTGCCTGTCA ATTTCTTAAA GACATTCTTA CCGGAACTAA 60 



SS 



800 



EP 0 786 519 A2 



10 



IS 



TAGAAGCAAT TAATAATGCy mAAGAAAAGA CAGCTAATAA TACCGGCTTA AAATTAATAT 180 
TTGCAATTAA TTATGGTGGC AGAGCAGAAC TTGTTCATAG TATTAAAAAT ATGTTTGACX5 240 
AGCTTCATCA ACAAGGTTTA AATAGTGATA TCATAGATGA AACATATATA AACAATCATT 300 
TAATGACAAA AGACTATCCT GATCCAGAGT TGTTAATTCG TACTTCAGGA GAACAAAGAA 3 60 ' 

TAAGTAATTT CTTGATTTGG CAAGTTTCGT ATAGTGAATT TATCTTTAAT CAAAAATTAT 420 
GGCCTGACTT TGACGAAGAT GAATTAATTA AATGTATAAA AATTTATCAG TCACGTCAAA 4 80 

GACGCTTTGG CGGATTGAGT GAGGAGTAGT ATAGTATGAA AGTTAGAACG CTGACAGCTA 540 
TTATTGCCTT AATCGTATTC TTGCCTATCT TGTTAAAAGG CGGCCTTGTG TTAATGATAT 600 
TTGCTAATAT ATTAGCATTG ATTGCATTAA AAGAATTGTT GAATATGAAT AT6ATTAAAT 660 
TTGTTTCAGT TCCTGGTTTA ATTAGTGCAG TTGGTCTTAT CATCATTATG TTGCCACAAC 720 
20 ATGCAGGGCC ATGGGTACAA GTAATTCAAT TAAAAAGTTT AATTGCAATG AGCTTTATTG 780 
TATTAAGTTA TACTGTCTTA TCTAAAAACA GATTTAGTTT TATGGATGCT GCATTTTGCT 840 
TAATGTCTGT GGCTTATGTA GGCATTGGTT TTATGTTCTT TTATGAAACG AGATCAGAAG 900 
2S GATTACATTA CATATTATAT GCCTTTTTAA TTGTTTGGCT TACAGATACA GGGGCTTACT 960 

TGTTTGGTAA AATGATGGGT AAACATAAGC TTTGGCCAGT AATAAGTCCG AATAAAACAA 1020 

TCGAAGGATT CATAGGTGGC TTGTTCTGTA GTTTGATAGT ACCACTTGCA ATGTTATATT 1080 

TTGTAGATTT GAATATGAAT GTATGGATAT TACTTGGAGT GACATTGATT TTAAGTTTAT 1140 

TTGGTCAATT AGGTGATTTA GTGGAATCAG GATTTAAGCG TCATTTCGGC GTTAAAGACT 1200 

CAGGTCGAAT ACTACCTGGA CACGGTGGTA TTTTAGACCG ATTTGACAGC TTTATGTTTG 1260 

TGTTACCATT ATTAAATATT TTATTAATAC AATCTTAATG CTQAGAACAA ATCAATAAAC 1320 

GTA»GAGGA GTTGCTGAGA TAATTTAATQ AATCTCAGAA CTCCTTTTGA AAATTATACG 1380 

CAATATTAAC TTTGAAAATT ATACGCAATA TTAACTTTGA AAATTAGACG TTATATTTTG 1440 

TGATTTGTCA GTATCATATT ATAATGACTT ATGTTACGTA TACAGCAATC ATTTTTAAAA 1500 

TAAAAGAAAT TTATAAACAA TCGAGGTGTA GCGAGTGAGC TATTTAGTTA CAATAATTGC 1560 

45 ATTTATTATT GTTTTTGGTG TACTAGTAAC TGTTCATGAA TATGGCCATA TCTTTTTTGC 1620 

GAAAAGAGCA GGCATTATGT GTCCAGAATT TGCGATCGGT ATGGGGCCAA AAATTTTTAG 1680 

TTTTAGAAAA AATGAAACAC TTTACACTAT TAGGTTATTG CCTGTTGGTG GATATGTTCG 1740 

^ TATGGCAGGA GATGGCTTAG AAGAGCCACC AGTCGAGCCC GGTATGAACG TTAAAATTAA 1800 

ACTTAATGAA GAAAATGAAA TAACACATAT CATATTAGAT GATCATCATA AGTTTCAACA 1860 
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CACTGCTTAT GATAATGAAA GACATCATTT TAAAATTGCT AGAAAGTCTT TCTTTGTTGA 1980 

AAATGGTAGC TTAGTTCAAA TTGCTCCGAG AGACAGACAA TTTGCACATA AAAAGCCATG 2040 

^ GCCGAAATTT TTAACATTAT TTGCGGGACC GTTATTTAAC TTTATATTAG CTTTAGTCCT 2100 

ATTTATTGGT CTTGCATATT ATCaAGGcAC GCcTACGTCT ACTGTAGAAC AAGTCGCAGA 2160 

TAAGTATCCA GCTCAACAAG CAGGATTACA AAAAGGTGAT AAGATCGTCC AAATTGGCAA 2220 

10 

ATATAAAATA TCTGAATTTG ATGATGTTGA TAAGGCGTTA GATAAAGTTA AAGATAATAA 2280 

GACGACTGTT AAATTTGAAC GTGATGGTAA AACAAAGTCA GTTGAATTAA CACCTAAAAA 2340 

GACTGAAAAA AAACTGACTA AAGTAAGTTC AGAGACGAAG TATGTTCTCG GATTCCAACC 2400 

15 

AQCGAGTGAA CATACACTTT TTAAACCAAT TGTATTCGGA TTTAAAAGCT TTTTAATCGG 2460 

TAGTACTTAT ATTTTTACAG CTGTAGTAGG TATGTTGGCT AGTATATTTA CGGGCGGATT 2520 

20 CTCATTTGAT ATGTTAAATG GTCCGGTTGG TATTTATCAT AACGTCGACT CAGTTGTTAA 2580 

AGCGGGTATC ATTAGCTTAA TTGGTtnCAC TGCX5TTATTA AGTGTAAACT TAGGTATTAT 2640 

GAATTTAATT CCTATTCCTG CACTAGAOGG TGGTCGTATT TTATTTGTTA TATATGAAGC 2700 

25 GATTTTCAGX AAACCAGTTA ATAAAAAAGC GGAAACAACG ATTATTGCTA TTGGTGCCAT 2760 

TTTCATGGTC GTTATAATGA TATTAGTAAC GTGGAATGAT ATTCGACGAT ATTTCTTATA 2820 

ATTTAGGAGG ATAAATAATT ATGAAGCAAT CCAAAGTTTT TATACCAACG ATGCGTGACG 2880 

TGCCATCAGA AGCAGAAGCA CAAAGTCATC GTTTATTATT GAAATCX3GGT TTGATAAAAC 2940 

AAAGTACAAG TGGGATTTAT AGTTATTTAC CGCTAGCAAC ACGTGTGTTA AATAATATTA 3000 

CTGCAATTGT GCGACAAGAA ATGGAACGTA TCGATTCTGT TGAAATTTTA ATOCCAGCGT 3060 

35 

TACAACAAGC TGAATTATGG GAAGAATCAG GACGTTGGGO TGCATATGGC CCAGAATTAA 3120 

TGCGTTTACA AGATAGaCAT GGAAgACAAT TTgCATTAGG TCCaACACAT GAAGAATTAG 3180 

TTACATCAAT AGTAAGAAAT GAATTGAAAT CATACAAACA ATTACCGATG ACATTATTCC 3240 

40 

aAATTCAATC TAAATTCCGT GATGAAAAGA GACCACGTTT TGGTTTAyTC GTGGGCGTGA 3300 

ATTTATTATG AAAGATGCAT ATTCATTCCA TGCTGACGAG GCATCATTAG ATCAAACGTA 336 0 

45 TCAAGATATG TATCAAGCGT ATAGCCGTAT TTTTGAGAGA GTTGGCATTA ACGCAAGACC 3420 

AGTAGTTGCA GATTCAGGTG CTATAGGCGG TAGCCATaCA CATGAATTTA TGGCATTAAG 3480 

TGCTATCGGT GAGGATACAA TCGTTTACAG TAAAGAAAGT GATTATGCTG CTAACATCGA 3540 

SO AAAAGCAGAA GTCGTTTACG ArcCAaATcA TaAGCATACT ACTGTGCAAC CTTTAGAAAA 3600 

AATTGAAACA CCAAATGTTA AGACTGCGCA AGAATTGGCA GACTTCTTAG GTAGACCAGT 3660 
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GCGTGGCCAT CATGAAATTA ATGACATTAA ATTAAAATCT TATTTCGGCA CAGATAATAT 3780 

TGAATTAGCA ACACAAGACG AAATTGTTAA TTTAGTTGGT GCAAATCCTG GTTCACTAGG 3840 

5 TCCTGTAATT GATAAAGAAA TCAAAATTTA TGCAGATAAT TTTGTGCAAG ATTTAAATAA 3900 

TTTAGTTGTC GGTGCTAACG AAGATGGTTA TCACTTAATT AATGTAAATG TAGGTAGAGA 3960 

CTTCAACGTT GATGAATATG GCXSATTTCCG TTTTATTTTA GAAGGCGAAA AGTTAAGTGA 4020 

TGGTTCAGGC GTTGCACATT TTGCTGAAGG TATTGAAGTT GGTCAAGTAT TCAAATTGGG 4080 

TACTAAGTAT TCAGAATCAA TGAATGCTAC ATTCTTAGAT AACCAAGGAA AAGCTCAATC 4140 

TTTAATTATG GGTTGTTACG GAATTGGAAT TTCTAGAACG CTAAGTGCGA TTGTTGAACA 4200 

IS 

AAATCACGAT GATAATGGAA TTGTTTGGCC TAAATCAGTT ACTCCGTTTG ATTTACATTT 4260 

AATTTCTATT AATCCTAAGA AAGATGATCA ACGAQAACTA GCAGATGCAC TATATGCTGA 4320 

ATTTAATACT AAATTTGATG TGTTGTACXyv TGATCGTCAG GAAC6TGCAG GTGTTAAATT 4380 

20 

TAATGATGCC GATTTAATTG GTTTACCACT GCGAATTGTT GTTGGTAAAC GTGCATCGGA 4440 

AGGTATTGTA GAAGTTAAAG AACX5TTTAAC AGGTGATAGC GAAGAAGTTC ACATTGATGA 4500 

2s CTTAATGACT GTCATTACAA ATAAATATGA TAACTTAAAA TAATTAAGAT CGAATGAATT 4560 

ATAAGAGTA6 GAAAAAGCTG AAAGAAATCT GATGCTTATG TCCTGCTCTT ATTATTTTTG 4620 

ATATAATGAT TATTCGATGA AAAATGACTG AAQACATAGT ATAATTAAAG ATAAATTTGT 4680 

30 TTTAACAATA TAATGATTAG CCAAATATAA AGCATTTAAT TTTCTATCAT TACTATGCTC 4740 

ACATAATCTA AATATTGTTC GAACACGTAA AAGTAATTTC TATTTAAGGT GGTAATTGTC 4800 

TTGGCAATGA CAGAGCAACA AAAATTTAAA GTGCTTGCTG ATCAAATTAA AATTTCAAAT 4860 

^ CAATTAGATG CTGAAATTTT AAATTCAGGT QAACTGACAC GTATAGATGT TTCTAACAAA 4920 

AACAGAACAT GGGAATTTCA TATTACATTA CCACAATTCT TAGCTCATGA AGATTATTTA 4980 

TTATTTATAA ATGCAATAGA GCAAGAGTTT AAAGATATCG CCAACGTTAC ATGTOGTTTT 5040 

40 

ACGGTAACAA ATGGCACGAA TCAAGATGAA CATGCAATTA AATACTTTGG GCACTGTATT 5100 

GACCAAACAG CTTTATCTCC AAAAGTTAAA GGTCAATTGA AACAGAAAAA GCTTATTATG 5160 

TCTGGAAAAG TATTAAAAGT AATGGTATCA AATGACATTG AACGTAATCA TTTTGATAAG 5220 

45 

GCATGTAATG 6AAGTCTTAT CAAAGCGTTT AGAAATTGTG GTTTTGATAT CGATAAAATC 5280 

ATATTCGAAA CAAATGATAA TGATCAAGAA CAAAACTTAG CTTCTTTAGA AgCACaTATT 5340 

CAAGAAGAAG ACGAACAAAG TGCACGATTG GCAACAGAGA AACTTGAAAA AATXSAAAGCT 5400 

GAAAAAGCGA AACAACAAGA TAACAACGAA AGTGCTGTCG ATAAGTGTCA AATTGGTAAG 5460 
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GCAATAGAGG GTGTCATTTT TGATATAAAC TTAAAAGAAC TTAAAAGTGG TCGCCATATC 5580 

GTAGAAATTA AAGTGACTGA CTATACGGAC TCTTTAGTTT TAAAAATGTT TACTCGTAAA 5640 

AACAAAGATG ATTTAGAACA TTTTAAAGCG CTAAGTGTTG GTAAATGGGT TAGGGCTCAA 5700 

GGTCGTATTG AAGAAGATAC ATTTATTAGA GATTTAGTTA TGATGATGTC TGATATTGAA 5760 

GAGATTAAAA AAGCGACAAA AAAAGATAAG GCTGAAGAAA AGCGTGTAGA ATTCCACTTG 5820 

CATACTGCAA TGAGCCAAAT GGATGGTATA CCCAATATTG GTGCGTATGT TAAACAGGCA 5880 

GCAGACTGGG GACATCCAGC CATTGCGGTT ACAGACCATA ATGTTGTGCA AGCATTTCCA 5940 

GATGCTCACG CAGCAGCX^yi AAAACATGGC ATTAAAATGA TATACGGTAT GGAAGGTATG 6000 

TTAGTTGATG ATGGTGTTCC GATTGCATAC AAACCACAAG ATGTCGTATT AAAAGATGCT 6060 

ACTTATGTTG TGTTCGACGT TGAGACAACT GGTTTATCAA ATCAGTATGA TAAAATCATC 6120 

GAGCTTGCAG CTGTGAAAGT TCATAACGGT GAAATCATCG ATAAGTTTGA AAGGTTTAGT 6180 

AATCCGCATG AACGATTATC GGAAACGATT ATCAATTTGA CGCATATTAC TGATGATATG 6240 

TTAGTAGATG CCCCTGAGAT TGAAGAAGTA CTTACAGAGT TTAAAGAATG GGTTGGCGAT 6300 

25 GCGATATTCG TAGCGCATAA TGCTTCGTTT GATATGGGCT TCATCGATAC GGGATATGAA 6360 

C6TCTTGGGT TTGGACCATC AACGAATGGT GTTATCGATA CTTTAGAATT ATCTCGTACG 6420 

ATTAATACTG AATATGGTAA ACATGGTTT6 AATTTCTTGG CTAAAAAATA TGGCGTAGAA 6480 

30 TTAACGCAAC ATCACCGTGC CATTTATGAT ACAGAAGCAA CAGCTTACAT TTTCATAAAA 6540 

ATGGTTCAAC AAATGAAAGA ATTAGGCGTA TTAAATCATA ACGAAATCAA CAAAAAACTC 6600 

AGTAATGAAG ATGCATATAA ACGTGCAAGA CCTAGTCATG TCACATTAAT TGTACAAAAC 6660 

CAACAAGGTC TTAAAAATCT ATTTAAAATT GTAAGTGCAT CATTGGTGAA GTATTTCTAC 6720 

CGTA€ACCTC GAATTCCACG TTCATTGTTA GATGAATATC GTGAGGGATT ATTGGTAGGT 6780 

ACAGCGTGTG ATGAAGGTGA ATTATTTACX; GCAGTTATGC AGAAGGACCA GAGTCAAGTT 6840 

GAAAAAATTG CCAAATATTA TGATTTTATT GAAATTCAAC CACCGGCACT TTATCAAGAT 6900 

TTAATTGATA GAGAGCTTAT TAGAGATACT GAAACATTAC ATGAAATTTA TCAACGTTTA 6960 

ATACATGCAG GTGACACAGC GGGTATACCT GTTATTGCGA CAGGAAATGC ACACTATTTG 7020 

TTTGAACATG ATGGTATCGC ACGTAAAATT TTAATAGCAT CACAACCCX^ CAATCCACTT 7080 

AATCGCTCAA CTTTACCGGA AGCACATTTT AGAACTACAG ATGAAATGTT AAACGAGTTT 714 0 

5^ CATTTTTTAG GTGAAGAAAA AGCGCATGAA ATTGTTGTGA AAAATACAAA CGAATTAGCA 7200 

GATCGAATTG AACGTGTTGT TCCTATTAAA GATGAATTAT ACACACOGCG TATGGAAGGT 7260 
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CTGCCTCAAA TCGTAATTGA TCGATTAGAA AAAGAATTAA AAAGTATTAT CGGTAATGGA 73 BO 

TTTGCGGTAA TTTACTTAAT TTCGCAACGT TTAGTTAAAA AATCATTAGA TGATGGATAC 7440 

5 

TTAGTTGGTT CCCGTGGTTC AGTAGGTTCT AGTTTTGTAG CXSACAATGAC TGAGATTACT 7500 

GAAGTAAACC CGTTACCGCC ACACTATATT TGTCCGAACT GTAAAACGAG TGAATTTTTC 7560 

AATGATGGTT CAGTAGGATC AGGATTTGAT TTACCTGATA AGACGTGTGA AACTTGTGGA 7620 

10 

GCGCCACTTA TTAAAGAAGG ACAAGATATT CCGTTTGAAA CATTTTTAGG ATTTAAGGGA 7680 

GATAAAGTTC CTGATATCGA CTTAAACTTT AGTGGTGAAT ATCAACCGAA TGCCCATAAC 774 0 

TACACAAAAG TATTATTTGG TGAGGATAAA GTATTCC6TG CAGGTACAAT TGGTACTGTT 7800 

15 

GCTGAAAAGA CTGCTTTTGG TTATGTTAAA GGTTATTTGA ATGATCAAGG TATCCACAAA 7860 

AGAGGTGCTG AAATAGATCG ACTCGTTAAA GGATGTACAG GTGTTAAACG TACAACTGGA 7920 

20 CAGCATCCAG GGGGTATTAT TGTAGTACCT GATTACATGG ATATTTATGA TTTTACGCCX5 7980 

ATACAATATC CTGCCGATGA TCAAAATTCA GCATGGATGA CGACACATTT TGATTTCCAT B040 

TCTATTCATG ATAATGTATT AAAACTTGAT ATACTTGGAC ACGATGATCC AACAATGATT 8100 

25 CGTATGCTTC AAGATTTATC AGGAATTGAT CCAAAAACAA TACCTGTAGA TGATAAAGAA 8160 

GTTATGCAGA TATTTAGTAC ACCTGAAAGT TTGGGTGTTA CTGAAGATGA AATTTTATGT 8220 

AAAACAGGTA CATTTGGGGT ACCAGAATTC GGTACAGGAT TCGTGCGTCA AATGTTAGAA 8280 

GATACAAAGC CAACAACATT TTCTGAATTA GTTCAAATCT CAGGATTATC TCATGGTACA 8340 

GATGTGTGGT TAGGCAATGC TCAAGAATTA ATTAAAACCG GTATATGTGA TTTATCAAGT 8400 

GTAATTGGTT GTCGTGATGA TATCATGGTT TATTTAAT6T ATGCTGGTTT AGAACCATCA 8460 

^ ATGGCTTTTA AAATAATGGA GTCAGTACGT AAAGGTAAAG GTTTAACTGA AGAAATGATT 8520 

GAAAeGATGA AAGAAAATGA AGTGCCAGAT TGGTATTTAG ATTCATGTCT TAAAATTAAG 8580 

TACATGTTCC CTAAAGCCCA TGCAGCAGCA TACGTTTTAA TGGCAGTACG TATCGCATAT 864 0 

40 

TTCAAAGTAC ATCATCCACT TTATTACTAT GCATCTTACT TTACAATTCG TGCGTCAGAC 8700 

nTGATTTAA TCACGATGAT TAAAGATAAA ACAAGCATTC GAAATACTGT AAAAGACATG 8760 

TATTCTCGCT ATATGGATCT AGGTAAAAAA GAAAAAGACG TATTAACAGT CTTGGAAATT 8820 

45 

ATGAATGAAA TGGCGCATCG AGGTTATCGA ATGCAACCGA TTAGTrTAGA AAAGAGTCAG 8880 

GCGTTCGAAT TTATCATTGA AGGCGATACA CTTATTCCGC CGTTCATATC AGTGCCTGGG 8940 

so CTTGGCGAAA ACGTTGCGAA ACGAATTGTT GAAGCTCGTG ACGATGGCCC ATTTTTATCA 9000 

AAAGAAGATT TAAACAAAAA AGCTGGATTA TCTCAGAAAA TTAITGAGTA TTTAGATGAG 9060 
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GAAATAATCA AGGTATTTAT TTAATGCGTA 
CTGGACACAA AATTATGCCC GTATTTCTrT 
^ TGGTGAAATA AAGGAACAAA CTTTTACAAG 

TCAAGCATAA ACTTATGCTA TAATTAAGTT 
GTGGGAGATT CCCGCTCTTT TCTATTTGCC 

10 

AGAACAAGTA GAAGTGATTG TTAAACCAAT 
CGTTGAATAT GTCAAAGAGG GTAGAGATCA 
TGGCGTAGAT TTAAATGATT GTACGCTAGC 

IS 

AAATGATCCT ATTCCTGAAA TGTATTATTT 
AATTAAAAAA GAACAAGATT TCCAAAATGC 

2^ TGTACCAATT GAAGGTGAAA AGGAATGGTT 
AATTGTAGTA CAAGTTAAAA TCAAAGCAAG 
AATAGCAAAA GCACGTCACG CAGTTATGAT 

2$ AGTAATGAAT TATTATTAGC TACTGAGTAT 
GTATTAATTG ATGCTATTGA AGCAGCTTTA 
GCAAGAAATG TCCGTGTGGA ATTAAATATG 

30 AAAGATGTTG TTGAAGAAGT ATTTGACGAC 
GTTAAAAACC CTGCATATGA AATTGGTGAT 
TTTGGTCGTG TAGGTGCTCA AGCAGCGAAA 
GAAGGTGAAA TTTTATTTGA AGAATTTATA 
ATTGACC3GTG TTGACCATCG TTATGTATAT 
TCTGAAGCAG AAAGAAGTCC TAACGAAAAA 

40 

GTTAACAAAG TGGAACAAAC GACAAAAGGT 
GGTTTATTAA AACGTTTATT TGAACAAGAA 
GTTAAATCAG TAGCACGTGA AGCTGGCGAT 

4S 

AATGATATAG ATGCTGTTGG TGCATGTGTT 
GTTGAAGAGC TAGGTGGTGA AAAAATCGAC 
50 TTTGTAAAAA ATGCTTTAAG CCCTTCTCAA 
CAATCTACA5 TAGTTGTTGT TCCTGATTAT 



TGGGGTAGTC AAAGAAATAC AAAATTGTTG 9180 

TCAATGTCTT ACGAGTCTAT TCAAATGTAA 9240 

AATCTCTGAT TAATAGTGAA GTCATTTGTT 9300 

GCTTAAAAAT TAGTGAACTC AGGCAGAAGA 9360 

AAAAAGGGAG GCCTGTATGA GTAAAATTAC 9420 

TATGGAAGAC TTGAATTTTG AACTTGTAGA 9480 

TTTTCTTAGA ATCTCTATTG ATAAAGAAGG 9540 

TTCTGAAAAA ATAAGTGAAG CTATGGATGC 9600 

AGACGTAGCG TCACCTGGTG CAGAACGTCC 9660 

AATAACTAAA CCTGTATTTG TTTCTTTATA 9720 

AGGCATTTTA CAAGAAGTCA ATAATGAAAC 9780 

AACGAAAGAT ATAGAGATAC CGAGAGACAA 9840 

TTAACGTGAT GAGGAGGAAA AAACGTGTCA 9900 

TTAGAAAAAG AAAAGAAGAT TCCTAGAGCA 9960 

ATTACTGCAT ACAAAAAGAA TTATGATAGT 10020 

GATCAAGGTA CTTTCAAAGT TATCGCTCGT 10080 

AGAGATGAAG TGGATTTAAG TACAGCGCTT 10140 

ATATACGAAG AAGATGTAAC ACCTAAAGAT 10200 

CAAGCAGTAA TGCAACGTCT TCGTGATGCT 10260 

GACAAAGAAG AAGACATACT TACTGGAATT 10320 

GTGAATTTAG GTCGTATCGA AGCTGTTTTA 10380 

TATATTCCTA ACGAACGTAT CAAAGTATAT 10440 

CCTCAAATCT ATGTTTCTCG TAGCCATCCA 10500 

GTTCCAGAAA TTTACGATGG TACTGTAATT 10560 

CGCTCTAAAA TTAGTGTCTT CTCTGAAAAC 10620 

GGTGCTAAAG GCGCACGTGT TGAAGCTGTT 10680 

ATCGTTCAAT GGAATGAAGA TCCAAAAGTA 10740 

GTTTTAGAAG TTATTGTTGA TGAAACAAAT 10800 

CAATTGTCAT TAGCGATTGG TAAAAGAGGA 10860 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



GATGCGCGTG AA6CGGGTAT CTATCCAGTA GTTGAAGCTG AAAAAGTAAC TGAAGAAGAT 10990 

GTTGCTTTAG AAGATGCTGA CACAACAGAA TCAACCGAAG AGGTAAATGA TGTTTC7W3TT 11040 

GAAACAAATG TAGAGAAAGA ATCTGAATAA TAGGTTGGAG TGAAGTATCT ATGAAAAAGA 11100 

AAAAAATTCC GATGCGAAAA T6TATTCTTT CAAATGAAAT GCATCCCAAA AAAGATATGA 11160 

TTCGTGTTGT TGTTAATAAA GAAGGCGAAA TCTTTGCXX5A TGTTACTGGA AAGAAACAAG 11220 

GCCGTGGCGC ATATGTTTCT AAAGATGTTG CTATGGTTGA AAAAGCACAA CAAAAAGAAA 11280 

TTTTAGAAAA ATATTTTAAA GCATCTAAAG AGCAATTGGA TCCTGTTTAC AAAGAAATTA 11340 

TTAGATTAAT TTATAGAGAA GAGATCCCAA AATGAGTATA GATCAAATAT TAAACTTTTT 11400 

AGGATTAGCA ATGAGAGCTG GTAAAGTAAA AACAGGTGAA TCAGTCATTG TTAATGAGAT 11460 

TAAAAAAGGA AATTTQAAGC TCGTTATTGT TGCAAATGAT GCGTCTGATA ATACAGCTAA 11520 

ATTAATTACA GATAAATGTA AGAGTTACAA AGTTCCATTC AGAAAGTTTG GAAATCGAAA 11580 

TGAATTGGGA ATAGCACTTG GAAAAGGTGA GCGTGTTAAT GTAGGGATTA CTGACCCAGG 11640 

CTTTGCTAAA AAGTTGCTAT CAATGATAGA TGAATATCAT AAGGAGTGAT TATATGAGTA 11700 

AACAAAGAAT TTACGAATAT GCGAAAGAAT TAAATCTAAA GAGTAAAGAG ATTATAGATG 11760 

AGTTAAAAAO CATGAATATT GAGGTTTCAA ATCATATGCA AGCTTTGGAA GATGACCAAA 11B20 

TTAAAGCATT AGATAAAAAG ITCAAAAAAG AACAAAAGAA CGACAATAAA CAAAGCACTC 11880 

AAAATAATCA CCAAAAATCA AACAATCAAA ACCAAAATAA AGGGCrvACAA AAAGATAACA 11940 

AAAAGAATCm ACAACAAAAT AATAAAGGCA ACAAAGGCAA TAAAAAGAAT AATAGAAATa 12000 

ATAAGAAAAA TAACAAGAAT AATAAACCAC AAAATCAACC AGCTGCTCCA AAAGAAATAC 12060 

CATCAAAAGT GACATATCAA GAAGGTATTA CAGTAGGCGA ATTTGCGGAT AAATTAAATG 12120 

TTGSATCATC AGAAATTATC AAAAAATTAT TCTTACTTGG TATTGTTGCT AATATCAATC 12180 

AATCATTAAA TCAAGAAACA ATCX3AATTAA TTGCCGATGA TTATGGCGTT GAGGTTGAAG 12240 

AAGAAGTTGT GATTAATGAA GAAGACTTAT CAATCTATTT CGAAGACX3AA AAAGATGATC 12300 

CAGAGGCAAT TGAGAGACCA GCAGTTGTAA CAATTATGGG ACATGTTGAC CATGGTAAAA 12360 

CGACTTTATT AGATTCAATT CGTCATACAA AAGTTACAGC AGGTGAAGCA GGCGGAATCA 12420 

CTCAACATAT TGGTGCATAT CAAATTGAAA ACGATGGCAA AAAAATCACT TTCTTAGATA 12480 

CACCGGGACA TGCTGCATTT ACAACGATGC GTGCGCGTGG TGCaCAAGTA ACAGATATTA 12540 

CTATTTTAGT AGTAGCAGCT GACGATGGTG TTATGCCACA AACAATTGAA GCAATTAACC 12600 

ATGCTAAAGA AGCAgAAGTA CCAATTATTG TTGCAGTAAA TAAAAITGAT AAACCAACTT 12660 
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GCGGCC5AAAC AATTtTCGTc CACTTTCTGC 
AGAAATGATA GGATTAGTTG CAGAAGTTCA 
^ TGTTGGTACA GTTATCGAAG CTGAATTAGA 

AGTACAAAAC GGTACATTAA ATGTTGGTGA 
TATCCGTGCA ATGGTTAATG ACTTAGGTCA 

10 

TGTTGAAATT ACAGGTATTA ATGATGTGCC 
TGATGAAAAA CAAGCTCGTC GTATTGGTGA 
ACGTCAAGAA AGTAAAAATG TTTCATTAGA 

IS 

AATGAAAGAT TTAAACGTTA TTATTAAAGG 
TGCATCATTA ATGAAAATTG ATGTTGAAGG 
TGGTGCAATT AATGAGTCAG ACGTGACACT 
TTTCAATGTT CGTCCAGACA GTGGTGCAAA 
GCGTTTACAC AGAGTTATTT ATAATGTTAT 

2S ACTTGATCCA GAATTTGAAG AACAAGTTAT 

AGTTTCTAAA 6TTGGTACTA TTGCTGGATG 
TGCTGGTGTA CGTATTATTC GTGATGGTAT 

30 TAAACGTTTC AAAGATGATG CTAAGGAAGT 

TGAAAACTAC AATGACCTTA AAGAAGGCGA 
TAAGCGTTAA TTAAATAAAT TACAAGCTAA 

^ AAATATTGCA CTTTTTAAGT GTCTACTTTA 

TTTTTATTAG AATAGATTAC CTATTAAAAG 

ATTTTGTGGT GGCCTAGAAT GATTTTTAAT 

40 

AATTTTCTGT TGaAATGCCT ATCTTACGGC 

ATTAAAATAA CGTGTGAGCT AAAATAATTG 

TAACAATTGA ACTTAAAGAT AAAGAGGTGA 

4S 

TTGGTGAACA AATGAAGAAG GAATTAATGG 
GAGTTGGTTT TATTACAATT ACAGATGTTG 
50 TATTTTTAAC TGTATTAGGT AACGATAAAG 

AAGCAAAAGG CTTCATTAAG TCTGAATTAG 



ATTAAGTGGT GATGGTATCG ACGATTTATT 12780 

AGAACTTAAA GCAAATCCTA AAAACCGTGC 12840 

TAAATCACGT GGTCCTTCTG CATCATTATT 12900 

TGCGATTGTA GTTGGTAATA CTTACGGCCG 12960 

AAGAATCAAA ACGGCTGGTC CATCAACGCC 13020 

ACAAGCTGGG GATCGCTTTG TTGTATTTAG 13080 

ATCAAGACAC GAAGCTAGCA TTATACAACA 1314 0 

TAACCTGTTT GAACAAATGA AACAAGGTGA 13200 

TGATGTTCAA GGTTCTGTTG AAGCTTTAGC 13260 

CX3TAAATGTT CGTATCATTC ATACAGCGGT 13320 

TGCTAATGCC TCAAATGGTA TTATCATTGG 13380 

ACGTGCTGCA GAAGCTGAAA ATGTTGATAT 13440 

C6AAGAAATT GAATCAGCGA T6AAAGGTTT 13500 

CGGACAAGCT GAAGTTCGTC AAACATTCAA 13560 

TTATGTTACT GAAGGTAAAA TTACGCGAAA 13620 

TGTTCAATAT GAAGGCGAAT TAGATACACT 13680 

TGCAAAAGGT TATGAATGTG GTATTACAAy 13740 

TGTTATCGAA GCATTTGAAA TGGTTGAAAT 13800 

AAGTATAGTT AAGATTGATA TGCTCCCTAT 13860 

TAGGGAGCAT ATTTGATACT AGCTTTTGGT 13920 

TTACXSTTATA TGGACATGAT TTTGTATAAA 13980 

GACAAAATAT AATGTCGACT ATTATTGGAA 14040 

AAACTTTATT TGATTTTATA GGCTTAATTT 14100 

TTTAAGCATT GTTACACTAA AAAATGCAAA 14160 

CAAGAATGAG CAGTATGAGA GCAGAGCGTG 14220 

ATATCATCAA CAATAAAGTC AAAGATCCTC 14280 

TTTTAACAAA TGATTTATCG CAGGCTAAAG 14340 

AAGTAGAAAA TACATTTAAA GCACTTGATA 14400 

GTTCTAGAAT GCGATTACGT ATTATGCCGG 14460 
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AAGATTTACA CAAACAAGAT AGATAATTTA 
TTCTTAATAT CGGTATATTA ACATTAAACA 
^ CATTTTCCAG TTTTTTTATG AATAAATTTA 

AAGAAGGTGA CTATATGTAT AATGGGATAT 
GTCATGACGT TGTATTCAAA TTGCGTAAAA 

10 

GTACGCTTGA TCCCGAAGTT GCAGGCGTGT 
TTAGTGATTA TGTTATGGAT ATGGGCAAAG 
GTACAACGAC TGAAGATCAA ACGGGTGATA 

IS 

ATTTTAATAA GGACGATATT GACCGATTGT 
TTCCGCCGAT GTACTCATCC GTCAAAGTAA 
ATAATGAAAC AGTTGAAAGA CCAAAGCGTA 
CTGAATTA6A TTTTAAAGAA AATQAOTGTC 
GTACATATAT TAGAACGCTA GCAACTGATA 

2s TGTCGAAATT AACACGAATC GAGTCTGGTG 

AACAAATAAA AGAACTTCAT GAGCAGGATT 
ATGGATTAAA GGGTTTGCCA AGCATTAAAA 

30 TAAATGGGCA GAAATTTAAT AAAAATGAAT 

TTATTGATGA TGATTCAGAA AAAGTATTAG 
CAGAAATTAA ACCTAAAAAA GTCTTTAATT 

^ AGtGACACAT CCTATACAAT CTAAACA6TA 

ATTttTCGAT GGCATGCATA AAGGTCATGA 
TGAGGCACGC AGTTTAAAAA AAGCGGTGAT 

40 

GAATCCTAAA AGAAAACGAA CAACGTATTT 
TAGCCAACAT GATATTGATT ATTGTATAGT 
GAGCGTAGAA GATTTTGTTG AAAATTATAT 

4S 

TGGTTTTGAT TTTACTTTTG GTAAATTTGG 
TGATGCGTTT AATACGACAA TTGTGAGTAA 
^ AACTTCTATT CGTCAAGATT TAATCAATGG 

CTATATATAT TCTATTAAAG kCACTGTAGT 



GTGTTAGGTA TCTGGAAAAT GTTTGATAAT 14580 

GTTAATACAT AGATGTGTAG AAATAGTTAA 14640 

GTTGATACGC TATTAAAATA TATTTTAAAA 14700 

TACCAGTATA TAAAGAGCGC GGTTTAACAA 14760 

TATTAAAAAC TAAAAAAATA GGTCACACGG 14 820 

TACCGGTATG TATAGGTAAT GCAACGAGAG 14 880 

CTTATGAAGC AACTGTATCG ATAGGAAGAA 14940 

CATTGGAAAC AAAAGGTGTA CACTCAGCAG 15000 

TAGAAAGTTT TAAAGGTATC ATTGAACAAA 15060 

ATGGTAAAAA ATTATATGAA TATGCGCGTA 15120 

AAGTtAATAT TAAAGACATT GGGCGTATAT 15180 

ATTTTAAAAT ACX^CGTCATC TGTGGTAAAG 15240 

TTGGTGTGAA ATTAGGCTTT CCGGCACATA 15300 

GATTTGTGTT GAAAGATAGC CTTACATTAG 15360 

CATTOCAAAA TAAATTGTTT CCTTTAGAAT 15420 

TTAAAGATTC GCACATAAAA AAACGTATTT 15480 

TTGATAACAA AATTAAAGAC CAAATTGTAT 15540 

CAATTTATAT GGTACACCCT ACAAAAGAAT 15600 

AAAGGAGATA GAATTTATGA AAGTCATAGA 15660 

TATTACAGAG GATGTTGCAA TGGCATTCGG 15720 

CAAAGTCrrr GATATATTAA ACGAAATAGC 15780 

GACATTTGAT CCGCATCOGT CTGTCGTGTT 15840 

AACGCCACTT TCAGATAAAA TCGAAAAAAT 15900 

GGTTAATTTT TCATCTAGGT TTGCTAATGT 15960 

AATTAAAAAT AATGTAAAAG AAGTCATTGC 16020 

AAAAGGTAAT ATGACTGTAC TTCAAGAATA 16080 

ACAAGAAATT GAAAATGAAA AAATTTCTAC 16140 

TGAGTTGCAA AAAGCGAATG ATGCTTTAGG 16200 

GCAAGGTGAA AAAAGGGGAA GAACTATTGG 16260 
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TGCTGTTAGT ATTGAAATCG GCACTGAAAA TAAATTATAT CGAGGGGTAG CTAACATAGQ 16380 

TGTAAAGCCA ACATTTCATG ATCCTAACAA AGCAGAAGTr GTCATCGAAG TGAATATCTT X6440 

5 

TGACTTTGAG GATAATATTT ATGGTGAACG AGTGACCGTG AATTGGCATC ATTTCTTACG 16500 

TCCTGASATT AAATTTGATG GTATCGACCC ATTAGTTAAA CAAATGAACG ATGATAAATC 16560 

GCGTGCTAAA TATTTATTAG CAGTTGATTT TGGTGATGAA GTAGCTTATA ATATCTAGAG 16620 

10 

TTGCGTATAG tTATATAAAC AATCTATACC ACACCTTTTT CTTAGTAGGT CGAATCTCCA 16680 

ACGCCTAACT CGGATTAAGG AGTATTCAAA CATTTTAAGG AGGAAATTGA TTATGGCAAT 16740 

TTCACAAGAA CGTAAAAACO AAATCATTAA AGAATACCGT GTACACGAAA CTGATACTGG 16800 

TTCACCAGAA GTACAAATCG CTGTACTTAC TGCAGAAATC AACGCaGTAA AC6AACACTT 16860 

AOGTACACAC AAAAAAGACC ACCATTCACG TCGTGGATTA TTAAAAATGG TAGGTCGTCG 16920 

20 TAGcATTTaT TAAACTACTT ACGTaGTAAA GATATTCAAC GTTACCGTGA ATTAATTAAA 16980 

TCACTTGGTA TCCGTCGTTA ATCTTAATAT AACGTCTTTG AGGTTGGGGC ATATTTATGT 17040 

TCCAACCTTA ATTTATATTA AAAAAGCTTT TTACAAATAT TAACATTTAT TATATGTTAA 17100 

25 GCTAATATTG AGTGAATAAT AAGGTTACAA TGAGATAAAG ATGATATAAG TACACCTAGA 17160 

GTAATAATCA AGATATTAAA AATAAAGTAT GTTTTTTTAA AAAATATAAC TTATATTTAT 17220 

ACTGATAAGG GTGGGACGAT AAGTCTATTT TGTAAATAAT AGATGGATAT CCCGCTCTCT 17280 

30 TTTTTTCCAA TTCAATATTT TATAACTAAT ATTAAAATAC GATAATAAAT GATATGATAT 17340 

AACTATTAGA TTCAAGAGAG GAGATTTATA ATGTCTCAAG AAAAGAAAGT TTTTAAAACT 17400 

GAATGGGCAG GAAGATCTTT AACGATTGAA ACAGGGCAAT TAGCTAAACA AGCAAATGGC 17460 

3S 

GCT6TATTGG TTCGTTATGG AGATACAGTC GTGTTATCGA CGGCAACTGC ATCAAAAGAA 17520 

CCTOTTGATG GAGATTTCTT CCCATTAACA GTGAACTATG AAGAAAAAAT GTACGCTGCG 17580 

GGTAAAATTC CTGGTGGATT TAAAAAGAGA GAAGGACGTC CTGGTGACGA TGCAACATTA 17640 

40 

ACTGCGCGAT TAATTGATAG ACCAATTAGA CCTTTATTCC CTAAAGGATA TAAGCATGAT 17700 

GTTCAAATTA TGAACATGGT ATTAAGTGCA GATCCTGATT GTTCACCACA AATGGCTGCA 17760 

^ ATGATTGGTT CATCTATGGC GCTTAGTGTG TCGGATATTC CATTCCAAGG GCCAATCGCC 17820 

GGTGTAAATG TGGGTTATAT TGACGGTAAA TATATCATTA ACCCAACAGT AGAAGAAAAA 17880 

GAAGTTTCTC GTTTAGACCT TGAAGTAGCT GGTCATAAAG ATGCX3GTAAA CATGGTAGAG 17940 

SO GCAGGCGCTA GTGAGATTAC TGAACAAGAA ATGTTAGAGG CGATTTTCTT TGGTGATGAA 18000 

GAGATTCAAC GTTTAGTTGA TTTCCAACAA CAAATCX3TCG ACCACATTCA ACCTGTTAAA 18060 
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GAAGAAAAAG GACTTAAAGA AACA6TTTTA 

CTTGATAACT TAAAAGAAGA AATCGTCAAT 

5 

GAaTTACTTA TTAAAGAAGT TTATGCAATT 

CGTTTAATTG CAGATGAAAA AATTAGACCA 

TTAGATTCTG AAGTTGGTAT TTTACCTAGA 

10 

CAGACTCAAG CACTTTCAGT TTTAACATTA 
GGTTTAGGAC CTGAAGAAGA AAAAAGATTC 
^5 GTAGGTGAAA CTGGTCCAGT ACGTGCGCCA 
GGTGAAAGAG CATTAAAATA TATTATTCCT 
ATTGTAAGTG AGGTACTTGA ATCAAATGGT 
ACATTAGCAT TAATGGATGC GGGCGTACC!G 
GGCCTTGTTA CACGTGAAGA TAGCTATACG 
GCATTAGGTG ATATGGACTT TAAAGTCGCT 

25 

ATGGATATTA AAATTGACGG TTTAACGCGT 
AGACGTGGTC GTTTAGAAAT AATGAATCAT 
GAATTAAGTG CTTACGCGCC AAAAGTTGTA 

30 

GATGTTATCG GACCTGGTGG TAAAAAAATT 
TTAGATATTG AACAAGATGG TACTATCTTT 
35 CGTGCTCGTG AAATCATTGA GGAAATTACA 
GCCACTGTTA AACGTATTGA AAAATACGGT 
GCGTTGCTTC ACATTTCACA AATTTCAAAA 

40 

AAAATCGGTG ACACAATTGA AGTTAAGATT 
GCTTCACATA GAGCATTAGA AGAATAATAT 
TGTGATTTTT TTATGCCACT TTTTACGAAG 

45 

TTTTAAAACG CTTTATTATT TTGTGTGCAA 
ATAGTGTACA TCAAGTGTTT TTTAACTTAT 
so AACAAATTTA GGAGGTAAGA TTTTGAGTTT 

TATACCATTA GGCGGTGTTG GCGAAATTGC 
TGAAATGTTT ATGTTAQATG CTGGACTTAT 
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ACATTTGATA AACAACAACG AGATGaAAAT 18180 

GAATTTATCG ATGAAGAAGA TCCAGAGAAT 18240 

TTAAATGAAT TAGTGAAAGA AGAAGTTCGA 18300 

GACGGCCGTA AACCTGATGA AATCCGTCCA 18360 

ACXICATGGTT CAGGTCTATT TACACGTGGT 18420 

GGTGCTTTAG GCGATTATCA ATTAATTGAT 18480 

ATGCATCATT ACAACTTCCC GAATTTTTCA 18540 

GGTCGTCGTG AAATTGGACA TGGTGCX3TTA 18600 

GATACTGCTG ATTTCCCATA TACAATTCGT 18660 

TCATCATCTC AAGCGTCAAT TTGTGGATCA 18720 

ATTAAAGCAC CAGTTGCTGG TATTGCTATG 18780 

ATTTTAACTG ATATCCAAGG TATGGAAGAT 18840 

GGTACTAAAG AAGGTATTAC AGCAATCCAA 18900 

GAAATTATCG AAGAGGCTCT AGAACAAGCG 18960 

ATGTTACAAA CAATTGATCA ACCACGTACT 19020 

ACTATGACAA TTAAACCAGA TAAGATTAGA 19080 

AACGAAATTA TTGATGAAAC AGGTGTTAAA 19140 

ATTGGTGCTG TTGATCAAGC TATGATAAAT 19200 

CGTGAAGCGG AAGTAGGTCA AACTTATCAA 19260 

GCX3TTTGTAG GCCTATTCCC AGGTAAAGAT 19320 

AATAGAATTG AAAAAGTGGA AGATGTATTA 19380 

ACTGAAATTG ATAAACAAGG TCGAGTAAAT 19440 

TTAAAGTCAT ATGACGACAA TGTATCX3TCA 19500 

TGACCCGTTT TGAATTTGTT GTATTGAACA 19560 

CTGTTAATTA TCCTGTATGT ATAGTGATTA 19620 

AATGAATAGT GAGTTTATAT ATGGACGGGT 19680 

AATAAAGAAA AAGAATAAAG ATATTCGCAT 19740 

TAAAAATATG TATATCGTTG AAGTAGACGA 19800 

GTTTCCAGAA GACGAAATGC TAGGTATTGA 19860 



811 



CCTTACACAC GGACATGAGC ACGCGATTGG 
TGCACCAGTA TATGGATCTA AATTGACAAT 
^ TAATATTGAT AAAAAAGTTC GCTACTATAC 

AAACGTGAAT ATTAGTTTCT TTAATACGAC 
TATTCACACT TCATATGGTG CCATTGTGTA 

10 

ACATGGACAT TATGCACCAG ATATTAAACG 
TGTCTTAATC AGTGATTCTA CTGAGGCAGA 
GATTGAACAT CATATGTATG ATGCmTGC 
TTATGCTTCG AACTTTATAC OTATTCAGCA 
TAAAGTGTCA TTTTTAGGAA GATCACTTGA 
20 GTATTTCXy^C ATTCCTAAAG ATTTGCTAAT 
AAATGAAGTG ATAATTATAG CTACTGGTAT 
AATGGCGCAA CATAAGCATA AAATTATGAA 

25 

AATTACGGCT TCTGCTAATA TGGAAGTTAT 
GnCTGGCGCA CATATTATTC CAAATAACAA 
GGAAGAATTA AAAATGATGA TTAATATTAT 

30 

TGAATTTAAA ATGCAGATAG CACATGCGAA 
AAAGATTTTC CTTGTGGAAA AAGGAGATGT 
35 AAATGAAAAG GTAAATTCAG GAAATATTTT 

AAATATCGTG TTGAGAGACC GTCATCTTTT 
AACOTTAGAT CCTAAAAATA GACGTATAGC 

40 

TGTATATGTA CGTGAAAGTG AAGACTTATT 
AGTAGAGGCT GGTTTACAAG AAAAACGCAT 
TGATCAAATT AGTAAACTAT TATTCGAAAG 

45 

AATTTCTGAA ATTTAATCAA AAAGTCATTA 
ATATAATGGT TGTCATGGAC AATTTACTTA 
GATTAATCTG TTATCTTAAG TAAATTGATA 
CGTCTGTTTG GACTACATAT TCTAAACATC 
TTAACTAAGG CAACATAAGG AGGTGCGTCA 
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TGCAGTGAGT TATGTTTTAG AACAATTAGA 19980 

AGCGTTAATT AAAGAAAATA TGAAAGCCCG 20040 

AGTTAATAAT GATTCAATTA TGAGATTCAA 20100 

ACACAGTATT CCTGATAGTT TAGGTGTTTG 20160 

TACAGGTGAA TTTAAGTTTG ACCAAAGTTT 20220 

TATGGCAGAG ATTGGTGAAG AAGGCGTATT 20280 

GAAACCTGGA TATAATACTC CGGAAAATGT 20340 

AAAAGTGCGA GGTCGCTTGA TAGTTTCATG 20400 

AGTTTTAAAT ATTGCTAGCA AGCTAAATCG 20460 

AAGTTCATTT AATATTGCTC GTAAAATGGG 20520 

TCCTATAACA GAAGTTGATA ATTATCCTAA 20580 

GCAAGGAGAA CCTGTAGAAG CCTTAAGTCA 20640 

TATCGAAGAA GGCGATTCTG TATTTTTAGC 20700 

CATTGCGAAT AcATTAAATG AGCtTgTtAC 20760 

AAAGATTCAT GCTTCAAGTC ATGGTTGCAT 20820 

GAAACCTGAA TACTTTATTC CTGTACAAGG 20880 

GCTAGCAGCT GAAGCAGGTG TTOCACCAGA 20940 

CATTAATTAC AACGGTAAAG ATATGATATT 21000 

AATAGATGGC ATTGGTATTG GGGATGTAGG 21060 

AGCAGAAGAT GGTATCTTTA TTGCTGTTGT 21120 

TGCGGGACCT GAAATTCAAT CTCGTGGGTT 21180 

ACGTGAAGCA GAAGAGAAAG TACGTGAAAT 21240 

AGAATGGTCT GAAATTAAAC AAAATATGCG 21300 

TACAAAACGT CGTCCTATGA TTATTCCAGT 21360 

ACATAAAAGA GGTCAGAACA AGTCACTGAA 21420 

TATTTTATGA TAGTCAATTG AAGGGGTAAC 21480 

CATAGATGAT ATTGTTCTAA CCTCTTTCAT 21540 

AAATAGGAAA TTATATATAA TAACGTCGTT 21600 

ATTGGCACAA GCAAAAAAGA AATCGACAGC 21660 
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813 



ATAAATAATT GTTTTAGGGA GAATAATCGT 
TTTTCACCTT TTTGGTTACT TAAAGAGATT 
^ TAAACAAGTG TATGCTTAGG CATAAATTTG 

TATTTTTTAA GAGGATGAAA GTCTTTTTTT 
TGAATAGTTC GAATAGGGAG GTCAGTGACA 

10 

AAAACAACAA GCGCTTGTAG ATATTAATGA 
TAAAGTGAAA TACTTAGGTA AAAAAGGGTC 
TTTGCCGAAT GAAGATAAAC CTGCX5TTTGG 

IS 

TCAAAATGAA TTAGATGAAA GACAACAGAT 
TGGcTGAAGA AACAATTGAT GTATCATTAC 
20 ATCCATTAAC ACXSTACAATA GAAGAAATTG 

TXXSTGAATGG ATATGAAGTT GAACAAGATC 
AATCACACCC TGCACGTGAT ATGCAAGATA 
GTACGCATAC ATCACCAGTG CAGGCACGTa 
TTAAAATTAT TTGCCCTGGT AAAGTGTATC 

ATCAATTTAC ACAAATCGAA GGATTAGTTG 

30 

AAGGTACTTT AGAATTGTTA GCTAAGAAAT 

GTCCAAGTTA CTTCCCATTC ACTGAACCTT 

GTAAAGGAAA AGGTTGTAAT GTGTGTAAAC 

35 

GAATGGTACA TCCTAATGTA TTAGAAATGG 
TTGO^TTTGG TATGGGACCA GACCGTATTG 

40 GTCATTTCTA TACTAATGAT GTGAGATTTT 

GTGACATGTA ATGTTGATAT CAAATGAATG 
TGTAAGTAAT TTGGCAGAAC GTATTACGCX! 

^ CTACACAAAA GATATCAAAA ATTTAGTTGT 

TGATGCTGAT AAATTAAATG TTTGCCAAGT 
CGTTTGTGGT GCACCGAACG TTGaTGCAGG 

50 

CAGATTGCCT GGTGGTATTA AAATTAAGCG 
TATGATTTGT TCGTTACAAG AAATTGGTAT 
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GACTGCAAGT TATTCCAATT ATTTAAAGTC 960 

TAAGTCGGAA AGACAATCCG TTATCAATAT 1020 

GGTGGTACCA CGGAAATGAC TTTCGTCCCT 1080 

AGTTAAACAA CAAATATGAT AAATAGAAAA 1140 

TATGTCTGAA CAACAAACAA TGTCAGAGTT 1200 

AGCAAATGAT GAACGTGCAC TGCAAGAAGT 1260 

AGTTAGCGGA CTAATGAAAT TGATGAAGGA 1320 

TCAAAAAGTG AATGAATTGC GTCAAACAAT 1380 

GTTAGTTAAA GAAAAATTAA ATAAGCCAAT 1440 

CAGGTCGTCA TATTGAAATC GGTTCAAAGC 1500 

AAGACTTATT CTTAGGTTTA GGTTATGAAA 1560 

ATTATAACTT CGAAATGCTG AATTTACCTA 1620 

GTTTCTATAT TACGGATGAA ATTTTATTAC 1680 

CGATGGAATC ACGTCATGGT CAAGGTCCAG 1740 

GTCGTGACTC TGATGATGCG ACACATAGTC 1800 

TTGATAAAAA CX5TTAAAATG AGTGATTTGA I860 

TATTTGGTGC TGATCGTGAA ATTCGTTTAC 1920 

CTGTAGAAGT TGATGTGTCA TGTTTTAAAT 1980 

ACACAGGATG GATTGAAATT TTAGGTGCTG 204 0 

CTGGTTTTGA TTCTTCAGAG TACTCTGGAT 2100 

CAATGTTGAA ATATGGTATA GAAGATATTC 2160 

TAGATCAATT TAAAGCGGTA GAAGATAGAG 2220 

GTTGAAAGAA TATGTAACAA TCGATGATTC 2280 

CACAGGTATT GAAGTGGATG ATTTAATTGA 2340 

CGGCTTCGTT AAGTCAAAAG AGAAACATCC 2400 

TGATATCGGA GAAGACGAAC CTGTACAAAT 2460 

ACAATATGTC ATTGTTGCTA AAGTAGGTGG 2520 

TGCCAAATTA CGCGGTGAAC GTTCAGAAGG 2580 

TTCAAGTAAC TATATACCGA AAAGTTTTGA 2640 
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ATATTTAGAT GATCAAGTAA TGGAATTTGA TTTAACGCCX5 AATCGTGCAG ATGCTTTAAG 2760 

TATGATAGGT ACTGCTTATG AAGTTGCAGC ATTATATAAT ACAAAAATGA CTAAGCCAGA 2820 

GACAACATCA AATGAGCTTG ATTTATCTGC AAATGATGAA CTGACTGTGA CAATTGAAAA 2880 

TGAAGATAAA GTACCATATT ATAGTGCACG TGTTGTTCAC GACGTGACAA TTGAACCCTC 2940 

GCCAATTTGG ATGCAAGCAC GCTTAATAAA AGCGGGTATA CGTCCTATTA ATAATGTTGT 3000 

TGACATTTCA AATTATGTGT TATTAGAATA CGGTCAACCA TTGCACATGT TTGATCAAGA 3060 

TGCGATTGGT TCACAACAAA TTGTTGTTCX3 TCAAGCTAAT GAAGGCGAAA AAATGACAAC 3120 

ATTAGATGAT ACAGAACGTG AATTATTAAC GAGOGATATT GTCATTACTA ATGGACAAAC 3180 

TCCAATTGCA TTAGCTG6TG TTATGGGTOG CGATTTTTCA GAAGTTAAAG AACAAACATC 3240 

AAATATAGTG ATTGAAGGTG CTATTTTTGA TCCAGTTTCA ATTCCTCATA CATCAAGACG 3300 

20 TTTAAATTTA CGCAGTGAAT CATCTAGTCG TTTTGAAAAA G6AATAGCTA CTGAATTTGT 3360 

AGATGAAGCA GTCGACCGTG CATGTTATTT ATTACAAACT TATGCAAACG GAAAAGTGCT 3420 

AAAAGATAGA GTGTCTTCAG GAGAACTTGG TGCATTTATT ACACCAATCG ACATCACTGC 3480 

TGATAAAATT AATCGCACTA TTQGATTTGA TTTGTCACAA AATGATATTG TTACTATTTT 3540 

TAATCAACTA GGGTTTGATA CAGAAATAAA TGATGATGTT ATTACAGTGC TAGTACCATC 3600 

ACGTCGTAAA GATATTACAA TTAAAGAAGA TTTAATTGAA GAAGTTGCAC GTATATATGG 3660 

ATACGACGAT ATTCCATCAA CGTTACCTGT CTTCGATAAA GTTACTAGTG GTCAGCTAAC 3720 

TGATCGCCAA TATAAAACTA GAATGGTTAA AGAAGTGTTA GAAGGTGCTG GATTAGACCa 3780 

^ AGCTATTACG TATTCGTTAG TTTCTAAAGA AGATGCTACT GCaTTTTCGA TGCAACAGCG 3840 

TCAAACAATT GATTTATTGA TGCCAATGAG TGAAGCGCAT GCGTCATTAC GTCAAAGTTT 3900 

ATTlfcCACAT TTAATCGAAG CGGCATCATA TAATGTGGCA CGCAAAAATA AAGATGTAAA 3960 

40 ATTATTTGAA ATCGGCAATG TCTTCTTTGC TAATGGAGAA GGTGAACTAC CAGATCAAGT 4020 

TQAATATTTA AGTGGTATTT TAACTGGAGA TTATGTAGTC AATCAATGGC AAGGTAAGAA 4080 

AGAAACGGTT GATTTCTATT TAGCAAAAGG TGTCGTGGAT OGAGTATCTG AAAAGTTAAA 4140 

TCTTGAATTT AGTTATCGCC GTGCTGATAT TGaTGGATTA CATCCAGGTC GTACTGCTGA 4200 

AATCTTATTA GAGAATAAAG TTGTTGGTTT TATTGGTGAA TTACATCCAA TATTAGCAGC 4260 

TGATAATGAT TTAAAAOGTA CGTATGTTTT TGAGTTGAAT TTTGATGCAT TAATGGCTGT 4320 

GTCGGTAGGT TACATTAATT ACCAGCCAAT TCCGAGATTC CCAGGCATGT CTCGTGACAT 4380 

TGCATTAGAA GTAGATCAAA ATATTCCAGC AGCTGATTTA TTATCAACGA TTCATGCACA 4440 
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AAAAGGTAAA AAATCAATTG CAATACGTTT AAATTATTTA GACACAQAAG AAACATTGAC 4560 

AGATGAGCGC GTTTCAAAAG TACAAGCGGA AATTGAAGCA GCATTAATTG AACAAGGTGC 4620 

TGTTATTAGA TAATGATTTA AACCCCATGT ATAAGGATAT CTGAAGTAGA TTGATATCCC 4680 

TAACATGGGG TTTTATTTTT GGGTTCACCA ATTTGGTTCC AATGCATTTA AAAAGTCAAA 4740 

GAGGAACAGC GGAATACAGA TGATGcTTCG CACAACTGCA TAAAAGCCTC TAATGATTAA 4800 

AAATCAAAGA GGCTTTAAAA TTTTTTGGGC TTTTTCACGA TTTTTAAAAT GCTTTTTTGA 4860 

AATGGTATCT AAACGTGAAA GACCGTATTT TTTTATAATT TTGGCGGCGA TTACATCGAC 4920 

TTTAGCACCG GCACCTTTAG GAATCGTCAT ATTAATATTT TTTGATATTT GATCCATATA 4980 

TGTAACAAAT GCGTATCGAG AAATTATGCT TGCCACTGCA ATGGCTAATG ACTTCGATTC 5040 

TCCTTTTGTT TCAAATTTTG TTTTCTTTGG AAGTGGTATA TCTGATAATG CGTAATGGCT 5100 

20 ATACACTTCG CGTTTTGCGA ACTGATCAAT GACGATATAG TCTAATTGAG ACGAATCAAT 5160 

TTTTTCAAGT ACATTTTTGA TGGCTTCATT ATGAAGGGCA GCTTTCATTT TTACTTGAGT 5220 

CCAGCCTTTT GCTTGCTGAA TATTATATTT TTCATTGTGT AGTGTTAATA ATGAATGTGG 5280 

TATGAAAGTA ACCAATTGCT CAGCAAGTTC TACAATTTTG GTATCGGTTA ATTTTTTTGA 5340 

ATCATCTACA CCCAAAGTTT TTAAAATAGG GACATGCTCT TTGGTAACGA AAGCAGCACA 5400 

CACAGTCAAC GGACCAAAGT AATCGCCACT TCCAGCCTCA TCACTACCAA TACAGTTAAA 5460 

TTGrTCATAC ATTAaAGTTg TcCAgAAAAG AATTAGCCAT ATTTnCCTTT 5510 
(2) INFORMATION FOR SEQ ID NO: 166: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 166: 

GnTTATACTT ATAAATTTTA CGGGGGTAAT ATAATACTtA TTTACCTGTA ATATATGATA 60 

ATTCTTCAGC GGCAGCTGCG TTGATAOTTC TATGAGAAAT GATACCTAAT CCTTTAACAT 120 

TGGATTCTGA AATAACGATA GAACCATCAC TGTTAACTTT TTCAACAAAT GCTACATGAC 180 

CGTAATGTTG ATCTGCACCA AATTGTCCAG CCTCAAATAC AACAGCAGCA TGACCTTTTG 240 

GTGTATGACT TACTTGATAA TCACGGTATT GAGCTCGATT ATTCCAATTA TGTGCATCAC 300 

CTAAATCACC TGAGATAGAT GTACCAAATT GTTTCATACG GTTATATACG TACCAAGTAC 360 
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ATGAATCATC ATAATCCTTG ATAGAACGTT CATATTTATC TAAATCTGGC ATGCGTTCAT 480 

CGTCAAACTG AGTTAATTGA TAGTG1TTAA TAATACTGTT TAATTTCTTA GCATAGTTTG 540 

GATCTGTAGC ATATGTTTTA GATAAGTGTG ATGTTGCATC TTTATAAGAA TCGGCTTCCX3 600 

ATTTCCATGT TGGTTTATAA ATTGTTCGAT TGCCATCAAT ACCATTTTTA ATAAGGTCAG 660 

AGTAATCTTT TAGTGATTCT TTCGTGCTTG GATATTTTCG GAATCCAGCA TTAATACTAT 720 

ACAATTGATT ACCATCAGCT TCTAATGTGT TAAAAGGAAC AGAATTCCCT TCaAAAGCAC 7B0 

CTTTGATACC GAATAAATTA TGGTTTGGTG ACwTAGCTAA AGCACTACGA CCTGAGTCAG 840 

ATTCTAAGAT TGCTTGGGCA ATCATGACAG ACGCATAAAT ATCGTTATCT TGACCAATGC 900 

GATGTGCATC TTTAGCAATT GATTTGACAA ATTGACGTGT ATCTTTTGAG TCAACAACGT 960 

TAAATTGTCC GCTATCATCA TTGTTAGATA TACTAGGATC TGTTTCXSAAT AATGATGTTG 1020 

20 CACX3TGTATC CTTTTGATTA ACATCGTTAT TGAATGATTG AGCAGGTTTA GATTTATGTT 1080 

TCAATTCATC TTGTGTTGGT AACTGTGGAT TCTTTGTATT AGATTTTTCA TTTTTGTCTT 1140 

TTTTAGATTG AGATGCATAA TCTTTTTGTG TTTTCTTTGC ATCTTCACTG TATTGATCCA 1200 

AAATAGAGTC TAAAGCCGAA TCTGACATTG ATTGATTATC TTTCGATGAA GATTTTTGAT 1260 

TTGCTTTATC GTCACTTGCT GGTTGACTAT TTGATTGATT AGGTTGTGTT GGCTTTGGCG 1320 

AATTTGGTTG CTTATTAGAT GTACTTGGTT TTGTATTGTT TGATTTAGGT GCTTTTTGAT 1380 

TGTCTGCTTT ATCTTGTTTA GATGATTGCG TATCAGTGTC ATTTTTGATG CTATTGTCAC 1440 

TGTTTTTATT CGAATCATTT GTTGACTTTT CGCCATTACG AGGTTGTTCG TAATCAGAAA 1500 

TATCCGAATT TAAATTGAAT AAGTTTTGGA TTAAAGTTGT TAATGAGTAA TTATCATCGT 1560 

ATTTATTTTT GGTTAGCAAT TGGTTTATAT TGGTTTGTGG TAAATTCTTA TAAATAAAAT 1620 

CAAtBATATT GTTAGAGTCT GAAGTGCTGT CXTTCTATAGT TTTAAATTTT TTGTCGTTAT 1680 

40 TGTCTTGGTT ACTTGTATTA TTTTTGTCTG CTTTATCAAT ATCTTTACTT GTAGTATCCT 1740 

TAGAAGTTTC ATCGTCATTA GATTTTTTTG AATCATGAGA TGTTGTCTTA GCTGTAGTAT 1800 

CTTTTTGAGG TGTATCAGCA TAAGCGgTAG GTGAAaCTAA AGTAGGTAAT ACGAGCGTAG 1860 

TTGATAGCAA ATAAATTAAA ATTTTATTTT TAGGCATATT TCGTATTCTC CCTTGAAAAA 1920 

TATAATAATT AAGTGTGATA ATAAACTATG ATTTGTTATA ATTTATCGTA TGCTGAAAAT 1980 

AGTTGATAGG TATCAATCGA CTAAATATCT TCCAGTAAAT TGATTATACT AATTCACAAC 2040 

GCAAAAATAA ATTAATTTAC AAAAAATATA TAAAAAATAT GAATAATTCC TACATAGGAG 2100 

TGTGACAATG AAGAACGCAT TTAAATTATT TAAAATGGAT CTGAAGAAAG TAGCTAAGAC 2160 
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TAACTTATGG GCAATGTGGG ATCCATATGG 
TAATGAAGAT AAAGGCGACA CAATCAGAGG 
^ TAATACACTC AAGAAAAATA AAAGTTTTGA 
TCATGAGATA AAAATGGGTA AATATTTTGC 
TGAAATTACA GGGACACTAC GTAAGCAGCC 

10 

TCAGAAGATT AACGCTGTTG CGTCTAAGCT 
AAAAGCGAAT GAACAATTTA ATAAAACAGT 
AGCAGGTTTA ACTATTGAAG AAAATGTGCC 
TTCAGCAGAT AAAGCTTTAC CTAAGATTAA 
TAACCACCAA GCGGATTTAG ATAAATATGC 
20 AGGTGATATT TTAGATGCTC AGAAAAAATT 
TAATGAAAAG GCTAAGTTGA TATTAGCTTT 
GTTAAATTTT GCAGCTGATG ACGTGCCAGC 
CATTCCGAGT CAAGGTATTG ATCAAGCTAA 
CACACAAGTT AGAAGTAGAG TCGGTGATTA 
AAATCGAAGA AACCAGCAAC AGATTCCTCA 

30 

TAGTGCACCT GCAGCTGGTA ATGGTGTAGC 
TACTGCACCA AATAATAATG TTACGCAAAA 

^ ATCGACTACA CCACAAAGTA CAAGCGGGAA 
AACAACACAA GTCAGCACAG CTAACGAGAA 
ATCAflX3GAA GCGGCATTAA CGGGCTCTTT 

"^0 AGCX^AAAGCC GCACAAAAAG ATAGTCAGGC 
ATCXXy^CAAG CCTTCTGATT TTAGAGAGTC 
CACAAOGCAA TATAATCAAC AATTTATCGA 

4S 

TGTTGATTTA TCAAAAGAAA TTGATAAGGT 
ATTAAGGTTA GTTAATCAAT TAAGCAATGC 
AGCTACTAAA TTACTAGATC AACTTTCAAA 

SO 

TTATGTTAAA AAAGATCTTA ACAGCTCTTT 
ATTGAACAAA GGGCAAACTG CATTATCCAA 

SS 



CAACACGGGA CACATCAAGG TCGCAGTCGT 2280 

GAAAAAAGTT AATGTCGGTA ATACGATGGT 2340 

TTGCCAGTTT GTAAGTAGAG AGAAAGCTGA 2400 

AGGTATTTAC ATCCCATCTA AGTTTACACA 2460 

TCAAAAAGCA GATGTAGAAT TTAAGGTGAA 2520 

AACAGATACT GGTTCGTCAG TTGTCGTTGA 2580 

AACTCGAGCA TTATTAGAAG AAGCTAACAA 2640 

GACAATTAAC AAGATAAAAA ATGCGGTATA 2700 

TGACTTTGCG AATAAAATTG TATATTTGAA 2760 

CAATGATTTT AGAAAACTAG GAAATTATAA 2820 

AAACGAaGTC AATGGTGCTA TTCCGCAACT 2880 

AAATAATTAT ATGCCGAAAA TTGAAAAAGC 2940 

GCAGTTCCCT AAAATTAATC AAGGACTTAA 3000 

TGGACAGTTA AATGATGCCA AAGGCTTCGT 3060 

TCAAGATGCA ATTCGACGCG CGCAAGATTT 3120 

AAATAGCGCG GCGAACAACG AAACATCAAA 3180 

ATCAACGCCA CCAAGTGCAC CAAGTGGCGA 3240 

TACCGCACCA AATAGTAATA ATGCGCCTGT 3300 

AAAAGATGGT CAAAGTTTTQ TAGATATAAC 3360 

CACACAAAAC ATTACAGATA AAGATGTTAA 3420 

ATTATCATTA TCAAATAATT TAGATACCCA 3480 

ATTACGTAAT ATTTCGTATG GGATTTTAGC 3540 

TTTAGATAAT GTTAAGTCCG GTTTAGAATA 3600 

TACATTAAAA GAGATTGAGA AGAATGAAAA 3660 

AAAAGCAGCT AATAATCGAA TTAATGAATC 3720 

ATTAAAGAAT GGTAGTTCAG GAACTGCTGA 3780 

ACTAGATTCA TCATTATCAT CATTTAGAGA 3840 

AGTATCAATA TCACAACGTA TTATGGATGA 3 900 

TGTTCAGTCT AAATTAAATA CAATTGATCA 3960 
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AACAGTATTA CCAAGTATTG AACAACAATA CATTAGTGCT GTTAAAAATG CTCAAGCAAA 4080 

CTTCTCGAAA GTGAAAAGTG ATGTAGcTAA AGCTGCTAAC TTTGTGCGCA ATGACTTACC 414 0 

ACAGTTAGAA CAGCGATTAA CTAATGCGAC AGCAAGTGTG AATAAAAATT TACCAACGTT 4200 

ATTAAATGGT TATGATCAAG CGGTAGGATT ACTAAATAAA AATCAGCCAC AAGCGAAAAA 4260 

GGCTTTATCA GATTTAGCTG ATTTTTCTCA AAATAAATTG CCTGATGTTG AAAAAGATTT 4320 

GAAAAAA6CG AATAAAATTT TCAAGAAATT AGACAAAGAT GATGCAGTCG ACAAATTAAT 4380 

CGACACACTT AAGAATGATT TGAAAAAGCA AGCGGGTATT ATTGCAAATC CTATTAATAA 4440 

^5 GAAGACTGTT GATGTTTTCC CAGTTAAGGA TTATGGTTCA GGTATGACAC CATTCTATAC 4500 

TGCACTGTCA G7ATGGGTAQ GTGCACTCTT GATGGTAAGT TTATTAACGG TTGATAATAA 4560 

ACATAAGAGT CTAGAGTCAG TCTTAACGAC AAGACAAGTG TTCTTAGGTA AGGCAGGATT 4620 

20 CTTTATAATG CTTGGTATGT TGCAAGCACT CATTGTATCX5 GTTGGAGATT TGTTAATCCT 4680 

AAAAGCAGGA GTTGAGTCAC CTGTATTATT TGTACTTATA ACGATTTTCT GTTCGATTAT 4740 

TTTCAACTCA ATCGTATATA CGTGCGTATC ATTACTTGGT AACCCAGGTA AAGCCATTGC 4800 

AATCGTATTG CTTGTATTAC AAATTGCAGG TGGTGGGGGA ACATTCCCAA TTCAAACTAC 4860 

GCCACAATTT TTCCAAAACA TTTCGCCATA CTTACCATTT ACGTATGCAA TTGATTCATT 4920 

ACGTGAAACA GTAGGCGGTA TTGTTCCGGA AATCCTAATT ACAAAATTAA TTATATTAAC 4980 

GTTATTTGGT ATAGGATTCT TCGTTGTAGG TTTAATTTTA AAACCTGTAA CAGATCCATT 5040 

GATGAAGCGC GTATCTGAAA AAGTTGACCA AAGTAACGTT ACAGAATAAA AATTAAATCC 5100 

35 ACACATTAGG GTTATAGCTC CTTAATGTGT GGATTTTTAT GTTTTTAGAC AGAAGAGATA 5160 

GTAATTTCTG TCTTTTATGG GACGGTTGTT ATCATTGCTA TTATCCAGGA TGACTTACTA 5220 

TAG»CTAAT ATTACCGACA AAGTGAATAT CCTCGTCTTC CGTAGTTAAA ATAAAGCTAG 5280 

^ AACCTTTTTG GATGTCATAG TGCTTATCGT TTACTGTTAA AGTACCAGTA CCATCGATAA 5340 

TTGTAACTAA GCAATAAGCA TGTGGTTTAT TGAATTTTAA ATCTCCATGA ATATCCCATT 5400 

TATATACTGC AAAATATTGA TTATCTACAA ATTGAGTTAC AGTGTGTGTG TCGATGTGAG 5460 

TTGTTATAGG AGTAGTATTT GGTTCATGAT TGCCTAATTC AATCACATCT TTACTTTGCT 5520 

CTAAGTGCAA AT«CGCAAT TGACCATTTT GATCTCGTCT ATCATAGTCA TAAATACGGT 5580 

ATGTCGTATC GGAGGATTGT TGTGTCTCTA AAATTAAAAT ACCCGAACCA ATGGCATGGA 5640 

CAGTGCCAGC AGGAACATAA TAAAAGTCAC CGGGCTTAAC AGGTATACGT TTGAAAAGAC 5700 

TGTCAAATTC ATGATTATCA ATCATGTCTA TTAACGTCTG TTTATTATGT GCATGTACGC 5760 
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GTTCGCCTTC GTGTTTTAAA GCGTAGTCAT CATCTGGGTG AACTTGAACA GATAATTTAT 5880 

CATTGGCATC TAATACTTTA GTTAGCAGAG GGAAACTATC TCGTGAATCA TTATCGAATA 5940 

ATTCACGATG TTGTGACCAA AGTTGATCTA GGGTCATATC CTTGTATGGA CCATTGATAA 6000 

TTGTATTAGG ACCATTTGGA TGTGCAGAAA TTGCCCAGCA TTCACCAGTT GTTTCATTAG 606O 

GGATATCATA GTTAAATGCT TTTAATGCAT GACCGCCCCA AATTCTGTCT TTAAAAACGG 6120 

GTTGTAAAAA TAATGCCATA GTTAAAACTC CTCTATATTT TCATTAATAA GTTATAAATT 6180 

TCTGTAGTAC TGTTTGCATT AATTAGTGAT TGGCGTGTCT CATCATTCAT TAACGCTTTA 6240 

GATAAGC6CT GAAGTATTTT TAAAXGTGTA TCCTGACTGT TGTTTGGTAC GGCAATTAAG 6300 

AATATCAATT GAGGTAGACT ACCATCTAGA CTGTCCCATT TAACACCATG ATTATTTTTC 6360 

ATAACAGCTA CAATCGGTTG TTTTACAACA TCAGACTTTG CATGTGGAAT GGCCACGTTC 6420 

20 ATGCCAATAG CTGTCGTAGm tCcATTTCAC GTTCTAGTAT TGCATTTTTT AAATGCGATG 6480 

TGTGCTCTAC ATAACGGCAA ATTTTAAGTT TATGAATCAA CATATCAATT GCTTCGTTTC 6540 

GAGACATGTC GTGATCAGTA ATTATCATAG TTTGTTGATC AAAAACATGA GAAGGTTTAT 6600 

TGAGATGTGA ATGTTTCGCG GTGTTATCTA CATTOTCAAC CTCTGTATCA TGTTGTGTAA 6660 

TATCTGTATC ATGAAGTTGC GTGTGTTGCG CTGGTGCATC TACTGCTATA ACTGGTGTAT 6720 

TGCGTTTTAA TAATAGTACA GTAGTCATTG TGACAAGACT ACCTACTATC ACTGCAAAGA 6780 

TAAACCATAA TACATGATCA ATACCACCTA ATACAGCCAC GATTGGACCT CCATGTGCGA 684 0 

CTCTATCGCC GACACCACCA ATGGCTGCAA TGACTGATGC. AATCATTGCA CCAATGATGT 6900 

TTGCAGGTAT AATGCGCAAT GGATCTTGGG CTGCGAAAGG AATAGCACCT TCAGTAATAC 6960 

CAAATAGTCC CATAGTGAAG GAAGCCTTAC CCATTTCTCT TTCGGAATGA TTGAATTTAT 7020 

ACTTTTGAAC AAACGTTGCT AAACCTAAAC CX3ATTGGTGG TGTACATACA GCAACTGCGA 7080 

40 CCATACCCAT AACGGCGTAA TTACCTTCAG CAATAAGTOC TGAGCCAAAT AAAAATGCTA 7140 

CCTTGTTTAC TGGACCGCCC ATATCGAAGG CAATCATCGC ACCTATAATC ATCGCAAGTA 7200 

TAATAATATT AGCACCTTGC ATACTTTTTA ACCAGGTTGT TAATGCCTCA AAAATATTAG 7260 

AAATTGGTGC ACCGATTAAA AATATAAATA TCAATCCTAC AACGACCGAT GAAATAATGG 7320 

GAATAATAAT GATAGGCATA ATTGGTGCCA TTGCTTTTGG AACTTTAATA TCTTTAATCC 7380 

ACTTTGCGAT ATAACCTGCT AAGAAACCAG CAACT^TACC ACCTAAAAAT CCTGCGCCTG 7440 

CATCACTGCC ATAAAAACTA CCGTCAGCAG CGATAGCGCC GCCAATCATA CCAGGAACAA 7500 

GACCGGGCTT GTCAGCGATA CTAACAGCXSA TATATCCAGC TAGTATTGGA ACCATAAATT 7560 
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ATCCTTTTGA TGTCGTTtCA CCGCCTAQAG TCAGCGCGAT GGCGATAAGG AGTCCACCAA 768 0 

CTACGATAAA AGGAACCATA AACGATACAC CGTTCATTAA ATGTTGATAC ACCATTTGAA 774 0 

TACCATTTTT AGACTTACCG CGATCTTTCG AATGATAATT TGTTTCAGAT TGATAAATAG 7800 

GCGCATCTTG ATTAATGATA CGTTGAATTA GACCTCTCGG ATTATGAATC CCTTCGCGAA 7860 

CATTTTCATT AATCAACOGT TTACCAACAA ATCGGGACAG ATCAACTTGT TTATCAGCTG 7920 

CAATTATGAC ACCGTCAGCT TCTTCGATGT CTTGCGTAGT TAAAACATTT TCAGCACCAA 7980 

CACCGCCCTG TGTCTCTACT TTAATATCCA CACCCATTTC TTTTGCTACC TGCTCAAGCT 8040 

,5 TTTCTTGAGC CATATATGTA TGTGCAATGC CATTTGGGCA TGAGGTAATA GCTACAATTT 8100 

TCATAAAATC ATCTCCTTTT CTATATTGTA AGCGTATTCT CGATACTAAA AAAAAGAATA 8160 

ATTACCGTTA CTAGTGGCAA TTATTCTTGT AAGTATTCAA ATAACTGTTO CTTTAAACTA 8220 

20 TGATCATCTA AACTACATAA ATGGTTCACT GAATCATCAT CCAAGTTAGC AATTAATTGC 8280 

ATCATTTGTT TTGTAAAAGC TTTGTCTTTA TGCGAAATOG CTAAGAAAAA GACAAGTTTG 8340 

ACATCGTGTT GTCGCCAAGG AAAAACATCT TTTGTGCGAA AAATAAGCAC ATGTGATTGT 8400 

AAAACTTTTT CAGGATCTCC ATGAGGAATC GCCATAAAAT TACCTATGTA TGTAGAAGAT 8460 

GATTTCTCAC GCTCTAAAGC TGATTCGATA TATCCTTCTA CAATCGCATG ATGTGCTTGT 8520 

AATATTTTTT GAGCTTCTTC AAAAATTTGC ACAGTATGCC GTGATTTTTG TTCAGTATTT 8580 

ACGACAAGGA AATTGACAGT GTCCATATGA TGATGTGCTT GAACCGGATT TTGCTTTTGC 8640 

TTCACAACGT GTCTGATTTT GTGACGATCA TCTTCAGAAA ATAATGGTGC AACCTTGATA 8700 

GTCGTCAGGT GCTTAGGAAG TATGTTTAGC GTTTGTTTAG GAATATCATG GGTCGTTATT 8760 

AATAAATCTA CATTGTCAAA GTGATAGTGT GTTATATTTT CTAGTTTAAT CGTATTTATC 8820 

ACTGACAACT CTTCGGATAA GTTATTTATT TTAGTTTCTA AAAAATTCGA CACACCTAGA 8880 

40 CCATAATAAC AAGCAATGAC TACATTTAAT TGTGTTTTGG TACGACGCTC GATGGCAGCT 8940 

TGAAAATGAA TTGTTAAAAA TGCAATTTCA TCTTCGCTCA TCTCTATATC AGTATCAATT 9000 

GCTAATTTAT CAATCGCTTC AAAAAGTGTG TTAAACACAA AGGGATAGAG TTTTTTAATC 9060 

TCTATAACTA AAGGATTGTT TAAATAAATG TTTTGAGTGA TACGTAAATA TGCTTTACTA 9120 

AAATGATTAT ATAAATTTTG TTGTAAAATC GAATCTTCAT TGAAAGGTAC ATGAATACGT 9180 

TGCTGCATCA ATTCGATTAA GCGATCAATA TAACTTTGTA TAAATATACG TTCTATGCCA 9240 

ATATCQAGTT TATTAAAATG ATAAGCAATA AAGAATGAAA ACATATTGAT TACTTTTTCX5 9300 

TTCAAGTCAT AACCTAATCT TTCGTIGATT TGCTTAATGC AAGATTGAGA TATCAATTTT 9360 
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AGATGAATTA AAAGCTGTTG TATTTGAATA TCAGTTGTTT CAATACTATG TTGTTGAAGT 9480 

GTCTCTTGTA TAATATGCGA AATCATCCTT TGGTGTGAAT CAGGTAATTC aTTTAAAATT 9540 

AGGTCTTCAA CATGTACATG CCCTGATGAT AATTGATTTA AATGGATGAT GGCATTAGTG 9600 

ATATCATTAT CTGTTCCATC GAC 9623 
(2) INFORMATION FOR SEQ ID NO: 167: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

^0 ACC6TGGAAA CAC6TCTAGT CAATCAGAAA GCGATAAAAA T6TGACTAAA TCATCTCAAG 60 

AGGAAAATCA AGCAAAAGAA GAATTACAAA GCGTTTTAAA CAAAATTAAC AAACAATCAA 120 

GTAAGAATAA TTAAAAAATT TTGATATTGT CTATOTTTAT A6TTCACAAG CCATTCAACG 180 

TATTGTAAAC TAAGGATAGT GTATTTTTTT AATAGTAATT TGTCAGGAGG TGCCTATCTA 240 

TGGAAGAACA TTACTACGTA AGTATTGATA TTGGATCATC AAGCGTAAAA ACAATAGTAG 300 

GCGAGAAATT TCACAATGGT ATAAATGTGA TAGGTACAGG ACAAACCTAC ACGAGCGGTA 3 60 

TAAAAAATGG TTTAATTGAT GATTTTGATA TTGCGCGACA AGCAATCAAA GACACAATTA 4 20 

AAAAGGCATC AATCGCTTCG GGTGTTGATA TTAAAGAAGT TTTCCTGAAA TTACCTATCA 4 80 

^ TTGGAACGGA AGTTTATGAT GAATCAAATG AAATCGACTT TTATGAGGAT ACAGAAATCA 54 0 

ACGGTTCACA TATCGAAAAA 6TATTAGAAG 6TATTAGAGA AAAAAATGAT GTGCAAGAAA 600 

CAGfSuSTAAT TAATGTGTTC CCGATTCGTT TTATAGTCGA TAAAGAAAAT GAGGTTTCAG 660 

40 ACCCTAAAGA ATTAATTGCC AGACATTCAT TAAAGGTTGA AGCAGGCGTA ATTGCTATTC 720 

AAAAATCGAT TTTAATTAAT ATGATTAAAT GCGTAGAAGC ATGTGGTGTT GATGTATTAG 780 

ATGTTTACTC TGATGCATAT AACTATGGTT CAATCCTAAC AGCTACTGAA AAAGAGTTAG 840 

GTGCATGTGT CATTGATATT GGTGAAGACG TTACGCAAGT TGCTTTTTAT GAACGCGGTG 900 

AATTAGTAGA TGCTGATTCT ATCGAAATGG CAGGGCGTGA TATTACaGAC GATaTTGCAC 960 

aAGGrTTaAA CACTTCTnAT GAAACTGCTG nAAAAAGTTA AACACCAATn TGGTCATGCA 1020 

T 1021 
(2) INFORMATION FOR SEQ ID NO: 168: 
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(A) LENGTH: 7963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1€B: 



10 


TAATCTATTA 


TAAAAACTGT 


CCATACCCTT 


TGATTACCTT 


CTCTTCAGGT 


ACAGGCCACA 


60 




CTTGAQGCCA 


TAAGCCATAT 


GCTT6CTGTG 


AATAAAATTG 


TGCCATTTGT 


K 9k/*ilk Kl^xtf^lk Ik 

AACAATATAA 


120 




TATATACAAA 


TAAACACCCA 


ATAATTGCTG 


TCACTAATGG 




Ik Ik ^^^i^ffnmik 

CAAACCATTA 


180 


Id 


ATAAAACTGC 


AATAATTACT 


AACCTAAAGA 


TAATATTAAA 


TGCGTCTCTC 


CCTCTTATAA 


240 




AGCTTCTAAT 


AAATAAGAAT 


AAATACATCG 


CATTAGAGTT 


AAATTTACTA 


CCCTTTGGAA 


300 




CTGGTAAAAG 


TATATCTAGA 


TAACTTCTTC 


TGACTGCAQA 


TTCTTTCAAA 


TGTTTTACAT 


360 


20 


CGGTGAACAT 


ATTAACAAAT 


TTATAATAAT 


TCATATGATG 


TCGATGTTCG 


ATTGCAATCA 


420 




TTTTCTCCCA 


AGGATACAAA 


AAGCCTGGTT 


TATATlTi'lT 


AACTAAAAAT 


TCTATTAACA 


480 


2S 


CAGGCAAAGC 


AACCATCACA 


AATGCGATGT 


ACCATTTTGG 


AGCTAATAGT 


AAGTAATATG 


540 


TTAGAGCAAA 


GGTGATGAAT 


GATATTAAAT 


TAACTTGCCA 


TGTTTTAAGT 


CCCGATTGAT 


600 




AC CATTG CCA 


TCTTAAGCGT 


ik« ik^^« x^*k «n 

AAACCAACAT 


ATGGAAAAAT 


TAATGCACTG 


ACTCCAAAAC 


660 


30 


AAATATAAAA 


TGCCACATTA 


TGTTGATTAA 


TATTGTAAAA 


Ik Ik ^^^^^^ Ik Ik 

CAACGGGAAC 


^ ^n^^n ^^ik Ik Ik 

ATTACAATAA 


720 




CAAXAAXvjAu 


TTQGAITAAT 


ATGCGCGCAA 


AGTAACTATA 


mik Ik fk fkrp^^^Jk 

TAAAATCGCA 


TGACGCATAA 


780 




Ai IXxAuAUAT 


GTGTTTTTCA 


AATGGTAATA 


AAAAGATTTT 


ATCCgCTTCT 


TTTAACAGTG 


840 


35 


\>iL.SU1ii ITiv? 


AAAAATAGrX 


GTCAACGCAA 


CAATCACTGC 


TGCTATTaAT 


GAAAAATTGa 


900 






AATAT61TTT 


AACuAi i CAL 


CATATCwUTA 


AATAAATGCA 


WWCAGCAAAA 






TAASTAAAAA 


GACCATGAAA 


TGACCATTAA 


ATATAAACTT 


ATTATAATAA 


TTTTtCTCTT 


1020 


40 


TACGAAGGGC 


ATGTAATCTT 


TTATTAAATA 


atgtggtAgc 


TTGGTTACGC 


ATGTACATCT 


1080 




CCACCTTGCG 


TCACATGAAT 


ATATATATCG 


TCTAATGTTT 


GATTATGTAA 


GCCAGTTTGT 


1140 


45 


TGTCTCAATG 


CTTCTAAATC 


TCCAAATGCA 


ACGACTTCAC 


CTTCGTCTAG 


TATGaTAAAA 


1200 


GGATCACAGT 


AACGTTCAGC 


TGTTGCTAAA 


ATATGTGTAC 


TCATTAGAAC 


GGTTCTACCT 


1260 




TCGTTTTTCT 


TTTCAACCAT 


TAAATCTAAC 


ATGGATTGAA 


TTCCTAATGG 


ATCTAGGCCA 


1320 


SO 


AGGAATGGTT 


CGTCTATAAT 


ATACAATTCG 


GGATTAACGA 


TAAACGCACA 


AATAATCATG 


1380 




ACTTTTTGTT 


TCATCCCCTT 


AGAAAAATGA 


CTCGGAAAAA 


CTTTCAACTC 


ATTTTCTAAA 


1440 




CGGAATGTCT 


TTAATAATGG 


CATTGCTCGA 


TTCATCGTTT 


CATCACGATC 


AATATCATAT 


1500 



55 



823 



EP 0 786 519 A2 

TCCGGAATAT AAGATAACTT TCTTCTATAA GCCTCTATGT CATCATTAAT GTTGATATCT 1620 

GAAATTGATA GAGATCCTTC CATAGGTGTA AGCAATCCTA GCATATGTTT AATCGTTGTA 1680 

5 CTCTTACCAG CGCCATTAAG GCCAATAAGT CCAACAATTT CGCCTTTGTT TAATTCAAAA 1740 

TTTATATCTT TAATTACAGG GCGTTTTCCA TATCCACCTG TAAGCTGTTC TACTTTAACT 1800 

GTCATAAGGC ACCTCCATGA CTTATATTGT ACCAAAAATT ATAAAATGCT CATATTAAAT 1860 

10 

ACACATGTCC TAATATCGAA TTTTTAGCGA CAATGTTATA ATGAATGGTA ATACTAGTTG 1920 

AAAAGGAGTG TAGTCATCAT GTCAGAAACA ATTTTCGGCA AAATTTTAAC TGGAGAAATT 1980 

CCTAGCTTTA AAGTATATGA AGACGATTAT GTCTATGCCT TTTTAGATAT ATCACAAGTT 2040 

IS 

ACTAAAGGAC ATACGTTATT AATTCCTAAA AAAGCTTCTG CTAATATCTT TGAAACTGAT 2100 

GAAGAAACAA TGAAACATAT CGGTGCAGCA TTACCTAAAG TAGCAAATGC TATTAAGCGT 2160 

20 GCATTTAATC CTGATGGTTT AAACATTATT CAAAATAATG GTGAGTTTGC AGATCAATCT 2220 

GTATTTCATA TTCATTTCCA CTTAATTCCT CGATACGAAA ATGATATTGA TGGATTTGGT 2280 

TATAAGTGGG AAACACATGA AGACATTTTA GATAACGATG CAAAACAACA AATTGCTGAA 2340 

25 CAAATTCAAG CACAATTTTA AATGTATGCT TAATCTAAGC TCGAACGGGT ATAATATGAT 2400 

TAATATTATA ACAATTGCGT TTGAAGTGAT AACATCAAGG TTAGCAATTT TAAACAAAAT 2460 

GAGTTATCAA GATAACAGAT GTTAAAAGTG AGGAGAATAT AAATGAAAGC ATCACGCATT 2520 

CTATTCGGTA TCGGTGTTGG CX3TAGCAGCT GGTTTTGTAG TTGCACTTCA AGGACGTGAC 2580 

GACAAAAGTG TCAAGAACAA CACGATCGAT CGTACTGCCC CTACTGGTTC AAAATCAGAA 2640 

CTACAACGTG AATTTGAAAC GATTAAACAA AGTTTTAATG ACATTTTAAA CTATGGTGTT 2700 

35 

CAAATTAAAA ACGAAAGTGC GGAATTTGGT AGTTCAATTG GTGGTGAAAT TAAGTCATTA 2760 

CTTOGAAACT TCAAATCTQA CATTAATCCT AATATTOAAC QTTTACAGTC ACACATCGAA 2820 

AATTTACAAA ATCGTGGCGA GGATATTGGA AACGAAATTT CTAAGTAGCA GGTTACGTTC 2880 

40 

TCX5ATCACAA CTATTTTTAT TAGTAACAGC ATATTTATTT TTTAAAATTA AATGCCAAAT 2940 

AAACGAGATG ACATTAGAAA TTAGATATTT CTTGTCATCT CTTTTTTAAA ACTCAAATGA 3000 

45 ACTTATGrrr ACAAATTATA GGAAGACATT GTTTGTAGTG ATTTTCGCTT AAATCATATT 3060 

TATGAATTGA TTGAAAACAT TGCTTAGGAT TCATTGTGTT ATCCtTGCAC TTTGATTACG 3120 

CTTTACTTAA ATCATTATCG ACAAACAACA TACTTATATT TTCATTGAGC CGAACCTTAT 3180 

SO ATACACATTA CATATACCTT ACTTGCACAA ATTATTAATC TGGTGTTTAT TATAATTACA 3240 

TATCACTATA TTTTTAGCAT TTGTATAACT TAGTTGGTCA AAAGATGCTT TTGCATATGC 3300 
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TTTCATAAGT GATGCTTTAT TAGCAAGAAT ATGTGTTCGC AGAAATTTGT TCTGCATTCT 3420 

ACTTCTACGC TAGTCAATCA GACAATTTTA CCAATCCCCA CTTTCGCGTT TCAAATCAAA 3480 

CAATACGTCG CTCCTTTCTT CTTATATAAC AATTCTTCTA ACATGATATG TTACTATTGA 3540 

ATTACTGAAC CTGAGTTAGT TATAATCTAA CTTATATTGA AAAGAGATGA GGCGTAAGAT 3600 

ATGTTTTTAT GTAAAAGACA AATTGATATC AATGCACGAT TTGGTTTGCC TAGAATTGCA 3660 

TTTATGAGTG CAGTTGCAAC CATCATTATG TTTTTAGTTA GTTATGAAGT AATGTATTTT 3720 

TTATCTAATA CGCCATTATC AGATAGACAT TTTCTCATCT TTTTATTACT TGTATTTATG 3780 

ACGTATCCAT TACATAAAAG TATACATTTA TTATTTTTCT TACCATATAG AAAATCGTTT 3840 

AAAGTTCATA AGTTAACTAA AAGAAAATGG CTTATATTCT ATAATACCTA CGTCAATCAA 3900 

CCTGTACACA AATTTTATTT TTGCATTAAC TTAATATTGC CGTTAATTAT CTTATCTGCA 3960 

ATGTTCGTTT ATCTAACAAT TTCATTCCCG CAATATGGAC ATTATTTTAT GTTCTTATTG 4020 

GCATTGAATT TCGGTATTTC CATTACAGAT TTATTATATT TAAAAATAAT TATATTTTCT 4080 

AATTATGGAC AATATATAGA AGAACATAGT ACAGGTATTA ATATTTTGAA AAAAATTAAA 4140 

2s AATCCATATC ATTTATAACA AAATAATTAT AGCAAGGTGT TATTATTTGT TTTTAGGCTA 4200 

TGTAATAgcT tACAATCAAA TGTATATAGA CCTrGTTTTT TTATTTTCAT CAATTTCTAC 4260 

CCCTAAACCT AATGCTCTAG TCTGATGTCA TGGGTTATTG ATTGGTGATA ATATAAAACT 4320 

30 ATGTTATATT CACGATGATT AACTTACAAA GGAGTTTCAA CTATGAAGAT GATAAACAAA 4380 

TTAATCGTTC CGGTAACAGC TAGTGCTTTA TTATTAGGCO CrTGTOGCgC TAGTGCCACA 4440 

GACTCTAAAG AAAATACATT AATTTCTTCT AAAGCTGGA6 ACGTAACAGT TGCAGATACA 4500 

ATGAAAAAAA TCGGTAAAGA TCAAATTGCA AATGCATCAT TTACTGAAAT GTTAAATAAA 4560 

ATTTTAGCTG ATAAATATAA AAATAAAGTT AATGATAAGA A6ATTGACGA ACAAATTGAA 4620 

AAAATGCAAA AGCAATACGG CGGTAAAGAT AAATTTGAAA AGGCCCTTCA ACAGCAAGGT 4680 

TTAACAGCCG ATAAATATAA AGAAAATTTA CGTACTGCTG CTTATCATAA AGAATTACTA 4740 

TCAGATAAAA TTAAAATCTC TGATTCTGAA ATTAAAGAAG ACAGCArGAA AGCTTCACAC 4800 

ATTTTAATTA AAGTTAAATC TAAGAAAAGC GACmAAGAAG GCTTAGATGA TAAAGAAGCG 4860 

AAACAAAAAG CTGAAGAAAT TCAAAAAGAA GTTTCAAAAG ATCCAAGTAA ATTTGGTGAA 4 920 

ATCGCTAAAA AAGAATCAAT GGATACTGGT TCAGCTAAAA AAGATGGCGA ATTAGGTTAT 4 980 

SO GTTCTTAAAG GACAAACTGA TAAAGATTTT GAAAAAGCAC TATTTAAGCT TAAAGATGGT 5040 

GAAGTATCAG AGGTTGTTAA ATCAAGCTTT GGATATCATA TTATTAAAGC TGATAAACCA 5100 
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AAAAATCCAA AATTATTGAC TGATGCATAC AAAGATCTAT TAAAAGAATA CGATGTTGAC 5220 

TTTAAAGATC GTGATATTAA ATCAGTTGTC GAAGATAAAA TCTTAAACCC TGAAAAACTT 5280 

AAACAAGGTG GCX3CACAAGG CGGACAATCC GCCATGAGCC AATAACACAA AACCGAGCGA 5340 

CCGTGGTTCA AAAATCATAC CACGGCCGCT CGGTTTTTTC GCATTAAAAA TCGGACAGAT 5400 

GAGCTCATGT TTCAGTATAC TCATCTGTCC GATATCTTTT AATTCTTAAT CGAGTGATTC 5460 

AGGATTGTAG AATCTACGAT TTTCAAGACC AAATATTTTA TCTGTAAACT GACCCTTGTC 5520 

AGTTTTTTTA TATGCCTTTT CAAACATATT CATTCTAGCA TCGATATTAT CGATATAGCA 5580 

TAAAATTTCT GCTTCTTTTA AGTATGGCAG TTTTGGAGAA CCATACTCTA ACTTACCATG 564 0 

ATGAGATAAA ATCATATGTC TTAACAACAT GATTTCTTCT CCTTCAATGT TCAATTCACG 5700 

AGCTGCTTCA ACTACTTCAT CACTCGCAAT CGAGATGTGT CCTAATAAGT TACCTTCGAC 5760 

TGTATACGAC GTCGCAACAG GACCACTCAA TTCTCTAACT TTACCAATAT CATGCAAAAT 5820 

AATACCACTA TATAACAAAC TTTTGTTrAA CAATGGATAA ATGTCaCAAA TTGATTTTGC 58B0 

AATACGTAAC ATCGTTAATA CATGATAGCT TAAGCCACTC GCAAAGTTAT GaTGATGAGA 5940 

25 ACTAGCAGCT GGATATGTGT AAAATCGTTC TTGATATTTT TTCAATAAAT GACGTGTGAT 6000 

ACGTTGTAAA TTAGCATTTT CAATATCTAG CAAATAATGA GAAATCTCTT CTTGTATTTC 6060 

TGCCGGTGAT AAAGGTGCAC CATCTACAAA TTGTTCTGTT TTTAATTGAT CTTCAGTTGT 6120 

CGCTAGTCTA ATTTGGTTGA CTTTCATCTG TTTATTTCCG CGATAGTTTA TGATGTCACC 6180 

TTTAACATGT ACAATTTCTT CAGGCTTGAT TGTTGCCATA TCATTTTTTG TAGCCGTCCA 6240 

AAATTTCGCT TCAATTTCAC CACTTTTATC TTGCAAATGT AATGTCATAT AATCTTTACC 6300 

35 

TTGTGCTGTT ACACCCTGT6 TAGCTTTATG CACTAAGAAA AAGTGATCAA CTGAATCTCC 6360 

GGGA^TTAGA TTCTCTATAT TTCTCATCGT TTCCCGCCTT CCTCTA'l"i"l"i' GTTTAATGTA 6420 

ATCACTTCTT TTGATGGAAC AATATTATCT TTTACACATG TAAAGTATAG TACTTGATAG 6480 

TGTTCTGATA ATGATCGTAA ATAATTCAAC ATTTTTTCAG TACGTTTTTT ATCAAAATGA 6540 

ACAAATGCAT CATCAACAAT TAATGGGAAC GGATAATATG GTCTTAGTAC CTTAATTAAA 6600 

CTGATACGTA AAGCTACATA AAGTAATTCT TTTGTAGATT GACTTAGTTC AACAGGATCA 6660 

TATAATTGAC CATTAACATG TTTAACCGTA ATTGAATCTT CATTATAGTT AATCATCGTA 6720 

TATCTGCCAT CTGTTAAATG CTTCAATATT TCTACCGCTT CATTAATAAC TTGAGGCAAA 6780 

SO CGTTTATCTT TAATTTGTTT AATGTGTTCA TCAACTAAAC TTTGTAAATA ACTTAAACTT 6840 

GCCCAATCTT TTGCGATATC ATTAAGTTGA TTTTTAAGAC TGTGATATTC ATGTCTTAAA 6900 
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GCTTGCATTT 


CAAGATATTG 


CTCATTATAT 


TCOTCAACTT 


GAGTAGCCAA TAAATGATCT 


7020 




TCTTCTTCAA 


GTTGTOCAGT 


TOTTTTTTCA 


CTTAAACTAG 


AACTTAATTC ATAAGAATAG 


7080 


5 


TTTTGGTTCT 


CAAGATATTT 


AGTTAAATCA 


TTAAAACGAC 


TCAAATTACT AGTATAAGTT 


. 7140 




TGGTAATCTT 


CATGATGTTG 


GTAAAAATCT 


TCTTCAGTAC 


CAACATTGAT AAAATCGAAT 


7200 




AGTGCT6TAA 


TTTCTTTATT 


ATTTTCTTCT 


AATTGAGCAT 


TTAAATGATT 


TAATTCATTT 


7260 


10 


GTAACAAGTT 


TGGTATTTTC 


AGCATTAATA 


CGCCATTTTT 


CATTCGTGTC 


TTCAGCTGAT 


7320 




TTCAACCATT 


GTtGCACATC 


GTGGAATAAA 


GATAATTTGT 


TGAAATAAAC 


AAATTGTGAT 


7380 


19 


TTTGTAACAG 


CTTCAGCATG 


ATTGTAGAAT 


GTATCTAATT 


CTTGAACCAA 


TTGCTGGCGT 


7440 


TGTTGATTTA 


AATCACTGAT 


ATGTTGATCT 


AATGCTTTAA 


TATTCGCCAT 


TGTAGAAATA 


7500 




CTATCAACAA 


TTAAATCATT 


TGAAATTTTA 


GATGATAAGT 


ATAATTCATC 


CTTAACGTTC 


7560 


20 


TCAACTGTCG 


ATTGTAATTC 


ATCATGACGC 


CCTTTCGCAT 


CATTTAAACG 


ACCTTCAATA 


7620 




TACTGACGTT 


TCTCTTCTAA 


AATATCTTTA 


TTTTTCAAAG 


CTTGTTGCCA GTGATCACGA 


7680 




ATGCGATATT 


GCTCATCAAG 


ATCAAAATCT 


AAGTCATAAT 


TTTCATCTAA AATGGCTAGT 


7740 


25 


TGTGCTTTAA 


TTTCTTCGAT 


TTCATCTGTG 


ATGGCCTCGC 


TATAATCTAC 


TTCTTTTGAT 


7800 




TTAGACATGA 


TGATACCGAT 


AACAAATACT 


AAAGTTAATA 


CTGCX3AAAAT AATACCAAAC 


7860 




AACATGTTGT 


TTGAAATAAA 


TGAGAAGGCA 


GTTAAACCAA 


TACCTACTAA 


TGTTAAAAGr 


7920 


30 


ATAAACXmXS 


TTCXSkAACAA 


TTTTTGACGT 


TTTTGttTTT 


CTT 




7963 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 3958 base pairs 

(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
r (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
ATATTGTCTT TACAATAGTT TGCTATGGAG GTAATTAACC AATAGGAGGA ATTTATAATG 60 
GCAGTAATTT CAATGAAACA ATTACTAGAA GCGGGTGTTC mCttCGGTCA CCAAACACGT 120 

4S 

CGTTGGAACC CAAAAATGAA AAAATATATC TTCACTGAGA GAAATGGTAT TTATATCATC 180 
GACTTACAAA AAACAGTGAA AAAAGTAGAC GAGGCATACA ACTTCTTGAA ACAAGTTTCA 240 
SO GAAGaTGGTG GACAAGTCTT ATTCGTAGGA nCTAAAAAAC AAGCACAAGA ATCAGTTAAA 300 
TCTGAAGCAG AACGTGCTGG TCAATTCTAC ATTAACCAAA GATGGTTAGG TGGATTATTA 360 
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GAAGATGGTT 


TATTCGAAGT ATTACCTAAA AAAGAAGTAG 


TAGAACTTAA 


AAAAGAATAC 


480 




GACCGTTTAA 


TCAAATTCTT AGGCGGAATT CGTGATATGA 


AATCAATGCC 


TCAAGCATTA 


540 


5 


TTCGTAGTTG 


ACCCACGTAA AGAGCGTAAT GCAATTGCTG 


AAGCTCGTAA 


ATTAAATATT 


600 




CCTATCGTAG 


GTATCGTTGA CACTAACTGT GATCCTGACG 


AAATTGACTA 


CGTTATCCCA 


660 


10 


GCAAACGACG 


ATGCTATCCG TGCGGTTAAA TTATTAACTG 


CTAAAATGGC 


AGATGCAATC 


720 


TTAGAAGGTC 


AACAAGGCGT TTCTAATGAA GAAGTAGCTG 


CAGAACAAAA 


CATCGATTTA 


780 




GATGAAAAAG 


AAAAATCAGA AGAAACAGAA GCAACTGAAG 


AATAATCAAC 


TGTTGAATCT 


840 


IS 


QACTTAGATA 


TAGTTTAAAT GGGTGATAAG ATATTAATGC 


TTATCACCTT 


TTTTAAAAAG 


900 


AAAATCGAGG 


CAAATTACAA ATATTCAATT AGAGTATTGG 


CAATCTTGCC 


TATAATAATG 


960 




CTAAAATCAT 


AATATATAAn ATGATAACTT ATTGGAGGAA 


TAATGAATGG 


CAACTATTTC 


1020 


20 


AGCAAAACTT 


GTTAAAGAAT TACX3TGAAAA AACTGGCGOG 


GGTATGATGG 


ATTGTAAAAA 


1080 




AGCGCTAACT 


GAAACTGATG GTGACATCGA TAAAGCX3ATT 


GACTACCTAC 


GTGAAAAAGG 


1140 




TATTGCTAAA 


GCAGCTAAAA AAGCAGACCG TATTGOGGCT 


GAAGGTTTAG 


TACATGTAGA 


1200 


25 


AACTAAAGGT 


AACGACGCAt TATCGTTGAA ATCAACTCTG 


AAACAGACTT 


TGTTGCTCGT 


1260 




AACGAAGGTT 


TCCAAGAGTT AGTTAAAGAA ATCGCTAATC 


AAGTATTAGA 


TACAAAAGCT 


1320 




GAAACTGTTG 


AAGCTTTAAT GGAAACAACT TTACCAAATG 


OTAAATCAGT 


TGATGAAAGA 


1380 


30 


ATTAAAGAAG 


CAATTTCAAC AATCGGTGAA AAATTAAGTG 


TTCGTCGTTT 


TGCTATCAGA 


1440 




ACTAAAACTG 


ATAACGATGC TTTCGGCGCT TACTTACACA 


TGGGTGGACG 


CATTGGTGTA 


ISOO 


35 


TTAACAGTTG 


TTGAAGGTTC AACTGACGAA GAAGCA6CAA 


GAGACGTTGC 


TATGCATATC 


1560 


GCTGCAATCA 


ACCCTAAATA TGTTTCTTCT GAACAAGTTA 


GCGAAGAAGA 


AATCAACCAC 


1620 




GAAAdAGAAG 


TTTTAAAACA ACAAGCATTA AATGAAGGTA 


AACCAGAAAA 


CATCGTTGAA 


1680 


40 


AAAATGGTGG 


AAGGACGTTT ACGTAAATAC TTACAAGAAA 


TTTGTGCTGT 


AGATCAAGmT 


1740 


TCGTTAAAAA 


CCCTGATGTA ACAGTTGAAG CTTTCTTAAA 


AACAAAAGGT 


GGAAAACTTG 


1800 




TTGACTTCGT 


ACGCTATGAA GTAGGCGAAG GTATGGAAAA 


ACGCGAAGAA 


AACTTTGCGG 


1860 


45 


ATGAAGTTAA 


AGGACAAATG AAATAATCTG TCATAAAGTA 


AAACAAGGAA 


GAAGACACCT 


1920 




TTAATGTTGC 


TTTATTAAAA TGTAAATCAT TCTAATAAAA 


CGACAACTGT 


GTCTTCTTTA 


1980 




CTTGTATATG 


TTACATATAT TCACGATAGA GAGGATAAGA 


AAATGGCTCA 


AATTTCTAAA 


2040 


SO 


TATAAACGTG 


TAGTTTTGAA ACTAAGTGGT GAAGCGTTAG 


CTGGAGAAAA 


AGGATTTGGC 


2100 




ATAAATCCAG 


TAATTATTAA AAGTGTTGCT GAGCAAGTGG 


CTGAAGTTGC 


TAAAATGGAC 


2160 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

10 

ATTAAAACAA CTTAATATAC CTATTTATGG TGGTCCTTTA GCATTAGGTT TAATCCGTAA 60 

TAAACTTGAA GAACATCATT TATTACGTAC TGCTAAACTA AATGAAATCA ATGAGGACAG 120 

TGTGATTAAA TCTAAGCACT TTACGATTTC TTTCTACTTA ACTACACATA GTATTCCTGA 180 

IS 

AACTTATGGC GTCATCGTAG ATACACCTGA AGGAAAAGTA GTTCATACCG GTGACTTTAA 240 

ATTTGATTTT ACACCTGTAG 6CAAACCAGC AAACATTGCT AAAATGGCTC AATTAGGCGA 300 

2Q AGAAGGCGTT CTATGTTTAC TTTCAGACTC AACAAATTCA CTTGTGCCTG ATTTTACTTT ' 360 

AAGCGAACGT GAAGTTGGTC AAAACGTAGA TAAGATCTTC CGTAATTGTA AAGGTCGTAT 420 

TATATTTGCT ACCTTCGCTT CTAATATTTA CCGAGTTCAA CAAGCAGTTG AAGCTGCTAT 480 

25 CAAAAATAAC CGTAAAATTG TTACGTTCGG TCGTTOGATG GAAAACAATA TTAAAATAGG 540 

TATGGAACTT GGTTATATTA AAGCACCACC TGAAACATTT ATTGAACCTA ATAAAATTAA 600 

TACCGTACCG AAGCAT6AGT TATTGATACT ATGTACTGGT TCACAAGGTG AACCAATGGC 660 

^ AGCATTATCT AGAATTGCTA ATGGTACTCA TAAGCAAATT AAAATTATAC CTGAAGATAC 720 

06TTGTATTT AGTTCATCAC CTATCCCAGG TAATACAAAA AGTATTAACA GAACTATTAA 780 

TTCCTTGTAT AAAGCTGGTG CAGATGTTAT CCATAGCAAG ATTTCTAACA TCCATACTTC 840 

35 

AGGGCATGGT TCTCAAGGTG ATCAACAATT AATGCTTCGA TTAATCAAGC CGAAATATTT 900 

CTTAeCTATT CATGGTGAAT ACCGTATGTT AAAAGCACAT GGTGAGACTG GTGTTGAATG 960 

CGGCGTTGAA GAAGATAATG TCTTCATCTT TGATATTGGA GATGTCTTAG CTTTAACACA 1020 

40 

CGATTCAGCA CGTAAAGCTG GTCGCATTCC ATCTGGTAAT GTACTTGTTG ATGGTAGTGG 1080 

TATCGGTGAT ATCGGTAATG TTGTAATAAG AGACCGTAAG CTATTATCTG AAGAAGGTTT 1140 

45 AGTTATCGTT GTTGTTAGTA TTGaTTTTAA TACAAATAAA TTACTTTCTG GTCCAGACAT 1200 

TATTTCTCGA GGATTTGTAT ATATGAGGGA ATCAGGTCAA TTAATTTATG ATGCACAACG 1260 

CAAAATCAAA ACTGATGTTA TTAGTAAGTT AAATCAAAAT AAAGATATTC AATGGCATCA 1320 

SO GATTAAATCT TCTATCATTG AAACATTACA ACCTTATTTA TTTGAAAAAA CAGCTAGAAA 1380 

ACCAATGATT TTACCAGTCA TTATGAAGGT AAACGAACAA AAAGAATCAA ACAATAAATA 1440 
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GCTTTTTCTT TATATATGAT GAGCTTGAGA CATAAATCAA TGTTCAATGC TCTACAAAGT 1560 

TATATTGGCA GTAGTTGACT GAACGAAAAT GCGCTTGTAA CAAGCTTTTT TCAATTCTAG 1620 

TCAGGGGCCC CAACATAGAG AATTTCGAAA AGAAATTCTA CAGGCAATGC GAGTTGGGGT 1680 

GTGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGTA AGTTGGGGTG 1740 

GGACGACGAA ATAAATTTTG AGAAAATATC ATTTCTGTCC CACTCCCGAT TATCTCGTCG 1800 

CAATATTTTT TTCAAAGCGA TTTAAATCAT TATCATGTCC AATCATGATT AAAATATCAC 1860 

CTATTTCTAA ATTAATATTT GGATTTGGTG AAATGATGAA CTCTTTGCCT CGTTTAATTG 1920 

CAATAATGTT AATTCCATAT TGTGCTCTTA TATCTAAATC AATGATAGAC TGCCCCGCCA 1980 

TCTTTTCAGT TGCTTTCAAT TCTACAATAG AATGCTCGTC TGCCAACTCA AGATAATCAA 2040 

GTACACTTGC ACTCGCAACA TTATGCGCAA TACX3TCTACC CATATCACGC TCAGGGTGCA 2100 

CAACCGTATC TGCTCCAATT TTATTTAAAA TCTTTGCATG ATAATCATTT TGTGCTTTAG 2160 

CAGTTACTTT TTTTACACCT AACTCTTTTA AAATTAAAGT CXITCAACGTA CTTGATTGAA 2220 

TATTTTCACC AATTGCCACA ATGACATGAT CAAAGTTACG GATACCTAAA CTTTTCATAA 2280 

2S CTGCTTCATC TGTAGTGTCT GCAACAACCG CATGAGTAGC GATATCACTA TATTCATTCA 2340 

CTCTATTTTC ATCATGGTCG ATGGCCATTA CATCCATGTC TAATGCATTC AACTCACGAA 2400 

CGATACTACC TCCAAAACGA CCTAGACCGA TGACTACATA TTCTTTACCC ATACTCGCCC 2460 

30 TCCATTAAAT GATTTTCATC AATTCATTGA AAATATAAAT TTAAAATTAT TATAAATGAG 2520 

TACCCCAACT AAATTATCTA AATGCAGTAA TGCAAGTAAA TGAAAGTTGG GGTATCGTCT 2580 

CAACTTATGA TTTCTTTCCT TCAACATATT CTTTGTCGAA AACAAATAAT CTTAATAATA 2640 

ATATTAACGA TGGAAGTAAT AAAAGTAAAC CTAAAATAAA GACAATCACT AATGTCCAGC 2700 

CCA'm'CTGG ATTAACATAT GCATCTGTAA TTTTT A CAAA CGGATATAAA AGGTATGGCA 2760 

ATTTACTAAT TCCATAGCCA AAGAACGCGA ACATCATTTG TAAAATAACA AATACAAAAG 2820 

CCAAACCATG TTTTTTCTTA AAGAATGTTA ACAATGAAGC TAATGCAAAG AATAAGAAAC 2880 

TTATACCAAA CATCCACCAA TAGTCAAAAA CAGCTGAATA AAAATGTTCA GAATTTTGAA 2940 

TGCGTAATGA TAGAAATACG AATAAACAAA TGATAATCAT CGGCGGCCCT AAAAATATGT 3000 

GCCATTGTCT TGTTAAATTA TATGCTGGTT CGTCATTTGC TTTTTTAGCA TAATATGTCA 3060 

AAAATCCTGA TGAAATATAT AAAACTGAAA TAATTGCCAA GAATACTACA GACCAAGCAA 3120 

SO ATGGGCTTAA TAATAACTGC ACCCAATCTA GATCGATAAC ATTGTTTCGA ACATTAATAT 3180 

AGCCACCTTC TGTAATAGTT AAAGCAGTAG ATAATGAAGC TGGAATTAAT AATCCACTTA 3240 
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AACTGTTTCT CAACGATATC ATAATCAGTG CTATTGAACC TGGTATTAAC AATACCGTGC 3360 

CTAAATATTT GATTGACTCT GGAAAGAAAC CTACGAATCC TACGAAGAAG AAAACAAAGA 3420 

ATACATTCGT AACTTCCCAA ACTGGGTTTA AATAACGTGA AATTAAGTGA TTAATTTTCT 3480 

TTTCATCACC AGTTAACTTT GAATGCAATG CGAAGAAACC TGCCCCAAAA TCTATAGAAG 3540 

CAATAATGAT ATAGCAAAAT AAAAACAACC ATAACACTGT TATACCTATA AATGCATAAA 3600 

TCATTTTTCT ATTTCTCCTC CTTGCTTCTT GGCTAAACGA TTTACATCTT CATACGCCGG 3660 

TTTATTTTTA AACATACGAA TTAATACX5TA TGCACATGTA TACATTAAAA TGATGTACAA 3720 

TATGCCAAAT AAAATTGTAA CGAaGGTTAT TCCGCCTGCT TGTGTTGCTG CTTCTGCCAC 3780 

GCGCATATAA CCACGAACAA TCCAAGGCTG TCTACCCATC TCTGTTAAGA ACCATCCAAA 3840 

TTCTATAGCT AGCATTGAAG CTGGGCCTGT TAATAATATT CCATAAAGCA TCCATTTATG 3900 

AGTAGAAAAC TTTCTAAGCT TTTTAAACAT TAAAGTTAAG ACATAAACAC CTGAAATGAC 3960 

AAAACATAAA ATTCCCATCG TTACCATTAA ATCAAAGAAA TAATGOACGA TCATAQ6CQG 4020 

ATGTAAACTT TTTGGAAAAT CATTTAACCC TTGTACTTTA GmTGACAC TATTATCT6C 4080 

25 TAAGAAACTC AATAGTCCAG GTAATTCAAT CGCACCTTTA ACTTGCTGAG TCTTTTCATC 4140 

TAACACACCA AATAATAATA ATTTGOCATG GGAA6ATGTA TCX3AAATGCC ATTCATAAGC 4200 

TGCTAATTTT TCAGGTTGGA ATTTATGCAA AAATTTTGCA GATAAATCCC CTGCCAACAT 4260 

30 AOAAAGTAAT GTTGAAAAGA ATCCAACTAT CATAGACATT TTCAAAGCTT TCTTATGGTA 4320 

GACAGTATCT TTAGGTTGAC GATTACGCAA TAATTTAAAA GCTGCTATTG ATGCAATAAC 4380 

AAATGCCATC GTCATACCGG CTGTAGTAAT TACGTGAAAT GATCGAACTA TAAACGATGG 4440 

GTTAAACATC 6CTTCTATAG GTTQAACATT GACCATCTTT CCATTCTTCA ACTCAAAACC 4500 

TGCA6GCGTA TTCATAAATG AATTCACTGA AGTAATGAAG AATGCTGAGA AAGAGCCACC 4560 

AATAATTACT GGTATACTAA TTAAGAAATQ TGTCCATTTA TTTTTAAAAC GATCCCAAGT 4620 

ATATAAATAT ATACTTAAGA AAATAGCTTC AAAGAAGAAC GCAAATGTTT CCATAAATAA 4680 

TGGAAGTGCA ATAACGTGTC CACCCATTTC CATAAATGTA GGCCAAATCA ATGATAATTG 4740 

AAGTCCTATA ATTGTACCTG TAACAACTCC CACTGCTACA GTAATTGTAT AAGCTTTAGC 4800 

CCATCTTTTG GCCATAGCTA TATATTGAAG ATCATTTTTG CGAATACCTA AAAATTCTGC 4860 

AATTGCGAAC ATTAAAGGCA TACCAACACC AATCGTTGCA AAAATGATAT GAACTGCTAA 4920 

50 AGTCATAGCT GTCAAAAACC GACTGATTTC AACTGTATCC ATTTAAAAAC ATCACCTTTT 4980 

TCTTTTTTTG ATGACAACAC AATGAACTTA ATTATAATTG CTATAATGTG TATTTTTAAA 5040 
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GAATTTCAAT GTATAATTGT GTATATTACA TTAGAATAAA GCACGAAGGA GCATGATACA 5160 

TGTCAGAAAT AATCGTTTAT ACGCAGAATG ATTGTCCACC TTGTACATTT GTAAAAAATT 5220 

ATCTAAATGA GCATCACATT GATTTTGAAG AGAGAAATAT CAACAATCAA CAATATCGAA 5280 

ACGAAATGAT AGATTTTGAT GCTTTTTCAA CTCCGTTTAT TTTGTTGAAT GGC 5333 
(2) INFORMATION TOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



20 


ATACGTGACC 


CTTTATCCGA 


AAATTTCTTT 


TCATATTCTG 


TTAAAATATT 


ACTGCCATCG 


60 




TCTTCTTGAT 


GTAAATTTAG 


ATTTATTTTT 


GTAAAATACA 


TTCCAAATTO 


AGACATAGTT 


120 




TCTAAACTGT 


AGGCAAATAG 


TCCTCTGTTA 


TCAGTTTTAA 


AATGTAAATC 


TCCTTCATCA 


180 


25 


TTTAAGATTT 


GTTGATACAA 


CGCTAAAAAC 


GTATGATACG 


TTAAAOGTCG 


TTTTGCATGA 


240 




CGATTTTTTG 


GCCATGGATC 


TGAAAAGTTC 


AAATAAATAC 


GCGAAACTTC 


GCCGTCTTTA 


300 




AAATATTCAT 


TTAATTCAAT 


GGCGTCATTA 


CAAATAATCT 


TTAAATTTGT 


TAAACCCATC 


360 


30 


TCTTTAACTT 


TATCCAATAC 


TTTATAAACG 


ATACTTTTCT 


CACGTTCCAT 


TGAAATATAG 


420 




TTAATATGAG 


GATTTTGAGC 


AGCTAATGTT 


GTAATAAACT 


GCCCCATACC 


CGAACCAATT 


480 




TCAATGTGTA 


TCGGTTGCGT 


TTTaTCAAAC 


CATTCAGTCA 


TTTTCCCTGc 


ATGTTGACCG 


540 


35 


TCCATGTCAA 


CCAATTCAGG 


ATGATCTTTT 


AAATAATCTT 


CAGCCCATGG 


TTTGTATCGA 


600 




ACTCTCATAT 


TTTATTCTCC 


TCTTAAATAA 


ACATGTTACT 


ATTCATAACT 


TCATTTAGGA 


660 


40 


ATTTAAGCCA 


AGTGTTCATA 


TCCTTATATC 


TTTTTTGCTC 


TTCATACCAT 


TGAACAAGAC 


720 


CTATAGATTG 
CACCATATGT 


AATTACCGTA 
TTCAAGCCAT 


TACCATTTCA 
TCAGACCATT 


TACGTTTATT 


TAAATTCAAG 


CTCTCTTGAA 


780 






AACATAGTTG 


TAAAGCAGCA 


840 


45 


TTCCGATATC 


AATTGCCGGG 


TCTGCAATCA 


TTGCACCTTC 


CCAATCAACT 


AAAAATAGTT 


900 




CATCTCGATC 


GGATAATAAC 


CAATTATTAT 


GATTCACATC 


ACCATGTACA 


ACAGTGAAAA 


960 




AACGCGAATC 


TAAACTCGGT 


ATATGCTCTT 


CTAAATAGGT 


TAATGATTTT 


CTCACAATAT 


1020 


50 


GATGTGTTAA 


AACTTCTCTT 


GATAAAGAGG 


CATTAATTTT 


ATTAAGCATA 


ATCTCAGGAG 


1080 




TAATAGGTTC 


CATTTCCATA 


CGCTTTAACA 


TACTTAATAA 


AGGTCTAGAA 


TTGTGTATCT 


1140 
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TTTTCCAATG TTGTGCTGTA ACAACCTCGC CTGTTTCTAT GCGTTTCGTC CATACTAATT 1260 

TGGGCACAAT ACCTTCTGCT GATAATGCCG CAATAAATGG ATTTGAATTT CGTTTTAAAA 1320 

ACAACTTTTG TCCATCTTGT TCAGCCATAT ATGCTTCACC AGATGCACCA CCTGCTGAAT 1380 

CAAGTGTCCA CCCTAATTGA TAAAACTGCT CCAACTCGTC CACCTCACTT TCAATTAGAA 1440 

AATGGCTCTA GAAATAGGTT TTTCAAGAGC CATATATTCT AATTTATAAC ACCATACTGG 1500 

TACAAATATT ATGTCCAGAT AATTATTGTA AATCCTCAAC CAATGCCTAC ATTACACGAC 1560 

TAAATTTAAA TCGTAATGTC TGTCATTGAC ACCATACATT CTATAGTCAC TTACTTGACA 1620 

TATAATGTTA CCGTGTCTAA AACTACATGT TTTTGAATCT CTGTAGGCGA TAAACTcTAG 1680 

TTTTCAAAAT AATTGCTATC CCATTTTCAT GGTTAOCATA AATTTATQAA CTGTAACATT 1740 

TACGTACTTA GTAAAATATG ATGCACATCA TATTTGTrAC TCATAGAAAA TTTTATAAtT 1800 

20 TTTATCATTA TATTTCAACT GAAAATGAGA AACAAAATGG CACTTTTTAC TAATATGTGT 1860 

TTTCTAAACA ACACTTTTAA GCTTCGTTTT AAATTATAAC ATAATTCACT TACGAAAGTT 1920 

C3ATAAATTTA AGTAATTTAA TCTAAAAATA TGATGAAAGA ATTTTAAATA CTGTGTGACT 1980 

2S CTATATACTT TTCAAATCCT TCTTGTAGTT GACGTGTAAT TOGOCCAACT TTACCATCAT 2040 

TAACTGGTTC ACCATCTAAT TTAATAACAG GTGTAACCTC AGCTGAAGTA CTTGAAACAA 2100 

TAACTTCATC TGCGTTTTTC AAGAAATCTA CAGTAAACGT TTCTTCTTTA AATGGGATGT 2160 

TATAGTCTTC GGCAATTTTT TTAATTACAA TTCGTGTAAT ACCATTAAGA ATATAGTTGT 2220 

TAATCGGATG TGTATAAATC ACACCXSTCTT TAATTGCATA AGCATTACTT GAAGATCCTT 2280 

CAGTTACAGT TtCACCTCGA T6TTGAATTG CTTCAACTGC ATTATATTTC ACAGCATATT 2340 

CTTTTGCTAA TACATTcTCC TAATAAGTTC AAGCTTTTAA TGTCGCAACG TAACCATCGG 2400 

ATATCTTCAA CGGTAACACC ATTCACACCA TTTTCTAAAT GATCATAAGG ACGATCATAA 2460 

CTCTTTGTAT AAGCAACAAT TGCTGGTTCT ACTTCAGGTG TCGGGAAGCT ATGATTCCTT 2520 

TCAGCTACAC CACGCGTTGC TyGAATATAA ATTGCCCCAG TTTCAATTTG ATTCATATCA 2580 

ACTAATTTAC GAGATAGTTC AATTAATTCT TCTACAGAAT AATTTAAATC TAAACCAATC 2640 

45 TCATTGGCAC TACGTwAAAw TCTTTCATAA TGTTCTGTTA CTGTAAATAA CTTACCATTA 2700 

TATACTCGAA TGTATTCATA AATACCATCG CCAAATACGT ATCCTCTGTC GTTGTATGAA 2760 

ACCTTTGCTT CACTTGGACT TACAAACTCA CCATTTAAAA AAATTTTTTC CATATATTAT 2820 

SO TCCTCCACGC ATAATGAATA AATTGCTTCT AAGTAAATAC TAGTTGCGTT AAATAACTGT 2880 

TTTTTAGTGA TATATTCATT TTTCTGATGC ATTAAATCTT CAGAATCACT AAACATTGCG 2940 
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TCAGTCATAT CATTTGTTTG ATTTCTATAT GCAGTAACTA ACTTTTGTAC AAAAGGATCA 3060 

TTTTTATCAA CATAATGTGG TGGTTGGACT TTACCTAATT TCACTTCAAA GCCATATTGT 3120 

^ TGAATCTCAT TTGCAAAACG ATCCATAGCT TTTTCAAATT CAAATCCTTC TGGGTAGCGT 3180 

AAGTTGATAC CGAAAAGACC TGCGTTTTCA TTATCATATG TAATAACACC AATGTTAGTT 3240 

GTCACGTCAC CCATGACATC TGTATGGAAT TTCATTCCCA TCTTTTCACC AAAATCTGAA 3300 

10 

TTAAATAAGT AGCGATTACT AAATGCTACA AACGCTTGTG CATTATTATC AAGATTTAAT 3360 

GATGCTAAGA ATTTTAGTAA GTAAAGACCC GCATTCACAC CGATAGATGG ATCCATACCA 3420 

TGAACCGCTT TACCTTCAAC TGTTAAAACT AGAATGCCAC TATCAACAGT ACTATCACCT 3480 

IS 

TGTAAATGAT TTTGTTCTAA AAAGTACTCA AAGTCTTGAA TAACATCTGT CATATTTTCT 3540 

TTAACAAGCA CTCTTGCTTC TGCATGATCA GGTACCATGT TGTAACGTTC ACCAGATTTA 3600 

20 AAAGTTATTA ATTCATAATC AGGTTCATCT TGATCTTCAG TAAGTTTATT TTGAACTAAA 3660 

TCAAATGTTG TAATGCCTTT TTCACCATGA ATACATGGAA ATTCTGCATC TGGTGCAAAA 3720 

CCTAATGTTG GCATTTCTTC TGTTTTAAAA TAGCGATCCG TACATTTCCA ATCAGATTCT 3780 

2S TCATCCGTAC CAATAATCAT ATGAATACGT TTCTTCCAAT CCACATTCAT ATCTTCTAAT 3840 

ATCTTAATTG CATAATAAGC AGCAATTGTT GGACCTTTGT CATCAAGTGT ACCTCTAGCT 3900 

ATGATAGCAT CTTCTGTTAC AACCGGCTCG AACGGATTAC TATCCCATCC ATCACCAGCA 3960 

^ GGAACAACGT CAACATGACA TAAGATACCT AATACGTCAT TTCCTTTACC TGCCTCAATT 4020 

CTTCCTGCAA TATGATCCAC ATCATGTGTt GTAAATCCAT CTCTATGTGC AATTTCATAC 4080 

ATGTAGTCTA ATGCCTTACG AGGACCTGGA CCAACTGGTG CGTCTTCTGA TGCTTTTGCA 4140 

35 

TCATCTCTCA CACTTTCAAT TGCTAATAAT CCTTTTAAGT CATTAATGAT TTGATCTTCG 4200 

TATT6TTGAA CTTTTTCTTT CCACATTCGA AATCGACTTC CTTTTTTCTA TAAGTTAAAT 4260 

TCTATTTTAC ATGAAAAGAT ATAAAAACTA CAATAAGATG TCAGAAAATA ATAAAAAGGA 4320 

40 

ACAAAACGAT GCTATTGATA TGACACAAAT CATAAATAGC TGCTTTGTTC CTTTTTTAAT 4380 

TTATATATTT AAAATACACA TATTCAAGAG CTCGAGATAT AAGTCAATGT ACTAGGCACA 4440 

45 CAATTTAATA TTGACAGTAA TTAACCGAAC GAAAATGCGC CCCGGGGCCC CAACATAGAG 4500 

AATTTCGAAA AGAAATTCTA CAGACAATGC AAGTTGGCGG GGCCCCAACA TAGAAGCTGG 4560 

CCAATAGTTA GCTTTCAATA ATGTGCAAGT TGGGGTAAGG GCCCCAACAC AGAAGCTGGC 4620 

SO CAATAGTCAG CTTTCAATAA TGTGCAAGTT GGGGTAAGGG CCCCAACACA GAGAATTTCO 4680 

AAAAGAAATT CTACAGACAA TGCAAGTTGG CGGGGCCCCA ACACAGAAGC TGGCCAATAG 4740 
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TAAAGAAATA CGTTTTCTTT 


AGATATTAGT 


ATTTCTTATG 


AATGAGTTTC ACGCATGTAT 


4860 




TCTTCTTTCT ATATGCATAT 


TAGCTATQAC 


TAACGATAAA GAACCTGAAA CACTAATAAA 


4920 


5 


TGTCCTATAG 


TTTACAATAT 


TATATTGGCA 


GTAGTTGACT GAATGAAAAT ACGCTTGTAA 


4980 




CAAGCTTTTT 


TCAATTCTAG 


TCAACCTTGC 


CXSGGGTGGGA 


CGACGAAATA AATTTTGCTA 


5040 




AAATATGATT 


TCTGTCCCAC 


TCCCTTATCA 


TTTCTGTCCT 


ACTCACATCT TATTCTTTAT 


5100 


10 


CAGATAATGC 


ATTTTTATTC 


TTTTTTAAAT 


CTTCTTCAGT 


GACGATACGT AAATTATTAT 


5160 




TTGGTGTGCG 


CCACCTTCAT 


CATCAAATTT 


ACCTTTTTCA 


ATACTTTCGT CAGTCTTATT 


5220 


IS 


GTCATATTCG 


GTAAATTTTG 


ATTTTTCTTC 


TTTGAAAAAT 


GCTTTTGGAT TATTTTTTAA 


5280 


TCTATTAGCA 


TATTCTTTCG 


GA'iTixyrrrr 


TACTTCTTTA 


ATTGTTTCAT TAGCAATTGT 


5340 




TCCTAATTGC 


GTCGCTTTAT 


CCTTAGCATT 


ATCTTTATAG 


CTTTGAGGAT CTTGTTTATA 


5400 


20 


TTTATTATAT 


TCcTGcTTTC 


AGCTTGTCAC 


GACTATCTTT ACGTGTAACA AGTACAGCTG 


5460 




CTACAGCGCC ACCTATACCT AAAATCGCTT TAAATAAATT ACCTTTTGCC ATATCAATCG 


5520 




TCTCCCTTTT ATTTATAATT 


TAATTTGTCA AAATCATTTT 


CAGTTAATAA ACGATATTCT 


5580 


25 


CCTGAATCTA AATTGCTGTC 


CAATTCTAAA 


TCAGCAATTT 


T6ATACGTCT TAAATGTAAT 


5640 




ACCTCATTTT 


GAATGCTATG 


AAACATTCGT 


TTAACTTGAT 


GATATTTTCC TTCATAAATT 


5700 




GTTACX5TGTG ACGTTTGATT 


ATCAATATAA 


GTTAATATTG 


CAGGCTTAAC CTTGCCATCA 


5760 


30 


GTCAGTGTtA CACCCTCTTT 


AAAAGCTTGA ATGTCGTCTT 


CAGTGATAGG ATTTGCTGAA 


5820 




ATAACTTCAT 


ATTTTTTAGA 


AACATGTTTG 


TTTGGACTCA 


TTAATTCATG ATTAAAATCA 


5B80 




CCATCATTCG 


TTATCAATAA 


AAGCCCTTCT 


GTATCTTTAT 


CAAGACGACC AACCGGAAAA 


5940 


35 


ATATTTAGAT 


GTTGGTATTC 


AGGTATTAAA 


TCAATAACGG 


TTTTTGAATG ATGATCTTCA 


6000 




GTTGCTGATA TATAACCTTT TGGCTTATTT AACATAATAT AGACATTTTC AATGTATTCT 


6060 


40 


ATTAATTCTC 


CACGAACTGT 


TATCTTATCG 


TTTTCTGGTT 


CTATATGTGT TTTTGGTGAT 


6120 


TTAATTACTT 


GTTCGTTGAC 


ATTTACAAGG 


CCTTTTTTAA 


GTAACTGTTT GACCTCATTA 


6180 




CGTGTACCGA 


CGCCCATATT 


TGCTAAAAAT 


TTATCTATTC 


TCATCGTAAA AACCTAACTC 


6240 


Ai: 
#o 


TACGTCTTAA 


TTTTTCAGGA 


ATTTCACCTA 


AGAATTCGTC 


CGCAAGACGC GTTTTAATTG 


6300 




TGATTGTACC 


GTAAATTAGA 


ATACCTACTG 


TAACACCTAA 


AATAATAATG ATTAAGTAAC 


6360 




CAAGTTTAGT 


AGGTTCTAAG 


AATAGATTTG 


CAAGGAAAAA 


TACTAATTCT ACACCTAGCA 


6420 


SO 


TCATAATAAA TGAATACAAG 


AATATTTTTG 


CAAAATGAAT 


CCAACTATAG CTGAATTTAA 


6480 




ACTTCGCATA 


TTTTTTAAGA 


ATATAGAAAT 


TACATCCAAT TGCAAATAAT AATGCOATAC 


6540 
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ACTTQATAAC TACAGAAGCT AAAATAACAT AAACTGTTAA TTTCTGTTTA TCTATACCTT 6660 

GTAACATTGA TGCCGTTACA CTTAATAGTG AAATTAGTAT TGCTACAGGC GCATAATAGA 6720 

^ ATAATAAGCG ACTACCATCA TGGTTAGGGT CATGACCTAA AACAATTGGA TCGTAACCAT 6780 

AGAAAACTGT GAATAATGGT TGTGCCAAGG CCATAATTCC AATACTAGCT GGAACAGTTA 6840 

TAAACATTAA TACACCAATA GATGTTCTAA TTTGATGATG CATTTCATGT AAGCGACCTT 6900 

10 

CTGCAAATGT TTTTGTAATA TAAGGAATTA AACTCACTGC AAAACCAGCA CTTAATGATG 6960 

TCGGAATCAT TACAATTTTA TTAGTrGACA TATTTAGCAT ATTAAAGAAT ATATCTTGTA 7020 

ACTGTGAAGG TATACCAACT AAAGATAAAG CACCGTTATG TGTAAATTGA TCTACTAAGT 7080 

IS 

TAAATAATGG ATAATTCAAA CTTACAATAA CGAACGGTAT ACTATAAGCA ATAATTTCTT 7140 

TATACATCTT GCCATATGAC ACATCTATAT CTGTGTAATC AGATTCGACC ATACGATCAA 7200 

TATTATGCTT ACGCTTTCTC CAGTAATACC AGAGTGTGaA TATACCAATA ATCGCACCAA 7260 

CTGCTGCTGC AAAAGTAGCA ATACCATTGG CTAATAAAAT AGAGCCATCA AAGACATTTA 7320 

GTACTAAATA ACTTCCGATT AATATGAAAA TCACGCGTGC AATTTGCTCA GTTACTTCTG 7380 

25 ACACTGCTGT TGGCCCCATA GATTTATAAC CTTGGAATAT CCCTCTCCAT GTCGCTAATA 7440 

CAGGAATAAA GATAACAACC ATACTAATGA TTCTTATAAT CCAAGTAATA TCATCGACTG 7500 

ACCAACCGTT TTTATCATGA ATGTTTCTAG CTAATGTTAA TTCAGAAATA TAAGGTGCTA 7560 

^ AGAAATACAG TACCAAGAAA CCTAAAACAC CGGTAATACT CATTACAATA AAACTCGATT 7620 

TATAAAATTT CTGACTTACT TTATATGCCC CAATAGCATT ATATTTCGCA ACATATTTCG 7680 

AAGCTGCTAA TGGTACACCT GCTGTCGCAA CTGCAATTGC AATATTATAT GGTGCATAAG 7740 

35 

OGTATOTGAA CGGCGCCATA TTTTCTTGTC CACCAATTAA ATAGTTGAAT GGAATGATaA 7800 

AAAGTACGCC CAATACCTTG GTAATTAATA TACTAATGGT AATTAAAAAG GTTCCACGCA 7860 

CCATTTCTTT ACTTTCACTC ATTACGAATC TCCCTATCTC ATGTTTATTA AAGTTTTGTA 7920 

40 

AACTAAAAGC TGTTTCTCTG TAAAATCATT TTTCATTATT ATGAATATAT CACAAAACTT 7980 

TATTTCATTG TCGTATATTC AATGAATTAT CATAACAAAA TTATCAACAC ATTGTCATTG 8040 

^ AATACTAGAT TTTGATTAGA ATATTACGAA ATTTCATATA AACATTATAC TACTATTTGA 8100 

GATGAACATC GCATAACAGT AGAAAAATCA TTCTTATCAT ACACATACAT CTTCATTTTT 8160 

TATGAAGTTC ACATTATAAA TATATTCAAC ATAATTGTCA TCTCATAACA CAAGAGATAT 8220 

50 AGCAAAGTTT AAAAAAGTAC TATAAAATAG CAATTGAATG TCCAGTAACA AATTTGGAGG 8280 

AAGCGTATAT GTATCAAACA ATTATTATCG GAGGCX3GACC TAGCGGCTTA ATGGCGGCAG 8340 
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GTAAACTCAA AATATCTGGT 


GGCGGTAGAT 


GTAAC6TAAC 


TAATCGATTA CCATATGCTG 


8460 




AAATTATTAA GAACATTCCT 


GGaAATGGGA AATTTTTATA TAGTCCCTTT TCAATTTTT6 


8520 


5 


ATAATGAATC CATCATAGAT 


'rnTri-GAGT 


CTAGGGGTGT 


TAAATTAAAA GAAGAAGATC 


8580 




ACGGGCGTAT GTTTCCAGTT 


TCCAACAAAG 


CACAAGACGT 


GGTTGATACA 


TTAGTQACAA 


8640 




CTATCGAACG CCAACATGTA 


ACGATTAAAG 


AAGAAGAAGC 


TGTTAGTAGA ATCGAAGTTA 


8700 


10 


ATACAGACCA AACTTTCACT 


GTACATACTC 


AAAATAATAG 


TTATGAAAGC 


CATTCGCTAG 


8760 




TGATTGCTAC AGGTGGTACA 


AGTGTCCCTC 


AAACTGGTTC 


AACTGGTGAT 


GGTTATAAGT 


8820 


IS 


TCGCACAAGA TTTAGGTCAT 


ACCATTACTG 


AGTTATTCCC 


GACCGAAGTT 


CCAATTACAT 


8880 


CAGCTGAACC TTTCATCAAA 


TCCAATCGTC TAAAAGGTTT AAGTTTTJUIA GATGTTGAAT 


8940 




TGTCAGTACT TAAGAAAAAT 


GGTAAAAAAC GCATCAGTCA TCAAATGGAT ATGTTATTTA 


9000 


20 


CTCATTTTGG TATCAGTGGT 


CCAGCTGCAT 


TAAGATGTAG 


TCAGTTTGTT 


TATAAAGAAC 


9060 




AAAAAAATCA AAAGACACAG 


CACATTTCTA TGGCAATCGA TGCATTTCCT 


GAATTAAACC 


9120 




ATGAACAATT AAAACAACAC 


ATCACATCAT 


TATTATCGGA 


CACACCAGAT 


AAAATCATTA 


9180 


25 


AAAACAGTTT GCATGGTCTA 


ATTGAAGAGC 


GCTACTTACT 


GTTCATGCTG 


GAACAAGCAG 


9240 




GAATCGATGA AAATACCACA 


TCACATCACT 


TATCAAATCA ACAATTGAAC 


GACTTAGTAA 


9300 




ATATGTTTAA AGGGTTTGTA 


TTTAAGGTGA ACGGGACATT 


ACCTATAGAT 


AAGGCATTTG 


9360 


30 


TCACAGGTGG TGGTGTGTCA 


CTTAAAGAAA 


TTCAACCTAA AACAATGATG 


TCTAAATTAG 


9420 




TTCCGGGATT ATTTTTATGT 


GGTGAAGTAT 


TAGATATACA TGGTTATACT 


GGTGGTTATA 


9480 




ATATTACAAG TGCACTCGTA 


ACAGGACATG 


TCGCTGGATT 


ATATGCCGGA 


CATTACTCAC 


9540 


35 


ATGCATCAAT GGAATAATAG 


TATAAAATTT 


GGTTCGATTC 


TCTTTAGTAG 


ATCAACTTTT 


9600 




TCATTCAAAT AAAAATGACC 


TTAATATAAC 


TGAGTCACTA 


AAAAGTGTCG 


TTATATTAAG 


9660 


40 


GTCAlTTCGT TAATTATGAT 


TCTTTTTCGT 


TTTTAGTACG 


TCTTCTAGCT 


AACAAAGCCG 


9720 


CACCTGTAAT CAGTGCAAAT 


TCTTTCAATG 


GTAAATCCAT 


TCCTTCAGAA 


CCTGTATTTG 


9780 




GAAGTTCTTT TTCAACTTTG 


CGCGATTCAT 


GTGTCTCTTC 


TTTTTTAATA 


GGCGTACAAA 


9840 


45 


CTTTTGGAGC TGGCTGAATT 


TCTTTTGGTG 


ATACTTTCGT 


CGCTTCAGCT 


GGTAATTTAA 


9900 




TTGCTAAAAT TTCATCAACA 


ATGAATTGCG 


TGTGTTGTTT 


GATGTCATTT 


AATGTCGCAT 


9960 




CTTCATCAAT CATTCTATTG 


CCATCTGCAA 


CATATTGATC 


AATTAATACT 


TTTACTTTAG 


10020 


SO 


CTAATTGTTC TGGTGTTGCG 


ATCGCTTTGA ATTTC6CATA TGTTTGTTGA GCAATGTTAT 


10030 




CAATTCGCAG TAAGCTATTT 


TCTTTTTCAG 


TAATTACTGC 


TTCTATATCG 


CTTAATGCAA 


10140 
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5 



10 



IS 



20 



25 



CATCCATTTG 


TAATTTTAAA 


GCAGTTATAG 


CTTTTAATGC ATCAGCCTTA TTACGATTAC 


10260 


TTACTTTTCG 


ATAATTTTGC 


ACTAAAGCAG 


TGACGCGTGC AA6ATCATCA TTAATCGTTT 


10320 


TTTCAGCATC 


TGGCTTTTTA 


ATAGGAT6TA 


CATCTAAATC ATGTATTGTT 


TGTAGATTTA 


10380 


ATGATGCTGT 


TTTATCAACT 


TGTGCATTGC 


TACGATCTTG ATCAATTTGT 


CCAATAGCAG 


10440 


TGTCATAAAT 


ATTTTGTAAC 


TGTGCTAATA 


TACTATTTCT TTCTTCTACC 


GTTGCTTGAA 


10500 


TATTCGCTTC 


AATTGCTTGT 


TTTTTATCGT 


TGAATAATGT TGTCAATTGT 


TCTCGAGCAG 


10560 


ACGCCTTTCT 


GTTAATAACA 


GGTTCGATTT 


CACGAATTTC GTTTTTCTCA 


TCATGCAATA 


10620 


AATATGCCAC 


ATCTGCATTA 


GTCACTGCAC 


TAGCAATTTG TTGTTTAGCT 


TTAATTAACT 


10680 


CTTTTTCAAC 


TTGTGCTATT 


GCAATATTTT 


GTTCTTCATC TGTCGCTTCG 


TTATTTGCTT 


10740 


TAATTAAATT 


AATTTTATTT 


GTAGCGATAT 


TTTGAATTTG TTGTAATGCT 


GTTGCTTTAA 


10800 


CTGTTGTCGC 


TGGTTTAATT 


TTTGAAATAA 


TATTTTGAGC ATTTATACTA 


TCTTGATTAA 


10860 


CTTGGGCAGT 


CTTATCTGCA 


TGATTGATCT 


GATCAATAGC CTGATTAAGT 


GCTTGTTCTA 


10920 


CTAAATGTTT 


AGCAGCTAGT 


CTTTCTTCTT 


CAGTTGATAA ATCGCTTTGA 


TCGATTAGT6 


10980 


CATTTTGAGC 


TTCGGCTTTT 


ACACCAACAG 


ATTGACGCGC TGCTGGTTTA ACTTGAACTT 


11040 


TAGGTAAAAT 


CACTTTGATG 


TTGTCGTTGC 


CATCAGTCnC AGltlCGATCC 


ACTTCTGCAT 


11100 


TCGTTTTGTT 


TTGTGCAATG 


TCATTT 






11126 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3660 base pairs 
(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

40 



45 



TTGCCCCGCA CGGCGGTGTG 


nTTCCTAGAA 


ATAATGAATA TAAAGaGAAA 


TATATAACAA 


60 


CGATTTTGAA 


TTATGAACCT 


GGTGATATCG 


TTACAATCAA AC6TGTGAGA 


GATAAGACGG 


120 


ATTTGCTAAT 


ATATTTGTCT 


AGTAAAGATA 


TTTCTATTGG TAATGAAGTG 


GAAATTGTAT 


180 


CGAAAGATGA 


AATGAATAAA 


GTAATTATCA 


TTAAACGTAA TGATAATGTA 


ATTATTGTCA 


240 


GTTACGAAAA 


TGCAATGAAC 


ATGTTTGCTG 


AAAAATAAAA TAAAGAAGCC 


ATAAAGATAT 


300 


CCATGATTGA 


ACTGATAAAG 


ACATATGGAT 


AATTGCTTTA GGCTTCTTTT 


TTATTAGTTA 


360 


ATTTATCAAG 


TGAGTATATT 


TGAGTAAAAT 


ATTCACTGCA TAAAGATTGA 


AGATAATCCA 


420 



55 
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CTGTGGACTC GGACGCTGGA AAGTCAATTT AGCAATCGTC CAACTAGATT GTAGAACTTC 540 

GCCTAATAAT ACACCTAAAA TATATTGATA ACTCATTGTG ACAACTAGTT GAATTTCTAC 600 

^ TATATTTTCA TCTTTTAATA TAAAATACAA CATGATAGAA ATTAAAGTTA TAACAACAAT 660 

GGGTGAGCCT TTTCtAGATG TTAAAATTAA AAAATAAATA AATATCAATA AATAGGTAAA 720 

TATAAAGAAA CTAGGTATCT GATAATGGCT CGACGCTAAA CCTATCAATA ACATAATAGG 780 

TGGCATAAAA TAACCACCAA TCGTTGTAAG CCATTGGCCT GCTAGATGTC TAGATTGTQT 840 

AATTGCGAAT CCTTGTTGTA ATGTCTGTTG TCGCTCTCGT GGACTTGTTA CAATGACTAA 900 

ATCTTTTGCA CGGCCACCAG CGAGTTTATT AAACAGTACA TGACCAAATT CATGTGTTAA 960 

IS 

AACAGGGATA TAGTTTAAAA TGACATCTAA ATAGTTCAAA ACAGGCTTAT GTCTATATTG 1020 

ATGAATAGCA ATATAACAAG CTGCAACAAT AACGATAATG TATATATTAA GTTGAATTGT 1080 

CGTATTAAAA AAGTTTGATA AATAATTCAT TGTTAACCTC ATATAAGATA TTAATTTAAA 1140 

GTTTGCTTAT CACTTATTAT AAATGATATT GGCATCAATA GCGTTAGACT TTAGACTTAC 1200 

CTTAGTTAAA CTAATTTTAA TTTTTGAAAA GGTGAATATG TGTTAAAATA AAGCAAAATC 1260 

25 ATTTCGATAT AAATAGGATG AATATAAATA CTGTTAATAT TGATTACACT AACATAATAA 1320 

TGAAATAAGA TAGGAGATTC CTGTTATGAC TGTTGAAOAA AGATCCAATA CAGCCAAAGT 1380 

TGACATTTTA GGGGTCGATT TTGATAATAC AACAATGTTG CAAATGGTTG AAAATATTAA 1440 

30 AACCTTTTTT GCAAATCAAT CAACGAATAA TCTTTTTATA GTAACAGCCA ACCCTGAAAT 1500 

AGTGAATTAC GCGACGACAC ATCAAGCGTA TTTAGAGTTA ATAAATCAAG CGAGCTATAT 1560 

TGTTGCTGAT GGGACAGGAG TAGTCAAAGC TTCGCATCGT TTAAAGCAAC CTCTAGCGCA 1620 

^ TCGTATACCT GGTATTGAGT TGATGGATGA ATGTTTGAAA ATTGCTCATG TAAATCATCA 1680 

AAAASTATTT TTGCTAGGGG CAACTAATGA AGTTGTAGAA GCGGCACAAT ATGCATTGCA 1740 

ACAAAGATAT CCAAACATAT CGTTTGCACA TCATCACGGT TATATTGATT TAGAAGATGA 1800 

40 

GACAGTAGTG AAcGnAnTTA AACTGTTTAA ACCTGATTAC ATATTTGTAG GTATGGGATT 1860 

CCCTAAACAA GAAGAATGGA TTATGACACA TGAAAACCAA TTTGAATCTA CAGTGATGAT 1920 

GGGCGTAGGT GGTTCTCTTG AAGTATTTGC TGGGGCTAAA AAGAGAGCGC CTTATATCTT 1980 

45 

TAGAAAATTA AACATTGAAT GGATATATAG AGCATTAATA GATTGGAAAC GTATTGGTAG 2040 

ATTAAAGAGT ATTCCAATAT TTATGTATAA AATAGCCAAA GCaAAAAGAA AAATAAAAAA 2100 

SO GGCGAAATAA TCATGATGAC AAAAATAAAA CCGAGGAAAT CCTTAAATGG AGATTCTCGG 2160 

TTTTTTCGGT TTATTTAATA ACGAAGCGGG ACTCATCGAG TTTGTTTCTA AATTCTTTTT 2220 

55 
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5 



10 



IS 



20 



35 



40 



CATCAAGTTC 


ACCGTAATCT 


TTTAACTTTC 


CX5CCTTCAAT 


CCAAGCAATC 


TTAGTACAAA 


2340 


ATTGTCTCAC 


TTGTCCTAAG 


TTATGACTAA 


CGAAAAAGAT 


GGTTTTGTTT 


TGCTCTTTAA 


2400 


ACTCGTAAAT 


TTTATCTAAA 


CATTTTTGTG 


CAAAAGTTTG 


GTCACCTACA 


GATAAAGCTT 


2460 


CGTCAATGAC 


TAAGATATCT 


GGATTAACTG 


TGATATTAAT 


TGAAAAACCA 


AGTTTTGCAC 


2520 


GCATACCACT 


TGAATACTTT 


TTAACTGGTT 


GATAAATAAA 


CTCACCAAGT 


TCACTAAATT 


2580 


CAATAATCTT 


AGGTGTCATC 


GCTTTAATTT 


CTTTTCGCTT 


AAAGCCCATA 


CATAACATTT 


2640 


TAAATTCGAT 


ATTTTCAATC 


CCTGTAAGTT 


GTCCACTCAA 


GCCAGCACTA 


ATTGCGATAA 


2700 


CGCTGACTTC 


ACCATTACGA 


TCCACTTTGC 


CAACAGTAGG 


CGACAAAGAA 


CCX5CCAATGA 


2760 


TATTGCTCAA 


CGTTGATTTG 


CCGGAACCAT 


TGATGCCAAC 


AAGCCCTATG 


ACGTCACCTT 


2820 


CATAT6CTTT 


TAAACTAATG 


TCATCTAAAG 


CGAAAAATGT 


TTTGTTTTTA 


TGnTGGGAA 


2880 


TGAGCGCATC 


TTTCATACGT 


TCTTTATTTG 


TACGATAAAT 


ACGATATTCT 


TTTGTTACAT 


2940 


TTTTAATGTT 


TACCGAAACG 


TTCATTTGTA 


GACCTTCCTT 


ATTCACATTT 


ATCTAGATTA 


3000 


TAATATACTA 


CTCAACAGTT 


GTTAAATTTT 


AAAACCTGTT 


GTAAAGTGTA 


TAGAAGATTT 


3060 


TGTTATTATC 


AGAGTGGGTG 


TTTTGACACA 


AAATGTTAAT 


CATCAATGAT 


AACAATGATA 


3120 


TTTAAAAACT 


AAACTTATTT 


CAACTTACAT 


GATTGTATAC 


TATAATGTAT 


TTGTAATAAA 


3180 


CTAATATTTT 


AAAGAACTAG 


ACAATAATTT 


TGATAGCATC 


CATGTATAGT 


GATAGTATTT 


3240 


ACAACAATTA 


TTATAATACT 


ATTTAGTTAA 


GTAGAGAAAT 


AGTTAAACAT 


TTGAAAGTGT 


3300 


GGTTTAATGG 


AATGTCAGCA 


ATAGGAACAG 


TTTTTAAAGA 


ACATGTAAAG 


AACTTTTATT 


3360 


TAATTCAAAG 


ACTGGCTCAG 


TTTCAAGTTA 


AAATTATCAA 


TCATAGTAAC 


TATTTAGGTG 


3420 


TGGCTTGGGA 


ATTAATTAAC 


CCTGTTATGC 


AAATTATGGT 


TTACTGGATG 


GTTTTTGGAT 


3480 


TAGCSO^TAAG 


AAGTAATGCA 


CCAATTCATG 


GTGTACCTTT 


TGTTTATTGG 


TTATTGGTTG 


3540 


GTATCAGTAT 


GTGGTTCTTC 


ATCAACCAAG 


GTATTTTAGA 


AGGTACTAAA 


GCAATTACAC 


3600 


AAAAGTTTAA 


TCAAGTATCG 


AAAATGAAcT 


TCCCGTTATC 


GATAtACCGA 


CATATATTGT 


3660 



(2) INFORMATION FOR SEQ ID NO: 173: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
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ATTAATCACT TGTTGTGTAG AGTCTTGTCC GTTTrGGTTA TGATTGTTAG CCATGATATA 120 

CCTCCCTTAC AACACTCGTG GACCAGAAGT TTTCTGATCT CTCACATTAA CTTCTAACTT 180 

^ ACGTACTGGC ATTTCTGTGA AATATTCTAC ATTCTTTTTA ATATCCGAAC GAATTGCTTC 240 

AGTTAAAGAT TGAACTTGAA CATTATTTGG TACGAAAAAG TCAGTTTTAA TGTCGATATA 300 

AGATTTATTT TTTTTGTTAT ATAGTTTCGC AACTACATTT GGTTGTCTTA CTTGATCATA 360 

10 

TTTTGCAACC GTATCGAATG CCGTCTTTTC AACAGCTTTA CGAGATACGT AAACATGACC 420 

ATCATCGAAG TCTTTGTATA ATCCAGGTTT TCGATGCGTA GGTTTGAAGA TACTAAATAC 480 

TAATATAAGA CCTATTAATA TCAATAGTGC AGCAAGTGAA ATAAGTAATG GTTGGAACCA 540 

IS 

TTCAAATTGA AGGAAGTAGT CTTGATATTC AGTTATACGT CCATCTTGGA TATACATGAA 600 

TAACAGGAAC CCCACGaTTA CTACTATTAA TAAGCCAAGG ATAAAGTTTT TAA6T0GTTT €60 

20 CACCCCTAAC GACACCTCCT TAGTTAAAGT TAATTTAAAA ACATATTAAA TATGTACCCA 720 

TCAGTTTTTT TCTTAAACAT AATAAATTAA TAACTTTAAA TTTATTTTTA ATATATAAGA 780 

TGAAGTACCA TTTAGTAATA TATTCCCTAG TTTTTGTAAA TAAAACCTCA TTATTAATTA 840 

2S ATTyTCGTCA ATATGTTTTG AAGAACGATA TTCTAAAATA TCTGGGTCAC GATGTTTAAT 900 

TAAAACCTTA TTACTATTTC TCGGTTTCTC CTCACTCAAA GATTTTATAA GCGACCATAT 960 

CATCGCTATA ATGACCACGG AAAATGGTAA CGCAGCAATG ATTAATAAAT TTTGAATTGC 1020 

TTGAGTACCA CCTGTGTAAA TCATGATGAT TGCAAATAAT GCCATAATGA TACCCCAACT 1080 

CACTTTGACA AATGACTTCG GATTAATATC ACCACTTGAA CTCAACATAC CTAAAACATA 1140 

AGTTGCCGAA TCCGCTGA7G TAACAAAGAA AATCATAATA ACAAGTAAAG TAATTAAGCT 1200 

35 

TAATACAAAA CCTAGCGGAT AAT6TTGTAG CGTCGCAAAA GTTGCTOTTT CTGTCGCAGC 1260 

TTTAScAATA TCGGCAATAT GATTATCTTG TAAGTAAATT GCTGACGCGC CGAATACCGC 1320 

AAAGAATATA AAGCAAACTA ACGCCGGGAC AAAAAGTACA CCTAGAATAA ATTCTTTAAT 1380 

40 

CGTACGTCCT TTTGACACAC GTGCAATAAA TATACCTACA AATGGTGCCC AAGATATCCA 1440 

CCATGCCCAG TAAAAGATTG TCCAATTTTG TAACCATTGG AATTTTTGAC CACCTGTCGG 1500 

4S AATGCGTAAA CTCATACTAA AGAAATTTGC AATATAATTA CCTAGACCAT TCGTAAATGT 1560 

ATTTAAAATG TATAGCGTTG GCCCAACAAT AAAAAGACCA ATAAGTACTA CAAAAGCAAG 1620 

TAACATGTTG ATATTACTCA ACGTTTTGAT ACCTTTATCG ATACCTGACC ATGCTGACCA 1680 

SO AGTAAATAAT ATGGTTGCAA TGACAATCAA GATTACTTGC ATCX3TGAAGT TACTCGGTAC 1740 

ATTAAATAAA AAATGTAAAC CTTCGTTTAT TTGCAATGCA CCGAAACCTA ATGTTGCAGC 1800 
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CATTGCCTTT TCACCTAATA AAGGCGTCAA TGTAGCGCTG ACTAAGCCAG GATATCCTTT 1920 

ATGAAAGCTA AAATATGCAA ACACTAGCGC GACAATACCA TAGACTGCCC ATGCATGAAT 1980 

^ CCCCCAATGG AAAAATGAAA ACTGCATTGC ATCATTAATT GCAGATTGCG TGCCAGCTTT 2040 

ATGAATAGGC GTTAATTTGA AGGCATGACT GATTGGTTCT GCCGTTGTCC AGAACACAAG 2100 

TCCTATTCCC ATACCAGCAC TAAATAACAT AGCAAACCAA GACGGCAATG AGAATTCAGG 2160 

10 

ATCTTCGCCT TCTTCACCTA ATGTAATGTT TGCGTATCTC GAAAATAAAA TATACACACA 2220 

GACAAATAAA ATAACTAAAA CGAGCAATAA ATAATACCAA GAAAAATGTA GCGCAATAAA 22 80 

TGTAGTAATG TTTTGCGTGA GTTTTrCTAA CTGTTTCGGA AATATTGCTC CAAAAGCAAC 2340 

IS 

AAATATCGTA CATATCACTA AAGATACCCA AAACACTAGA CTTACTGATT TATTTTTCAT 2400 

AAATACAAAC CCTTTCTGTG TAATQGTAAG TTCATACCCA TAACTGCAAC ATTTTAATCA 2460 

TTTGTAATTT TATATAGACA CAATTAATAA TGCCTCATCT TTTAAAAATG ATATATAAAA 2520 

CACACTCAAA TTATTTATCA TTGAGCAACA AAGTATTTTA TTGTATTTAA GTAATGCCTT 2580 

TCTAGTGCAT TATTGATTTG ATACCTGCAA AGTTGCCATA TTTCCGTTTA GAATCAATAG 2640 

2S TCGCTAGACA CAAAAAATAA GTCGCCTATA CAGTATTTTC TGCATAAGGC GACTTTACTT 2700 

ACTAATCTAT ATATTAATTA CTAATTTTCC AATCATTQAT TGTTTTTCCA ACAATTGATO 2760 

TGCTTGATAT AAGTTTTCAG GTGATAAACC TTCAAAAACT TGTGTCGTTG TTGGTTGGTA 2820 

ATGCCCTGAT TCTATATTTT TCGTAATATC TTCTAAATAC TCATGTTGTT TAATCATATC 2880 

AGGCGTTCGA TGAATTGGAC GCGCAAACAT AAATTCATGT GTAAATGTTA TACTTTTTAA 2940 

TTTTAATGCA TTTAAATCTT GATCTTCATT AAAAGCTACG ATAGTCGTAA TATGCCCTAA 3000 

35 

TGGnTTATC AGTTCAATCA TAGTATTGTA ATACAAGTCT GTATTATAGG TGCAAAATAT 3060 

ATAATCTACT AATGGAATTT CTTTAAATTG ACGCACTAAA TCCTCTTTAT GATTCAATAC 3120 

GATATCTGCG CCCATCTTTT CACACCACTC TGTTGTTTCT TGTCGTQATG CTGTTGTAAT 3180 

40 

GACAGTTAAA CCATACCGTT TAGCAATTTG AGTGGCTATA CTGCCTACAC CACCGGCACC 3240 

ATTAATGATT AAGACAGACT TCCCTTCGTT TTCAGCAGGA TTCGTAGAAA TTTTAAATGT 3300 

45 ATCAAAAAAC GTTTCATATG CCGTAATACC AGTTAGCGGT AGACTAACCG CTTCATTAGC 3360 

ACTTATGTTG TGTGGTGCTT TTGCAACTAT AGCTTCTGAC ACCAATTGAT ATGTCGCATT 3420 

TGATCCTTGT CTATTTGGCG ATCCAGCATA AAATACAACG TCACCCGGAC TAAATAATGT 3480 

SO AACGTCTGGT CCGATAGCTT CAACAGTACC AATAGCATCA AACCCAAGTA CACGAGGTGC 3540 

TTGAGTGACT TCCATTTGTC GTTGCTTTGT ATCTACAGGA TTTACACTAA TGCTATTTAC 3600 
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ATTTCCTTCT 


TCCAATTTAA AGGGCTTCTC AAATCCTATC ATTTTCATAT CGTTTCACCT 


3720 




CATTTATGAA 


CTTATTTCrr ATTATACAAA ATAGAAGCCA TGTGTGCTTA TATCGCAGCA 


3780 


5 


TCATGACTCC 


TTTTTCATTT 


GAATATATAA ATAATTACAG 


ACGACTTTCG 


TATTAAATTT 


3840 




TAGACTTATT 


TCTACCATGT 


TGCTGAACAA 


ATTTACTTTA 


GATAAAAAAT 


TATTAAATTT 


3900 


10 


TGGTCAATTA 


ACAAAGTTAG TTTGTTAAAA CGTgATACTT 


TATTATTCCG 


TTACTTTAAT 


3960 


AACTTGTTTA 


CCAAAGTTAT 


CGCCAGTaAA 


TAAATTTTTA 


AATGCATGTG 


GCGCATTTTC 


4020 




AAAACCATCT 


TCAATGGTTA 


CTTGTGACTG 


AATTTTACCT 


TCTTGAACCC 


ATGTTGCAAG 


4080 


IS 


CTGTTCACTA 


GCTTCTTTAA AAGCATTAGC 


GAATTCACTT ACCAAGAAGC 


CTCTCATCAT 


4140 




TACTTGCTTC 


TTAATAAGCG 


TACCTTGAAT 


ACXSTGGTCCG 


ATATCGGCTT 


CAGGATGATT 


4200 




ATATQACGAA 


ATTGCGCCAC 


ATACTGGTAC ACXSTGCAAAA 


GGATTTAAAT 


GCTTAAATAC 


4260 


20 


TTCATCGCCA 


ACTGTTCCAC 


CAACATTTTC 


AAAATAAACA 


TCAATACCAT 


CTGGTACTGC 


4320 




TTGTGCTAAC 


GCTTCTGCAA AATCCTCTTT 


CTTATAATCA 


ATACCAGCGT 


CAAAGCCCAG 


4380 




TGTCTCTGTT 


AAATAATTTA 


CTTTTTTQTC 


GCCACCCGCA ATACCTACTA 


CACX3GCAACC 


4440 


25 


TTTAATCTTA 


GCAATTTGAC 


CTACAACTGA 


ACCTACAGCA 


CCAGATGCAG 


CTGAAACCAC 


4500 




AACAGTATCA 


CCGGCTTTAG 


GTTGTCCAAT ATCAAGCA6A 


CCATGATATG 


CTGTTTGTCC 


4560 




TGGCATTCCT 


AAAACACTTA 


AATATAAATC 


AAGTGGTACA 


TCTGTCGTTG 


GAACTTTAGT 


4620 


30 


AATTTGATCC 


GCTTGGACAT 


GATTAATGAT 


TCGCCAAGGC 


AACATACCTA 


CAACGACATC 


4680 




TCCTTTTTTA 


TAATCTGCGA 


GTGTCGAATC 


AATTACTTTT 


GGAACGACAT 


GGCTAACAAT 


4740 




CGGTTTACCA 


ATTTCAAAAG 


GCTGTACATA 


CGAATCTGCC 


TTAGTCATAC 


GTCCTCTCAT 


4B00 


35 


ATATGGATCC 


ACTGAAATAT 


ACAGCGTTTG 


TACAAGTACA 


CCATCGCTCT 


CAAGTTTaGG 


4860 




CGTGTCAATC 


TCTTCaATTT 


TGAATGTATC 


CTCTTGAGGC 


ATGCCkTCTG 


GTATTTTGTT 


4920 


40 


AAAAAGAATT 


TGTTTATTTT 


GCATCATTAA 


TCACCTTTCT 


TTATTTGAAA 


CTTTTACTTA 


4980 




TTTGTTACTT 


AAGCGTTAAG 


TTTGAATTGT 


GTCtTCGTGA 


TGTCTGTATG 


CAAATACATT 


5040 




CTTAGTTGTT 


ATATTTTGAC 


TTAAGCACTG 


ATTCATTCAT 


GTAACTTCAA 


CCACATTATA 


5100 


AC 


TTTGCTATAA 


TCATAAATTT 


AAAATGTTAC 


GACTTAGACA 


TTTTATGGAA 


ACTCTCAAAC 


5160 




AATAGATAAT 


TTTTGAAAAG 


CTCTAATATT 


ACAAGCTTTT 


TTGCCCCAGA AAAACTAGCA 


5220 




GTTGCTTTAT 


TTTTTCCATA AGAAGTCXSAT 


TAACTCATTA GCAACATTTT 


CATTCTCATG 


5280 


SO 


AAGCTGACTA 


T6TTGTGCAG 


GCTCACCTTC 


ATATTTAGAT 


TCTCGATAAC 


TTTTOGGACT 


5340 




ATTTCCCAGT 


AAATATTTTA ATGATTTCGA AGAACTATTA GACACTCTGC 


CGTCTGAATG 


5400 
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CTACTAAATA TTGACCATCA CCAATAGGTC 
CTTCTGGGAA TTCTTTAAAA CAATATTCAG 
^ GATTTAGAAA ATCGCCATCT TTGTGTAaCA 

CAAATGGTTC AACATGACGT TTATGACACT 
CTTTAGGGTT AACTTCGCCA TATCCATTTG 

10 

GGATACCATT AACACCGAAT TTTTCACTTA 
AATCACTCGC TGGTTCTGCA GTGTACCAAT 
CX5CGACCTTT ACCATCXGTA TTTGTCGCAC 

IS 

CAAAAATAAA ATCTTCAGGT AATGTTTTAG 
AAATTGTTGT TGAACGAAAT CAAGGGCTGC 

2Q AGCACCTTGA GTCTTCGCTA ATTTAATACC 

ATAGTTAGAC GCAACTTGTG GCGCTAAAAT 
ATGTGCGCCA TATCC6CCAG cTGCCGCTTT 

25 ATTAAGTGCA TTGGCTAATA ATCCACTTGT 
TGTTTGTTCT GTGATATTTG AAGCTTTAGC 
AATTGAATCA GCTTTTTTCG TATCAAAGTT 
ACTTTCTTTA CGTCCTTCTT CTTCATCAAG 
GTAGTAAATA ATAATATCTA CAACAATCAA 
AAAACCTGTA CCCATGATAA TGCCCAATGG 

35 

AAAACTATTC ATTCCTAACA CTTCAACGAA 

TGCniATACA AATGGAATAA AGAACACAGG 
TGGTTCGTTT ACACCAAAGA ATGTTGGTAC 

40 

TTTAGATTTC GTCATCCACA TAAACATGAA 
AAATGTAACG ATAAACATTT GTGTACCTGA 

45 TTGAAGCAAC TTGAAGTTCG CTTCGATATT 
TACAATTGAC GGACCATGAA TACCTACAAA 
TGTGACACCA ATCCATCCAT CTGCTGCTGT 

50 TTCCX5CTACA TTTGATTTAA AGCTGTTGCG 
GATTACCGCT GAAAATGGAA TTAAGTCCTT 

55 
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CAATTTCATT GAATGTAGTC CAATATTTTA 7320 

CATAATCTAC AAAGTAGTCA ATCGTTTTAC 7380 

CTTCTGGTGT ATCAAAATGA TGCAATGTTA 7440 

CTGCAAATAA CTTATGGTAA TACTCAACAC 7500 

GGAAGATACG AGACCATGCA ATTGAAATTC 7560 

ATTCTAAATC CACTGGATAT CTGTTATAAA 7620 

AGTTTTCTTC TAAATACGTA TCCCATGCTA 7680 

CTTCTGCTTG ATATGCTGCT GTTGCTCCAC 7740 

TCATATGAAA AACTCCTATT CTTAATTTTC 7800 

TTGGCCATCT CGTGTCAATT TGATATATTC 7860 

TAATCTATCT GTATCTTGCT TAATATCTTC 7920 

GATTAATTGG TACTCTTTCA TAATGTCCAT 7980 

CACTGGCACA TGATATTCTT CAGCTGCTTT 8040 

ACCACCACCG GCACAAAGTA CTAAGACATT 8100 

TGCATCGTCT GATACACCAC TTGCCGCTAA 8160 

TGCTGCAACT TTTTCTTTTA AATCTGAATT 8220 

AATTTCACTA TCATAAACTT TTAGGAATGG 8280 

A6TAATAGCT AGTACGAATG ACCATAAACC 8340 

ACCTGGTGTT GTCCAAGGTA AATTCACACT 8400 

AAGTTTGAAA ATCCATACGT TAACAATTGG 8460 

ATTCAATACT AGTGGTGCAC CAAATAAAAT 8520 

AACTGATGCA CGTCCAATCG CTTTGTTTCG 8580 

CGGGACGACC AATGTTGCAC CCGTACCTCC 8640 

TGTAATAATT TTATCTGCGT GTTCTCCAGC 8700 

CGCATATGTA ATGGCTGCAA TTGCTGGCTC 8760 

CCAGAATAAT GCAAAGGCAC CAAAGATAAT 8820 

AAATAATGGT TCGAATAATT TTAAAATACC 8880 

AATGACTAAA TCTAATGCAT AAAGAATGAT 8940 

AAATACTTGT GAAATATTGG GCX3GTACTTC 9000 



846 



AAATGCTGAT AAGAATGCTG TTAATAAACC 
ACCATCTTTG GCTGGATCAG AAGCTAAGAA 
5 AGAAATAAAG TTAATTTGAT TTGTACTTTC 

TTTCGCTGTC GTTCCTGCTA CTAAAAATGC 
CATTAAAATG GCTTCCATGC CTTTATCCCA 
AATTAATAGA AAGATACTTG AGAATAAGAT 
GATGGCTCTT AAATATATGT TACGTGATAA 
TTGTGCGATC AATTTTTGCA TCATTGTCAT 

IS 

TTCATTAAAT CTTTCAGTAA AATTGTTGTC 
ACACTGTATG CAATATCATC ACCTTGCGCT 
TGCGCTTCCG CAATGCAATT GTTTCCTTCT 

20 

CCAGCTTGAG CTGCTGTTAA TGCTTCTAAA 
ACAATTTCAA AACCTAATAA TTGGACTTCT 

25 ACTTATTTTC TTTGTTTCCA AGATGTCGCA 
ATATTTTTGA AACCAGTTGT ACGTAACCAT 
TATACTTGAA CAGCACCAGA CCATGTA6CA 

30 CCAGCTTCGT GAGCAAATTT TAAAGTTTCT 

AAGTAAATGT ATQGTAAATG AGTT6CTGCA 
TCTTTTGTAT AAACCACTTC ACCTTCAGCA 

^ ACTTCAACTT TCAATACATC AACGTTAAAG 

TTAACCTTTC TAGGCTTAAC TTTTGCGAAT 
TCG^ATGTTA ATACTTCTAA AAAGAATGGA 

40 

CTTTCAATGT ATGCTTTCTT TTGAATGTTA 

AGTAAGAATT TAACAGCATT TGCGCCTTGT 
AAACAGTCAG GTAAACGACC TTTAGCGTTT 

45 

AATAATCCAC AATCTTTGTT ACGTGCATCT 
AAAATTGAAG ATGCATATTG TGTTAATTCT 
SO TGAGCTACAG TrGGTTCTTC AGTTTGATGT 

TGGTCAAATO CTAATGCAGA AATGATACCT 
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TTTAGTTCCC ATAAATOCAC TTAAGAATCC 9120 

CAAGAATCCA CACATCGCTG CTAGCATTGT 9180 

TAGCTTACGG TTAAATGAAT CTGTTAAAGA 9240 

TACAAGCCCC ATCGTATAGT TATATGGTTT 9300 

TTTAAAACCA AATATATTTG GCACATATGC 9360 

GACAGGCATT GCAGAAATAA ATCCATCACG 9420 

TTTCTCGAAA AATGGCTTCC CTTTTTCAAT 94 80 

CACCCTCTTT TATAAAATTC TAATAAATGC 9540 

ATTAAATGAT CTTGACCATG CATCATCXSTT 9600 

TCTTTAGCCA ACAGGCTTGT TTGTGCTCTA 9660 

TCAATCAGTG CATCTGCTTT TGCAAAATCT 9720 

AACTTAGAAC GTGCATCCCC TGCAAATGCA 9780 

TCTCTATTCA TAGCATTAAT CCCCTTTTAA 9840 

GTATCTTTTA ATACTTTATT TAAGTCATCA 9900 

TCACGAGCAG CATCTTCACC TTGTTCAATG 9960 

CGGCCACAAA GTACCCCGTT GAATTTAGCA 10020 

TGGAATAATT CCGCAGAAAC ACCAGCACTT 10080 

TCTTGATCTT TAAAGTGTTG TGCCGCTTCC 10140 

AAGCCTTCTA CATATTTCAT GTTTACTGGT 10200 

CGTGGTTCTG AGAATAATTT CATTGCTTCG 10260 

TcAACAcTAC CGTTATcAGG AATGTTGTCA 10320 

ATATCTTCTG CAACACATTC TGAACCGATT 10380 

ATTTCTTCAG CATCATCAAC ATCATAGTAA 10440 

TCTTTTAAAC GTTTTGCAGA CCACTCTACT 10500 

ACGTCATATC CAGTTTTTTC ATAAGCAAGT 10560 

GAAGCTGGTA AACCATATTC AGGATCTAAT 10620 

TCCGCAACTA ATACTTTTAA TTGTTCAATT 10680 

TTTGCCATCA TGCGTTTTAA AGCACCACGT 10740 

TCGTTGTTAC TTAATTGTTC AATTGATGCG 10800 
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TCATCATi^ : TATTTAAATT GACATAACCT GTTTGTGCTT CTTGTGCATT CAGCATGCCT 10920 

AAAGTATl i CTTTTTTTAG TAAATCGTGG TCGTTTTCAT GATTAAGAAT TGCTGAAGTA 10980 

^ ATTCCAGC ; CTGTAGAATC ACCTGAACCA ACCGGATTTA ATACACTTAT TGTCGGAATA 11040 

TTCACTCT ^ AGAATGTATG ATTGTGCTTA GCGAATGCAC CTTGTGCACC TAAAGACACA 11100 

ATAATCC^.. - CAATCCCTTC GAATAAGGGT TGTGACACTG CCTGTTTCAA ACTTTCTAAA 11160 

10 

CTTTCATC \ GTGGCTGGTT AAGCAATTGA TATAGTTCAG AAATGTTTGG TTTAATGACT 11220 

GTAGGTr: . ^ ATGGATTTTC CAAAACTGTT TGCAAAGTtG CACCCGAGCA ATCTAATATC 11280 

ACAGGCAC: : CTTTGTTTTG GCATCGTTCA ATGATTTGTG CATAATAATC TTGATTTAAT 11340 

IS 

CCTTTAGC" * AGCTACCTGA AATAGCAACT GCTTCAACTT TTTCTAATAA TTGTTCAAAA 11400 

TGTTTAAT^ \ ATCCTGCAGC CTCTTGATTA TCAATCTCCG GTCCCTGCTC TAAAATTTCT 11460 

^ GTTTGTTG;>: CTTCATGTAA AATTGCAATG CAGTTTCGTG TTTCACCCTT AATGTTATAA 11520 

AATGCATGCr TGATGTOGGC ATGATCTAAT TTTTTAGCAA TAAATT6ACC TAATTCACCG 11580 

CCAATAAA/. : CACTCGCAAG GACTGGCTCA CCTACTTGCG CAAGTACTCT TGTTACATTT 11640 

25 AAACCTTTAC CACCAGCTGT TTTACTTACT TCTTGAACAC GATTAACATC ATCTAATTTC 11700 

AATGCTGT7A ATGGGTATGA AATATCAACG GATGGATTTA ATGTTAAAGT TAAAATCATA 11760 

TGTGTCGTCC CTTAATCGTG GTATTCGCCT CTGTCCCATT TTTCTAAGTU^ TTCATCAAAG 11820 

AAATGTGGAT CAGCTTGATC TGCATTGCTT GTTTCTAAAT GTTTAATTTT AGCGATTAAT 11880 

TTTTTGTTCT CTTCAGTTGG TTTATATTCA GCATTAATAA ATGCATCGAT AATATCX3CAC 11940 

ATTAATAACT CACCTATAAT ACGTCCACCG AAGCCAATAA CGTTCGCATT TAATTCTTCT 12000 

35 

TTAGCGTATA ACGCTGATGT CATATCACX3T ACTAGTGCTG AACGAACGCC AGGTACTTTA 12060 

TTTJS&AGCAT TGTTAATACC AACACCTGTT CCACAAATAC AAACACCTAA GTCTGCATTA 12120 

CCGCTAACAA CTTGTTCGCC AACTTTTTTA CCAAAAATTG GATAATGTGT TCTTGTGAAA 12180 

40 

TCX5TATGTTC CTACGTCAAT GACTTCATGT CCTTTTGATT TTAAAAATTC AGATACACGC 12240 

AnTTTGTAT CTGTAACAAT ATGGTCGCAT CCTAATGCAA TCTTCATAGT AATTTTTCCT 12300 

^ CCTTAGCACA TTTTATTAAG CATATCTACG CGGATTTGGT GTCTACCACC ATCGTATTTA 12360 

CCTTCAACAA AACCTTTAAC GACATTTTTC GCTAATGTGT CTCCAACAAT TTCAGATCCC 12420 

ATAGTGATCA TTCTTGAATT GTTATGGCCT CTAGTCATAT ATCCAGAGCG TTCATCTGAT 124 80 

SO ACTTCAGCAG CAATCATGCC TTTGATTTTT GTAGCAACCA TAAAGCTACC TGCACCAAAT 12540 

GCATCX3ATAA CAATACCTAA GTTACCTTCT TGACTTTGAA CATCTTTTGC TACAGCCAAA 12600 
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5 



10 



IS 



20 



35 



TCTAATAAGT 


ATGATTTGAT 


GACTTCTTTT 


AATCGTTTGC 


CAGCTTCATC 


TGAACCAATA 


12720 


ATAATCGCCA 


TAATAAGACT 


CCTTTTTACT 


TTAATTTTGA AATACCTTTC 


TTAAAATGT6 


12780 


ACATATTTAT 


TTGTAGGTTA 


TGAAAATCTT 


GAGAAAAGGC 


TTTCAATTTG 


ATTACGTTTA 


12840 


AATTATAAAC 


ATAAACAAAC 


AATAAATCAA 


CATAATATGT 


TTATAATATG 


TTTGTTTATG 


12900 


ACGTATTTTC 


AAACAATAAG 


TGAACATTCA 


TATTGTGGTG 


TTGTTTTAAT 


TAGGTATTCG 


12960 


TCTGAAATTG 


TAGTAAAACT 


TTGTCGAGGT 


TCCCGTTGaC 


ATAAATTTGC 


ATAAAAAAtA 


13020 


GCCCATAAAT 


GAATGCAAAT 


TCACATTCAC 


TTATGAGCAT 


ATAGATACAT ATTTTAACAA 


13080 


TGCAGTTATA 


CTTTTAATTT 


AGTCGACTAC 


TTCAATATAT 


GTTTTAATCG 


TTTCTACTTT 


13140 


TTCTTCATCT 


TCATAGTCCA 


TGACCACTGC 


AGTCAATTCG 


TTTAACTGAC 


AAAATGATGT 


13200 


AAAATCTTCT 


TTGCCAACTT 


TCGTATGATC 


GATTAACAAG 


TATTTTTCAA 


TTGAATTACT 


13260 


TAGTGCCAGT 


TGTTGCGTAT 


AGGCTTCATC 


TAATGTAGAT 


GTCATCACAG 


CACCTTTATT 


13320 


TACTGCGTTA 


CTACTAAAGA 


ACATCTTGCT 


AAATCTTA6T 


TTTTCCAACA 


TGGCGTTCGC 


13380 


CATTTCACCT 


ACAAATGCTT 


CTGTAATATG 


GCGCATTTCA 


CCACCAATTA AATAGACACX3 


13440 


AAAATGTGCT 


GnTGTTTTT 


CTAACAAAAT 


TTTATACACC 


GGCAAACAAT 


TCGTAATAAT 


13500 


TGTGAGCGTA 


TGATGATTGA 


CTTCTTCTGC 


TAATAGTTCC 




CTGGTCCGAA 


13560 


AAACAAAGTA 


TCCCCATCTT 


CAATTAATGA 


TGCAGCTTTT 


TTAGCTATAA ATCGTTTTTC 


13620 


TGCAATTTGA 


CGGGTATGTT 


TTTCTTTATG 

AAA AAA A \J 


CGATATTTCT 


TTATACTGAA ATGTTGAATT 


13680 


ACTGCGTGCA 


CCACCATGAA 


TCTTCGTTAA 


AATCCCTTTA 


TTTTCCAATT 


CAATTAAATC 


13740 


TCTTOGAACT 


GTCATATCAG 


ACACATTTAA 


ACCTTCGACG 


ATTTCATTCG 


TTCTTATCGT 


13B00 


GCCCTTTTTA 


TTCACTAGTT 


TAGCAATTTC 


GTCCAAACGT 


TCATGTTTAT 


TCAATGTAAA 


13860 


ATTGCSrrC 












13868 



(2) iNPORMATION FOR SEQ ID NO: 174: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4S (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
SO TTAAGTCAAC TTTGTCTATA CGGTTTGGAT CtTCTaCCCA ATGTCTTATA AAAGACAATC 60 
CCGCACCTGA AACATAACTC ATGAAATAAG AAAATGOTAT ACCATTAATT TGATCATTTT 120 

55 
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10 



IS 



AATCTTTACC CATACGAAAC ATCAATTCAT AAAATGCGAT GTCTTTTTCT ATCATTTCTA 240 

TTAAAACGGT CATAATTTGA TGTATGTTAT CCGTGGATAA CTTAACTGCT CCATTTAACT 300 

TCTCATCATG AATGAAGTCT CTTATTTCCT CCAACTGCTG GTCCTCTAAT TTTTCAAGCA 360 

AATCATACTT ATCATAATAA TGCGTATAAA ATGTACTACG GTTAACATCA GCTAAATCTG 420 

CAATTTGTTG CACAGTAATC TCTTCTAATT GGTGTTGATG TAAAAGTTCA ATAAATGCAT 4 BO 

TTCTCATTGC AACTTOTGAT TTTCTAATAC GTCGATCTAT AGTCATTTAT ATCAAGTCCT 540 

CCCCAATGAT TATAAACGTT ATGTTCATTA TCCCACAAAT CTCCAACATT GATGATTGGC 600 

ACACAATGTT TACCTGTTTA ATATAGGTGA TACAAACAAA CAGAAAAAGG TGATAACAAT 660 

GAACCAACAT TTACTAGGAA ATCCAAAATT AACTGTAACT CATGTCAATG AAGTTAAAGC 720 

CGGTATTAAC CACATCGTTG TCGACAGTGT TCAATATGGA AATCAAGAAA TGATTATGGA 780 

20 AAAAGATGTC ACTGTGGAAA TGCGCGATGG CGAAAAATTA TATATTAATA TTTTCAGACC 840 

AAATAAAGAT GGCAAATTCC CTGTAGTTAT GTCTGCAGAT ACTTACGGTA AAGATAATAA 900 

GCCTAAAATC ACAAATATGG GTGCCCTTTG GCCAACATTA GGTACCATTC CGACATCTAO 960 

25 TTTTACACCT GAAGAATCAC CAGACCCAGG ATTTTGGGTG CCAAATGATT ATGTTGTAGT 1020 

TAAAGTTGCA TTACGCGGTA GTGACAAATC CAAAGGCGTC TTATCTCCAT GGTCAAAAAG 1080 

AGAAGCGGAA GATTATTACG ArTGATTGAA TGGGCAGCAA ATCAGTCATG GAGTAATGGA 1140 

AATATCGGGA CAAATGGTGT TTCTTATCTT GOGGTGACTC AATGGTGGGT CGCATCATTA 1200 

AATCCACCAC ATTTAAAAgC AAtGATTCCT TGGGAAGGCT TAAATGATAT GTATAgAGAA 1260 

GTAGCCTTTC ACGGAGGTAT mCCAGATACT GGCTTTTATC GTTTCTGGAC TCAAGGTATT 1320 

TTTGCGAGAT GGACAGATAA TCCAAATATC GAAGATTTGA TTCAAGCACA ACAAGAACAT 1380 

CCTCTGTTCG ATGATTTTTG GAAACAGCGT CAAGTGCCAT TATCACAAAT TAAAACACCT 1440 

CTACTAACAT GTGCTAGTTG GTCTACACAA GGTTTGCACA ACCGTGGCTC TTTTGAAGGA 1500 

TTTAAACAAG CTGCATCTGA AGAAAAATGG CTATATGTGC ATGGACGTAA AGAGTGGGAA 1560 

AGTTACTACG CTAGAGAAAA TCTCGAACGC CAAAAATCAT TCTTTGATTT TTACCTTAAA 1620 

45 GAAGAAAATA ACGATTGGAA AGATACGCCT CATGTCATTT ATGAAGTTAG AGATCAATTT 1580 

TATAAAGGCG AATTCAAATC AGCGTCACGT GTCCCTTTAC CTAACGCAGA ATATACACCA 1740 

TTGTATTTGA ATGCTGAAAA TCACACATTG AATCATGCAA AGATTAGTAG CGCGCATGTC 1800 

^ GCACAATATG ACTCTGAAGA TAAACAACAA GATGTAAGTT TTAAATATAC GTTTGACAAA 1860 

GATACTGAGT TAGTTGGAAA CATGAACTTA AAACTATGGG TAAGCACTAA AGACTCAGAT 1920 
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CCTGATTTTA ATCATATTGA AAATGGTCAA GTAGCTACTG GTTGGTTACG CGTATCACAT 2040 

CGTGAATTAG ATCAAGAAAA ATCCTCAATC GCGCAACCTT GGCATAAACA TGAAACAGAA 2100 

5 TTAAAGTTGT CACAAGATGA GATTGTACCT GTTGAAATCG AATTOTTACC TTCAGGCACG 2160 

CTATTTAAAC AAGGCGAAAC ATTGGAAGTT GTTGTAAAGG GTAGTGAAAT TGTAATTGGT 2220 

AATAGTACTC CTGGCATGAA AACACGTTAT GAACATGAAG AAACOGTAAA TAAAGGCATG 2280 

CACATGATTT ATACTGGTGG TAAATATGAT TCACAATTAA TCATTCCTAT CGTTAATTGA 2340 

TATGCAGCAA TTACX3GTCGC TTTTGATTAA AAGTGACATA GTGATAGGAC TGTATAACAA 2400 

GAGAAAGCCA CACGCTTGGA ATCTTAAACC AAGGTGTGGC CCTTTTTATT ATTGATGGCT 2460 

IS 

ATTTAATTTT ATAACACTAT CGTATTTTCT TTTTCATGAA TCATTTCAAT AATGACATTA 2520 

TCTTCATTCA TTACTGCTAC TTTAGGTGCA TGGTTTTTAA TTTCTTCTTC ATTCAACTGT 2580 

GCATAAGTCA TGATTATGAC TACATCGCCT ACTTCAACAA GTCTTGACGC TGCACCGTTT 264 0 

20 

AAACAAATTT TACCACTACC TCTTTCACCA GCTATTACGT ATGTTTCAAA ACX3TGCACCA 2700 

TTATTATTAT TCACX5ATGGC TACTTTTTCA TTTGGCAAGA TGTCTACCGC TTCCAATATA 2760 

2s TCTGAATCAA TCX5TAATGCT ACCTACATAA TTTAAATTTG ACTCAGTCAC TCTTGCTCTA 2820 

TGAATTTTAG CATTCATCAT TGTTCTTATC ACTITATTCA GCTCCAATTA TTATATTATC 2880 

TATTAAACGC GCTTTTGAAA ATTTAACAGC TAAcGAGATA AATATGCGTC CAGTTATTTC 2940 

30 GTGTTGTTCT ACTAATTGAG GATAACTATA AACAGCAACT TCTTCAATGC GTrCACTTAT 3000 

ATGTGATTCA AGATATTCAG TAACCCTGTC TATAATTACT TTACTTTQAC GTTCACCGTC 3060 

TTGATACAAC GCTTGTGCTA ATAGCAAACT TTTACTTAAA TGTACCGCTT CTTGTCGTTC 3120 

TTGCTCCGTT AAATAAACAT TTCTTGAACT TTTCGCCAAA CCATCTQCTT CTCGAACGAT 3180 

ATCA&TACCA ATAATTTCAA CGGCATGATT GAAGTCTTTT ACCnOTTGCT CGaCAATAGC 3240 

CAATTGCTGG GCATCTTTTT TACCAAAATA AGCATAATCC GGCATAACAA TATTAAATAG 3300 

40 

CTTATTAACT ACTGTTACCA CCCCATCAAA ATGCCCTGGr CCXStTCXSCTC CTTCTAACAC 3360 

ATCAGCTAAT GGGCCTACTT TGACATCAAT ACCTAATTCA CCTGGATACA TATCTTCTAC 3420 

TGCAGGATGA AAAACAATGT CCGCTCCTAC TTCTGATACT AATTCTAAAT CTTTATCAAT 3480 

4$ 

TTGTCTCGGA TAAGCATCGA AATCTTCGTT TGGACCAAAT TGTAATGGAT TAACAAATAC 354 0 

ACTCACAATT GTAATATCAT TTGTACTAAC TGATTCGCGT ACCATCGTTA AATGTCCATC 3600 

SO ATGTAAGGCA CCCATTGTTG GGATAAAACC AATCGTTGTG CCTGAGCGTT TGGCTGCTTT 3660 

AACAATGTGT TGCATCTCTT TTACCGTAGT AATCAGCTTA GTCATTGTTA TTAACCTCAT 3720 
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5 



10 



IS 



20 



GATCGTATTG 


TTTTAAACCA 


TCCACACCAA 


CACTAAAATC AGCAAATTGC 


TTCACAAATT 


3840 


TCGCTTTATG 


TTCAACACCA 


TAATTTAACA 


TATOGTGATA AACCAATACT 


TGACCATCTG 


3900 


TACCTTTTCC 


TGCACCAATA 


CCAATGACTG 


GAATTGTTAA GTGCTTGCTA ATTTCTTCTG 


3960 


CTAAATCATT 


TGGAATTGCT 


TCAAGTACTA 


ACGCAACTGC ACCAGCTTGT 


TCTACATTTT 


4020 


TCGCGTCTAA 


AATAAGTTGC 


TCCGCTGCTT 


CTTTCGTTGC ACCTTGTAAT 


TTATACCCCA 


4080 


TAACGCCAAC 


ACTTTGAGGT 


GTTAATCCTA 


AATGTGCAAC AACAGGAATA 


CCAATTGCCG 


4140 


TTGCTTTTTC 


AATAAATGGT 


GTAATATGCG 


CTCCTTCTGC TTTAATTGCA 


TTTGCATTCG 


4200 


TCTCCTGATA 


AAGCTTTAGA 


GCATGATTTA 


AGTCTTGTGT CATAGAGATG 


CCTACTGCAC 


4260 


CAATCX3GCAT 


ATCAACAACT 


ACAAATGTAT 


TTGGTGOGCC TCTTCTTACT 


GCACGACCX^T 


4320 


GATGAATCAT 


ATCTGCTAAC 


GTCACTTGTA 


CGGTACTTTC ATAACCTAAT ACAGTCATAC 


4380 


CAAGTGAATC 


CCCAACAAGA 


ATCATATCAA 


TACCCGCTGC TTCCACTTGT 


TTAGCACTTG 


4440 


GAAAATCATA 


AGCTCrrACC 


ATAGAAATTT 


TAGTTTGCTT TTGTTTCATA 


TCTATTAATT 


4500 


GACTTACTGT 


TTTCAATGTT 


ATTCAACCTC 


TTTTTGCAGT ATnATTAGA 




4549 



2S (2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

3S 

TTATCTTTTG TTGTTTCCTT AGACAAACGA CTAACCACAT TATAATGGAC TAATTTATTA 60 

ATTCTATTTA ATTCCATTAA GTTATCCGTA ACACTAAGTG AAGATGCGGA GTTCACTCTC 120 

GTTTGTACTC TTCGTTTTAA TAAAGCACCT CGTAATAATA CAATCATTCT TCTTATTAAT 180 

40 

GATGCTTGTC TATATACCTG TGTTCTTTCA GCATAACGCA TATAGTTTTC AAGTACACTA 240 

TTCGTTATTT GTCCTTCATC TACTAAAGAC TCTAATGTTT TkGTTTCTAC ATTAAAAGCA 300 

4S ATTTTTTGTA GACGTTCTAA TTCTTTAGAG TTTTCATCAT CTTTCTCTAC AGTTTTTAAA 360 

AATGCTAATT TATCATGATA TTCTTTAATC ACGTTACCAT ATTTAAAACT TGTTTCGAAA 420 

GTAGATTTTT GATTTAGATA ATCAATAACT TGTTCTAATA TATAAATTCT AGCAACTTTA 480 

^ AACGACATAT TGCCAATTAC TGTTTTAGGT GCAGGTTTCG TTAATAATGG CAATAATACT 540 

TGCGCAACTA CCAAACTAAT AATAACCATA CCAGATGCAA TAAATAATAA GTCGTTTCTA 600 

SS 
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ATTGTTCCAT GCACACCACA TAACGTCATA 
TTCTCAGTCG TTGGATTATC ATCATCATTT 
^ GCTAAATAAA AATAAGGATA TAAGACATAA 
AAAGCAACAA CAATAGTGAT GCCTATTAAA 
ATTTTAATAA TAACTTCAGG TACTAAAAAT 

10 

ACATAACCTA GTATATTCCA TGTATGATTG 
ATAATTCTGT CACGTTCGAA ACCATGTACA 
GATGCGTGaA ACAATTCAGC AATTAAATAC 

IS 

GTAAACATAT TAATGTTTTC ATATCCTCGA 
GCCATACCTA TAAGTAAACC AACCACTGCG 

20 ACAGCATCAA CAAGTGAAAA AGCACCTGTA 
ATAATACCAG CAGCATCATT CAATAATGAC 
GGCAAGACCT TTCCTTTAGT GATTGCTTGC 

25 GCAGCAATTG CAAATGCTGC TCCAATAGGT 
CCTACACCTA TCACAGTAGT AATGACTAAT 
TATTTCCTTA AATGGACTCT AGAAACATTA 

^ ATTGTTACCA TAAACAATTC AGAATCAAAA 

AGTAACATGC CCAAGAAAAT TTGTATAAAT 

ACAAACGAAC TTAGTATCAC AACAGCTATA 

35 

CITTCACCTC TCTAAAAAGT ATTGTTTAAT 

TATACTTTTAG AGGATAAATT GAGTTAGCGA 

CTACGATTGC AGTACTTAAA TTTGCAATTA 

40 

ATAAATAAAC AACTTGCTTT CACATAACAA 
TTAAAATCGA CTAACCAATT TCaAAGTACT 

45 TCaAAAGCAT TGTTATGCTT AACAATTTAG 
TTACGATGCT CTTCTCGTTT TTCAGCACGT 
TTTGAACTTA ATAATATTGA TGCATGTGTG 

SO CCGTTGCGAT AAGCAGCGCG AGCGACTAAG 
AAAACAAGTG ACAGTAATAA ACGCACACTQ 

55 



ATTAAAGCGT 


ATAAACTTCG 


CTTTGGTGGT 


720 


TTAGTCATCA TTTTTTGGAA TGGACTGATG 


780 


ACCCAAACAA ATCTAAATAG ATAGACAGCT 


640 


AAGATTAAAT 


TGTGCGGTTC 


TGTTTTGATA 


900 


CCTAATATTG 


AAAAAACAAA 


GCCATTTAAA 


960 


TAACTCATTT 


GCAGTTGTGT 


ACGTACTTGC 


1020 


AGTCCTGCAA 


CTACTGCTGC 


AATGATTCCT 


1080 


GTAACAAATG 


GTGTTAACAA 


TTGAATAATT 


1140 


CX3CATCAATG 


TTAATCGGAA 


CCTTACTAAT 


1200 


CCACCAATTG 


ATGCAATTAA 


AAACAACTGA 


1260 


ACTAATACTC 


CAACAGCTAT 


TTTAAATGAA 


1320 


TCACxrrrcAA gaattgtcat 


TGCTCCTTTT 


1380 


ACTGCTACTG 


CATCAGTAGG 


ACAAAGAATG 


1440 


AAATCTGGCC AAATCCAATG AATAAATAAA 


1500 


CCTAATGCCA 


TCATCATCAC 


TGGCTTAATA 


1560 


ACACCTTCTA 


CAAATAACAA AGGCGCAATC 


1620 


TTAAATTGAA 


CAGGGATTGG 


GGTAATAAAT 


1680 


GCTAGGGGTA 


CTTTAGGTAT 


GAAAGTATGG 


1740 


AATATAAGAA 


TTGTTTCAAA 


TATTTCCAAA 


1800 


TGAAAATTAA 


GTATCACATC 


TCGTTGTAAT 


1860 


CCACAAAAGC ACTTTAATAT AGATATATGT 


1920 


TTTAATTTTA 


TTTTATCACT 


AATTGTTT6T 


1980 


CATTAACTTA 


TAATACAAAA 


AATGAGCACC 


2040 


CTTTTAATGA 


TTAATTTTGA 


AAACAGATTT 


2100 


CCAACACTTC AATCGTTTTG 


ATACCATTTC 


2160 


AATTGTAATG 


CTTCTGTAGA 


GTTTTGTTCA 


2220 


TGAGCATCAT 


TTTTTCGATA 


CATATAAGCG 


2280 


TGCATGCCGA 


CTGGTGAAGT 


TAAATTAATA 


2340 


AAAAATCCTG 


TATTCACAAT AAAATAAATT 


2400 
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CTTAAGAAAA CATCTTGGAA TTTCACGATA CCTATTGCAC TAATAAGAGC AATAAAACTA 2520 

CCTAACAACA ACATCACAGC AGCAATAAGA CTAAAGATTT CTTTTGTTAT TTCCATTAAA 2580 

5 

CACATGCCCC CCACCAATAA AGCGTGATAT TGAAACAGAA CTTACAAAAG ATATAATGGC 2640 

AATGAGCATG ATTGAATCTA AGAAAGAAAC GGTGCCCATA AGTACACTTA ACACACCCAC 2700 

AATTGACATT ACGACAGCAC TTGTTGTATC AAATGTAACG ACACGATCTG CTGTTGTAGG 2760 

10 

TCCCTTGATT AATCTAAATA AACAGATGAT TAATGCAATT CCAAAAATAA TGAGTGAACT 2820 

AATAATCATA ATATGTGTTA TTGTTTGTAT CATCGCGACA CCTCCAATAT TAAGTCTTCA 2880 

^5 TAATGCTTAA TACTTCTTAA CAAACTATCT TTTTCTTTTT CTGACACGTC GATACTATGA 2940 

ATAAAAAACT TTTTAGAGTC TTGAGAAATT CGTATTACTG TAGACCCTGG AGTTATAATA 3000 

ATTAAAATTG TTAAAAATGT TATTGACCAA TCACTTGTTA GTCTTGTTTC ATATGAAAGT 3060 

20 AATCCAGGGT TCATATCTTT TGTTTTAAAA AGAATATAAT TAATCGTGCT AATGCTAGAT 3120 

GTTATTAATT GATATAAATA AACACCTAAA AATTTAATAG CTACCCATAT TTTTCTAACA 3180 

TAAAAATCAT CGCTGAAAAA CCTGTGTAAT ATATAAATGA CAATTAAACC AATTAGATAT 3240 

CCAGAAAAGA AAGTCGAGAA TTTAAAATGA TCTTCATCTT GAAATAATAC CCATAAGAAT 3300 

GCAATGATAA TATTTAAAAC TATTTGATTC ATTTAGTCCT CTCCTTTCAA ATGCGGATTT 3360 

ACAAGTTTTT GATATAATTG ATCACTCGTG TTCAACTCAG TTGCATCACT TGTAACATTT 3420 

30 

AACACAACAG GTGCAGCAAT TCCGATTGCG ATAACCACAA CTACTAAAAT ACTTAAAATT 3480 

CTTTTTCGAT ATAGCGGGAT TTTCTTAAAA TTAACTTCCT CCCCATCTTT ATCTCCAAAA 3540 

TACATATAAA AAAGTATCCT AAATAAACTG TACATTGCAA TTAGACTAGT AATAATCATT 3600 

35 

AACGCTAGTC CAATATAATT GCCATTTTGC AATGCACCTT GGAAAATAAG TACITTCCCC 3660 

GGAAAGCCAC TAAATGGAGG CACGCCGCCA ATAGCAAAAA TCATTATAAT AAACGCAACT 3720 

CCAAATAAAG GTTCTTrTTT AGCTAAGCCA TTCAAATATT GATATTGTCG ATAGCCTGTA 37 80 

40 

ATGTAAACTA AACTACCAAT AATAAAAAAT AGCAATGTTT TTACAACAAT GTCATTTACC 3 840 

AAATAAAATA TTGCACCATT AATACCTGCA AACGTGTTTG TTCCTAAACC TAAAATGATA 3 900 

45 AATCCTATTG AGATTATGAC TTGGTAAGCT GCAATCTTTT TAATATCTTT ATAAGCAATG 3960 

ACACCTATAG CGCCGATGAC CATAGTTATA GCAGCCATAG TTGCTAGCAA TGGATGTATG 4020 

AGATCATTAT GTTGATCAAA TAGTAAAGTG AAGAATCGAA TTAATGCATA GGCCCCTACT 4080 

50 TTGGTCATTA ACGCTGCAAA TAATGCTGCA AGCTcAGTAT TTAACACAGC GTAQGCTTTG 4140 

GGTAGCXACA TAAAAAGGAC CAGCGCTGCT TTCGCACTAA ATGCGACTAA GAAGATTAAT 4200 
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AAGTTTAATG TACCTACTGT TTTATAAAGT AAACCTATAC CTAATAAGAA TAGCCATGAA 4320 

CCAATAATAT TCAAGACAAC ATAAATAATT GCAGCACGTA ATTGTTCTAC AGATTGTCCA 4380 

AGTGTAATGA GTACAAATGA CGCTAGTAAC ATAATTTCAA ACATGACGTA TAAATTAAAT 444 0 

AAATCTGATG TTAGAAAAGA GCCTATCACG CCAACACTTA AAAATAATAT GAACGATGGC 4500 

AAGTGATAAC GATTTGCTTT ATGTTCGCCA CGCCCAAATC CGTATGCCAT AATTAAAGTA 4 560 

ATCACAAACG AAGCGGTTGT AACCATAATT AAACTTAAAG AATCTCCTAA AAACTGTATA 4620 

CCAAAGGGCG CTGACCATCC TCCAAAGTCT AGCGTAATTG GACGGTGACG CTGAACATAA 4680 

ATTAATAGCA TTAATGAAAT AATTGTGGTG ATAGTCATTG TACCTAAGTA TAAATATTTA 4740 

GAAATACGAT CATTATTTTT TAAAAATACA AGGATTAAGG CACAAAGGAA TGGTAATAAC 4800 

ATTGGTAAAA TCAATAAGTT ACTTAGCATC ATCTTCCCCC CTTAGGCCTT CAATTTCATC 4860 

20 TTCTXTTGrr ACTTTATAAG TTCTATAAAC AAGTACAAGT AAAAACGCAG TCATCCCAAA 4920 

CCCTATAACT ATTGCAGTTA GTACAATAGC TTGTAACAAG GGATCAACAA ACAATTGGTT 4980 

TCCACCAGTT ATTAGTGGTT CTGATCTACT AGAACCATAC GTTCCCATAC TCATAATAAT 5040 

25 GAGATTACCA GCATGAGTAT ATATTGAAAT TCCQATTACA ATACGAATTA AATTGATTQA 5100 

TAAAATCATA TATGTTCCTA TAAACACTAA AAATCCTATA ACTAGTAATA ATATTAAATT 5160 

CATGATCGAC CTCC6CTAAG CGACAACATC ACTGTGACAA TAACACCAAC AACTGAGAAT 5220 

AAAATACCTA ATTCAAAAAG TGTTATTGTA CTTACATGAA TTTGTCCTAA AATTGGAAGT 5280 

ATCCAAGTTG TTTCATATTG AGACAAAAAT GGTTTTCCAA AAAACATAGG TATTATCGCA 5340 

GTAATAGATG ATACCAATGC TCCAATAATC ATTAAAATTC TAAAATCAAT CGGTAAACTT 5400 

TCTAAAACCT CTTCAACATT AAAAGCCAGA AACATTAAAA TAAACGCTGA ACTAAATATT 5460 

AAACCACCAA TAAACCCACC ACCAGGATTA TTATGACCTG CGAAGAAGAC ATAGAATCCG 5520 

AAAGTCAATA AAATAAATAC AACAAGTTTC GTGACCGTTC TTAACACGAC ATCATTCTCT 5580 

TTCATCTTGT CCCCTCCGAT CTTGATAATT TAATAATGtg TAAATACCTA GCCCAGTAAT 5640 

AATTAACACT AATCCTTCAA ATAATGTATC TAATGCTCTA AAGTCACCAA GTATCGCATT 5700 

45 TACAATATTT TTACCACCTG TTAGTTTGTC AGCTTTTAAA TAAAAGTCTG ATATTGATGA 5760 

TAAACCATCT GTTTGTTGTG TAATAAAAAT TAATGATACA ACAATAAGTG CCATCAAGAG 5820 

TGATACAGAA ATTTTAATTA TTTCTCTTTT TTTGTTAGCX5 TTAGATCTTG GCACGTTTGG 5880 

^ TAATCTTGAA AAACTGACAA TAAATAGTAT CGTCGTTATT OTTTCAACTA CTAGCTQAGT 5940 

CAATGCTAGA TCAGGGGCTT TCATTGCTAT AAAGAATAAG GTCACAACAA ATCCX3ATGAC 6000 
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GACAGTTACG ATTGCTAATA TAATTTCTAA TGCCCCAAAT TCAGAAACAT GTAACTGATG 6120 

TACrrTAGGA AGTCCaATTC GAATATAACC ATATCCAATG ATAATCATAA ATATGCCTAA 6180 

^ GGTCATAATA ATGTACTGGT TTAAACGATC TTGCATAACA C6TTTAAATC GCTTCGTAGC 6240 

AAACTTTTCA AAATGTCGAT ATACCATCTC ATAGCTTTTT GAAACTGAAA TCTGTCTAAT 6300 

TTTACCTGTG AACACTTTTT TCCAATCTAC TTTGATTGCT AGTACACTAC CCAATAAAAT 6360 

10 

AATGATGATG GTTAAAAGAA GCGGTATGTT AAATCCATGC CATTGCGAAA CATGTGGTGC 6420 

CAATTGATCA ATTTGATGAT TACCACCTGA TACAGCTCTT AATGCnAGAA CGATAATCCC 6480 

CTTCCCAAAT ATATnTGGTA CAAAAAAGAT TACAGGTACT AGCACCATTA aTATAAGAGA 6540 

15 

TGGTAAACTA aACAACCATG GTTCGTGGAT ATTTTTTTTA GTAAAAACCT TAGAATCATA 6600 

nTTGtCCAA AATACTTCTT TTACCATGTA TAGTGCATAT GTGAATGTAA AAACACTCGC 6660 

AATAACACCA ACAAACACXyi TAGCTATCAT TGAAATCAAA CTAAATTGGG ATAATTGTCC 6720 

20 

AGTTTGTGTT AATGCATCTA AAAACATTTC TTTACTTAAA AATCCATTTA AAAATGGTAC 6780 

TCCAGCCATA GATAGAGCCG CTATCGTCAT QACTAGATTC ATTTTAGGAA ATAGTTGACG 6840 

25 CATTCCACTT AAAATTCGTA TATCCCTTGA ACCTGCTTCA TGATCTAAAA TACCTACTCC 6900 

CATGAAAAGC GCACATTTAA AGATGGCATG ATTCATTAGa TGAAATAGcG CACCArATAA 6960 

TACtnAATACA TAAATaGATG CTATTGCGTC TTGTTGGTGT TGAGCATATC CGCCACCTAT 7020 

^ ACCCACCATA GCCATAATCA TCCCAAGTTG ACTGATTGTA GAGTACGCTA GGATACCTTT 7080 

TAAATCCCAT TGTTTTAAAG CTGTAATTGA ACCAAATAAC ATTGTTATTA AACCAACAAA 7140 

CGTAACGATA TATACGTACA TATTGCTAnG ACCTAATAAT GGTGTAAATC GAAGTAATAG 7200 

AAnGATACCA GCTTTTACCA TCGTGGCTGA ATGTAAATAA GCACTTACAG GTGTAGGTGC 7260 

AGCATTGCT CTAGGTAGCC AGTATGAAAT GGAraTTGTG CTGATTTTGT AAATGCACCT 7320 

AATAAAAACA TAAAAATCAT AGGGATAAAC AATCCATGAT TCTTAATATG ATCTGCTTGT 7380 

40 

CCTAATATCT CTGTGATGTT ATTCGTTCCT GTCATGATAT ACAGCATAAT AAAACCAACT 7440 

AATAACGCCA ATCCACCAAA TACTGTAATC ATAAATGATT GAATCGCACC AAATTGACTG 7500 

TCACCATTGT TATACCAATA TGAnATCAAT AAAAATGATG ATmCACTCGT TAATTCCCAA 7560 

45 

AAaATGTACA TCmATATCGT ATTGTCTGAT AATACaaTAC CAATCATACT GAACATAAAT 7620 

AACGTTAAAT AAAAATAAAA CCTTGGTAAA TTGTCTTTTC GAGAGGATAA ATATTGAGTT 7680 

SO GCATAGAAGA ATACTGCAAT TCCAATAAGT GAAATAATAA GAGAAAACAT TAAACTTAAA 774 0 

CCATCTAAAC GTAAATCTAA ATTAATATCT AATGTCTTAA TCCATGGAAT AGAGGTAGAA 7800 
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GGTGCAACCA ACGCTATGTA CCCX3GCATAT TTAGCCAATG CTCTACGTTT AGACATTAGA 7920 

AGTATCATCG CCATAATCAC AAGTATAGCA ATTAATAAAT AAACCAAACT CATTATTAGC 7980 

5 

CTCCTTTGTT TCTATAATTG TAATGAAATA TAAATACTAT GTTCACACTC ATTTTCTAAA 8040 

CCGATAAAAT TTAGTGTTTC AATAGCAGAT TGATGCCCTA AATACTTTTG AATGACTGGT 8100 

ATAAGTATAC CTTTTTGATA AGCATGATAT GCAAATGTCT TACGCAATGT CGTTAGTCCT 8160 

10 

ACATTATCTA TACCAGCTTC AATTGATGCT TGGTGAATTA TTCGATATGC TTGCTGTCTA 8220 

GATAATACTT GATTTGTTCG TAGTGATTGA AAAAGAACGT CTTCATTCQA AAGACTCCTG 8280 

TCCTCTATAT ATTGAAOTAG TTCTTTCGAT AATGTTTCTG GTAACCTAAT TTTAATCAA 8339 

(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CCCGATTTTT TTACGTAATC TAATACATAC GGCAAAATCA ACTTTAATCA AAAAAGACTC 60 

ATACACAATG CCTTTAAAGC ACATGTATGA GTCCTTTTTA GTAGTTTATA TCAAAAAATA 120 

30 

GTTTAATGTA TAAATTAGTT TTTGTTTACA GATGCGTCGT AGATTGATTC TACAGCATCA 180 

CCTAAAGCTT TATCGAATTC TTCTTTAGAT TGATCAGCTC TTAAATCACT AGCTAATGCA 240 

CGTGAGAAAC TTGCGATAAG TTCAGCGTTA TCTTTAAGTA ATTCATTTGC TTTTTCTCTG 300 

35 

CTGTAACCAC CTGATAATAC AACGACACGA ACAACATTAG GATGTTCAGC TAACTCTTTG 360 

TATAAGTTTG GTTCAGTAGG AATTGTTAAT TTCAACATTA CTAATTGATC AGCATTTAAG 420 

^„ CTATCTAAAC CTTTTTTAAG TTCAGCTTTT AATACTTTTT CAATTTCAGC TTTGTCTTTT 480 

40 

GCATTAATAT TAACTTCTGG TTCGATAATT GGAACTAAAC CTTTAGCAAT AATTTGTTTA 540 
GCAACTTCAA ATTGTTGTTC AACAACGTCT TTGATACCTT GCTCATTT 588 
45 (2) INFORMATION FOR SEQ ID NO: 177: 

(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATAGAGTnCT GGnACTTACT ATGACATATG GCGCTAGAAT GGCTGAGCCA GGTGAATTTA 60 

CAAAACX5TGC CTTTTTAAAT GGTCGTATTG ATTTATCTCA AGCTGAAGCA GTTATGGACT 120 

TTATTCGCTC GAAGACAGAT AGAGCTTCTA AAGTTGCGAT GAATCAAATT GAAGGTC6TC 180 

TAAGTGACTT AATCAAAAAA CAACGTCAAT CTATATTAGA GATACTCGCT CAAGTGGAAG 240 

TGAATATTGA TTATCCTGAA TACGATGATG TTGAAGATGC GACTACTGAA TTTCTTTTAG 300 

AGCAGTCTAA AGAAATCAAA CAGGAAATTA ATCGTTTATT AGATACCGGT GCGCAGGGTA 360 

AAATTATGCG TGAAGGTTTA TCTACAGTTA TTGTTGGTAA ACCAAACGTA GGTAAATCAT 420 

CGATGTTAAA TAATTTAATA CAAGATAATA AAGCGATTGT AACTOAGGTA GCAGGTACTA 480 

CTAGAGATGT CTTAGAAGAG TAC6TCAATG TTCGTGGCX5T GCCATTAAGA TTAGTTGATA 540 

CTGCTGGTAT ACGTGAGACA GAAGATATAG TAGAGAAGAT TGGTGTTGAA CGCTCTAGAA 600 

AGGCTCTTAG CCAAGCAGAC TTAATTTTAT TTGTATTAAA CAATAACGAA GCATTGACwC 660 

AAGAAGATTA CACATTATAT GAAGTGGTTA AAAATGAAGA TGTAATCGTA ATTGTTAATA 720 

AAATGGATTT AGAGCAAAAC ATAGATATTA ATGAAGTTAA AGATATGATA GGTGATACGC 780 

CATTAATTCA AACTTCAATG TTAAAACAAG AAGGTATTGA TGaATTAGAA ATACAAATTC 840 

gAGATTTGTT CTTTGGTGGa GAAGTACAAA ATCAAGATAT GACTTATGTT TCTAATTCAA 900 

GACATATTTC ATTATTAAAA CAAGCAAGAC AAACGATACA AGATGCGATT GATGCAGCAG 960 

AATCTGGTGT GCCTATGGaT ATGGTACAAA TTGATTTAAC TAGAACTTGG GAAATATTAG 1020 

GAGAAATTAT TGGTGAGACT GCAAgTGATG AACTCATCGA TCAGTTATTC AGTCAATTCT 1080 

GCTTAGGTAA ATAGTAATTG AAATAGACGG AATACC6TCT TAAGAAGGCT AGTAAGATAT 1140 

3S CAAATAAGGA GGTTTATATT GTGGTTCAAG AATATGATGT AATCGTTATA GGTGCGGGAC 1200 

ATGCAGGTGT AGAAGCAGGT TTAGCATCTG CAAGACX5TGG TGCTAAAACA TTAATGCTAA 1260 

CAATAAATTT AGATAATATT GCATTTATGC CATGTAACCC ATCTGTAGGT GGACCAGCTA 1320 

^0 AAGGTATCGT TGTTCGTGAA ATTGATGCTT TAGGTGGACA AATGGCAAAA ACAATCGATA 1380 

AAACACACAT TCAAATGAGA ATGTTAAATA CAGGTAAAGG ACCTGCTGTA AGAGCACTAA 1440 

GAGCGCAAgc AGaTAAAGTA CTTTATCAAC AAGAAATGAA ACGCGTGATT GAAGATGAAG 1500 

AAAATTTGCA TATAATGCAA GGTATGGTAG ACGAACTTAT TATAGAAGAT AATGAAGTTA 1560 

AAGGTGTACG TACAAATATT GGTACAGAGT ATTTATCTAA AGCAGTAATT ATTACAACGG 1620 

GAACATTTTT ACGTGGTGAA ATCATTTTAG GTAATATGAA GTATTCAAGT GGACCAAATC 1680 

ACCAATTACC ATCAATCACA TTATCAGACA ATTTAAGAGA ACTTGGTTTT GATATTGTTC 1740 
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20 



2S 



30 



AAATACAACC 


AGGTGACGAT 


GTAGGTCGTG 


CATTCAGCTT TGAAACAACA GAATATATAT 


1860 


TAGATCAATT 


GCCATGTTGG 


CTAACGTATA 


CTAATGCTGA AACACACAAA GTTATCGATG 


1920 


ATAATTTACA 


TCTATCTGCA 


ATGTATTCAG 


GGATGATTAA AGGAACCGGG CCACGTTATT 


1980 


GCCCTTCAAT 


TGAAGATAAA 


TTTGTTCQAT 


TTAATGATAA GCCGCGACAT 


CAACTTTTCT 


2040 


TAGAGCCTGA 


AGGTCGTAAT 


ACAAATGAAO 


TATATGTGCA AGGATTGTCT ACAAGTCTTC 


2100 


CTGAACATGT 


GCAcGTCAAA 


TGTTAGAGAC 


GATACCAOGT 


CTTGAAAAAG 


CAGATATGAT 


2160 


GCGTGCCGGC 


TACGCAATTG 


AATATGATGC 


GATT6T0CCA AC6CAGTTAT GGCCTACACT 


2220 


TGAAACGAAA 


ATGATTAAAA 


ACTTATATAC 


TGCAGGTCAA ATTAATG6TA 


CATCTGGTTA 


2280 


TGAAGAAGCA 


GCAGGACAAG 


GATTGATGGC 


AGGTATTAAC 


GCTGCAGGTA 


AAGTGTTAAA 


2340 


CACAGGCGAA 


AAGATATTAA 


GTCGTTCAGA 


TGCATATATT 


GGTGTCTTAA 


TCGATGATCT 


2400 


TGTAACTAAA 


GGTACTT^TG 


AACCTTATCG 


TTTACTAACA 


TCACGTGCAG 


AATATCGTTT 


2460 


GTTAcTACGT 


CATGATAATG 


CTGATTTGAG 


ATTGACGGAT 


ATGGGATATG 


AACTTGGTAT 


2520 


GATTTCTGAA 


GAAAGATATG 


CACGTTTTAA 


TGAAAAACGT 


CAGCAAATTO 


ATGCGGAAAT 


2580 


TAAGCGTTTA 


TCAGATATTC 


GTATTAAACC 


AAACGAACAT 


ACGCAAGCGA 


TTATTGAACA 


2640 


ACATGGTGGT 


TCTCGCTTAA 


AAGATGGTAT 


TTTAGCTATC 


GATTTATTAC 


GCAGACCTGA 


2700 


AATGACTTAC 


GATATAATTT 


TAGAACTTTT 


AGAAGAAGAA 


CATCAATTGA ATGCAGATGT 


2760 


TGAAGAACAA 


GTAGAAATAC 


AAACAAAATA 


TGAAGGTTAT ATCAATAAAT 


CACTACAACA 


2820 


AG7TGAGAAA 


GTTAAGCGTA 


T 








2841 



(2) INFORMATION FOR SEQ ID NO: 178: 

3S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 
Z (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



45 



SO 



ATCTAATTTC 


AAACCCOGTG 


ATAAATTGCC AAGCGTGACG 


CAATTAAAAG 


AACGTTATCA 


60 


AGTAAGTAAG 


AGTACTATCA 


TTAAAGCATT AGGCTTATTG 


GAACAAGATG 


GTTTGATCTA 


120 


TCAAGCACAA GGCAGTGGTA TTTATGTGAG AAATATTGCT 


GATGCCAATC 


GTATCAACGT 


180 


CTTTAAGACT 


AATGGTTTCT 


CTAAAAGTTT AGGTGAACAC 


CGAATGACAA 


6TAAGGTACT 


240 


TGTTTTTAAG 


GAGATTGCAA 


CGCCACCTAA ATCTGTACAA 


GATGAGCTCC 


AATTAAATGC 


300 
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CGAATATTCT TATTATCATA AAGAAATCGT GAAATATTTA AATGATGATA TTGCTAAGGO 420 

CTCTATCTTC GACTATTTAG AATCAAACAT GAAACTTCGT ATTGGTTTTT CAGATATTTT 480 

CTTTAATGTA GATCAACTCA CTTCAAGTGA AGCTTCATTA CTACAATTGT CTACAGGTGA 540 

ACCATGTTTA CGTTACCACC AGACTTTTTA TACAATGACT GGCAAACCCT TTGATTCATC 600 

TGACATCGTA TTTCATTATC GTCATGCACA GTTTTATATT CCTAGTAAAA AGTAATAAAT 660 

ACATAAAAAC GTCTATATCC CAGTTATAAA CTGGAGTATA GACGTTTTTT TACGATAATA 720 

ACAATGGCTC AAATTGCTAT TATCTTGCTT AGGTTTTTCG TTTTAGAAGA ATATTGCTAC 780 

AAAGACAGGC ACAACTGCTA CAACAACTAC ACCAACTAAC ACTAAAGCTA TACTTGCCAT 840 

TGATTCTTCT ACAGGTCCTA ATTCTTTGGC TGGTGCTACA CCTAATGTGT GACCACTTGT 900 

TCCAAGTGCT AATCCTCGGG CAATAGGGTT AGTAATTCGG AAAAGCTTTA AGAATTTATT 960 

ACCTAGGGCA TAAATAATGA CACCATTTAA AATAACTGCT AATGATGTTA ATTCTTTTAT 1020 

ACCACCGATA CCAGCTGATA CTGGTAACGC AATCGCTGTA GTrGCTGCTT GAGGTAACAT 1080 

TGATAAAATA ACATCATTGG CAAATTGTGC TAACTTCGCA AAAGTTAAAA TAATTAATAA 1140 

CGCTACAACT GTACCGATAC CAATACCTCC GATGATACGA TGCCAATGTT TAACAAGCAC 1200 

TTCACGCTTT TTATATAACG GAATCGCAAA ACAGATTGTT GCCGGTTCTA AGAAGAAGTA 1260 

AATAATGTCT CCACCTAITT TGTAAGTCTT ATACGGAATG CCTGTTAAAT AGAGGAA6GC 1320 

30 CACACCAAAT ACCATACTGA CAAATAGCGG TGCGAATAAG AAGAAACGAT TAGTTTTTTC 1380 

AAATAATATG GTCGCTAAGA AAAATGGTAT AACGGATAAC AGTATTCCGA AGTAAGGTGT 1440 

GTTTaGTGCT AAGTGGTTAA TCaTGAGCTT GTGCCTCCTC TATTTTGATC TTTTTTGTGA 1500 

35 CTTTGTCACC TTTAGATCTC GAAGTAACTT TCATAATAAT TTgTGTQACA TAGCCAGTAC 1560 

AAATJ^GTAA TAGTATTGTT GAGACGATTA TTAGTCCAAT GATTAAAAAT GGTGCTTGGC 1620 

TAATCACACC TAAAGAGTTA ACAACTGAGA TACCG6CTGG TACGAAGAGT AAGCCAATGT 1680 

40 ______ ______ 

TATTTGTTAG TGTCGTTCCT ACTTTTTOGA CTTCGCCTAA CTTAACAGCA CCAGTACATA 1740 

ATAATACAAA TAATAATACT AAACCGATTA CTGATGCAGG CATAGGAATT GGCATAAATG 1800 

ATTCAATTAT TTTCXIATACA AAGAGTACTA AAGCAATTAC AATGACTTGG TGAAAAAAGT 1860 

GTGCTGGTTT TGATGCGTCT TTTTGTTGTT TCACGACCAT TGCCTCCTAC GTTTGATTTA 1920 

ACTAAAGTAT AGATGGCTCA CTTCGAITTG CGTGATT T TT AGTCCGAAAT ACAAAATATC 1980 

ATAGGTAAAA TGCATAAAAA AAAGGATTAC TGTTAAAGTA ATCCTATCXSA CGCTTTAAAA 2040 

TCTTTCATAA ATGAACGTCC AACTTGCATC TTGACACCAT TTGTCAATAT TACCATATAA 2100 
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25 



TGAATACGTA 


TAAAATAAGT 


GGGATTCAAT 


CGmrrcAT aacgattcaa tggctctgtt 


2220 


GTTTOGTATT 


TATGATT06T 


TGTATGTATG 


GTTGTAATAC CATTATGTGT GCCAATCCCA 


2280 


ATAATATTTT 


GTTGCTTTAA 


CATGTGAATT 


ttatcgtcaa TTTCAACAGG 


TAAGCTTTGA 


2340 


TCAAAATTCG 


CCGACATATC 


ATTCGCAATT 


GCACTTGCGT 


TATTATCATC TTTGGCTTTA 


2400 


GTCGCACGCA 


CTTTATTGAC 


TGCTTGTTCA 


ATACGTTTTT 


GACCAAACGG 


TTTCAAAATA 


2460 


TAGTCTGTCG 


CATTTAATTC 


AAATGCCTGT 


ACTGCGTATT 


GGTCATGTGC 


AGTTGCAAAA 


2520 


ATAATCGCAG 


GTGGCTCTTT 


CATCTTTTGA 


atcttagctc 


CTAATTCGAT 


CCCATTTTCA 


2580 


TCCATTAAAT 


TGACATCTAA 


AAATATAATG 


TCATATTGAT 


TGATCAGTAG 


TGCTTCCAAT 


2640 


GTTTCTTTTA 


CATTTTCTGC 


CTCATTAATT 


TCTTCAAAAC 


CACCAATTTC 


ATTTAATAAA 


2700 


TATGTTAATT 


CATTACGTGC 


TAATGGCTCA 


TCATCTATGA 


TTAATGCTTT 


CATATTTATT 


2760 








accaaaaagt 


GGTACCGCTC 


GATGTCGATT 




CAAATTGTAA 


TGCTGCGGAT 


TTTCCAAATA 


ATCCTTTTAG 


GCGTAAGTTT 


AAATTTTCTA 


2880 


AAGCACTACC 


AGTTCCAGAC 


TCTGATTCTA 


CAGATGTnTC 


TCCCaACAAA 


TGCATTTTAT 


2940 


CTTTAGAAAT 


ACCCTGACCA 


TTATCTTGTA 


CAATAATACG 


TACATGTGTT 


GCAGTiTClT 


3000 


TAATCACTGA 


CACGTCAATA 


TCGTT 








3025 



(2) INFORMATION FOR SEQ ID NO: 179: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1589 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



- (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



40 



ACAGAATTTC ACAGCATTTT 


TAGATGAAAA 


AATAAGCCAG 


TCATAGOGTT 


GATTTAACAA 


60 


ATGAATATCA AAATTTAGTG 


GCTTTATATC 


AATAAAGGGT 


TTGTGAATAA 


TTGATACTAA 


120 


ATCACTTTGC ATGTCAiTltj 


TTTGTGTCAT 


AACTACAACT 


GGCTTCATAT 


TTAAACGTCA 


160 


CTCChTThTT TAATGTTGTT 


CATTTAAGCG 


TTITATAATT 


TCATAAGCAC 


CTTGCTCTTT 


240 


TAATTTGTTA CTCACTGTTT 


TGCCTAACTC 


AACCGGATCT 


GTTCCGTTCA 


TTGTATATTC 


300 


AAATCGTTCT TTACCATCTG 


GGGTCATAAT 


TAAACCTGTA 


AATTCGATTT 


CGTTTTGATC 


360 


TGAGATTGTA GCATATCCTG 


CAATTGGCAC 


CTGACAACTA 


CCATCCATTT 


CTGCTAAAAA 


420 


CGTTCGTTCA GCAGTCACAC ATTTTGCAAC 


CTCATCATTA 


TGTACTTTGC 


TTAATAATGT 


480 
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TAACAATGTA TCTCTATCAA GATAAGATGT TnCAATATCA TCTGACCAGC CCATTCTTCT 600 

TAAACCAGCT GCAGCTAAAA TAATCGCATC ATAATCTTCA GTTTGTAACT TTTCTAATCG 660 

5 

TGTATCTATA TTACCTCTAA TCCATTTAAT CTCTAAATTA GGATACTTAG ATAATATTTG 720 

TGCACCACGA CGTAATGAAC TAGTACCAAT AATACTGCCT TCTGGCAATT GGGATAGTGG 780 

TGTATGTGTT TTAGAAATAT ACGCATCAAA AGGTAATTCT CTATCAGGGA TACAACCTAA 840 

10 

TGTTAAACCT TCCGGAATTA CACTTGGTAC GTCTTTAAGC GAGTGTATTG CCATATCGAT 900 

ATTTTTTrCA AAAAGTTCAT GTTGTATTTC TTTAACAAAT AAGCCTTTGC CTCCGACTTT 960 

AGACAATTGT TTATCTACTA TACX5ATCGCC TTTCGTGaCA ATTTCTTTAA TTTCAATTTC 1020 

TAGATTTGGC TCGACAGCTT TTAATTTATC AATAAATTGC TGGCTTTGTG TTAAAGCTAA 1080 

TTTACyTCTT CTGGAGCCAA CGACTrATTT ACX5CATGTTC AATTCCTCCT AGGAACGGAT 1140 

20 TGCTCTAGAT TATTTTCTCA ATTCACAAAA TGTGTTGCAA AAAATAAATT AATCATATTT 1200 

AAGCAAAATA AAATAATGTT ATAGTATATT AAATATCTTQ AATTCAACCA TTTGTTGATT 1260 

CTAAGTAAAA TATAACTTCC ATATAATACT GTAATAATTG AAGAGAGTAT TACCTTCGGQ 1320 

25 TCAATGAATA TACGTTCACC AACTGAAATT ACACCCCACT GTGTACCTAA AATAATACTA 1380 

AATATGAGAA TTATCCACCC ACTTAACGTT GAGTAAAACA CAATTGATTC AAGTGTAGCA 1440 

ACGCTACCAA TTCTAAAGTA TTTTTGATCA AAACGTTTTT CCTTCAAATT ACGGTATTGC 1500 

^ ATGATATACA GTAATGCATT GACAAAAGCT AAGGCAAAGA AGACATAACT TAACACAGCT 1560 

AGACCGATAT GGACTAACAG TAACTCGTCT ACAACAGCAA TTTTCTGAAC CTTATTAGTA 1620 

TAATGTGTCG GTTGAAATGT ATTCATCCCT AAnAGTGTTA ACCCTATTAA ATTCCAAGGA 1680 

^ AAAACACA6 1^89 

(2) Information for seq id no: iso: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

nTGGnTGGCT TTTCCTATTG GACCAAATGG ACCnTTTACC TGGCCnTTCC CAGGACACCC 60 

50 CGCTTGTGCC CACATTCCAA TCGGAAAAGG TGTATGTGGT ACAGCCGTTT CAGAACGTCG 120 

TACACAAATT GTAGCTGATG TTCATCAATT CGAAGGACAT ATCGCTTGTG ATGCTAATAG 180 
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IS 



20 



CGATGCCCCT 


ATAACGGATC 


GATTTGATGA 


CAATGACAAa GAaCATCTTG 


AaGCAATTGT 


300 


TAAAATTATT 


GAAAaGCAAC 


TCGCATAAAA 


GGACATCAGC ATTTTCAATA 


AAGTGTTGAC 


360 


AGTTAGCAGG 


AAAATGTTAC 


AATAATCTTT 


GTGTGAATTA ACGAAAGTAG 


CAGTTGTATA 


420 


TTATTGAGCG 


CTATGTTGTT 


CCCAATGCGG 


ACGTGTCACG TAACTGTCGC 


TATAAGGTGA 


480 


AGACACATAA 


AACAATATAT 


CTTAGTAAGC 


ATGCAACACT CTTTTTTGTT 


TATTCATAAC 


540 


AACAAAAAAG 


AATTAAAGGA 


GGAGTCTTAT 


TATGGCTCGA TTCAGAGGTT 


CAAACTGGAA 


600 


AAAATCTCGT 


CX3TTTAGGTA 


TCTCTTTAAG 


CGGTACTGGT AAAGAATTAG AAAAACGTCC 


660 


TTACXtCACCA 


GGACAACATG 


GTCCAAACCA 


ACGTAAAAAA TTATCAGAAT ATGGTTTACA 


720 


ATTACGTGAA 


AAACAAAAAT 


TACGTTACTT 


ATATGGAATG ACTGAAAGAC 


AATTCCGTAA 


780 


CACATTTGAC 


ATCGCTGGTA 


AAAAATTCGG 


TGTACACGGT GAAAACTTCA 


TQATCTTATT 


840 


AGCAAGTCGT 


TTAGACGCTCt 




ATTAGGTTTA GCTCGTACTC 


GTCGTCAAGC 


900 


AC6TCAA7TA 


GTTAACCACG 


GTCATATCTT 


AOTAGATGGT AAACGTGTTG ATATTCCATC 






AAACCTGGTC 


AAACAATTTC 


AGTTCGTGAA AAATCTCAAA AATTAAACAT 


1020 


CATCGTTGAA 


TCAGTTGAAA 


TCAACAATTT 


OGTACCTGAG TACTTAAACT 


TTGATGCTGA 


1080 


CAGCTTAACT 


GGTACTTTCG 


TACGTTTACC 


AGAACGTAGC 6AATTACCTG 


CTGAAATTAA 


1140 


CGAACAATTA 


ATCCGTTGAG 


TACTACTCAA 


GATAATACGG TCAATACCAA 


CACCCACAAT 


1200 


TGTGGGTGT 










1209 



(2) INFORMATION FOR SEQ ID NO: IBl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^ (O) TOPOLOGY: linear 



40 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



4S 



SO 



AAATCCCTTt GTtaAAgTsC AAAtTTTTCc 


AACrgCTTTA 


AtArGACCCA 


TATTACCtTC 


60 


TTGGATTAAA 


tCmAGGaATG 


AcATACCACG 


ACCaCGTATC 


TTTTAGCAAT ACTTACAACT 


120 


AAACGTAAGT 


TCGCTTCTGC 


AAGTCTTGAT 


TTTGCTACTT 


CATCACCTTG 


TTCAATACGT 


180 


TTGGCTAATT 


CGATTTCTTC 


TTGTGCACTT 


AATAAGTTAA 


CACGCCCAAT 


TTCTTTAAGG 


240 


TACATACGAA 


CTGGGTCATT 


TATTTTAACA 


CCTGGAGGGG 


CACTAAGATC 


ACTTGGATTC 


300 


AGTTTCTCGT 


CAGTATCTGA 


ACTATCTTTT 


TCATTAACTA 


GTGAAATATC 


ATTATCATTT 


360 
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GCAATTTCTT CATGACTTAA ATGACCCTCT TTTTTACCTT TTTCAATTAA TTGCTTCTTA 4 80 

ACATCTTCTA ATGTTAATGT CGGATCAATT GTTTGTmT TAATTTTAAC TGTGTTATCA 540 

5 

GACATGAAAC GGCCTCCCGA TTTTAAATAT GAACATTCGA AATTTATTCA ATATTGCTAT 600 

TTTAAACGAA ATTCTTAATT AATTCCATCC ATATTTTnAA TTTTATTTTA CAAATTGGGA 660 

ACTAAATCCC CAATATTTAT TTTTCAATAG TGGTGGTT 698 

10 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 





ACTTGATGAT 


GTATACAATG 


TATTTCAAGA 


ATATTA r WtA 


AAAACATCTA 


ACATTAAGTT 


o u 




TTGTAGAATT 


CACAATTCTA 


GCTATTATCA 


CTTCTuAAAA 


TAAAAACATC 


GTTCTTCTTA 




25 


AAGATTTAAT 


TGAAACAATC 


CACCATAAAT 


ACCCTCAAAC 


TGTTAGAGCT 


CTCAATAATT 


180 




TAAAAAAGCA 


AGGCTATCTA 


ATAAAAGAAC 


GCTCAACTGA 


AGATGAAAGA AAAATTTTAA 


240 




TTCATATGGA 


TGACGCGCAG 


CAA6ACCATG 


CTGAACAATT 


ATTAGCTCAA 


GTGAATCAAT 


300 


30 


TATTAGCAGA 


TAAAGATCAT 


TTACATCTTG 


TTTTTGAATA 


ATATCTCTAT 


TACGCAAGTG 


360 




TGCTGTATTC 


TAAAGTGCAC 


TTGTGTTTTC 


TATTTTTTAA 


TAAAACCTCA 


GCACATAATG 


420 




AACAACTTTC 


TATTTTCTAT 


ATCACTTAAA 


ACCATTTCCG 


AAATTAAACC 


TCAGCACATT 


480 


35 


CAAAGCCCCA 


CTTTATTCTT 


AAAAATATTT 


TTTAACTCAT 


ATGTATTAAA 


CCGCTTTCAT 


540 




TATAAAAAAT 


ATCTCTATAT 


TtTATCTGtT 


TtTATTAATC 


GAAATAGCGT 


GATTTTGCGG 


600 


40 


TTTTAAGCCT 


TTTACTTCCT 


GAATAAATCT 


TTCAGCAAAA 


TATTTATTTT 


ATAAGTTGTA 


660 


AAACTTACCT 


TTAAATTTAA 


TTATAAATAT 


AGATTTTAGT 


ATTGCAATAC 


ATAATTC6TT 


720 




ATATTATGAT 


GACTTTACAA 


ATACATACAG 


GG6GTATTAA 


TkTGAAAAAG AAAAACATtT 


780 


45 


ATTCAATTCG 


TAAACTAGGT 


GTAGGTATtG 


CATCTGTAAC 


TTTAGGTACA 


TTACTTATAT 


840 


CTGGTGGCGT 


AACACCTGCT 


GCAAAtgctG 


CGCAACACGA 


TGAAGCTCAA 


CAAAATGCTT 


900 




TTTATCAAGT 


CTTAAATATG 


CCTAACTTAA 


ATGCTGATCA 


ACGCAATGGT 


TTTATCCAAA 


960 


SO 


GCCTTAAAGA 


TGATCCAAGC 


CAAAGTGCTA 


ACGTTTTAGG 


TGAAGCTCAA AAACTTAATG 


1020 




ACTCTCAAGC 


TCCAAAAGCT 


GATGCGCAAC 


AAAATAACTT 


CAACAAAGAT 


CAACAAAGCG 


1080 
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AAAGTCTTAA AGACGACCCA AGCCAAAGCA CTAACX5TTTT AGGTGAAGCT AAAAAATTAA 1200 

ACGAATCTCA AGCACCGAAA GCTGATAACA ATTTCAACAA AGAACAACAA AATGCTTTCT 1260 

^ ATGAAATCTT GAATATGCCT AACTTAAACG AAGAAC7VACG CAATGGTTTC ATCCAAAGCT 1320 

TAAAAGATGA CCCAAGCCAA AGTGCTAACC TATTGTCAGA AGCTAAAAAG TTAAATGAAT 13 80 

CTCAAGCACC GAAAGCGGAT AACAAATTCA ACAAAGAACA ACAAAATGCT TTCTATGAAA 1440 

10 

TCTTACATTT ACCTAACTTA AACGAAGAAC AACGCAATGG TTTCATCCAA AGCCTAAAAG 1500 

ATGACCCAAG CCAAAGCGCT AACCTTTTAG CAGAAGCTAA AAAGCTAAAT GATGCTCAAG 1560 

CACCAAAAGC TGACAACAAA TTCAACAAAG AACAACAAAA TGCTTTCTAT GAAATTTTAC 1620 

IS 

ATTTACCTAA CTTAACTGAA GAACAACGTA ACGGCTTCAT CCAAAGCCTT AAAGACOATC 1680 

CTTCAGTGAG CAAAGAAATT TTAGCAGAAG CTAAAAAGCT AAACGATGCT CAAGCACCAA 1740 

AAGAGGAAGA CAATAACAAG CCTGGCAAAG AAGACAATAA CAAGCCTGGC AAAGAAGACA 1800 

20 

ACAACAAGCC TGGTAAAGAA GACAACAACA AGCCTGGTAA AGAAGACAAC AACAAGCCTG 1860 

GCAAAGAAGA CGGCAACAAG CCTGGTAAAG AAGACAACAA AAAACCTGGT AAAGAA6ATG 1920 

2s GCAAO^CC TGGTAAAGAA GACAACAAAA AACCTGGTAA AGAAGACGGC AACAAGCCTG 1980 

GCAAAGAAGA TGGCAACAAA CCTGGTAAAG AAGATGGTAA CGGAGTACAT GTCGTTAAAC 2040 

CTGGTGATAC AGTAAATGAC ATTGCAAAAG CAAACGGCAC TACTGCTGAC AAAATTGCTG 2100 

30 CAGATAACAA ATTAGCTGAT AAAAACATGA TCAAACCTGG TCAAGAACTT GTTGTTGATA 2160 

AGAAGCAACC AGCAAACCAT GCAGATGCTA ACAAAGCTCA AGCATTACCA GAAACTGGTG 2220 

AAGAAAATCC ATTCATCGGT ACAACTGTAT TTGGTGGATT ATCATTAGCC TTAGGTGCAG 2280 

35 CGTTATTAGC TGGACGTCGT CGCGAACTAT AAAAACAAAC AATACACAAC GATAGATATC 2340 

ATTTTATCCA AACCAATTTT AACTTATATA CGTTGATTAA CACATTCTTA TTTGAAATGA 2400 

TAAGAATCAT CTAAATGCAC GAGCAACATC TTTTGTTGCT CAGTGCATTT TTTATTTTAC 2460 

40 

TTACTTTTCT AAACAACTTC TGAAACGCCT CAACACTTTC TACTCTGATT ACATATATGA 2520 

CATTTTTAGG CATTAAAAAA TCGAACTAGA CAAGATGCTC ATTGCATTTC GTACTAGTTC 2580 

GATTCATGAA TAATTAGATT TAAAATGTCA TTTGAATCCA AGTGACAACA TTATTTATAT 2640 

4S 

TTAGAATATT AACGTTAGTA TAAACGTCCA AACACAAATA AAAGCAACAA ATATAATACT 2700 

GTATTTTAAC GTCATTTTTA ATAATGCAGA TTCTTCACCA ACTTTTTTAA CAGCTGCAGT 2760 

CGCAATGGCA ATTGATTGTG GTGAAATAAG TTTCGCTGCT ACACCACCTG CAGTGTTAGC 2820 

SO 

TGCCACAAGT AATGAACCGC TTGTTGAAAT TTGTTGTGCC ACTGTCGCTT GAATAGGTGC 2880 

55 
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TC3GAGAGAAT AATGGGAAAA TTGCTCCCGC TTTAGCAATA CCTTGTCCAA TTGCTACAGT 3000 

CAAACCACCG TATGTCATAA CTTTAGCAAT AGCTAGGATA GCTGAAATTG TAAGGATCGG 3060 

TAACCATAAT TCTTTAATTG CTTCGACCAA TAAAGCACCT GCACTTTTCC ATTTTAACTT 3120 

CGTAATTAAA ATTGTAATAA TTACTGTTAA TAAAATCGCT GTCCCAGTTG CACCAATTAA 3180 

ATCGAGACGC AACGCAATTC CTTTAGGCGA TAAATCACTC ACAGTATTTG GAATTGGCAA 3240 

TTTTATTACT AAACTTTCAA GT6CACCTCC AGGTTGGAAT AATTTTTTGA AGAATGGTGC 3300 

ACTCCATACT AATACAAAGG CAGTTAAAAT TACGAACGGA CTCCAAGCAA AGACAATTTC 3360 

TTTAGGCGTT CX5TTTTTGAA TTTTATGTTC AGACGCTTCC AATCTGAAAA TGTTTTTCXSG 3420 

TTTAAATTTA CGACAAACAA ATGCTAACAC CACCATTGTT GCTAGTGATG GAATAATGTC 3480 

TGCTAGTTCT GGACCATGGA ATATTGTTAA TAATAATTGT AATCCAGTAT ATGTACCACT 3540 

20 CACTGTTAAA ATGACAGGTA AAATTTCTTT AATACCTTTC ATACCATCTA CAATGAATAC 3600 

TAAAACAAAT GGAATAATAA AGTTTAAAAT TGGAAGTGTT AATGCTGAGT ATCTCGCAAC 3660 

ATCTAATGTT GTAACGCCTC CACTTAAGTT AAACGTATCA ATAATACTAA CTGGTAAACC 3720 

AATTGCACCA AAGGCACCCG CCGCACCATT AGCAATTAAA CATAACATCG CTGCTTTTAA 3780 

TGGTTCAAAT CCAAGTTGAA TTAATAATAC TGCACAAATC GCAATTGGCA CACCAAATCC 3840 

TGCTGCACCT TCTAAAAATG CGTTGAAACA AAATCCAATT AATAATAGTT GGATTCTTTG 3 900 

GTCCACTGAA ATACTTGCAA TACTATCTTG AATAATAGAA AATTGTCCTG TTTTAATAGA 3960 

AACTTTATAT AACCAAACTG CCATTAAAAC GATATATCCT ATTGGGAAAA TACCGGCAAC 4020 

AACGCCTTCT GTAATCGCAC CTGCTGATAC ACGCGCTGGT AATTCAAATA CAAATAAAGC 4080 

CACAATCAAT GTAACAACCA AAGTTGTCAA TGCTGCATAA ATGCCTTTCA TTTTAAAAAC 4140 

GGTTfiAGCAT AATAAAAATA AAATAATAGG TACTGCTGCA ACTAAGGCTG ATAATCCGAC 4200 

ATTATCX3AAT GGATTTACAG TAAGTAGTGT CATAATGACT CCCTCTCTTT ATATAAAATA 4260 

TTTATCATTC TGATTAATCT ACAACCTATT TCAACTTATA TTTTGCGATG ATCACATATT 4320 

TAAAATGTAA CACTCCTATA TGTGACAGGC AATCGAATTT TTACAAAAAG TTCACAAAAT 4380 

ATACACAATA TTTAACTATA ATAmATAATA TATCaTlltTA ATTATAAATA CTAGATATTA 4440 

TTTATAATAA TCTCAGGAAT TCGCTTCAAA ACTGCATCAT GAGAGTTTAT ATTTTTATTG 4500 

AGAATCTCTC ATTTTATGAA TTGTAGGAAG TAAACAAAAT ATGACAAGCG TCAAACCAAT 4560 

GATAATGATA AATATCATAT TAAACCATAG TAAATTGAAT TGATGATGGT GTTGTATTTG 4620 

CCAAATTTCT AATACTCTGA AGATAGACAT ATAGCTCATA ATCTCTAAAT TTAACGTACT 4680 
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10 



15 



AAATCGTTCA TAGTATCTAC CTGCAATGAA AAATATAAGC CAAATCACTA TAAATGCGCT 4800 

ATTAATCAAA AGCAGCACCC ATTTATCAGC AAAATTATCA GCATCCCCTG CTAAATTATA 4860 

ATGAATAGGC ACTTTGGTTG GTAATTTTGG ATAGGTCACT ACTGTATAGC ACATCATAGC 4920 

TAAGTAAATA AGTAGACTTA ATATTGTAAA AGACCTGATT TTAGACATTC TATCGCCTcT 4980 

TcTTTACATT TTATGTATAA CACTCTGCCT ATTTTACCTT TTAATaCATT ACCCCAAcGA 5040 

TtAAaCAATA tGTAaTGATA CTATAATTGC GTCAGGAGTA TCCGCTTGTT AAATGTGCAT 510 0 

AGCTTATATT TAGCTGTTTA ACATGCCACA TAATGATTCG AATTATT 5147 
(2) INFORMATION FOR SEQ ID NO: 163: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D} TOPOLOGY: linear 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CACTTACTTC CACCATTATC ATAACTTTAA AATGGATATA nTTCATCAAA CATTATCTAA 60 

AGGCGTCGCA CCTACACCAA CACCATCCAA CAATTAACTT ACAACTCTGC GATTACTTCT 120 

TCAGCAGCAA CTTTCACnTG CGTAATACAA TCAGGTAGTC CAACCGCTTC AAAAGATGCA 180 

CCAGTTACTC TAAGTCGTGG ATATGTTTGT TTAATATGTG CTTGAATCTG TCTAATTTGT 24 0 

TGAATATGAC CGACATGGTA CTGTGGCATA CTTTTCGGCA AACGATTGAC AATTGTAAAT 300 

55 TCAGGATCAC CTTTAAATGT CATCATTTGA CTTAAATCTC TACGTACAAT CGATACTAAT 360 

TCATTATCTG TATGATCATC AACCACAGTA TCACCTGGTT TACCTACATA CGCACQAATC 420 

AAAJS^CTTAC CTTCCGGTGT AGTAAATGGC CATTTTTTCG ATGTCCAAGT ACATGCGGTA 480 

^ ATGtCTGTAT CACTCGTTCT CGCAATTACG AAGCCAGTAC CATCATGGGT ATTTTCAATG 540 

TCTTTTTCAT CAAATGCCAA TACAACAGTT GCAACAGTCG TACTATCCAT CGTTTTAAAG 600 

TAATCAAATG CTGGATCTTG TCCGAACCAA TTTAAAAACA CTTGATGTGG TGTCGTTACT 660 

AATACGCCAT CATACACTTC TTCTAGTTGA TCATTGTAAA CAATTTTATA TTGTTTTTGA 720 

GATGTAATTA TATCATCCAC TGACGTATTG TAGOGTATTG TCACACCTTT ATTTTTAACA 780 

TCTTGTTCTA ATGCTTCAAT AAATGAGCTT AAACCATGCT TAAATTGTTT GAATTGTCCT 840 

TTCGGTGCGC CAGGATATAA TTGTCTTTGT TTCAGACGCT TATTTTTCTC ATCCTTCATA 900 

CCTTTTATCA GACTTCCGAA TGCCTCTTCT TTTTCTTTAA AATTAGQAAA CGTACTCATC 960 

55 
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TCAAGTACCT CATTACCTAA TCTTGCTCTG AAAAATGCAC CAACAGAAAT GTCACCATCC 1080 

TGCATTTGAG TAGGTTTTTT TAATAAATCA AACCCTGCTC TTAATTTACC AAGTGGCGAT 1140 

ATTAATTTTG TAGTAACAAA TGGTTTAATA TCTGTTGGAA TACCCATAAT TGAACCACCT 1200 

GGAATCGGAT ATAATTTATT TTTCGCAAAA ATATATGATT GTCCAGTCGT ATTTGTAACA 1260 

ATATCTTGTT CTAATCCAAT ATCTTTCGCT AATTCTGTCA TAATCGTTTT TC 1312 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTTTACAATA AAAATATGAT ATACTACTTG TCGTATATAA GGAACGGAGG ACAATTTATG 60 

CATACATTTT TAATCGTATT ATTAATCATT GATTGTATTG CATTAATAAC TGTTGTACTA 120 

CTCCAAGAAG GTAAAAGCAG TGGACTTTCA GGTGCCATCA GTGGTGGTGC TGAGCAGTTA 180 

TTCGGTAAAC AAAAACAACG TGGCGTCGAT TTATTCTTAA ATAGATTAAC AATTATTTTA 240 

TCAATATTAT TTTTTGTACT TATGATTTGC ATAAGTTATC TTGGTATGTA AGGTCCGGCG 300 

ATGTAAATGT CGGGCTTTTT TATTTATAAT TAAGAATGTA ATAGTTTAAC AATAAGCTAT 3 60 

GTAAAATATA TAGCCTAGTT AAGTATGCAA AGGGAGCGTT AGATTTATGC AGATAAAATT 420 

ACCAAAACCT TTCTTTTTTG AGGAAGGTAA ACGTGCCGTG TTATTACTAC ATGGTTTTAC 480 

AGGCAATTCG TCTGATGTTC GTCAATTAGG TOGATTTTTA CAAAAGAAAG GTTATACATC 540 

ATAICCACCG CAATATGAAG GCCACGCGGC ACCACCAGAT GAAATACTGA AATCTAGTCC 600 

TTTCGTTTGG TTTAAAGATG CGTTAGATGG TTATGATTAT CTTGTTGAAC AAGGTTATGA 660 

TGAAATTGTT GTTGCTGGTC TATCATTAGG TGGGGATTTT GCTTTAAAAT TAAGCTTAAA 720 

TAGAGATGTA AAGQGTATTG TAACGATGTG TGCTCCTATG GGTGGCAAAA CTGAAGGTGC 780 

CATTTATGAA GGCTTTTTAG AATATGCACG CAATTTTAAA AAGTATGAAG GTAAAGATCA 840 

AGAGACTATT GATAATGAAA TGGATCATTT TAAACCAACT GAAACTTTAA AAGAACTAAG 900 

TGAAGCATTA GATACGATTA AAGAGCAAGT TGATGAAGTG TTGGATCCTA TTTTAGTGAT 960 

TCAAGCAGAA AACGACAATA TGATTGATCC ACAATCCGCA AATTATATAT ATGACCATGT 1020 

AGATTCTGAT GACAAAAATA TCAAGTGGTA CAGTGAATCT GGACATGTTA TTACGATTGA 1080 
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AGAATAAAAA GAGATTTTAA CATTAGAAAG GAGGGGCATA ATGAATTTAA AGCAATCTAT 1200 

AGAAGAGATT ATTAATCAAC CTGAATATGA ACCTATGTCA GTGTCAGATT TTCAAGATGC 1260 

ATTAGGTTTA AGCAGTGCCG ACTCGTTTAG AGATTTAATT AAGGTGCTTG TGGAGTTAGA 1320 

ACAATCAGGA TTAATCGAAC GTACAAAAAC AGACAGATAC CAAAAAAAGC ATAGTTATAG 1380 

AGGTCAATCA AAATTGATAA AAGGAACGTT AAGTCAAAAT AAAAAAGGCT TTGCATTCTT 144 0 

AAGACCTGAA GATGAGGATA TGGAAGATAT ATTTATTCCC CCGACGAAAA TTAATCGTGC 1500 

CTTGGATCGA GATACTGTTA TTGTAGAAAT CCATCAATCA AAAGGTGAAC ATAAAGGTAA 1560 

AATCGAAGGG GAAGTTAAGT CGATTGAGAA GCATTCTGTA ACTCAAGTTG TTGGTACGTA 1620 

TAGTGAAGCT AGACATTTTG GCTTTGTTAT TCCX3GATGAT AAACGTATTA TGCAAGATAT 1680 

TTTCATTCCT AAAGGTCAAA GTTTAGGCGC AGTCGATGGT CATAAGGTAC TTGTAGAAAT 1740 

20 TACTAAGTAT GCTGATGGTT CAGATAATCC AGAAGGACAT ATTTCTGCTA TTTTAGGACA 1800 

TAAAAATGAT CCTGGCGTAG ATATTTTATC TATTATCTAT CAACATGGCA TAGAAATTGA 1860 

ATTTCCTGAT GAAGTGTTAC AAGAAGCTGA AGCAGTACCT GATCATATTG AAAATACTGA 1920 

AATTAAAGGC CGTCATGATT TACGTGATGA ATTGACAATC ACAATTGATG GTGCTGATGC 1980 

TAAAGACTTA GATGACGCAA TTAGTGTTAA AAAGTTAGCG AACGGTAATA CQCAATTAAC 2040 

TGTAAGTATT GCTGATGTCA GCTATTATGT AACAGAAGGT TCTGCATTGG ATAAAGAGGC 2100 

ATATGATAGA GCGACAAGTG TATATCTTGT TGACCGTGTA ATTCCAATGA TTCCACATCG 2160 

ATTAAGTAAT GGTATTTGTT CATTGAATCC TAATGTTGAT CGTTTAACTC TAAGCTGTCG 2220 

CATGGAAATC GATGCTAGTG GTCGCGTTGT TAAACATGAA ATTTTTGATA GTGTTATACA 2280 

TTCTGATTAT CGAATGACGT ATGATGCGGT AAAT»GATT ATTACTGAAA AGGATCCTAA 2340 

CATTCGCGAA CAATATAATG AAATTACX5CC TATGCTAGAT TTAGCACAAG ATTTATCTAA 2400 

TCGTTTGATT CAAATGAGAA AACGACGTGG TGAAATCGAT TTTGATATTA GTGAAGCAAA 2460 

AGTATTAGTT AACGAAGACG GTATACCAAC AGATGTTCAA TTAAGACAAC QTOGCGAGGG 2520 

TGAACGTCTA ATTGAATCAT TTATGTTAAT TGCAAATGAA ACAGTTGCTG AACATTTTAG 2580 

TAAGTTAGAT GTACCTTTTA TTTACCGAGT GCATGAGCAA CCTAAATCAG ATCGCTTAAG 2640 

ACAATTCTTT GATTTTATTA CAAACTTTGG CATCATGATT AAGGGTACTG GCGAAGATAT 2700 

TCATCCAACA ACACTTCAAA AGGTTCAAGA AGAAGTAGAA GGTCGACCTG AACAAATGGT 2760 

CATTTCAACA ATGATGTTGC GTTCAATGCA ACAAGCGCAT TATGATGATG TGAACTTGGG 2820 

ACATTTTGGC TTATCAGCTG AATATTATAC GCATTTTACA TCACCAATTA GACGTTATCC 2880 

55 



2S 



30 



3S 



40 



4S 



SO 



869 



AGAAGTCSAAG CGTTGGGAAG ACAAATTGCC 
ACGTCGTGCT ATTGAGGCAG AACGTGATAC 
5 TCAACATATT GGTGATGAAT TTGAAGGTAT 

CATTGAATTG CCAAATACGA TAGAAGGTAT 
TTACCGTTTT GAAGAGCGTC AAATGGCATT 
AATTGGTGAC ACAGTTAAGG TTAAAGTGAC 
TTTTCAAATT GTAGGTATGC CTTTACCGAA 
TAAGACAATT CAAGCCAAAA CGCGTGGTAA 

IS 

GGGTCGTAAG AAAAAAGGTA AGCAACGTAA 
AGGTAATAGT AAGCATAAGC CATTTTATAA 

20 TAAGAAAAAA TAAGCAGCAA TGAGGTGAGT 

ATTAGCGGAA AATCGTAAGG CAAGACATGA 
AATTGTATTG CAAGGCACAG AAATAAAATC 

2S TAGTTATGCG CAAGTTAAAA ACGGTGAAAT 
CGAAGAAGGG AATCGTTTTA ATCACGATCC 
GCGTGAAATC ATTAAATTGG GTGATCAAAC 

30 

AAAGCTTTAT TTGAAGCATG GACATTGTAA 
AAAATATGAT AAACGTCAAG CTTTGAAAGA 
TATGAAAGCC CGTTATTAAG CGATTTAGTT 

35 

ATGTGCTTTT GTAAATTACA AAAGTATGAT 
TGGAfTCGAC AGGGGTCCCC CGAGCTCATT 
40 ACACACACAG TTTATAATAA CTGGCAAATC 
GCACTCTGCA TCGCCTAACA GCATTTCCTA 

GATATGCTAA ACACTGCCGT TTGAAGTCTG 

45 

TGTTGGTTGT TTATCACTTT TCATGATGCG 

GATGTGTATC AGGACCTTTG GACX3CGGGTT 

ACAGCCTTTG TGGTTGTGGG CTTTTTTATT 

SO 

AATTTGTTGT GAGTATTGAT ATAGCAGTGT 
AGTAATAAGT GATTATGAAT TAGTTTTTGA 

SS 



0 786 519 A2 

TGAGTTAGCT GAACATACTT CTAAACGTGA 3000 

TGATGaATTG AAAAAAGCAG AATATATGAT 3060 

TGTCAGCTCA GTAGCTAACT TCGGTATGTT 3120 

GGTTCATATT GCGAATATGA CTGATGATTA 3180 

AATTGGTGAG CGTCAAGCTA AAGTATTTAG 3240 

GCATGTTGAT GTAGATGAAC GATTAATTQA 3300 

AAATGATCGA TCACAGCGCC CAGCGCGAGG 3360 

ATCATTAGAT AAATCAAAAT CTGATGATAA 3420 

AGGTAAAAAC CAACGTAATA ATGATAAAXC 3480 

AGATAAAAGT GTGAAAAAQA AAGCACXSTCG 3540 

ATGAATGGCT AAGAAGAAAT CACCAGGTAC 3600 

TTATAATATT GAAGATACGA TTGAAGCGGG 3660 

AATTCGCCGA GGTAGTGCTA ACCTTAAAGA 3720 

GTATTTGAAT AATATGCATA TAGCACCATA 3780 

TCTTCGTTCT CGAAAATTAT TATTGCACAA 3840 

AOGTGAGATT GGTTATTCGA TTGTGCCGTT 3900 

AGTATTACTT GGTGTtGCAC GAGGTAAGAA 3960 

AAAAGCAGTC AAACGAGATG TTGCGCGCGA 4020 

GCTTAATCGG GCTATATTTG ATATAGTTAT 4080 

TTGTTTGATT TATTATTTCG GGGACGTTCA 4140 

AAGCGTGTCG GAGGGTTGTC TTCGTCATCA 4200 

AAACAATAAT TTCGCAGTAG CTGCCTAATC 4260 

TGTGCTGTTA ACGCGATTCA ACCTTAATAG 4320 

TTTAGAAGAA ACTTAATCAA ACTAGCATCA 4380 

AAACCTATCG ATAAACTACA CACGTAGAAA 4440 

CAAATCCCGC CGTCTCCATA TTTGTAGCCT 4500 

TTGTGTTTTT CAGGGGATAA TGCATTGCAG 4560 

TTGTATAGGT GTTTATTTGA TGGAGGAAAG 4620 

GATATAAGGG GACAGTGATG TGTGTCAAAT 4680 
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CATTTGTTAT 


TCTGAGTAGC 


CAATTTGGCA 


AAGATGAACA 


AACGTCTGAA CAAACGTATC 


60 


AAGTTGCAGT 


CGCATTAGAG 


TTAATTCATA 


TGGCAACACr 


TGTTCATGAT GACGTTATTG 


120 


ATAAAAGCGA 


CAAGCGTCGA 


GGCAAGTTAA 


CCATATCAAA 


GAAATGGGAT 


CAGACAACTG 


180 


CTATTTTAAC 


TGGGAATTTT 


TTATTGGCAT 


TAGGACTTGA 


ACACTTAATG 


GCCGTTAAAG 


240 


ATAATCGTGT 


ACATCAATTG 


ATATCTGAAT 


CTATCGTTGA 


TGTTTGTAGA 


GGGGAACTTT 


300 


TCCAATTTCA 


AGACCAATTT 


AACAGTCAAC 


AGACAATTAT 


TAATTATTTA 


CGACGTATCA 


360 


ATCGCAAAAC 


AGCACTGTTA 


ATTCAAATAT 


CAACTGAAGT 


TGGTGCAATT 


ACTTCTCAAT 


420 


CTCATAAAGA 


GACTGTACXSA 


AAATTGAAAA 


TGATTGGTCA 


TTATATAGGT 


ATGAGCTTCC 


480 


AAATCATTGA 


TGATGTATTA 


GACTTCACAA 


6TACCGAAAA 


GAAATTAGGT 


AAGCGGGTCG 


540 


6AAGTGATTT 


GCTTAATQGT 


CATATTACX3T 


TACCXaVTtTT 


ATTAGAAATG 


CGTAAAAATC 


600 


CAGACTTCAA 


ATTGAAAATC 


GAACAGTTAC 


GTCGTGATAG 


TGAACGCAAA GAATTTGAAG 


660 


AATGTATCCA 


AATCATTAGA 


AAATCTGACA 


GCATCGATGA 


GGCTAAGGCA 


GTAAGTTCGA 


720 


AGTATTTAAG 


TAAAGcyrrG 


AATTTGATTT 


CyGaGTTACC 


aGATGGACaT 


CCGaOAtCAC 


780 


TACyTTTAAG 


TTTGACGAAA 


AAAATGGGTT 


CAAnAAACAC 


GTAGTATTTA TGnAAAAGTA 


840 


TTGAAAGCGC 


TTTACCAACC 


TGTTAATATA 


TAATAGTAAT 


ATAC 




884 



(2) INFORMATION FOR SEQ ID NO: 186: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 



40 



45 



SO 



AATTTCATCT 


GCTCGTGCAA 


AATCmGTT 


TTTCCTTGCT TCATTAOGCT CTTCGATTAA 


60 


TTTTTCAACA 


TCTTCATCCA 


ATAATTCATC 


TGCATTTTTA GATTTTAACG GTACACCTAA 


120 


AACATCGCTG 


AAAATTTGAT 


AAACTGCTTT 


AAATTTATCA ATTACTTCTG TTGATGTTGT 


180 


GTTCTCTAGT 


ACATATTTAT 


TCGCAAGTkT 


TGCTAAATCA TACCAAGCTG TAATTGCATT 


240 


AGCTGTATTA AAATCATCAT 


TCATAACTGT 


TTCAAAACGA TTTAAAATCG CATCAATTTG 


300 


ATCAATATAT 


GTCTGTTGAT 


TTTCAATATT 


AGTAGCAATT TGTGCGCGCT CTTCAATTAA 


360 


TTGATAACTA 


TTGCGAATAC 


GCTCTAGTcC 


aCTACGTGCT GATTCTACCA ATTCTAGATT 


420 


ATAGTTAATT 


GGGCTTCTAT 


AATGTACGCT 


AATCATAAAG AATCTTAGTA CATCTGGATC 


480 



55 
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ATTATCAATA TTAATGAAAC CATTATGCAT CCAATAATTA GCAAATGGCG CATGATTATG 600 

TGCTTCTGAT TGTGCTATTT CATTTTCATG ATGTGGAAAT TGTAAATCTG AACCACCCGC 660 

ATGTATATCA ATTGTAGGTC CTAGCTCATG AAATGCCATT ACAGAACATT CTATATGCCA 720 

TCCTGGTCTA CCTTCACCAA ATGGGCTATC CCAACTAATC TCGCCAGGTt CGCTTTTTTC 780 

CACAATGTAA AATCAAGTGC ATCTTCTTTA TGCTCTCCTG CATCTATACG AGCACCCACT 840 

TTTAAGTCAT CTATGGATTG ATGACTTAAT TTACCATAAC CTTCAAATTT ACGTGTTCTA 900 

AAGTAAACAT CGCCACCACT TTCATATGCA TAACCTTGAT CCACCAAATC TTTAATAAAT 960 

TGAATAATGT CATCCATATG GTCCATTACC CrTGGATTTG AAGTCGCTTT TCTAACATTT 1020 

AACGCACCAA CATCTTCATG AAAAGCAGCG ATATATTTTT CTGCAATTTC GGGAACAGAC 1080 

TGATTTAATT CTTGAGAACG TTTAATTAAT TTATCATCTA CGTCTGTAAA ATTTGATACA 1140 

20 TATTCTACAT TATATCCTTG GTATTCAAAG TAACGTCTCA CTACGTCATA ATTAATTGCw 1200 

GGTCTTGCGT TACCAATATG AATGTAOTTA TATACAGTAG GACCACATAC ATACATTTTT 1260 

ACTTTCCCTG GTTCTATAGG CTTGAACACT TCTTTTTGAC GTGTAAGCGT ATTATATAAT 1320 

GTAATCATCT TGAATCTCTC CATTCCTAGT CTTTTCAAGT TGTCGTTCTA AATGCTTAAT 1380 

TTGTTCATAA ATTGGATCAG GTAGATGGCG ATGATCAAAT GTTTTTCCAA CTCGAACACC 1440 

ATCTTGCTTA ACAATATGTC CTGGTATACC AACAACCGTT GAATAACTTG GAACTGATTG 1500 

TAAAACAACT GAATTTGCAC CAATATTTAC ATTTGAATTT ATTTTAATAT TTCCTAAAAC 1560 

TTTCGCACCG GCTGCTATTA AAACATTGTC TCCTATATCT GGGTGTCTTT TCCCTCTTTC 1620 

TTTCCCTGTC CCACCAAGTG TCACGCCTTG ATAGATTGTC ACATTATCAC CAATTGTACA 1680 

TGTTTCTCCT ATTACAACGC CCATACCATG ATCTATAAAT AGACGCTTTC CAATTTTAGC 1740 

ACCTGGATGG ATTTCTATAC CTGTGAAAAA TCTTQAAATT TGAGATATCG CGCGTGCTGC 1800 

AACATATTTT TTTTGGTTGT ATAACTTATG TGCAATCAAA TGACTCCAAA CTGCATGTAA 1860 

ACCTGCATAC GTTGTAATGA CTTCTAATGT TGAACGTGCC GCTGGATCCT GCTCAAATAC 1920 

CATTTTTATA TCGTCTCTCA TTCTTTTTAA CAAGATCATT TCCTCCTCAA TGATTGAACT 1980 

ACXrrAAATAC ATAATTGAAG TACCTGCGAA ATTAAATATC AAAAAAGCAC CACTAACATA 2040 

CAAATTGTAT TGTTAGAGGC GCTTCCGCAC GGTTCCACTC TGAATTTAGC GAATAACATT 2100 

AATAATATTG CGGGCGCTTC CAAATTATCA AGGAAACTAA GTCAACTTAA TGCTCATCAC 2160 

TCTCATTATA TATTTAATTC ATTTTACGAA GGTGCATTCA TTAATTTCTA CGTTGTACTC 2220 

ACAGCAACCG TACACTCTCT GCATCGTATA AATTTAATTA CTAATCCTTC GTTTTATATA 2280 



25 



30 



35 



45 



SO 



SS 
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ATAAAATTCA AGTATATACT ACCTTGATCT 
CGGTTTAGCA CTTTTTCTTT ACCAAGTACT 
^ ATTTGGCCTG TTACAGCAAC ACGAATAGGC 

TCTTTTTGAA CTTCTTTAAT TGTCTTTTTA 
TCTAATTTAC TGAATAAGTG CGTCATTAAC 

10 

TGTTCTTCTT CACCAAGAGC TGGCATTTCT 
TCACCGGCAT AACTCATTTC TTTTTGATAA 

,5 TCCTCTTCTG ACGGCACCTC AGGAATCAAA 

TGGAATACTG TTTCAGTATC TTTTTGTTTC 
TGCTTATCGA AAAATGCTGG TGATTTTGAC 

20 TCTTCTTTAG AAAAGATTTC TTCTTCACCT 

TTAAATAACG CTTCAGGTAA ATAACCTAAG 
TGCCCATCAC GTTTACTTAA CTTTTTACGT 
AAACGAGGTG GCTCCCAGCC AAATGCTTCA 
ATATGATCAT CACCACGAAT TACATCTGAA 
GCAAAATTGT ACGTTGGAAT GCCATCTTTT 

30 

GAATCAAATG AAATATTTCC TTTTACCATA 
ACTCGGAAAC GAATTGATGG TTGGCGTCCT 

^ GTCAAATGCG CATGTTGACC ACCATAGCGA 

CGTTCAGCTT CTAATTCTTC TTCTGTCATA 
AACTCATCTA TTAATGGTTG QTAGATATGT 

40 CCATTGTCTT TATCTACAGA CTCATCCCAA 

TGTGATGTTT CTCCATCTTC TAAATTACGT 
AAATCTCCGT TGTAATGTTT AGCATACAAG 

^ ATATGAAGAT ACCCAGTTGG ACTTGGTGCA 

TTCACTCCTA AATTAAATAT CAGATTTTCA 
TCTTCGACCG TCATAACAAA TGTCTAACTC 

50 

TAACATGACC TTAAAATAAT TTCATTGTTT 
TTAAATTTTA AATAGAAAGC TGTTGTTTTT 

55 



TGTCTATTTC ATTACTTATA TTGTTTTAAA 2400 

TCAATTGTAT TTGGTAATTC AGGACCATGC 2460 

ATAAATAATT GCTTGCCTXT TATTCCTGTT 2520 

ATTTCAGCCG CTTCAAATGG TTCAAGTGCT 2580 

TCTGGTACTT GCTCTCCATT AATCACTTGT 2640 

TTAAAGAACA TTTCTGATAA AGGTACAATT 2700 

AGCGCAATTA ATTTGCX3TCC CCAAGATAAA 2760 

TTTGCTTTAA TTAAATGAGG TAATGCTAAT 2820 

ATATATTGGT TATTAACCCA TGCTAATTTT 2880 

AAACGCTTTT CATCAAAGAT TTTGATAAAT 2940 

TCAGGA6ACC AACCTAATAA CGCAATAAAA 3000 

TCACGATATT GCTCAATAAA TTGTAAAATT 3060 

TCTTCATTAA CAATTAATGA CATATGACCA 3120 

TAAATCATAA TTTGTTTAGG CGTGTTTGAA 3180 

ATTTGCATGT AATGATCATC TATAGCTACT 3240 

TTTACX»ATAA CCCAGTCACC AATACCATTT 3300 

TCATCAAATG AATACGTTTG GTTTTGAGGT 3360 

TCTGCTTCAA ATTGTTGACG TTGTTCTTCA 3420 

GGCATTTCAC CACGAGCGAT TTGCGCTTCA 3480 

TAGCATTTAT ATGCTTTATC TTCTGCTAGT 3540 

TGACGTTCAG ATTGACGATA TGGTCCGTAG 3600 

TCTAATCCTA ACCATTTAAG ATTATCAAAT 3660 

TTTTTATCAG TATCTTCAAT TCGAATCACA 3720 

TAATTGAATA ATGCTGTTCT TGCATTACCA 3780 

TATCTTACTC TTATACGATC GCTCATTTTT 3840 

AGTTAGTTCA TATAAATTGT TCATTTGCTA 3900 

GTCTTATTGT TAAAACGAAA CAATGCTTTT 3960 

AATCATAACA TAATTCCCTG GGTAATATGC 4020 

TCAACACTTT AAAAAAGCTA TCCCTAAGAA 4080 
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TTAAACTTCA AATTAACTAT TCAAATACGT 
TTGCTATAAG AATAACTTTA TTAATATCTA 
^ GATATATTAG CACTATTTGT ATTTTTTGAT 

ATACTTCAAA TTACAAACTt CGCCATTTCA 
CAACTACAGT TTTTATAATG ATACTGTATC 

10 

TGAAGATGTT TGCAATAAGC TGACTACTTC 
AGCATTATCA ACAACTACCA TCGTACCATC 

,5 CTTACCCATT TTTGTCAGTA AAATATGCAG 
TGCTTCTGAT AAATCATTAA CATTTAATGC 
TAGGTTGAAA TCTGTCGTTA TAATACTTGC 

20 CGTATCAATA TCACTATGTG TTTTAGTTGG 
ATACAATTCA TTTAAAATAT CTAAGCCTCT 
TGAATCTGCA ACAATTTGTA ATTCATTAAT 
GATAAAACCG CAACGAATGA CTTCTAAAAT 
AATTTTTGGC GTAgcaCTTT TaGTATGTTG 
AAACATTAAC ATTTCATCTC GTTTTTTAAG 

30 

TATCGTAATT ATGACAGGAA TGAAATGATT 
AAACGACACC ATAACAGAAA TAAGTAATCC 

^ TTCAACAGCA CTTCTACGCA TAATAAAATG 
TCTAATAAAT ACACCAAAAA TTAAGAACAT 
GTGATTTTTT AAAAAGCTGG AGTTTTGTAA 

40 AATTATTCCT AATGCGCTCC CAATAATTAA 
ATTCACACAA TGTCCTCCTT TCTTGATGTT 
ATGTACAGTA GTTACACCTA TTACTTGTAT 

^ ATTTTTAGGA ATAATTACAC GTTTGAAACC 
TATCCGAGAT ACAGGACGTA CCTCACCCGT 
TCCGTCGACA GCTTTATCTT TAAAGCTAGA 

SO 

TGCTGGCTCC GTTAACTTTA CACCGCCAGC 
TAGATAATTT TCTTTCTTTT CCAAAACAGC 

SS 



TAAAATTGAT TCTAATTTTG TATGTCTTGA 4200 

AAATTTAACA CTTAATGAAC TTGTTTCAAT 4260 

AACTAATATG TTTTGCATTT ATTTATAGTT 4320 

TATACCTTTT AATATCTATT TTGTTTTCGT 4380 

TTCGATTTTT TTAGCAAAAA CAATTCTTCC 4440 

TAAATTGACA TGACTGCCAA TAAGATTTTT 4500 

ATCTAGATAT CCTACTGCCT GACCAGGCtC 4560 

TTGATCACCT TGATGTACAT TAGGTTTGAT 4620 

TTTGATACCA TGTACATGAC AAACTTTATT 4680 

ATGATATTGT TTTGCTAATT TTAATAACAT 4740 

ATGTATAACC TTTGTAGGAT AGTCTAAATC 4800 

TTTACCCTTT TCaCGTTTAA CACTGTCATT 4860 

AACACCTTGT GGAATTAAAA TATTGCCATC 4920 

ACGACCATCA ATAATTQCX3C TTGTGTCGAT 4980 

TGACATGGAA CGCGCTATAT TCTCAGGTAA 5040 

GCCAAATTGG AAACCGAAAT AACATAGTAA 5100 

AAAAATAGAG TTGCCAATTG ATTCTAATAT 5160 

GATTATTAAA CCTATTGTTG CGAATAGTAT 5220 

TTCTAAACCT TTTATAGCGT TAGTAACTCG 5280 

AAAAATACTA CCGATAATGC CATCTACATA 5340 

TCCAAGATCA TTTGCAATTT CAGGAATAAT 5400 

GTAAATAATA ATAACCATTA GTTTAACGAT 5460 

TTATGAATGA AGAGCAAATG ACAATACTTC 5520 

ACCTTCAGGA TATGTCCATC CGCCTATATT 5580 

TAGTTTTGCA GCCTCTTGCA CGCGTTGTTC 5640 

TAAACCAACT TCTCCAATAT AGCAATCTAA 5700 

TGCAGTTGCT ACAATTACAC TTAAATCAAC 5760 

TACTTTGATA TAAGCATCTT GTTGTTGTAA 5820 

CATCAACAAA CTTAATCGAT TATGATCAAT 5880 
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ATGTCCGTAA CTGATGTAGG TACTGTATTA CAAATTGGTG ATGGTATTGC ATTAATTCAC 540 

GGATTAAA1G ACGTTATGGC TGGTGAGCTA GTAGAATTCC ATAACGGCGT ACTTGGTTTA 600 

GCCCAAAACC TTGAAGAGTC AAACGTGGGT GTGGTTATTT TAGGACCATA CACAGGTATT 660 

ACTGAAGGTG ACGAAGTTAA ACGTACTGGT CGTATCATGG AAGTACCAGT AGGTGAAGAA 720 

CTAATCGGAA GAGTTGTTAA TCCATTAGGA CAACCTATTG ATGGAcAAGG ACCGATTAAC 780 

ACAACTAAAA CACGTCCaGT AGAGAAAAAA GCTACTGGTQ TAATGGATCg TAAATCAGTA 840 

GATGAGCCAT TACAAACAGG TATCaAAGCA ATTGATGCTT TAGTACCAAT TGGTAGAGGT 900 

IS CAACGTGAGT TAATCATCGG TGACCGTCAA ACAGGTAAAA CAACAATTGC AATTGACACA 960 

ATTTTCAACC AAAAAGATCA AGGTAC3GATT TGTATCTATG TTGCTATTGG TCAAAAAGAT 1020 

TCAACAGTAA GAGCAAATGT TGAAAAGTTA AGACAAGCAG GCGCTTTAGA CTACACTATT 1080 

GTTGTAGCAG CATCAGCTTC TGAACCTTCT CCATTATTAT ATATTGCACC ATATTCAGGT 1140 

GTAACAATCG GTGAAGAATT CATGTTTAAC GGTAAACATG TTTTAATCGT TTA 1193 
(2) INFORMATION FOR SEQ ID NO: 188: 



2S 



30 



35 



45 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

TGCTAAGAAG TCAAAATAAA CTAACTATnA AACATCTAGT ACGATTATTA AAGTGACAGA 60 

TnATAAAATT OAATTATTnA GAGAAGGAGA TATAAAGTTT GAAGAAATAA AAGAAAGACT 120 

AGGTACAGGT ATTATTTATQ AATAAGTTAA TACriGGGAT TTATTTATAC CGAATTTTTT 180 

CACGAGCATA CTTTTATTTA CCGTTTTTAT TAATTTACTT TTTGATTCAA GGTTATTCCA 240 

TAATACAATT AGAAATATTA ATGGCGTCTT ATGGCATTGC AGCATTTTTA TTCTCTCTAT 300 

ACAAAGAGAA GTGTTTTAAA ATTTGTAACT TAAAAGATTC TAATAAATTA GTTGTTAGTG 360 

AAATATTCAA AATCATCGGT TTATTGTTGT TATTATATCA AAATCAATAT TTAATTTTAG 420 

TAGTGGCACA AATATTATTA GGGTTAAGTT ACTCAATGAT GGCGGGTGTT GATACCGCAA 480 

TAATTAAAAG AAATATAACA AATGAGAAAT ACGTACAAAA TAAGTCAAAT AGCTATATOT 540 

TCCTATCATT ATTAATTTCA GGGATTATAG GTAGTTATCT TTATGGAATA AATATTAAAT 600 

GGCCTATAAT AATGACTGGT ATATTTTCAA TTCTAACAAT TATAATTATT CGATGCACAT 660 

55 
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CAACATGAAG CATTTAATTT TACAGTGATG 
TACTCGTAAA TGACATCTTT GTATTAAGGC 
^ GGTCATTGAG CTTTAAAATA TTCAAAATAC 

CTTTATCTGG GCAGCTTCAA TAATAAGAAA 
TGTGATGAAT CTTTCGGCGG TTTAATTACT 

10 

AATACTGCCA TGATAATTGG ATTCACTACA 
AGTACAAAGT TAACCATTTO CATTAATAAT 
ATGTCATTCT ACCTCCACTT TAATTATATA 
GTATA6TAAC ATCTCATATA TTTTGACCAT 
GAATGGCTAT TCTAAATTAA GCGCATTAAA 
20 AAAGCATTAA AATTTTATTA ACTAGTCAAT 

AATCTATGTT AAATTAATAC CTGGXATTAA 
ACGTGGGTAT TAAAGGTTTT GGTGCATATG 
TTGAGCAATT TTTAGATACA TGTGATGAAT 
GACATTGGGC AGATGATGAT CAAGATACTT 
CAATCGCTGA CGCTGGTATT CAGCCCGAAG 

30 

CTGGaGATAT GCCATTTCCA ACTGTCGCAA 
AAGTTGCCTC TATGGATCAA CTTGCAGCAT 
CTAAACAATA TGTTCAATCT GGAGATTATC 

35 

TATCTAAAAT AACAGATTTA ACTGACCGTT 
GTGCGGTTAT CATCGGTGAA GTTTCAGATG 

40 CTGATGGCAC AGGTGGTAAA CATTTATATT 

ATGGTCGAGA AGTATTTAAA TTTGCTGTTA 
TTGAAAAAGC GAATTTAACA TCAGATGATA 

^ TTAGAATTAT GGAATCAGCT AGAGAACGCT 

CTGTAAATAA ATATGGAAAT ACTTCAGCTG 
TAAAAAATGG TAAAATCAAA GATGATGATA 

so 

TAACTTGGGG CGCAATGACA ATAAAATGGG 
TAAAAGAGTA GTTATTACAG GTATGGGAGC 

55 



ATTATAAAAT AATTGCCTTG ATACAAAGAT 2580 

TTTTTCTAAA TTTAAAAGTG ATGGGTTAGA 2640 

AAAACATTAA TGGCCAAAAA TAAAAGCCGC 2700 

GACATATTTC ATTTTATACT AAATAGTTAT 27S0 

GCAGCAAAAA TTGCTGTGAA AATCGTGAAC 2 820 

TTTAAGCTGT CTCCACCTAC TAGGCTATTA 2BB0 

AATGCCCAAA AGAATGTTAC GAGGTGTTTC 2940 

TATTTTATTT TAAGTGAAAG TTAGAAATTT 3000 

ATTATACAGT TTAAATAAAT GATTTTATCT 3060 

ACCAATTTCA TACTGAAATT TGACGATAAT 3X20 

ATTCCTACCT CTGACTTGAG TTTAAAAAGT 3180 

AAATTTTATT AAGAAGGTGT TCAACTATGA 3240 

CGCCAGAAAA GATTATTGAC AATGCCTATT 3300 

GGATTTCTAA GATGACTGGA ATTAAAGAAA 3360 

CAGATTTAGC ATATGAAGCA AGTTTAAAAG 3420 

ATATAGATAT GATAATTGTT GCCACAGCAa 3480 

ATATGTTGCA AGAACGTTTA GGGACGGGCA 3540 

GTTCTGGATT TATGTATTCA ATGATTACAG 3600 

ATT^CATTTT AGTTGTCGGT GCAGATAAAT 3660 

CTACTGCAGT TCTATTTGGA GATGGTGCAG 3720 

GCAGAGGTAT TATAAGTTAT GAAATGGGTT 3780 

TAGATAAAGA TACTGGTAAA CTGAAAATGA 3840 

GAATTATGGG TGATGCATCA ACACGTGTAG 3900 

TAGATTTATT TATTCCTCAT CAAGCTAATA 3960 

TAGGTATTTC AAAAGACAAA ATGAGTGTTT 4020 

CGTCAATACC TTTAAGTATC GATCAAGAAT 4080 

CAATTGTTCT TGTCGGATTC GGTGGCGGCC 4140 

GAAAATAGGA GGATAACX3AA TGA6TCAAAA 4200 

CCTTTCTCCA ATCGGTAATG ATGTCAAAAC 4260 
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20 



25 



35 



40 



TGAACCTTAT 


AGCGTTCACT 


TAGCAGGAGA ACTTAAAAAC 


TTTAATATTG AAGATCATAT 


4380 


CGACAAAAAA 


GAAGCGCGTC 


GTATGGATAG ATTTACTCAA 


TATGCAATTG TAGCAGCTAG 


4440 


AGAGGCTGTT 


AAAGATGCGC 


AATTAGATAT CAATGAAAAT 


ACTGCAGATC GAATCGGTGT 


4500 


ATGGATTGGT 


TCTGGTATCG 


GTGGTATGGA AACATTTGAA 


ATTGCACATA AACAATTAAT 


4560 


GGATAAAGGC 


CCAAGACGTG 


TGAGTCCATT TTTCGTACCA ATGTTAATTC CTGATATGGC 


4620 


AACTGGGCAA 


GTATCAATTG 


ACTTAGGTGC AAAAGGACCA AATGGTGCAA CAGTTACAGC 


4680 


ATGTGCAACA 


GGTACAAATT 


CAATCGGAGA AGCATTTAAA 


ATTGTGCAAC GCGGTGATGC 


4740 


AGATGCAAT6 


ATTACTGGTG 


GTACAGAAGC ACCAATTACT 


CATATGGCAA TTGCTGGTTT 


4800 


CAGTGCAAGT 


CGAGCGCTTT 


CTACAAATGA TGACATTGAA 


ACAGCATGTC GTCCATTCCA 


4860 


AGAAGGTAGA 


GATGGTTTTG 


TTATGGGTGA AGGTGCTGGT ATTTTAGTAA TTGAATCTTT 


4920 


AGAATCAGCA 


CAAGCTCGAG 


GTGCCAATAT TTATGCTGAG 


ATAGTTGGCT ATGGTACTAC 


4980 


AGGTGAT6CT 


TATCATATTA 


CAGCGCCAGC TCCAGAAGGT 


GAAGGTGGTT CTAGAGCAAT 


5040 


GCAAGCAGCT 


ATGGATGATG 


CTGGTATTGA ACCTAAAGAT 


GTACAATACT TAAATGCCCA 


5100 


TGGTACAAGT 


ACTCCTGTTG 


GTGACTTAAA TGAAGTTAAA 


GCTATTAAAA ATACATTTGG 


5160 


TGAAGCAGCT 


AAACACTTAA 


AAGTTAGCTC AACAAAATCA 


ATGACTGGTC ACTTACTTGG 


5220 


TGCAACAGGT 


GGAATTGAAG 


CAATCTTCTC AGCGCTTTCA 


ATTAAAGACT CTAAAGTCGC 


5280 


ACCGACAATT 


CATGCGGTAA 


CACCAGATCC AGAATGTGAT 


TTGGATATTG TTCCAAATGA 


5340 


AGCGCAAGAC 


CTTGATATTA 


CTTATGCAAT GAGTAATAGC 


TTAGGATTCG GTGGACATAA 


5400 


CGCAGTATTA 


GTATTCAAGA 


AATTTGAAGC ATAACTATAA 


nAATCTTCAG TAACGTTGTT 


5460 


TTAGTTACTG 


AAGATTTTTT 


CaOTTTCTTT ATACTAAGAT 


GAGCGACAcA CAATCGTCAT 


5520 


AATAAAATAT 


GAATATTTAT 


TAATAATAA 




5549 


(2) INFORMATION FOR SEQ ID NO: 189: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

50 

AGATTATAGT AAGATTGATA GTTTGGCGAC TGaAGCgCGa GaAAAATTAT CAGaAGTAAA 60 
mCCTTTAAAT ATTGCACAAG CTTCTAGAAT ATCAGGGGTA AATCCA6CAG ACATATCTAT 120 

55 
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TGGTTAGCAG AACAATTAAA AGAACATAAT ATTCAATTAA CTGAGACTCA AAAACAACAG 240 

TTTCAAACAT ATTATCGTTT ACTTGTTGAA TGGAATGAAA AGATGAATTT GACAAGTATT 300 

5 ^ 

ACAGATGAAC ACGATGTATA TTTGAAACAT TTTTATGATT CCATTGCACC TAGTTTTTAT 360 

TTTGATTTTA ATCAGCCTAT AAGTATATGT GATGTAGGCG CTGGAGCTGG TTTTCCAAGT 420 

ATTCCGTTAA AAATAATGTT TCCGCAGTTA AAAGTGACGA TTGTTGATTC ATTAAATAAG 480 

10 

CGTATTCAAT TTTTAAACCA TTTAGCGTCA GAATTACAAT TACAGGATGT CAGCTTTATA 540 

CACGATAGAG CAGAAACATT TGGTAAGGGT GTCTACAGGG AGTCTTATGA TGTTGTTACT 600 

IS GCAAGAGCAg TAGCTAGATT ATCCGTGTTA AGTGAATTGT GTTTACCGCT AGTTAAAAAA 660 

GGTGGACAGT TTGTTGCATT AAAATCTTCA AAAGGT6AAG AAGAATTAGA AGAAGCAAAA 720 

TTTGCAATTA GTGTGTTAGG TGGTAATGTT ACAGAAACAC ATACCTTTGA ATTGCCAGAA 780 

GATGCTGGAG AGCGCCAGAT GTTCATTATT GATAAAAAAA GACAGACGCC GAAAAAGTAT 840 

CCAAGAAAAC CAGGGACGCC TAATAAGACT CCTTTACTTO AAAAATAATG CATAATCCTT 900 

TACAACTAAC ATAAAAGGAG CGAATGGATA ATGAAAAAAC CTTTTTCAAA ATTATTTGGT 960 

2S 

TTCAAAAACA AAGATGACAT CATTGQACAT ATTGAAGAAG ATCGCAATAG TAATGTTGAA 1020 

TCCATTCAAA TTGAACGTAT CGTTCCCAAC CGTTATCAAC CAAGACAGGT GTTTGAACCA 1080 

AATAAAATTA AAGAACTTGC TGAATCAATA CATGAACATG GTTTACTACA ACCTATTGTT 1140 

30 

GTAAGACCGA TTGAAGAAGA TATGTTTGAA ATTATTGCTG GAGAGCGCCG ATTTAGAGCA 1200 

ATACAATCAC TAAATTTACC TCAAGCAGAC GTTATTATTC GTGATATGGA TGATGAAGAG 1260 

35 ACGGCTGTTG TTGCATTAAT TGAGAATATT CAAAGAGAAA ATTTGTCTGT TGTTGAAGAA 1320 

GCGGAAGCCT ATAAGAAATT ATTGGAAATT GGTGATACAA CGCAAAGTGA ATTGGCAAAA 1380 

AGTTfAGGTA AAAGTCAAAG CTTTATTGCA AATAAGTTGC GTTTATTGAA GTTGGCGCCG 1440 

^ AAAGTACTAC TTCGCTTAAG AGAAGGTAAA ATTACTGAAC GTCATGCGAG AgcGGtATTA 1500 

TCATTGTCTG ATAGCGAACA AGAAGCGTTG ATTGAGCAAG TCATTGCACA AAAGCTAAAT 1560 

GTGAAcAGAc TGAAGATAGA GTACGCCAAA AAACGGGGCC CGAAAAAGTC AAAGCACAAA 1620 

45 

ACCTTCGCTT TGCACAAGAT GTCACTCAAG CACGAQATGA GGTAGGCAAA AGTATCCAAG 1680 

CGATTCAACA AACAGGATTA CATGTTGAGC ATAAAGACAA AGATCATGAA GATTATTATG 1740 

AAATAAAAAT TCGAATATAT AAACGTTaGT AGTAGQATGT CGTATACATG ATGACTAACA 1800 

so 

CATAAAAGAC AAAGCTAAGA TCATAACAGC TTTGTCTTTT TTTTTTGTTT TACGTGAAAC 1860 

ATAAAAATTT ATATTTATAT GTTGATCAGG CTGGTACATA AATCAATGTT CTATGCTCTA 1920 

ss 
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TTCTAGTCAA CCTTGCTGGG GTGGGACGAC GAAATAAATT TTGCGAAAAT ATCATTTCTG 2040 

TCCCACTCCC TAATTTGAGC TGGATATACT TTCATTTGAA CCCTTTATTG CTAGTTTATG 2100 

AAAGTATCAT GAAAGCTTTA TGAACATCX3C TTGAGTTGCC TTTACAGTAG AAAATTTAAG 2160 

TTTTACACTT TGTGTGAATG ATACGTTTTG TATTGAATTA ATTATAGAAA GGTACGTTGA 2220 

AGATGTTTTC AATTGGAAGT GCAATTCTTC ATrTTGTCAT TGGTGGTATC GCTGTTGCAT 2280 

TAGCTTCAAT TATTGCTGAT AAGGTAGGTG GTAAGTTAGG AGGTATTATA GCTACTATGC 2340 

CGGCAGTCTT TCTTGCGGCT ATTATCGCAT TAGCTTTAGA TCATCGTGGT ACGCAATTAG 24 00 

TGGAGATGTC QATGAATCTT AGTACTGGAG CAATTGTCGG TATTCTGTCT TGTATATTAA 2460 

CTGTATTTTT GACATCTCTC TACATTAAGC ATAAAGGTTA TCGGAAAGGC GCAATATTCA 2520 

CAGTTGTTTG TTGCTTTGTC ATTTCCCTCG CAATATTCAG TATTAGACAT TTATAGTTTG 2580 

20 OAAAATGCGT GATAATTAGT TGTATTCAGT TATTAAGTAA TAAATTATTG GAGGCAGAAC 2640 

ATCATGAAAT TAACATTAAT GAAATTTTTT GTGGGGGGAT TTGCAGTATT ATTAAGTTAT 2700 

ATTGTATCTQ TAACACTACC TTGGAAAGAA TTTGGCGGTA TATTTGCaAC GTTTCCGGCA 2760 

25 GTATTTTTAG TGTCTATGTT TATTACAGGT ATGCAATATG GTGATAAAGT CGCTGTGCAT 2820 

GTAAGTCGTG GOGCAGTGTT TGGTATGACA GGGGTATTAG TTTGTATTrT AGTTACATGG 2880 

ATGATGTTAC ATATGACGCA CATGTGGTTG ATTAGCATTG TTGTTGGTTT CCTAAGCTGG 2940 

TTCATCAGTG CAGTATGTAT TTTTGAAGCG GTAGAATTTA TAGCACAAAA AAGATTAGAA 3000 

AAGCATAGTT GGAAAGCTGG AAAATCGAAT AGTAAATAGT GTGAACGTAA TCTCTTAACT 3060 

AGGACTAACT TTGCAAGCAT TGAATAGCAT GGAAAAGTTG CATCATTAAT AAGTGAAATT 3120 

CAAGTTGGCA TTGAGAAAAT TACAAGCGCG TAATCATACa GGTCTGTCTT AAGGGAGTCT 3180 

TCG;ACCCCG ATGTTGTCGT ATGTCAAAAC ATTTAGTCAA TCATAAAGGT GACTTGATTT 3240 

AACTTTATCT GATAGTCTGA TTGTAATGAT TGTACTAATT GACTGGAGGC GTATGTAATT 3300 

GAATCTGAGT AAACAAATTA AAAAGTATAG GGAACGAGAT GGTTATTCAC AAGAATATCT 3360 

TGCTGAAAAG TTATATGTAT CTAGGCAGAG TATTTCTAAT TGGGAAAATG ACAAAAOCTT 3420 

45 ACCAGACATA CATAACTTAT TAATGAyGTG TGAATTGTTC AATGTAACTT TAGATGATTT 3480 

AGTAAAAGGG ACCATTCCAT TTGTACCTGA TATTAAAGCG CAACGAAGTC TTAACTTATO 3540 

GACATATGTG ATGCTTATTT TCATGACATT AGCIGCAATT TTAATGGGAC CTTTAGTTGT 3600 

SO 

TTATTGGAAT TGGACTTGGG GTGTAAOGGT GGCAATCATT TTGGGAATAG GTTTTTATGC 3660 

ATCTATGAAA ATAGAAGATT TAAAAAAAGT GCATAAAATG GACAACTACG ATCQAATTGT 3720 

SS 



30 
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GACAAATGCG CTTTCTATTA TATCAGTAAT TGGTATACTC AGCCTCATAA TTTTCCTTAG 3840 

TGTGTArrTG GCAAATAAGT TTTTATAAAT CATCGTGGTA TCGTCTCATA TTATTTATAT 3 900 

TATCCAAAAT AGCATAAAAA AATACCAACA AGATTTAGAA CCTTTGTTGGT AATCAAAGCG 3 960 

aTTCATTTAT AATGAGTCGT TTTATGTTGT AAGATTAAAC AGTTTGTACG TTAACTGCTT 4020 

GGTCTCCACG TTGACCTTCA GTOATTTCGA AAGTAACTTT TTGACCTTCT TCTAAAGTTT 4 080 

TGTAGCCATC GCTAGCGATA CCTGAGAAAT GTACGAATAC GTCTCCGCCA TTTTCTTGTT 4140 

CGATGAAACC AAAACCTTTT TCTGCrTTAA ACCATTTwAC TGTACCGTTA TTCATATwGA 4200 

AwACCTCCGT gTGCTTTTGC ACTTAATATT TGTAACAAAT TCATAACTAA AAAAQAGGAT 4260 

ATTCTAAACA AATACACTAC AATTTAATTC ACX3AGCTTTT ATTACXTTAAG ACCAACTATA 4320 

CGCTCATATT GGCATAATGT ACAGTGTTTT TTGAAAATAA ATTAAAAAAG ATTTTTAAAA 4380 

20 ACCTTAGAAA CGTTGATTTA AAGGGGTTTA TAAAAATwAw AAAATTGTAG TCTTTTATGG 4440 

TGTTTGCTAG TTTTCAAAGT GACATATOGT TTAAACATGA TGATTTTATA AGCAATCCAT 4500 

AAAAAACAAG CAGCGATAAA CX3CTACTTGT TGATATTAAA ATCTGACTTG AAAGGTCATA 4560 

GCAATCTTCT ATACCGATGG AATGTGCTTA CTTGCCTTTT TCTTCACGAC GTTTTAAATA 4620 

ATAAGAGCCA CCTAATAAAC CAGCTGGAAT GCCTATCATT GGTGTTGTGA ATGAGCTTAA 4680 

TACAATAACA AGTATTGTTA AAGCAATGAC GTTATACCAA GTTACAGTCA AATTTTTCAA 4740 

30 

ATCCTCATAT GATTGTTTTA CTAATTCTCT AAATTTCATG ATTCAATCTC TCCTTTTTTA 4800 

TAAATCTTTA GATTGTCAAA TTAAGCTGGA CA 4832 

(2) INFORMATION FOR SEQ ID NO: 190: 

3S 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5727 base pairs • 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

40 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CAAAGCTGTT CAAAAGGCTT ATAATTTAAA TTTAGATAAC ATACGTACAA TGGAACCTAA 60 

GTTCAGATAT CAAGCGATCA ATAAAGGTAA TATTAATTTA ATAGATGCAT ATTCAACTGA 120 

CGCTGAATTA AAACAATATG ATATGGTTGT GTTAAAAGAT GATAAGCACG TATTTCCACC 180 

ATATCAAGGA GCACCATTAT TTAAAGAAAG CTTTTTAAAG AAACATCCAG AAATTAAGAA 240 

ACCGTTAAAC AAACTAGAAA ACAAAATATC TGATGAAGAT ATGCAAATGA TGAACTATAA 300 
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GTTAATCAAA 


TAACGACCAA 


CGCCACATAA 


GATGCGTAAC 


ACCAAATTAT 


ATCTTATGTG 


420 




GCGTTGTTAT 


ATTTAAATCT 


ATAATTATGT 


TCAATTTAAA 


CATGCAATAA 


TGATTAAAAA 


480 


5 


ATATGACATG 


TTAAACACAA 


TGTAAGCTAT 


TATGATGTGA 


AAATAGTAGC 


ATTGCATTTT 


540 




AGAAACATAG 


AGCGATATAA 


TGAATATAAG 


TTTTTTGAAA 


TTTCAGTTAA 


TTCTAAGGAG 


600 


10 


GTTGTTTTTA 


TTATGAAAGA 


ACAACTTAAT 


CAACTATCAG 


CATATCAGCC 


TGGTTTATCT 


660 


CCAAGGgCAT TGAAAGAAAA GTATGGCATT GAAGGAGATT 


TATATAAACT 


TGCATCAAAT 


720 




GAAAATTTGT 


ATGGACCATC 


GCCTAAAGTT 


AAAGAAGCGA 


TATCAGCACA 


CTTAGATGAG 


780 


IS 


TTATATTATT 


ATCCTGAAAC 


AGGATCACCG 


ACATTAAAAG 


CGGCGATTAG 


TAAACATTTA 


840 




AATGTAGATC AATCACGCAT 


TTTATTTGGT 


GCX3GGATTAG 


ATGAAGTTAT 


ATTAATGATT 


900 




TCTAGAGCTG TATTAACGCC AGGGGATACT ATTGTTACAA GTGAAGCGAC ATTCGGTCAA 


960 


20 


TATTATCACA ATGCGATTGT 


TGAATCAGCT 


AATGTGATAC 


AAGTACCTTT 


AAAAGATGGT 


1020 




GGCTTCGATT 


TAGAAGGTAT 


TTTAAAAGAA GTTAATGAAG ATACGTCATT 


GGTATGGTTA 


1080 




TGTAATCCAA ATAATCCTAC 


AGGTACATAT 


TTTAATCATG 


AGAGCTTAGA 


TTCGTTTTTA 


1140 


25 


TCTCAAGTAC 


CTCCACATGT ACCAGTAATT ATAGATGAAG CTTATTTTGA ATTTGTGACA 


1200 




GCAGAOGACT ACCCGGATAC 


ACTTGCTTTG 


CAACAAAAAT ATGACAATGC 


TTTCTTATTA 


1260 




CGTACATTTT 


CAAAGGCGTA 


TGGATTAGCG 


GGTTTACGTG 


TAGGATATGT 


GGTAGCAAGT 


1320 


30 


GAACATGCGA 


TTGAAAAATG 


GAACATCATT 


AGACCACCAT 


TTAATGTGAC 


ACGTATATCT 


1380 




GAATACGCAG 


CAGTTGCAGC 


ACTTGAAGAT 


CAACAATATT 


TAAAAGAGGT 


AACACATAAA 


1440 


3S 


AATAGTGTTG 


AACGCGAAAG 


ATTTTATCAA 


TTACCTCAAA GTGAGTATTT 


CTTGCCAAGT 


1500 


CAAACGAATT 


TTATATTTGT 


AAAAACmAAG 


CGGGTAAATG AACTTTATGA AGCACTTTTA 


1560 




AATGTAGGGT 


GTATTACGCG 


ACCATTTCCA 


ACTGGTGTTA 


GAATTACAAT 


TGGTTTTAAA 


1620 


40 


GAACAAAATG 


ATAAAATGTT 


AGAAGTTTTA 


TCAAACTTTA 


AATACGAATA 


GTAAGTGGGG 


1680 




AGTGGGACAG 


AAATGATATT 


TTCGCAAAAT 


TTATTTCGtC 


GTCCCACCCC 


AACTTGcATT 


1740 




GTCTGTAGAA 


ATTGGGAATC 


CAATTTCtCT 


TTGTTGGGGC 


CCCGCCGGCA AGGTTGACTA 


1800 


45 


GAATTGAAAA AAGCTTGTTA 


CAAGCGCATT 


TTCGTTCAGT 


CAACTACTGC 


CAATATAACT 


1860 




TTGTAGAGCA 


TTGAACATTG 


ATTTATGTCT CAAGCTCAAT GCAGTGTGAA TGATGAGGTG 


1920 




AGAGTATTCA GTGTAAAAAG 


CAACAATAGA 


TGATATTGTT TTGTATCAAT 


TGCTTTTTTG 


1980 


SO 


CTATACTGAA TCAATACTGA TATTTTCAGG AGAAGATTAA AATGACCCGT AAATCAATOG 


2040 




CGATTGATAT 


GGATGAAGTA 


TTGGCAGATA 


CATTAGGAGA AATCATTGAT 


GCTGTCAATT 


2100 



SS 
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TTCCTGAACA TGATGGATTA ATTACAGAAG 
TTAAAGTGAT GCCGTATGCA CAAGAAGTTG 

5 

ATATTGCTAC AGCAGCAATG GATGTACCAA 
TAGAGTTCTT TCCATTTTTA GATCCTCAGC 
TTAAAGCTGA TTATTTAATA GATGACAATC 

TO 

CGATTATGTT TACAGCAGTG CATAATATTA 
GGAAAGATGT AGAACAGTAT TTTTTAGATA 
AATTTCATGT AGAAAAGATG ATGGATAGGC 
TATGTCTTAG ACACTACAAC ACTATATTGG 
GAAATTGGAT TCCCAATTTC TACAGACAAT 
20 ATACTTTTTC TTTAGAAATT AGTATTTCTT 

TTTAAGTACA CATTAGCTGT GACTAATGAT 
GTTCTTTATC ATTTCCGTCC CGCTCTCAAT 
GGATGAATGT GTTAATCTAT GGCAGATTAC 
AAQACGCATA AACCACTATG ATTTAAAATA 
AAATTATATG TCTGTTTTTG AAACAGGGTA 

30 

GGTGCQTTOA ATCTTATGAT TACACTCCAA 
CGGATGCTTA CAAATCATTG ACAGTAAGTA 
CAATAGGTGT TTCTAATGAA ATTTCGCCAT 

35 

TAAGTGAAAT CTTTTTACCA GGTATATGCT 
TGCTATCACG CTTTTTAAAA ATATCATTTA 
AAGTGTTCAG TTCACCATCT TGAGGAGACA 

40 

TGAATGGACC ATTTGCTGTT AGTATCATGG 
TTGATAATTG ATAATTAAAT TGTGTTGGAT 

45 AACTCAATTT ACCAAATATA TCTTTTGAAC 
TGAGACCTAA GCCAACAAAG TTQAGTGCAT 
TACCAACTTG TGCAGAAATC ATTTGTTCAC 

^ GCGTTTTTGT AAAATCATTA AAAGTACCGC 

CATGTGTCAT AACACCGTTT ATAAGTTCGT 

55 



TATTGAGAGA ACCAGGCTTC TTCAGACATC 2220 

TGAAAAAATT AACTGAACAT TATGATGTAT 22 80 

CATCATTTAG TGATAAATAT GAATGGTTAC 234 0 

ATTTTGmT TTGTGGTAGA AAAAACATCG 2400 

CTAGACAGCT TGAAATTTTT ACTGGTACAC 2460 

ATGATGATCG ATTTGAACGC GTAAATAGCT 2520 

ATATTGAGAA ATAAAATATA TCACTTGAAA 2580 

TATAAAGTAA TTGTGACTGA GATGAACTTT 2640 

CAGTA6TTGA CTGCGGGGCC CCAACATAGA 2700 

GCAAGTTGGG GTGGsCCCCA ACATAAAGAA 2760 

ATGCATGAGT GTAACTCATG CATTCATATT 2820 

AAAGAATGGC TACATAATCA ATCATTAGTC 2880 

AAATGTTAGT CTATCTTATT ATTATAAATC 2940 

ACGTCATCCG ATTTTTTATA GAATTTGAAA 3000 

CAACATCAAT CATTTTAGTG gCATGCGCCA 3060 

ATAGCTTAAA GCTAATAAAA ACGAATATAA 3120 

ACCTAATATA ATATCGGGTT AAGATCATTC 3180 

ACTGAATGGC ATTTGGTATA ACCTCAATAT 3240 

CAATATCAAC TTTCATTGCT GGATCTGTTG 3300 

CAATACCTTG AGTAATTTCA TTCCaATTCA 3360 

AAATACTGAA ACTTTGTTCA TTAAAAATGA 3420 

AATCAGTCaA TGGTATACGA CTACCACCAA 34 80 

TCGTTTCGCC AGAATATGTC TTATCATCTA 3540 

TTAGCAGTGT TTTGACAGTT GATCCAATAT 3600 

CATCTTGTAC GTTTTCAGCG TTTTGAACAA 3660 

ATTGATTATT TATTTTAATT ACATCGTATG 3720 

TAGCTTGTTT ATGATTAGGT GCTATATTTA 3780 

CTGGTAAAAT GCCAATAGGG AGTTGAAGGT 3840 

TAACCGTGCC ATCACCGCCA AGAATAAATA 3900 
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CACCTTCGTT TTCACTCAAT TGAATAGAAA GATGCTTACA AATTGAACTT AATGCTGTTG 4020 

TAACTTCCCC AATACCTTGA TTAATATTTT TTAATCCACT GTGTTCATGG TAAAAGAGGA 4080 

^ CACCATGTGT ATATTTATTT TCCATAGTTT AGCCTACTTT CTAAAAATTG GTTCATTAAA 4140 

TATATATACC CACTTTTAAT TGTTAATACC AAAAATATGT TTTTAAATAG AGAAAATGGT 4200 

AATAAATGAA ATTGATTTCT ATAGAGTGGG ACGAGAAAAT ATAGTTATAG CTGTCTATAA 4260 

10 

TGAGCATATT AAGTTTTTAT TTATACTGAT ATCTTGAATT TAATTAATAG AAACCTATAA 4320 

AAAAACAGTA AGCCATTTAA ATGACTTACT GTTTTTTGAA TTAGGCCAAC AATATTAACG 43 BO 

TATACCTTTC ATCGCTTTGA TGATTAAAGG TGAGAATGCT AATACAATTG TTGTAACAAT 4440 

IS 

AATTGCAACA ACACCTAGGA AAATAAAGTA ATTTGTTTGA CCTAGTGGTT CTATTAACTT 4500 

AACTAAAGTA CCATTGATTG CTTGTGCAGA AGCGTTAGTT AAGTACCAAA TACTCATCAT 4560 

2^ TTGGGCATTA AATGCTTTAG GTGCTAACTT AACAQCAGCA CTATTACCCG TTGGTGATAA 4620 

GGATAGCTCA CCGATAACAC AAATAAT6TA CGATAAAATA ACCCAGTTAA CTGAAAAGTT 4680 

TGATGAACCT GATGCATAAC CTACAATACC AATTAGTATG TATGACGCAC CTGCTAAGAA 4740 

2S CGTACCAATT GCAAATTTTA CTGGCAGGCT AGGTTGTTTA GTTCCAAGCT TTTGCCATAA 4800 

AAGTQAAATA ATTGGAGCTA GTAATAAAAT AAATAATGGG TTAATTGATT GGAAGATCGC 4860 

TTCACCAAAG TTTGTTTTCC AACCAAATAA GTTTAATTTC ATATCTGAAT GTTCAATTCC 4920 

ATATATGTTT AATACATTAG ACCCTTGTTC TTGAATAGCC CAGAACACCA TTCCAAGAAT 4980 

AAATAATGGA ATAAATGCTT TAACACGAGA ACGTTCAGTA TCAGTGACAT CTTTACTTCT 5040 

AATAATTAAA GTGAAGTAAA TGAnTGGTAA TGCAATACCT AATACTAAAA CAGTATTACT 5100 

35 

AACTAAGTTA AATGATAATG AGTTAGTTAA TGCACCAATA ACGATAATTA ATACAATTGC 5160 

TAAiJACAACA CTTCCGATAA TAAGACCATA CTTTTTCTTT TCAGCTGGTG TCAATGGGTT 5220 

AGTAGGTTTC ATACCAACGC TACCTAAGTT TTTGCGGTTG AAAAGTACAT ACCATACTAA 5280 

40 

ACCTAATGCC ATACCAACTG CTGCAATCAA GAATCCGCCG TGGAAGTTTT TAACATTAAC 5340 

AAAGTGTTGC AAAATAATAG GTGATAATAA TGCACCCATA TTAACTGACA TATAGAAAAT 5400 

45 AACAAAACCT GCATCCATAC GTCTATCATT TTCAGGATAT AAACGGCCAA CGATATTTGA 5460 

AATGTTTGGC TTCATTAAAC CTGAACCAAT AATGATGAAG AACATTGATG TGAATAAGCC 5520 

GATTAATGCA AATGGTAAGC TTAAACAAAT. ATGTCCGATA ATAATAAAGA CTGCACCTAA 5580 

SO TAAAGTAGCG CCTCTAGTGC CTGTAATTCT GTCAGCAATC CATCCGCCTG GTATTGATGT 5640 

CATATAGATT AATGAACCAT ArACTGACAT AATTGACATA GCrGTTGTTT TATCAATTCC 5700 

55 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14078 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGGACTATTA ACGGCGaAGA AGATTTAACG AAATACTTAC AAACCAATGT TGATGGTATT 60 

ATCACAGATG ACCCAGCATT AGCTGATCAG ATTAAAGAAG AAAAGAAAGA CGAAACATAC 120 

IS 

TTCGATCGTT CTATAAGAAT TTTGTTTGAA TAATATAAAC AAAGACCTCT AAAGTTATCA 180 

AGATGATACC TTCAGAGGTC TTTTTAATGT TGCCATCTAT GGGATAGGCA ATCGTTTCAT 240 

20 TCGTTTATAT TCATATGACA AGTATTTGTA TGGCAATTTG GCGTCACAAA CACTTACATG 300 

ATTTATTGGT GAATTATTAA TTGTTTTGTG AATGCAAAGG GTTAGAAATT GAATTGTAAA 360 

TACTTTCTAA TCTTTGTTTC GCTTTAGTCA TTTGATCCAA ATTTTTAGTG OGTATAGCGG 420 

2S ATTTTGCAAT ATAGTGCGCA CTAAAATATC GCGTTTTTGA AACGCATCTA AATTTAGGTA 480 

CGATAATTTA TTTAAGTCAG TGTTTGCTAT TAATTCATGT AATTGATCTA CAAOCGCTTG 540 

ATGTTGATAC GTATGTGATG TAGTTTCAGA TTTGCTTGCT AATTTAATAC CAGTCGTATC 600 

AAGGAGCGCC GCTTTAATAC CAGCAACTAA ATATGTTTTG ATTTTCATTT GTGTTGTCAT 660 

GCTTTGTTAC TCCTTTGATG TACATTAATC AAAAAAATTA TACACTATTG TATATTGCAA 720 

AGCTAATTAA CTATAACAAA AAGATAGTTA ATGCTTTGTT TATTCTAGTT AATATATAGT 780 

35 

TAATGTCTTT TAATATTTTG TTTCTTTAAT GTAGATTGGG CAATTACATT TTGGAGGAAT 840 

TAAAAAATTA TGAAAAAGCA AATAATTTCG CTAGGCGCAT TAGCAGTTGC ATCTAGCTTA 900 

TTTACATGGG ATAACAAAGC AGATGCGATA GTAACAAAGG ATTATAGTGG GAAATCACAA 960 

40 

GTTAATGCTG GGAGTAAAAA TGGGACATTA ATAGATAGCA GATATTTAAA TTCAGCTCTA 1020 

TATTATTTGG AAGACTATAT AATTTATGCT ATAGGATTAA CTAATAAATA TGAATATGGA 1080 

^ GATAATATTT ATAAAGAAGC TAAAGATAGG TTGTTGGAAA AGGTATTAAG GGAAGATCAA 1140 

TATCTTTTGG AGAGAAAGAA ATCTCAATAT GAAGATTATA AACAATGGTA TGCAAATTAT 1200 

AAAAAAGAAA ATCCTCGTAC AGATTTAAAA ATGGCTAATT TTCATAAATA TAATTTAGAA 1260 

50 GAACTTTCGA TGAAAGAATA CAATGAACTA CAGGATGCAT TAAAGAGAGC ACTGGATGAT 1320 

TTTCACAGAG AAGTTAAAGA TATTAAGGAT AAGAATTCAG ACTTGAAAAC TTTTAATGCA 1380 

55 
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GTTGTATCAT ATTATGGTGA TAAGGATTAT GGGGAGCACG CGAAAGAGTT ACGAGCAAAA 1500 

CTGGACTTAA TCCTTGGAGA TACAGACAAT CCACATAAAA TTACAAATGA ACGTATTAAA 1560 

5 AAAGAAATGA TTGATGACTT AAATTCAATT ATTGATGATT TCTTTATGGA AACTAAACAA 1620 

AATAGACCGA AATCTATAAC GAAATATAAT CCTACAACAC ATAACTATAA AACAAATAGT 1680 

GATAATAAAC CTAATTTTGA TAAATTAGTT GAAGAAACGA AAAAAGCAGT TAAAGAAGCA 1740 

GATGATTCTT GGAAAAAGAA AACTGTCAAA AAATACGGAG AAACTGAAAC AAAATCGCCA 1800 

GTAGTAAAAG AAGAGAAGAA AGTTGAAGAA CCTCAAGCAC CTAAAGTTGA TAACCAACAA 1860 

GAGGTTAAAA CTACGGCTGG TAAAGCTGAA GAAACAACAC AACCAGTTGC ACAACCATTA 1920 

75 

GTTAAAATTC CACAGGGCAC AATTACAGGT GAAATTGTAA AAGGTCCGGA ATATCCAACG 1980 

ATGGAAAATA AAACGGTACA AGGTGAAATC GTTCAAGGTC CCGATTTTCT AACAATGGAA 2040 

CAAAGCGGCC CATCATTAAG CAATAATTAT ACAAACCCAC CGTTAACGAA CCCTATTTTA 2100 

20 

GAAGGTCTTG AAGGTAGCTC ATCTAAACTT GAAATAAAAC CACAAGGTAC TGAaTCAACG 2160 

TTAAAAGGTA CTCAAGGAGA ATCAAGTGAT ATTGAAGTTA AACCTCAAGC AACTGAAACA 2220 

2s ACAGAAGCTT CTCAATATGG TCCGAGACCG CAATTTAACA AAACACCTAA ATATGTTAAA 2280 

TATAGAGATG CTGGTACAGG TATCCGTGAA TACAACGATG GAACATTTGG ATATGAAGCG 2340 

AGACCAAGAT TCAATAAGCC ATCAGAAACA AATGCATATA ACGTAACAAC ACATGCAAAT 2400 

30 GGTCAAGTAT CATAOGGAGC TCGTCCGACA TACAAGAAGC CAAGCGAAAC GAATGCATAC 2460 

AATGTAACAA CACATGCAAA CGGCCAAGTA TCATACGGAG CTCGTCCGAC ACAAAACAAG 2520 

CCAAGCAAAA CAAACGCATA TAACGTAACA ACACATGGAA ACGGCCAAGT ATCATATGGC 2580 

GCTCGCCCAA CACAAAACAA GCCAAGCAAA ACAAATGCAT ACAACX3TAAC AACACATGCA 2640 

AACGGTCAAG TGTCATACGG AGCTCGCCOG ACATACAAGA AGCCAAGTAA AACAAATGCA 2700 

TACAATGTAA CAACACATGC AGATGGTACT GCGACATATG GGCCTAGAGT AACAAAATAA 2760 

40 

GTTTGTAACT CTATCCAAAQ ACATACAGTC AATACAAAAC ATTACGTATC TTTACAACAG 2820 

TAATCATGCA TTCTATGATG CTTCTAACTG AATTAAAGCA TCGAACAATC GGAAGCATAT 2880 

TTCTAAATTA TTTATTCATT ATAGTCTTAA ACATAACATG ACCTAATATA TTACTAACCT 2940 

4S 

ATTAAAATAA ACCACGCACA TCTAAGTGAT ATACGACAAT CACAGCAATA ATAATTGCTT 3000 

TAGAAAGTCG TGCCGAACTG GAACTTACAA GTCTAGTTCG AACACACACT GATGTGAGTG 3060 

SO GTTTTCTTTA TTTTAAACAT GAACAATCAG ATAAGTTACT AGCATTAGCA AATATTATTA 3120 

AATCAAAGGG CTTCGATTCA TAAAATTTAA AACAATGATT AAAATTAGAC GTGTAAATGT 3180 
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TATTTCACAC AGCTTCATTA ATAAAACGAA ATTGCTTCAA CCCGCTTCAA CTTCAACTGG 3300 

CTTCAACTTC AGCCTACTTC ATTCAATAAC AAAACX5AATC CGCTTCATCC AAAATCAACC 3360 

ATTCTAACGC ACATATTCAA ATATAGCAGC TGCACCCATG CCGACACCAA TACACATCGT 3420 

AACCATGCCG TAACGGCTAT CGGGACGTCT ACCCATTTCA TTAAGTAAAC GOGCGGTTAA 3480 

CATTGCGCCT GTAGCACCTA ATGGATGACC TAAAGCAATA GCGCCACCAT TCACATTCGT 3540 

ACGTGATATA TCTAGACCTA CTTCTTTAAT AGATGCAATC GTTTGAGAAG CAAATGCTTC 3600 

GTTCAATTCG ATCAAATCAA TGTCTTCAAC AGATAGATTG CTGAGTGACA ATACTTCAGG 3660 

AATCGCATAT GCAGGCCCAA TACCCATAAT TTTCGGGTCA ACGCCTACTG CCTTAAAACC 3720 

AACGAATCGT GCAATAGGTG TCACGCCGAG TTCTTTCACT TTATCTCCAG ACATTAAAAC 3780 

TACAAATCCT GCACCATCAG AAAGTGGGGC AGATGTTCCT GCAGTCATAG TGCCGTCAGC 3840 

TTTAAATACT GTACX3TAATT TGGCTAATGC CTCCATCGTG GTGTCAGGGC GTATAAATTC 3900 

ATCTTGGTCA AAGATATTTG TGTGTACTTT TGGTCCTGCG TTTGTATATT CAACTGAGTT 3960 

TACTTGTATT GGAATAATTT CATCTTTGAA CCGACCATCA CGTTGTGCGT CATAGGCACG 4020 

25 TTGATGACTT CTGACAGCAT AAGCATCTTG ATCTTCGCGT GATACGTCAA ATTGGGATGC 4080 

TACATTTTCA GCAGTTAAAC CCATAGGATA TGACGCACCT ATATCATCAT ATTGTAAGGT 4140 

TGGATTGTTT GTGGGCTCGT TGCCACCCAT TGGTACGGCA CTCATCAATT CAACGCCACC 4200 

^ AGCTACAAGT ATATCTCCTT GACCAGCCAT AATTTGATTG GCTGCAATCG CGATGGTTTG 4260 

TAATCCTGAT GAGCAGTAGC GATTCACTGT TTGACCCGGT ACCGTGTCAG ATAATCCCGC 4320 

ACGCAATGCA ATCGTTCGTG CAATGTTTTG GCCTTGTAAT CCTTCTGGAA AAGCCGTACC 4380 

AACAATGACA TCTTCAATCA TATTCTTATT GAATTTTCCO TCAATACGTT TCAATACGCC 4440 

TTGliATACT TTGGCTGCGA CATCATCAGG TCTTTCGTGG AATAATGCGC CTTGCTTTGC 4500 

TTTCGCTGCG GCTGAACGCC CATAAGCTAC AATGTATGCT TCTTGCATGO TTATCATCCT 4560 

CTCTTAATGA CTATCTTTTA ATTACGTAAT GGCTTACCAG TTTTTAACAT ATGTGCAATT 4620 

CTTTCATATG ATTTTTTAGA TTTTAGTAAG TCAATAAAGC CAATTTTCTC CAACGATTGA 4680 

ATGTAACGTT GATTGATAAA TGTATTTCTT GGTAAATCAC CACCCGCTAA AATTGTGGCG 4740 

ATATTTAAGG CAATATGATA ATCATGGTCG CTAATAAAAT GACCCCGTCT TTGCGCATCT 4800 

AATTGTCCTT GGATCAATGC TTTGAAGTCT TCACCTAAAG CGATATATTG ATGTCTAGGA 4860 

SO TTCGGAATAT AGTTTGTTTC TGCTTCATAT TTCGCACGTT TGAGCGCAAC TTCGACACGT 4920 

TGTGCTGTAT TQAAAATAAT CGTATCTGTA TCACGTAAAT AACCATAACG ACGTGCCTCA 4980 
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GTGATTCCTC CAATTTAGTT GAGGATAAGA TAACCATTAA GATAATTGGA ATAACGTTGC 6900 

TATTTTATAA AATTAATTAA GTATCTTTGA CAGTCATCTT AGCCTCTTAT TTAAGGAAAA 6960 

AGCTTTATGC TTAAAATAAG TCTTTTTTAG TGAAATTAAT GCATCTCATA TAATTATTTG 7020 

CTATTTATAC GAAAGCAGAA TCTCCAGTCA AAGCGCGTCC AATTACTAAG GCATTAATTT 7080 

CATGTGTACC TTCGTACGTG TAAATCGCTT CTGCATCAGA GAAGAAACGT GCAATATCAT 7140 

AATCGTCAGC TAGTATGCCA TTACCACCTG TAATACCGCG GCCCATAGCT ACTGTCTCAC 7200 

GCAAACX3TAA GGCATTCATC ATCTTCGCCG TTGAAGTTGC AACCTCGTCA TATTCACCAT 7260 

GTGCTTGCAT ATTAGCTAAT TGAGCACATG TTGCCATTGC TTGAGCTAAA TTACCTTGCA 7320 

TCATTGCTAG CTTTTCTTGT ATTAACTGAT ATTTACTAAT TGGTTTGCCG AATTGcTTAC 7380 

GCTCAGTGAC ATAATCTAAT GTGGCACX3TA AAGCGCCAGC CATACCACCT GTAGCCATAT 7440 

AAGCAACGCC TGCTCTCGTT GAATAAAGAA TTTTGGCAAT ATCTTTAAAG CTTGTTATGT 7500 

TTTGTAAGCG ATCCGCTTCA TCTACTTTGA CATTAGTTAA TTTAATTAGG GCGTTAGGAA 7560 

CAATGCGAAG TGCGATTTTA TTATCAATGA CTTCAATATC GACGCCATCT TGTTCTGGTC 7620 

25 TGACTACAAA GCAATGGGGT TTGCCAGTTT CTTTATTTAC TGCGAATACT GGAATGACAT 7680 

CAGATACATO TGCACCACCA ATCCATTTCT TTTCACCATT GATAACCCAA GTATCGCCTT 7740 

GGCGTTCAGC GACTGTTTCA AGACCTCCCG CAACGTCCGA ACCGTGTTCT GGTTCAGTTA 7800 

30 AAGCAAAGCA TGTACGCAGT TCATGTGACT GTAATTTAGG TACATATTTC GCAATTTGTT 7860 

CTTTGCTACC TCCGAAATAG AAAGTGTTAT GCCCTAAACC TTGGTGAACA CCGAGTAGGG 7920 

TAGCTAAGGA AATATCAAAT CGCGCGAGTA GGTAAGACAT GAAAAACTGA AATAGTTGAC 7980 

TAGGCAmr GGCGTTTGGA CGATCCTTGT AAAGTAATGG ATTGTTAAAA TAATTTAATT 8040 

CTCfiCAGATC TTTAAAATAG TCXTTCGGGTA CAGTAGCGTC TATCCAATGT TGATTAATAT 8100 

TTTCACGGTA CTTACTTTCT AGCAATGAAT CTACTTGTTG TAAAAATTCG ACTTCACCGT 8160 

CTGTTAAACC TTTAGCAATA CTAAGTACAT CTTCAGGAAA TAATGTTTTT AAGACCGTTT 8220 

CTTTTTCAAA TGTCATATAA ATTCCTCCTA AAAATAATAT GAATACTAAT GTGAAATGCA 8280 

TTTAATTCAA AAACAACAC6 CTTTATTTGT AAACGCTTAC ACTAAATGTC AAAAATTTTT 8340 

ATCACCTTTA AAGTGTTTGC GAGACTTTGT CATTCATCAT TTGTCGAATC GCAAGTTTAT 8400 

CTGGTTTCTG CGTACTGTTT AACGGCATAT GTGTCACTGG TACATACATT CTTGGGACTT 8460 

SO TATAACCTGC TAAACGACTt CGCATATGTT GATTTAAAAT TTCAGCGTAA TGAGGTTCAT 8520 

CTTCGCGAAG TATAATGGCT GCAGCAATTG ATTCACCATA TTTTGGATGA TCATAGCCAA 8580 
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AGACATTTTC GCCACCAGTT ATGATTAATT CTTTTTTGCG GTCAATAATA AATATATCGC 8700 

CATCGTTGTC CATCTTCGCT AAGTCACCAG TTAATAAATA TCX5ACCATGA AATGCTTTGG 8760 

^ CAGTCTCTGC TGGTTTATTC CAATATCCTG GCGTGACATT TTTAGCCTTA ATTGCAAGTT 8820 

CGCCAATCTC ACCAGTAGGT ACTTCCTCAC CGTTATCATC AAGGATACGT GCATCAACGA 8880 

ACATGACTGC TTTACCAfVTA CTCATTGGCT TACGTTTTGA ATTTTCCGGT GTATTAACAA 8940 

10 

GTACAAGAGG TGCTTCAGTT AAACCATAGC CGTTAATAAT GTTTATGCCA TATTGTTTAA 9000 

AAGCTGCTTG GATACTTGGT AATGGTTGTG AACCACCTTG GATGATATAA TCCATAGCTC 9060 

TAAAATTTTC AGGATTAAAA TTACTAGCAC GTAGCGTACT ATAATACATT GTCGGAATCA 9120 

IS 

TGATAATAAA TGTAGGGTGA TATTGTGCAA TCATGTCATT CAATTCTTCG CCGTTAAAGT 9180 

AACGTTGAAG AATAAGTGTG CCACCTGACA TTAATACTGG TAATACAGTA TCGTTAAACC 9240 

CTAAAACATG GAACATTGGT GTTGATACAA TCGTAATATA GTTTGAATTG AACTTATAOG 9300 

20 

TCAGCTCTAA GITTGCACCG TTATGAACAA ATGATTCATA TGAGAACATC ACACCTTTAG 9360 

GTGATCCX3GT TGTACCACTT GTATAAATTA ATGCTQCAAG ATCTTGTGGT TCAACAGGTG 9420 

25 TTGCTTGAAA AGGTTGGTGA TAATCTGGAT TTACGATTTC ATCATATTGC GCCACATCAA 9480 

TATCCATATG CAATAAGTTT TGGTCAATAT CGGTGAGTGA ACTTAAATGT TTTTCAGCAT 9540 

AGAAGAGCAG TTTTAATTGT GCATCTTCCA CAATGGCTOC AATTTCTTTT GGGTTAAGCC 9600 

GCCAATTCAA TGGTAAAAAA ACCX^KZACCTG TTTTAAAACA AGCAAACAAT AAATCTAATA 9660 

TTGCAATATC ATTTGGCGCA AAAATACCGA TAACATCX5CC TTTTTTAACA CCTTGAGATG 9720 

TTAAATAATG TGCCATATTA TCAG0GCX5TG CATTGAGTTG TTGGTATGTC CAAGATGTTT 9780 

^ GTTTTGCGTG ATCAATAACG GCAGGCTTGT CATCATCGAA GTCTGAACGC GTTTTTATCC 9840 

AATCGAAATT CATTAGTATA CCCCCTTTAG CTTCACTTTC ATACTTTATG AATTGATTGT 9900 

TTAAGTTGTC CCCATTTTTC TTTGTAAATG CTGGTATCAA TTAATTTTAA ATGATCAGCA 9960 

40 

ATAATTGGTT TAAAAGCCAT TTGATTCAAA ATATCTTTAT GCAAATCAAG ACCTGGTGCA 10020 

ATTTCAATTA GTTTCAAGCC TTGATTGGTG AGTTCGAATA CTGCACGATC AGTAACAAAA 10080 

TAGATTTCTT GCTCGAGTGA TTGTGAATAT TGTGCATTAA AGTCGATATG GCTCACATCT 10140 

46 

GATACAAATT TCTGGTTTTG TCCTTCAGTT TCAATGTTTA ATCGTTGATT ATGGCATGAG 10200 

ACATGACTGC CAGCTACAAA AGTACCTGAA AAGATAATTT TATTTACAGA TTGCGTAATG 10260 

50 TCTATAAAGC CACCACATCC ATTTAGTCGG TCATTGAAGT AAGACACGTT GACATTGCCG 10320 

TATTGATCAA CCTCAGCAAA GCTAAGATAG GCAACTGATA CACCATTGTT ATAAATAAAA 10380 
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CGACTCCCAA CGAATCCACC GAAAATGCCA ACATCTAAAA TCGGTTGCAC ATCATGTTCA 10500 

ACACATTCTT CATGCAATAA ATTAGAGAGT TCATTATTGA TGCCATAACC GATGCTAATT 10560 

GTATCGCCAT AAGTTAAAAA CTGAGCAGCA CGTCGGAGAA TCAATTTGCG ACTATTAAAA 10620 

GGTAATGCGG GTTCAGGTAT TCCATCAATT CGTTCTTCTC CAGACAAGGC TGGTAAATAA 10680 

TGACTTTGAA TTACTTGGCG GTGATTCTTT TCATCTTCTG TGACGTATAC ATAATCGACA 10740 

AGATTTCCTG GGATAACAAC TTCATTCGGT TTTAGTTGAT AGTCXSTCAAC TAAAGCTTTA 108 00 

ACTTGTACAA TAACTTTCCC ATGATTGGCT TTCGCGTTTA ATGCGACATG ATAACACTCG 10860 

CTCAAGTACG CTTCTTGAGT TAAATAAATG TTACCTTGTT GATCTGCGTA TGTTCCTCTC 10920 

AGTAGTGCCA CATCAACGCT AGGGAATGTG TAATGTAAGT ATGTTTCATC GTTGATGGTT 10980 

ACTAATGAAA CTAAATCATC CGTTGTTCGT GTATTTACTT TACCXJCCACC GTATCTAGGA 11040 

TCAACAGCTG TGTTTAATCC GATTTTAGTA ATAACTCCAG GTAATAATTG ATTACTCTGA 11100 

OGATAATGAG TTGCAATGAT ACCTTGTGGT AAAAAATAAG CTTCAATGTC ATTATTTTTC 11160 

ATTGCTTGTG CCGTTTTGGA AGAAGCCGTT AAAATACTCA TAATGACACG TTTAATCATG 11220 

OGACGTTCTA TAAAATCATC TAAATCCGGT GCGGCACCTA AACTATGAAT ATCATTCGCT 11280 

AATATAAACG TTAAATCATT GGGCGTATGA TATGTGTCAT GTTGCGCTAA CACAGCACGT 11340 

AGAACTTCGG CGGGTAAGTT GGCTACAGCT AATGCTGGTA AACCAATCAC ATCACCATCT 11400 

TTAATGATAT GTTGTAAGTC GTGCCATGTG ATTTGTTTCA AGCAAGTCAC CTCCATCACA 11460 

TTTGATAAAA TATAGCGTTT TTACACTTTG TGTAAACCCT TaCAAGAAAT ATAACATAAC 11520 

GACGTTTAAA ATCAATTAGA AATAT C TTTT TATTCTGATA ATAGACACAG TATAGACACA 11580 

TTTTGATGGT CGATAACAAT TGTAATATCA AGGGTTTGTA ATGAATTGAA TATCATTAAA 11640 

ATACTTATAT AAAAATATTG TTCGGAATAT AAAAAGTTAA ATAGGTTTTO ATTTTTAAAT 11700 

ATGAAATACA AAGTGCCCAA TOGAACAAAG TATTTATATT AAAATATGGA AAATCCATCA 11760 

ATATTAAATT AAAATAGTTT TATTATGAAA AGTGAAAGTA GGTAAGTCTA TGGAAGGTCT 11820 

TAATCATCGA AGAAATACAG AAAAAGAAGA GACAACACAA ACGCAATCaG TTGCACCTAA 11880 

TACAGGTGAA GAGGGGATGT CATCAGCAAG TACACAATCA ACTAAGACGT CCGACATACA 11940 

TAATGAATCT ATCGATAAAC AAATGGAAGC TAAAGCGCAT GAAACAGCGC AAAATACAGA 12000 

TTTAAAAAAC GAAGCAAGAA GTTTATTTGA TAATGCAACC AAATCAATCG GTAGACTAGC 12060 

GGGCAATGAT GAAAGCTTAA ATCTTAATTT AAAAGATATG CTTTCTGAAG TATTTAAGCC 12120 

GCATACTAAA AACGAAGCAG ATGAAATATT TATAGCGGGT ACTGCTAAAA CTACGCCAGC 12180 
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TTTCACAGTA ACATTTATTG GATTATGGGT 
GATTCCGGGT CTCATTTTTA TAGGGGCTTT 
5 TTATGAATCA AATGCGTTTA AAAATATTAG 

TGGCGGCGTA TTTTCATTAC TAAGTACGAT 

TCAATTCGAA AGGTTTGGTT CTTTAACATT 

10 

AGAAACTGGA AAAGCACTCA TTATTGTTTA 

TTTGAATGGA TTATTAATCG GTGCTGCTAT 

AGGTTATATT TTGAATTTCG CTTTAGGAGA 

IS 

ACGTGCGTGG ACTGCGATTG GTGGTCATTT 
AGTTATTGCG AAAGAACAGC ATGGCTTTGA 
AATATTCTTT TTATCAGCCG TTGTTTTACA 

20 

TGGCAGTGAT ACGTTGAAAA TATTTATTTT 
TTTAATGGGG GCAOGTTTAA AACAAGTGAA 

2s C3AAAAAAGTA GACGAATAAT AATTAAAGCT 

ATTTTTATAA TTTGTCTTTA AAAGAGTGGA 
TTtCACGTTA AACAAATAGT GACAATTAGA 

30 GTATGTAATA ATCATTTTAT TGAAATTCAT 

AATTAAACCA CGCCATAATC ATATTGGATG 
GGCTGAGACA TAAATCAATG TCCCACACTC 

^ ATTAAATAGA TACCTTCAGG TGTTACTTTA 

ATTD&ATCX5T TGTAAGCAAT ATAACTGTTT 
TCAtTAGAAA TATTGTGACG TTCCTTAACA 

40 

TCATCGACGT TGCTATTATT GTAGTCTCCT 
TGATTAAAGA AATAACCTGT ATCTCCACGA 
TCCCATGCAG AACGGTCTTT TAAGTAACCT 

4S 

TCAATATTTG CTTTTTTAGC ATCTGATTGT 
CGACCGTCAT ACGTAATTAA CTTAATTTTT 
50 GCTAATAACA TTTTTGCTTG TTCGATATTT 

GGAATTTTAT CATTAAATGG ACTCGTTGCA 
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CATGGCAGCA ATTTTTAATA ACACTAACGC 12300 

AACAGTACCA TTATCGGGTT TGTTCTTCTT 12360 

CATTTTTGAA GTTATTATCA TGTTCTTTAT 12420 

GGTATTATAT AGATTTGTCG TTTTTAGTGA 12480 

TTTCGATGCA TTTTTAGTAG GATTAGTTGA 12540 

TTTCGTCAAT AAATTGAAAA CAAATAAGAT 12600 

TGGTGCAGGG TTCGCAGTTT TTGAATCAGC 12660 

AAATGTCCCA TTATTAGATA TTGTCTTCAC 12720 

AGTTTGGTCA kCGATTGTTG GTGCTGCAAT 12780 

ATTCAAAGAT ATTTTTGATA AACGCTTTTT 12840 

TGGCATTTGG GATACATCTT TAACTGTACT 12900 

AATCGTTATT GTGTGGATAC TTGTATTCaT 12960 

TTTACTGCAG AAAGAATTTA AAGAACAACA 13020 

TATGTTGCTC ATATGTTT6T GACATAAGCT 13080 

ATAGGAATAC TTTTTGGAGT TAAAAAAGTG 13140 

TTTATATAAA ATGAACATGA TTCACTGAAA 13200 

CAAACAGAAA TTAATACAAT CATATAAGCA 13260 

ACTTCGGCGT GGTTTTTATA GTTGAAGCAG 13320 

CCTTATCGTT CAATCGTTGT TCGATAATCG 13380 

TAATTTTTAA CCTTAGAGTT AGCAGCGACT 13440 

GGTACATCTC GACTTGATAA TTTAATAATA 13500 

TCTACAGTAT GATTCAATTG ATTAATTAAA 13560 

TTATTAATAG CACCATCTTT TTTATATGCT 13620 

GGAATTGTTC CGAAACTATA CATCGTTGCA 13680 

TCTATGTCAT CAACACTTTT AATGTCGATT 13740 

AATACTTGCG CAATTTTCGA TAGCTCTGGA 13800 

AAAGGGTGTT CTTTTGTATA ACCATCTTTA 13 860 

TGTTTGGTTA ACTTAGGTTC TTTAATATAT 13 920 

GGTTTCGCAT AACCTTQATA AATATGATCT 13980 
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TTATtAGTAT GATTATACAT AAGTaAGAAG TTCTAAAn 14078 

(2) INFORMATION FOR SEQ ID NO: 192: 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGAAAACTAA AGTGTTTCTA ATGCGTGACT AAAATTAGTA ATAATTAAGT TCTCATGATA 60 

ATAGGTATTT TTGAAAAATG GAGGAGTCTA TAAATGGGTA AAAAAATGGG TCTAGGTTTA 120 

TCTATTGCAT TGGTTGTTAT TGGTATTGCC GTTGTATGTT TAATGATTTT TTCTAGTCAA 180 

20 AAAACGACTT ATTTTGGTTA TATQAATAGT AATACAAATG CAGAAAAAGT TGTCAGTGAA 240 

AAAGATGGAT TAGTCAAACA TAATATCAAA GTAGAACCAT CTAATGATTT CAAGCCGAAA 300 

AAAGGAGACT TTGTAAAATT AGTTTCTAAA GATGATGGGA AGACATTTTA TAAACAAGAG 360 

2S ATTGTTAAAC ATGATGACXTT CCCACACGGT TTAATGATGA AAATTCACGA CATGCATATG 420 

AATTAATAAA AAA6CATCTA TAACGTAATT TTGAAGAAGT AGAGTTATCT TCTTATGCGT 460 

TTTAGA 486 
(2) INFORMATION FOR SEQ ID NO: 193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1626 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

40 

GAGGTCTATA TACAATTATG GTTGTTCCAG TTAAACGAAC TGATGGCTTT ATTACTAAGT 60 

TTAATAGATT AATTGAAAGA CGATTATTAC GTCATTTCAG TAAAAAAGGT TATATCACAT 120 

45 GGGAGGAAAA TTGATTGTCT GACATTTTAA AATGTATCGG TTGTGGTGCG CCACTTCAAT 180 

CTGAAGATAA AAATAAACCT GGTTTTGTAC CAGAGCATAA TATGTTTCGT GATGACGTGA 240 

TTTGCAGACG TTGTTTCCGC TTGAAAAATT ATAACGAATT CAAGATGTAG GATTAGAAAG 300 

50 TGAAGACTTT TTAAAATTAT TATCAGGACT TGCGGATAAA AAGGGTATTG TCGTCAATGT 360 

CGTGGATGTA TTTGACTTTG AAGGATCATT TATTAATGCA GTTAAACGTA TTGTCGGAAA 420 
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TCGAGTTAAA 


GAATGGTTAA 


AACGAACAGC 


AAGAAAATAT GGTTTGGAAG CTGACGATGT 


540 




CGTATTAATT 


TCAGCTGAAA 


AAGGCTGGGG 


CATAGACGAC TTATTATCAT CAATTGCGAA 


600 


5 


TATTCGA6AA 


AATGAAGATG 


TGTATATTGT 


AGGGACAACG AATGTTGGGA AATCTACATT 


660 




GATTAATAAA 


CTGATTOAAG 


CTAGTGTTGG 


TGAAAAAGAT GTAGTAACAA CTTCAAGATT 


720 




CCCTGGAACA 


ACTTTAGATA 


TGATAGATAT 


TCCTTTAGAT GAAACATCAT TTATGTATGA 


780 


10 


TACACCAGGT 


ATTATTCAAG 


ATCACCAAAT 


GACGCATTTA GTTAGTGAAA AAGAATTGAA 


840 




AATTATTATG 


CCTAAGAAAG 


AAATAAAACA 


ACGCGTATAT CAATTAAATG AGGCGCAGAC 


900 


IS 


ATTATTCTTC 


GGCGGTCTAG 


CGCGCATAGA 


TTATGTATGA GGTGGTAAAC GTCCGTTAGT 


960 


TTGTTTCTTT 


TCTAATGACT 


TGAATATACA 


TCGTACTAAm ACGGAGAAGG CTAATGATTT 


1020 




ATGGCGTAAT 


CAACTTGGCG 


ATTTATTAAC 


GCCACCTGGA AATCCACAAA ATTTTQATCT 


1080 


20 


TAATGAGGTA 


AAGGCTGTTA 


GACTTGAAAC 


AGGCAAAGAG AAACGCGATG TTATGATCTC 


1140 




TGGTCTAGQC 


TTTATAACTA 


TAGGACCAGG 


GGCTAAAGTA ATCGTTCGTG TTCCTAAAAA 


1200 




TGTTGAkGTT 


GTATTAAGAA 


ATTCTATTTT 


ATAAGGTGaT TAAAAAAATG AAATTTGcAG 


1260 


2S 


TTA7AGGAAA 


TCCTATTTCA 


CATTCCTTGT 


CGCCCGTTAT GCATAGAGCA AATTTTAATT 


1320 




CTTTAGGATT 


AGATGATACT 


TATGAAGCTT 


TAAATATTnC CAATTGAAGA TTTTCATTTA 


1380 




ATTAAAGAAA 


TTATTTCGiAA 


AAAAGAATTa 


GAAGGCTTTA ATATCACAAT TCCTCATAAA 


1440 


30 


GAAOGTATCA 


TACCGTATTT 


AGATTATGTT 


GATGAACAAG CGATTAATGC AGGT6CAGTT 


1500 




AACACTGTTT 


tGATAAAAGA 


TGGCAAGTGG 


ATAGGGTATA ATACAGATGG TATTGGTTAT 


1560 




GTTAAAGGAT 


TGCACAGCGT 


TTAnCCAGAT 


TTAGAAAATG CATACATTTT AATTTTGGGC 


1620 


3S 


GCAGGT 








1626 



(2) Information for seq id no: i94: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

4S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
AGGGTTAATT GTCGGTTTAA TTGCAATGAA TAAGTTCCAT GTATTAGCTG GCTATAGAGC 60 
SO GAAATTCATC TTAATGGTGA TTTTAACTAT GATGGTCTTC GTACTTATTA ATACGTATTT 120 

ACTAAGACAG GTAAAATCTA TCGGTATGTT CTTAATGATT GCTGCATTGG GTCTATACTT 180 

SS 
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GTCTTATATC GATAACATGT TCTTCAATTA TTTAAATGCA GAGCATCCTA TAGGCTTGGT 300 

GCTAGTAATA TTAACAGTAC TTGTGATTAT TGGCTTPGTA CTGAACATGT TTATAAAACA 360 

CTTTAAGAAA GAGAGATTAA TCTAATGTTG ATGAATAGCG TGATTGCTTT AACTTTTTTA 420 

ACAGCATCTA GCAATAATGG CGGACTTAAT ATTGATGTGC AACAAGAAGA GGAAAAGCGA 480 

ATCAATAATG ATTTAAATCA ATATGATACA ACGCTATTTA ATAAAGACAG CAAAGCGGTT 540 

AATGATGCGA TTGCTAAGCA GAAAAAAGAA CGACAACAAC AAATAAAAAA TGATATGTTT 600 

CAAAATCAAG CGAGTCACTC GACTCGCTTG AATGA 635 



(2) INFORMATION FOR SEQ ID NO: 195: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



25 


CTGAAATGGG 


TATTATTTGT 


CTTCTTCATC 


ATAAAGTAAT 


AAAGATTGTT 


CATCATTGCG 


60 




ACGTTGCCAA 


TTTTCATTTG 


GCGATCTTGG 


AACGTATAAT 


AATTGCCTAG 


TAGCATACGC 


120 




TTTGATTGAA 


ACATACAAGT 


CATTCCTTGA 


ACTTGCACCT 


CAATTTCCAC 


ATTTGAATTT 


180 


30 


TCTTTTATAT 


TAATAATTTC 


ATCCAAATTC 


AGCTCACGTG 


CTAAGACAGC 


TCTTGATGCG 


240 




CCTCTTTTAC 


CCCAGTAATT 


ACATTGAAAA 


TGATTAGTTA 


CTAACGTCTC 


TGCATTCCAA 


300 




TGAAGTGGTA 


TTGGATTTTC 


TTGCGCCTTC 


ACATACATTA 


CTACTGCTGO 


ATCCCCGAAA 


360 


35 


ATAATTCTGT 


CAACTCGTAT 


TTCATGTAAA 


AAATTAATAT 


AATCTTCTAC 


AGCATCTAAA 


420 




tgatJuvttat 


GAAATAATCC 


ATTCACTGCC 


GCATATACTT 


TTTTATCGTT 


TTTGTGAGCT 


480 


40 


AATGCGACAG 


CCTCTGTCAT 


TTGTTGTCTA 


TTGAATTCCC 


CTGGAAGTCT 


TAAACCAAAC 


540 


TTTTGCTCGC 


CAATTACAAA 


AGCATCTGCA 


CCTAAATCAA 


TAAGTGTTTC 


CATATGGCTT 


600 




AATGACTTGG 


GTGTGACAAG 


TAATTCTGTC 


ATAGTCATTC 


TCCTTTAATT 


GAAATCGCTA 


660 


45 


ATCCATCGTC 


TATATTTAAA 


AAATTCGTTG 


TATATCCTGG 


TTGCTTTATT 


AACCACTCAT 


720 




TATAATCTTG 


AACCTTTTTA 


ACCATTTGTC 


TTACATTTCT 


CGATCTAACA 


ATCCCAATAT 


780 




CCGATACAAA 


ACCGTGATAT 


AAAACATTAT 


CTGTAATTAC 


GAGACCTTGG 


TGCTTTAAAA 


640 


SO 


GTGGTGTATA 


TATTTCAAAA 


AATTTCTTTG 


ATTGCGCTTT 


TGCTGCATCA 


ATAAATATCA 


900 




TATCATAAAC 


TTTGTCATTT 


ACATTTTCAA 


ATTGCTCTAA 


AGCATTACCT 


TCAATAATTC 


960 
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CATTACGCTC 


TATCGTTGTG ACATGAATGT CATCAGATAT AGAAGCGAAT TGCATAGAAC 


1080 




TATAGCCGAT 


TGCTGTACCA ATTTCTAAAA TATTTTTAAC 


ATTATTCATA 


CGAATTAATT 


1140 


5 


GCTTAATTAA 


ATCTAATGTT 


AAACGATCTA 


CAATTGGCAC 


TTCATTTACC 


TCGGCAAATT 


1200 




CACGCAAAAC 


TTCGATTGAA 


CTATTTTGAT 


GTTGATGTAA 


ATCTATTAAA 


TATm-riAT 


1260 




TTAGGTCATC 


CATGTTTTAA ACTTCCTTTA TGTAAAATAA GTCAATATGA TTATGACAAT 


1320 


10 


AAAATAAATC 


AGCCTTCACA 


ATTGATTATA 


ATTTTGCCAA 


CCAATTAAAT 


GACTGATTTC 


1380 




GTGTTAGACG 


CAAAGCTATT 


TTATTTATAG 


AAGCGAATCA 


TTCATATAAA 


ATTTAACTTT 


1440 


IS 


AGATATTTTA 


CCATATTTTC 


AATAAAATTA 


TAAGCGTTAA 


TTATTTATAC 


ATTGCTTGAC 


1500 


TTAAAAAATA 


CTCTTGCCTC 


CCCATCTTTA 


AGGTTAGCAA 


GAGTAAAATC 


TTTTTAATTA 


1560 




TTCTTCCATT 


TCAGTATTTA 


CAACTTCTTC 


AATCATGTCC 


CATTCTTCAT 


CAGTTTCGAT 


1620 


20 


TGGTACTAAC 


TTACCACCGT 


CACCTGACTC 


ATCTGGTTCA 


TTGATCATTG 


GTACAAGCTC 


1680 


AATCATATCG 


TCTTCATCTG 


ATTGAGCACC 


TTCTTCAGCT 


AAGATAACAT 


ACTCTTTTTT 


1740 




GAATTCAGGA 


TGATAAAATT 


CTAAAACTTT 


TCGGTATAAA 


ACTTCATTTC 


CCTCTTCATC 


1800 


25 


GAATAAAGTT 


AATAATTCTT 


CTTCGTTATT 


AATTTCTAGT 


TGTGAATCAT 


GATTATGTTC 


1860 




AGTCATAGTA 


AAATCTCCTT 


TTAATGTAGT 


GAATCTAAAT 


AGCCTTGTAA AATAAATACC 


1920 




GCTGCCATTT 


TATCAATCAC 




TTTTGTCTTG 


AAACATCTGC 


TTCTAATAAT 


1980 


30 


GATCGTTCAG 


CAGCCATTGT 


GCTTAATCTT 


TCATCCCACA 


TCACAATCTC 


AATAGAAGGA 


2040 




TAAGCTTCTA 


ATAATTTTTC 


TTTATATGTT 


AACGAAGCTT 


CGCCTCGAAA 


TCCTATTGAA 


2100 




TTATTCATGT 


TTTTAGGTAG 


TCCTATTACG 


ACTGTACCCA 


CATTATGTTT 


TTTAATAATG 


2160 


35 


TCTACTAATT 


GGTCAATACC 


TAATTCATTA 


TTTTCTTCAT 


TGATTCGGAG TGTGTCTAAT 


2220 




CCTT&TGCCQ 


TCCAACCCAT 


TATATCACTA ATTOCAATTC 


CTACCGTTCT ACTACCXSACA 


2280 


40 


TCGAGTCCTA 


AAATTTTATG 


TTGTAACATA AATTATTTAT 


TTTGCTCTTT 


TAAATAGTAA 


2340 


GAAACAAGCT 


CTTCCATAAT 


AACATCTCTA 


TCAATATGAC 


GAATTTGATT 


TCTTGCTTCA 


2400 




TTTTGGCGTG 


GAATATACGC 


AGGGTCACCT 


GATAATAAAT 


AACCTACAAT 


TTGGTTTACG 


2460 


45 


GCATTATATC 


CTCGTTCATC 


TAATGTTCGA 


TAAACATTAT 


TTAAAACATC 


TCTTACATCT 


2520 


TGCX3TTGGAA 


GTTCTTCATA 


GTCGAATTTC 


ATTGTTTTAT 


CAAAGTTTTC 


CATTTGCGAC 


2530 




ACTCCTTTAA 


TTACAAATAT 


AACTCACTAT 


CATCATACAA 


TATTATGGCT 


TTAAATTATA 


2640 


SO 


GATTTTTAAT 


GTAATCTTTA 


ATAAAGCTTA 


ATGATTTTGA 


GATATTTTCA 


GGTTGTGTAC 


2700 




CGCCACCTTG 


AGCCATATCT 


GGACGACCGC 


CACCTTTACC 


ACCAACGATT 


GGTGCCATTT 


2760 
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AAGCCCAAAT TGTATCTTGC ATCAACATGA TTTTTATCAA TCGTTAATAC ATGTTTAAGT 8280 

TGAGTTATGG CTTCATTAAA CATTTCTAAT TGACATAATA CAAGACCATA TTGAAATTGA 8340 

ACTTCTGCAT CTTTGTCTTT ATCTAGTTCC GCAGCAGTCA TTAAATACGG CAATGCCAAG 8400 

CTTAAATGAT TCTAACTGAT TAAACGCCAT ACCGATCATA TAATTACAAT CAACTTGTTC 8460 

AATCTCTGTT TGTAATGCTT GTTGATATAA TTTAATAGCT TCTTGATAAC GTTGCTGATT 8520 

ATAATATACA TTTGCTAGAT TAAAAAATAC GACGCCATTC TTCX3GATCTA TTGTnAAAGC 8580 

TTTTTGGAAA AAACGCTCTG CCTTTyCAAy CyCATTCgCA TCAGCAAGTA CGATmCCaGC 8640 

ATTAATATAA TTTTCAATAA TTGTAGGATT TTCTTCXiATA TITCCGAACA ATGCTTGTAA 8700 

OGCTTCTTCT ATTTTTCCAT TTTGTATGTA TTGATAAATT GTTTGTTGAT CTATCATTTA 8760 

CGAACCTCAT TTCTCATCAA TTATAACATC TTGATAAATT GTATGTCTCG AATCACTTAA 8820 

CAACGAATAA AATATAATCT AATATCATCT TCATTCATGA AAAAGCGGGA ATGGAATAGA 8880 

AATGCTTAAG AACCATTAAC GGTTTATTAT GTAATGGTTC TTCCACATTA GCCACCACTA 8940 

TTATGTACTT AAAAATAAGA ATACATAATT AGATTCATGC ATAGGGAGTG GGACAGAAAT 9000 

25 GATATTTTAA CAAAATTAAA TTCGTTATCC CCAACTGGCA TTGCCTGTAG AATTTCTTTA 9060 

CGAAATTCTC TATGTTGTGG TCCCGCCAAT ATAACATTGT AGAGCCTAGG ACATTGTGAT 9120 

GTCCCAGACT CTATCCTCAT GAATTATTCT CATCAAAAAC TGTCTTTCGT CATTTTCAAC 9180 

30 GTTGAAACTT CAAATAAGTA ATTTATTGTT GCCATTGTTT ATACAACATA ATTTAATTGA 9240 

CCTTCATTTT TGAACACATC GTCAATTGTT GCACCACCAA GACACACATC ACCTTGATAA 9300 

AAAACAACTG CTTGTCCAGG TGTGATTGCT CTTACTGGCT CAGCAAAAGT AACACGTAGg 9360 

CAtGGtCGTT TTCACGTTTC ACAAAAACTT TCGTATCTTT TTGGCGATAT CTAAATTTAG 9420 

CTGtACATTC AAAACCTTGA TCTAAGTCAT TATCTTCTGG ATTTACAAAT GAATAGTCTG 9480 

AAGCAATTAA GTAATCACTG TATAATGCAT CXSTGATGGAA TCCTTGTTCT ACATATAAAA 9540 

CATTATCTTT TAGGTTTTTA CCGACAACAA ACCAAGGATC GCCATCTCCA CCTATACCTA 9600 

ATCCATGTCT TTGTCCTATT GTGTAATACA TCAAACCACT ATGTTTACCC ATTTTCTTAC 9660 

CATCAAGTGT TATCATATCA CCCGGTTGTG CAGGTAAATA TTGT6ATAAA AATGTTTTAA 9720 

AGTTTTTTTC GCCGATAAAA CAAATGCCTG TAGAATCTTT TTTCTTAGCA GTAACAAGTC 9780 

CTTGTTCTTC AGCAATTCGA CGCACTTCAC TCTTTTCGAT GTCGCCAATT GGGaACATCA 9840 

SO CTTTTGAAAG TTGTTGTTGA GATAATTGAT TCAAGAAGTA TGTTTGATCT TTATTATTAT 9900 

CTACACCACX; TAACATTTCA ACATGACCAT CTTCATGACG ATGTATGCGT GCGTAATGTC 9960 
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TTTCTTTATT ACACATAACG TCTGGATTTG GAGTACGACC TTTTTTGTAT TCATCTAAGA 10080 

AATACGTAAA GACTTTATCC CAATATTCTT TTTCAAAATT AACAGCGTAA TACGGAATGC 10140 

5 CAATTTGATT ACACACTTCA ATAACATCGT TGTAATCTTC AGTTGCAGTA CATACGCCAT 10200 

TTTCGTCAGT GTCATCCCAG TTTTTCATAA ATATGCCAAT GACATCATAA CCTTGTTCTT 10260 

TTAAGACGTG GGCTGTTACA GAACTATCTA CACCGCCTGA CATACCAACG ACAACACGTA 10320 

10 TATCTTTATT TGACAATTAT GACTCCTCCT TAAATTTAAA ATATATTTTA TGAATTTCAG 10380 

CTACAATTGC ATTAATTTCA TTTTCAGTAG TCAATTCGTT AAAACTAAAT CGAATCGAAT 10440 

GATTTCATCG CTCCTCATCT TCGAACATTG CATCTAAAAC ATGCGACX3GT TGTGTAGAGC 10500 

IS 

CTGCTGTACA TGCAGATCCA GACGACACAT AGATTTGTGC CATATCCAAC AATGTTAACA 10560 

TCGTTTCAAC TTCAACAAAC GGAAAATATA GATTTACAAT ATGGCCTGTA GCATCCGTCA 10620 

TTGAACCATT TAATTCAAAT GGAATCGCTC TTTCTTGTAA TTTAACTAAA AATTGTTCTT 10680 

20 

TTAAATTCAT TAAATGAATA TTGTTATCGT CTCGATTCTT TTCTGCTAAT TGTAATGCTT 10740 

TAGCCATCCC AACAATTTGC GCAAGATTTT CAGTGCCTGC ACGGCGTTTC AATTCTTGTT 10300 

25 CACCGCCAAG TTGAGGATAA TCTAGTGTAA CATGGTCTTT AACTAGTAAT GCACCGACAC 10860 

CTTTTOGTCC GCCAAACTTA TGAGCAGTAA TACTCATTGC GTCGATCTCA AATTCGTCAA 10920 

ACTTAACATC AAGATGTCCA ATTGCTTGAA CCGCATCAAC ATGGAAATAT GCATTTGTCT 10980 

30 CAGCAATAAT ATCTTGAATA TCATAAATTT GTtGCACTGT GCCAaCTTCA TTATTTACAA 11040 

ACATraTAGa TACTAAAATC GTCTTATCTG tAATTGTTTC TTCAAGTTGA TCTAAATCAA 11100 

TAGCACCTGT ATCATCAACA TCTAGATATG TTACATCAAA ACCTTCTCGC TCTAATTGTT 11160 

^ CAAAAACATG TAACACAGAA TGATGTTCAA TCTTCGATGT GATAATGTGA TTACCCAATT 11220 

aTTC&TTTGC TTTTACTATG CCTTTAATTG CCGTATTATT CGATTCTGTT GCGCCACTCG 11280 

TAAATATAAT TTCATGTCTA TCTGCACCAA GTAATTGTGC AATTTGACXjT CTTGACTCAT 11340 

40 

CTAAATATTT ACGCGCATCT CTTCCCTTAG CATGTATTGA TGATGGATTA CCATAATGCG 11400 

AATTGTAAAT CGTCATCATC GCATCTACTA CTTCAGGTTT TACTGGTCTG GTCGCAGCAT 11460 

AATCTGCATA AATTTCGATQ TTTGGACACT CCTCACAATT TTATCAATGT TCCAATAATA 11520 

4S 

GCACCTTACA TACTATTTTT CTACTTTTCT GTTTAACTTT ATTTATAATG TTTTTAATTA 11580 

TATTTTACCA TTTTCTACAC ATGCTTTTCG ATAGGCTTTT TTAAGTTTAT CGCTTTATTC 11640 

^ TTGTCTTTTT TATAAATTTT AGTATTTGCA GATATTTTTT TATTTGTAAA ATGTAACX3TA 11700 

CTATTATTTT GGTTATGAGC AATTTAATAT TTATCTGGTT ATTCGATTGG TATACTTCTT 11760 
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35 



40 
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GTCCCTATTC GAGAAGGTGA AGATGAACAA ACAGCAATTA ATAATATGGT TAATCTCGCA 11880 

CAACATTTAG ACGAATTATC ATATGAAAGA TATTGGATTG CTGAACACCA TAACGCTCCC 11940 

AACCTAGTAA GTTCAGCAAC TGCTTTATTA ATTCAACATA CGTTAGAACA TACGAAACAC 12000 

ATACGTGTAG GTTCTGGAGG CATCATGTTA CCTAATCATG CTCCATTAAT CGTTGCGGAA 12060 

CAATTTGGCA CGATGGCAAC ATTATTTCCA AATCGTGTCG ATTTAGGATT AGGACGTGCA 12120 

CCTGGAACAG ATATGATGAC CGcAAGTGCA TTAAGACGAG ATCAACATGA TGGTGTTTAT 12180 

AAATTTCCAG AAGAGGTTTC ATTATTACAA CAATATTTCG GCCCTGCTCA CCAACAAGCA 12240 

TATGTTCXJTG CTTATCCAGC AGTAGGTAAA AATGTGCCTT TATACATTCT TGGTTCTTCA 12300 

ACAGATTCTG CACATTTAGC TGCTCGCAAA GGGCTTCCAT ATGTGTTCGC TGGACATTTT 12360 

GCACCTCAAC AAATGAAAGA AGCTATCGAA ATTTACAAAA CGTTATTTGA ACCTTCTGAT 12420 

GTATTAGACG AACCTTATGT TATTGTATGT TTAAATACAA TCGTTGCTGA AAATGATGAC 12480 

GAAGCACAAT ATTTAGCTTC ATCTATGGCA CAAGTAATGG TTAGTATCAC TCGTGGCAGA 12540 

ATGCAGCCCG TTCAACCGCC AACACATGAA CTACAAAATA TATTAACGCC GAGAGAATAC 12600 

GCGATQGCTA TGGAAAGACA GAAAATATCA TTAATAGGTT CAGAAAATAC TGTTCAACAA 12660 

AAAATTCAAG ATTTTATGGA AACTTATGGT GAAGTCAACG AAATTATGGC AATAAGTTAT 12720 

ATTTATGATA AAGATATGCA ATTAGACTCT TATCX5TCGGT TCAAGAATGT TATAAATCAG 12780 

ATAAATGAAA AAAACACTTT ATAATGTGAT AAATAAACTA AGTGAAAGTA TGTATCCA7A 12840 

ATATTAATAA AAATATACAG TAACAGCATT TTQAATGAAA GATGTCTTTA TTGTTCAATC 12900 

ATTTATTTTA GTAATGATTC AAATTCACTT AAAATyCTAA tGCAAATATG AAAGCGCCCC 12960 

TTCAcTTTAC ACTGTGTAAG TGTTTATTTG ATGGGGCGCT TTCAAAATAT TGAAAAGCAT 13020 

ATCCSAAATT TAAAGAAATT TATTTCTCTT TATCTTCATT TTCTTTTTTC TCTTCGTTAT 13080 

TCGATCCTGT ATATTCATTT ATCTTATCTT TTACATTTTT AACTTGTTCA TTATCGCTAT 13140 

TTTTAAATTT TTCTACGCGT CTTTAGCTTT ATCCATAAAA CTCATATTAA TCGCTCCTCT 13200 

TATATTTGAT TAGTTTAATT GAACTTATTT TTTAAGTTTA TCAATTGCAT CAGTTATTTT 13260 

GTTTTTAGCA TTTTCAACAA CTTCTTTTGC TTTaCCAGTC GCTTTATCTT GCTGACCTTC 13320 

TTTTTCTAAT TCTTTGTTAT CAGTAACGTT ACCTACTGTT TCTTTAACAT TTCCTTTAAA 13380 

TTGATCGAAC TtACTTTCGT CTGCCATAGT GAAACCTCCT TGGATGTATA TATTTATATA 1344 0 

CCACTAAGGA GGTTCGCTmm mCAyymyAAT ATGAAGTTTT TATGTTATAG TATAGTATTT 13500 

ATACGATTAA ATATAAAACA TGTATCCGTC TAAATCTTCA CTTGTATCTA CATATTCCGC 13560 
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TAGTTGTTTT TGCGCAGGTG GTTCTGATTC AATACTTTCA ACAAATGTAA TTGGACCTTC 13680 
TAACAGTCTT ATAATATCCC CTGCTGAGAT TTCTT 13715 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

IS 

AAATCCATAA TGTCATGATA ATCTGCATAT GCTTCATATA ATTCAATCAT TGTGAATTCA . 60 

GGGTTATGTC TAGTTGATAC ACCTTCATTA CGGAATACTC TACCAATTTC ATATACTTTT 120 

^ TCAAGTCCAC CGACAATTAm ACGTTTTAAA TGCAACyCAA TAGCAATACG CATGTATAaC 180 

GTTGCATCTA ATGCATTATG ATGTGTTACA AATGGTCTAG CAGCTGCTCC ACCAGCAATT 240 

TGGTGCATCA TAGOTGTTTC TACTTCCAAG AAACCTTTAT TATTTAAATA ATTACGCATT 300 

25 TCTTGAATGA TTTTACTACG ATTAATAAAT GTACGAGTGC TATCTTCGTT CGTAATTAAA 360 

TCTAAATATC TTTGACGATA tCTCTGTTCA ATATCCTGTA AACCGTGGAA TTTATCCGGT 420 

AATGGTCGCA ATGATTTAGT TAGTAGCGTG AATTTCTTCG CTTTAACCGA TAATTCGCCA 480 

^ GTATTTGTTT TGAACATTAC ACCTTCAACA CCAACGATAT CGCCTAAATC AGCATTTTTC 540 

CATAAATCAA ATTCGTCATC GCCAACTTGA TCTTTACGAA CGTAAATTTG AATTTGTCCA 600 

GCTAAGTCCT GAACGTGTGC AAATCCTGCT TTACCTTTAC CACGCTTAGT CATTAATCGT 660 

35 

CCAGCTATAG CGACATGACT ATCCGCTTCT TTTTCTACCA ATTCTTCTTT AGAATACTGG 720 

TCCS^CTCTT CTTTCAAATC ACTAGATAAA CCTGAACGGT CAAATTTAGA ACCAAACGGG 780 

TCTATACCAA GATCATATAA TTCTTGTAAT TtTTGACGTC GAACCAACAT TTGGTCATTC 340 

40 

ATTTCTTCTG ACATAACTtT CTCTCCTTTA ACT 873 
(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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ACCATAATAT GAATGGCTTC AGGATCAAAA TAAAGACCAA CTTCACTGCC TACTTCAGCT 120 
TTTTTAGTCG TTTGTATTAC CCATTCATAA CCTTTATTGT CTATACAACA TATTTCATAG 180 
^ TGGACCCCTC TAAATAACAT AGAATCAACA GTTGCTTTAA ATAATCCTTC TTCAGCTTTG 240 

ATTAATGATA TATCTTCTGG TCGAATAACG ACTTCTACTT TTTTATTTTC AGGAATACCC 300 
ATATCGACAC ATTCGAAATC TTGCCCATAA ATATTCACGA CATAATCTCT AACCATGCGC 360 
CCTTCAACAA TATTAGATTC TCCAATAAAA TCAGCTACAA ATCGATTCAC TGGTTCGTCA 420 
TaTATATCTG TTGGTGTGCC AAATTGTTGA AT 452 
(2) INFORMATION FOR SEQ ID NO: 198: 

IS 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



25 


TAGGTTGGGT 


TCTAACATAC 


GATAAAGCTC AACAAATCAA 


CACAGCTTTC 


TTTGTAAAAT 


60 




TGTTTAATAC 


TGCATTAGCA 


GAACGTGATT ATTATTTTAA 


TATA6ATGGA ACAAATGCTT 


120 




TTAGATTATT 


TAATGCTGAA 


GGTGATGGTG TTGGGGGATT 


AACAATCGAC 


AATTACGATG 


180 


30 


GTCATTTGTT 


GATTCAAT6G 


TACTCAAAAG GTATTTATAA 


ATTTAAATAT 


GCCATTCTTG 


240 




AAGCGGTTAG 


AAAAGTATTT 


GATTATAAAT CTATTTACGA 


AAAAGTAAGA TTTAAAGACA 


300 




GCGAATATAG 


TGGTGGTTTT 


GTTGAAGGAG ATGCACCTGa 


GTTTCCAATT GTTATCGAAG 


360 


35 


AAAACTTCAC 


ATTTTATAAT 


GTAGACCTTG AAGATGGTTT 


GATGACAGGT 


ATCTTTTTAG 


420 




ATC/SAAAAGA 


AGTGCGCAAG 


AaATTAAGGG ATCAATATGC 


CAAAGAACGC 


CATGTTTTAA 


4B0 


40 


ACTTATTTAG 


TTATACAGGT 


GCTTTTTCTG CAATAGCAGC 


AAGTGAGGCA 


TCTTCAACAA 


540 


CAAGTGTAGA 


TTTGGCTAAT 


CGTTCTCGTA GTTTAACTGA 


AGAAAATTTT 


GGATTAAATG 


600 




CTATTGATCC 


TAAATCCCAA 


TATATTTATG TCATGGACAC 


TTTTGATTTC 


TATAAATATG 


660 


45 


CTGCACGACA 


TGGACATAGT 


TATGACACGA TCGTGATTGA 


TCCACCTAGC 


TTTGCGCGTA 


720 




ACAAAAAACG 


TACATTTTCA 


GTGCAAAAAG ATTATGACAA 


ATTAATTAAT 


GGCGCCTTAA 


780 




ATATCTTATC 


ATCTGAAGGA 


ACATTATTGT TATGTACAAA 


CGCAAGTGTA 


TATCCATTAA 


840 


SO 


AGCAATTTAA 


AAATACTATT 


AAAAAGACGC TTGAAGAGAG 


TGGCGTTGAT 


TATGAATTAA 


900 




CTGAAGTTAT 


GGGATTACCA 


AAAGATTTTA AAACGCATCC 


ACATTATAAG 


CCATCTAAAT 


960 
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TATTGAGAAA 


AAGAAGGGTG 


ATAATATTAT 


GGGATTCAAA 


AACAATTTAA 


CATCAAATTT 


1080 


AACAAATAAA 


ATCGGTAATT 


CAGTCTTTAA 


AATAGAAAAT 


GTTGACGGAA AAGGTGCAAT 


1140 


GCCAACGACG 


ATTCAAGAAT 


TGAGAGAAAG 


ACGACAACGT 


GCTGAAGCAA 


TTGTAAAGAG 


1200 


AAAGTCTTTA 


ATGTCATCAA 


CAATGAGCGT 


TGTTCCAATT 


CCGGGTTTAG 


ATTTTGGTGT 


1260 


TGATTTAAAA 


TTAATGAAAG 


ATATTATCGA 


AGATGTTAAT 


AAAATTTATG 


GTTTAGATCA 


1320 


TAAGCAAGTT 


AATAGCCTTG 


GGGATGATGT 


GAAAGAAAGA 


ATTATGTCTG 


CAGCAGCAAT 


1380 


TCAAGGTAGT 


CAATTTATTG 


GTAAAAGAAT 


TTCAAATGCA 


TTTTTAAAAA 


TTGTAATTAG 


1440 


AGATGTAGCT 


AAACGTACTG 


CTGCAAAACa 


AACAAAATGG 


TTTCCTGTTG 


TAGGACAAGC 


1500 


TGTGTCTGCA 


TCTATTAGTT 


ACTATTTTAT 


GAATAAAATT 


GGAAAAGATC 


ACATTCAAAA 


1560 


ATGCGAAAAT 


GTTATTAAAA 


ATGTCATGTA 


GGTGCTATAA 


TAGTTTTGCA ATTTGCAAAT 


1620 


TTTACTGAAA 


CCGGTTTTAA 


ACGAATTGAA 


TTTAAAGcAT 


GGTTTTGGTA 


AAGTTAATGT 


1680 


ATAAAACTAA 


GTTAGyATTG 


TAATAATATk 


GAAGATTCTA ACTATACGAA GGAGAAATGT 


1740 


AATTATGCSAA 


CAAAATTCAT 


ATGTAATCAT 


CGACGAGAmT 


GGTATTCACG 


CTAGACCAGC 


1800 


AACAATGTTA 


GTACAAACAG 


CTTCAAAATT 


CGATTCTGAT ATTCAATTAG 


AATATAACGG 


1860 


TAAGAAAGTA 


AACTTAAAAT 


CAATCATGGG 


TGTTAT6AGC 


CTTGGTGTTG 


GTAAAGATGC 


1920 


TGAAATTACA 


ATTTATGCTG 


ACGGTAGTGA 


TGAATCTGAC 


GCCATTCAAG 


CAATCAGTGA 


1980 


CGTCTTATCA 


AAAGAAGGAT 


TGACTAAATA 


ATCATGTCTA AATTAATTAA AGGTATTGCC 


2040 






TGrTAAAGCT 


TATTTATTAG 


TTGAGCCAGA 


CTTAACATTC 


2100 


GACAAAAATG 


AAAAAGTCAC 


TGATGTTGAA 


GGAGAAGTTG 


CAAAGTTCAA 


TAGCGCTATC 


2160 


GAAGCTTCTA 


AAGTTGAGTT 


AACTAAAATT 


AGAAATAATG 


CAGAGGTTCA ACTAGGTGCT 


2220 


GATMAGCTG 


CTATCTTTGA 


TGCAcaTTGG 


GGGGTGGTAG ATGACCCTGA ATTAATTCAA 


2280 


CCAATCCAAG 


ATAAGATTAA 


AAATGAAA 








2308 



40 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5559 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

AAGTAATAAA TCGTCTCATT TGGCAACTGA CGCATAATTT CTTTAGCTAC TGTCAAACCT 60 
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TTTAATTTTT AGTTTATCAT AACTAAGCAT TGGATTTTAG 


TATTATGCAC 


TGTGTTTACC 


180 




ATTTTTGTCA 


TTATAATATT 


TATTTTAAAT 


CAGCCCACTA TCATATTGTC 


ATGTAATCTG 


240 


5 


CTTATTAAAA AAATCCCTTC 


CAAGTTATTG TGTATCTCCA TTCAATTTAA TTTTGAAAGG 


300 




AACATAACWT 


TTTAACTCAA 


AAGGGATTAA 


TTTnTAnTCT 


ACTTCATGGT 


CTGAACCAAA 


360 




GAATGATTTA AACATGTGGA ATGTTGTTTC 


TCTGTTCATT 


GCTGCAATGG 


ATGTTGTTAA 


420 


10 


TGGAATACCT 


TTAGGGCAAG 


CATTAACACA 


GTTTTGTGAA 


TTACCACACT 


GCTGTAAGCC 


480 




ACCAGTACCC 


ATTAATGCAT 


TTAAACGTTC 


ATCTTTAGTC 


ATAGATCCTG 


TTGGGTGCAA 


540 


IS 


ATTAAACAAA 


CGAACTTGCG 


AGATTGCTTG 


TGCACCAACG 


AaTTTATTAT 


TTTCAGTAAC 


600 


ATTAGGACAA 


ACCTCTAAAC 


ATACACCACA 


TGTCATACAT 


TTAGATAATT 


CATAAGCTGT 


660 




TTGACGTTTT 


TTCTCTGGCA 


TACGTGGTCC 


CGGACCTAAA 


TCATACGTTC 


CATCAATTGG 


720 


20 


GATCCATGCT 


TTCATACGTT 


TTAAGTTATC 


GAACATTCTA 


GAACGATCAA 


CTTGTAAGTC 


730 


ACGGATAACT 


GGGAAAGTAT 


TCATTGGOTC 


TAAACGAATA 


GGTTGTTCTA 


ATTGATCAAC 


840 




AATCGCAGAA 


CAAGATTGTC 


TTGCACX3ACC 


ATTGATAACC 


ATAGAACATG 


CTCCACATAC 


900 


25 


TTCTTCTAAG 


CAGTTCATAT 


CCCAGACAAC 


AGGTGTTGTT 


TTTTCACCTT 


TAATATTAAC 


960 




TGGGTTACGT 


CTAATTTCCA 


TTAAACAAGC AATOACOTTT AAATTTTCAC 


GATATGGAAT 


1020 




TTCAAATGTT 


TCTTCATAAG 


GCTTAGAATC ACTTGTATCT TGTCGTTTAA TAATTAATTT 


1080 


30 


TACTGTTTTT 


TGTTTCGGTT 


TAGATTGTGT TTCATGTTGT GGAGTCTTTT TCACTGATTG 


1140 




TTCAGTCATT 


ATTTTTTACC 


CCCTTTAGAC 


TTACTTGTGT 


AATCACGTTT 


AGGAGGTGGT 


1200 




ATTAAACTCA 


CATCGACGTC 


ATCATAAGTA AACTGCGGTT 


TTTCAAATGC 


GCCTTGGAAT 


1260 


35 


GAGGCCATTG 


TCGTTTTTAA 


CCACTCTTCA TCATTACGCT CTGGGAATTC 


TGGTTTATAA 


1320 




TGGGCACCGC GTGATTCGTT ACGGTTATAT GCACCAATCG TAATAACACG TGCAAGTACT 


1380 


40 


AACA'TGTTCC ATAGTTGACG GGTAAAGAAT ACCGCTTGGT TACTCCAAGT TTGAGTATCT 


1440 


TCCATATCAA TATCTTCATA ACGTTTCATC AATTCAACAA TCTTTTTATC 


TGTTTCTAAC 


1500 




AGTTTTTCAT 


TTTCACGAAC AACAGTTACA TTTGCTGTCA TAATTTCACC AAGTTCACGG 


1560 


45 


TGTAATTTAT 


ATGCATTTTC 


TGTACCGCGC ATAGCTAATA ATTTATCAAA ACGTTCTTGC 


1620 


TCTTCAGCTT 


TACGCTTTTC 


AAAAATACTT 


TCGTCCATAT 


CAGTATATGA 


TCGATCAATA 


1680 




TTTGAAATAT 


AATCAATCGC 


GTTTGGACCT 


GCTACTGTAC 


CACCATAAAT 


CGCTGATAAC 


1740 


SO 


AATGAATTGG 


CACCTAAGCG 


GTTACCACCA 


TGTTGAGAGA 


AGTCACATTC 


TCCAGCTCCA 


1800 




AATAACCCTT 


TAATATTTGT 


CATTTGATCA TAATCTACAT ATAGACCACC 


CATTGAATAG 


1860 



55 



908 




909 



CGATCACCAA ACTGCATGTC GAACAATGTA 
TGATGATAAA CTTTCTCATT TAGAAAACGC 

5 TAACCATATT GTAACATTAT TTCATATATT 

TAATTTATTT ATGCAATTGT TCATGTAGGT 
TTTTAAAATC TTCAAGTGTA GCTTCCTTCA 

''^ ATAATGTTTT ACGTTTGTTA CCGATACCAT 

TCTTTTGACG TGTTTGTCTA TGAAATGTGA 
GGTGCAACAA ATAAAATGCC TGGCTATTTT 

IS 

CATATAATAA TTCAGATGTT TGGTGTTTAT 
CAAGACCTAA TTCGTTTTGT AGCACATCAA 
CTACTATTAT TAAATCAGGT AATGGTAATC 

20 

TTACTACTTC TCTCATTGAT TTGTAATCAT 
TTCTATAATT TTTCTTATCT GGTTTACCGT 

2s CCACACCTTG AATATTAGAA TTATCGAATG 

TTTGTGTTCC AAGTTCTTCA ATAGCTTTAA 
CAAATTTATT ATTTAAGGAT ACTTTAGCGT 

30 TGGGACCTCG CGCGGGTTGA ACGATTTTAG 

CCAAATTACG TGGTACATGA ACTTCCTTAG 
GTCCAATAAA TGTATAAAAT TCTTCTTcTT 

^ CTCX5CTTTAT CATATTACCT TGTCGTATAA 

CAACACTATA ACCAAAGACA TCACGAATCX; 
TTGtCAGATT TTGAATATGT TGAATTAAAT 

40 

GTGATTCACT TGCAGTTAAC ATTCGCTCTT 
CATTCAGAAA ATCAGTAATT TCCTTCX5TCA 
ATACACATGG TCCTAAACAT TGTCCAATAT 

4S 

TATCACATTT GCGATATGGA TATATTCTGT 
ATGCATTCGG ATACGGTCCG AAATATTTGC 
50 GTAGTCTAGG ATATTTCTCC TTCGTAATTT 

ATAATATATT ATATCTTGQT TGATATTGTT 

55 



>0786 519 A2 

ACATTTGGAT TCGATATTTA AAATTGCTTG 3780 

TTCCACGTAC ATTCAAAAAA ATAACTTTGT 3840 

TTGGGGCATG AGAATGATTC TCACGCCCAG 3900 

TCTTTGCGAC GTTTTCAGGA ATACCTATAT 3960 

TTTTCTTGAT TGAACCGAAT GAACGCAATA 4020 

CTATATCATC AAGTATTGAT TTCAAGCCTG 4080 

TTGCGAATCT GTGAACCTCA TCTTGGATAC 4140 

TCTTCAGTGG TACAATTTCT GCACTAGCGC 4200 

CATTTTTCTG CAAACCTGCA ACAGGGATAT 4260 

TAACCCCGTT CATATGTCCT TTACCACCAT 4320 

CTTCGTTTAA AACGCGAGAA TATCGTCGTC 4380 

CTGGACCTTT AACCGTTTTG ATTTTATACT 4440 

CGACAAATGT AACCATTGCT 6ACACTGGAT 4500 

CTTCAATTCT AATTGGTGTT TGAATTCCCA 4560 

TCGTTCTGGA CTCATCACXST GATATTAATT 4620 

TATGTGCAGC TAGGTCAACC ATATCTTTTT 4680 

TGTCCACAAC AGATTGAATC ATTTCTTTAT 4740 

GTAAAATATG TTGGTTTAAG CTATAAAATT 4800 

CTGTTTGCTG TAATGGAATC ATCGTTGTAT 4860 

AGAAAACTTG GATACACATC CATCCTTTAT 4920 

TTTTATCTGA TGACATAATT TTTTGTTTGT 4980 

CTCTATATTC TTTAGCCCGT TCAAAATCAA 5040 

CTAAACTTTT TAAAATTGTT TTGTCTTCCC 5100 

TTTGTGCGTA TTTACTCAAA TCAACGTCAT 5160 

GGTAATAAAG ACATAATTTA TCTGGCATCT 5220 

CTAATAACTT TTTAGTTTCT TGAGCAGAAT 5280 

CAGTACCTTG TTTTACAGTT CTCGTCACTA 5340 

TAATAAATGG ATAACTTTTA TCATCCTTTA 5400 

TAATCAGATT CAATTCCAGT AAAAGTGATT 5460 
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GTTTGACAAT GAAACTGTAA AATTAGGTGA AACGTTGTCT AAAGATTTAT ATGGTAAGGA 1440 

AATCAATGCC AGTGTATCCC GTTTTGAAGG TTATCAACAA TGCCCATTTA AACACTATGC ISOO 

^ GTCACATGGT CTGAAACTAA ATGAGCGAAC GAAGTATGAA CTTCAAAACT TTGATTTAGG 1560 

TGATATTTTC CATTCTGTTT TAAAATATAT ATCTGAACGT ATTAATGGCG ATTTTAAACA 1620 

ATTAGACCTG AAAAAAATAA GACAATTAAC GAATGAAGCA TTGGAAGAAA TTTTACCTAA 1680 

10 

AGTTCAGTTT AATTTATTAA ATTCTTCAGC TTACTATCGT TATTTATCAA GACGCATTGG 1740 

CGCTATTGTA GAAACAACAC TAAGCGCATT AAAATATCAA GGCACGTATT CAAAGTTTAT 1800 

GCCAAAACAT TTTGAGACAA GTTTTAGAAG GAAACCAAGA ACAAATGACG AATTAATTGC 1860 

IS 

ACAAACATTA ACGACAACTC AAGGTATTCC AATTAATATT AGAGGGCAAA TTGACCGTAT 1920 

CGATACGTAT ACAAAGAATG ATACAAGTTT TGTTAATATC ATTGACTATA AATCCTCTGA 1980 

2^ AGGTAGTGCG ACACTTGATT TAA06AAAGT ATATTATGGT ATGCAAATGC AAATGATGAC 2040 

ATACATGGAT ATCGTTTTAC AAAATAAACA ACGCCTTGGA TTAACAGATA TTGTGAAcCA 2100 

GGTGGaTTAT TATACTTCCA TGTACATGAA CCTAGAATTA AATTTAAATC ATGGTCTGAT 2160 

2S ATTGATGAAG ATAAACTAGA ACAAGATTTA ATTAAAAAGT TTAAGTTGAG TGGTTTAGTT 2220 

AATGCAGACC AAACTGTTAT TGATGCATTG GATATTCGTT TAGAACCTAA ATTCACTTCA 2280 

GATATTGTAC CAGTTGGTTT GAATAAAGAT GGCTCTTTGA GTAAACX3AGG CAGCCAAGTG 2340 

GCAGATGAAG CAACGATTTA TAAATTCATC CAACATAACA AAGAGAATTT TATAGAAACA 2400 

GCTTCAAATA TTATGGATGG ACATACTGAA GTTGCACCAT TAAAGTACAA ACAAAAATTG 2460 

CCATGTGCTT TTTGTAGTTA TCAATCGGTA TGTCATGTAG ATGGCATGAT TGATAGTAAG 2520 

^ CGATATCGAA CTGTAGATGA AACAATAAAT CCAATTGAAG CAATTCAAAA TATTAACATT 25 BO 

AATGATGAAT TTGGGGGTGA GCAATAGATG ACAATTCCAG AGAAACCACA AGGCGTGATT 2640 

TGGACTGACG CGCAATGGCA AAGTATTTAC GCAACTGGAC AAGATGTACT TGTTGCAGCC 2700 

40 

GCGGCAGGTT CAGGTAAAAC AGCTGTACTA GTTGAGCGTA TTATCCAAAA GATTTTACGT 2760 

QATGGCATTG ATGTCGATCG ACTTTTAGTC GTAACGTTTA CAAACTTAAG CGCACGTGAA 2320 

ATGAAGCATC GTGTAGACCA ACGTATTCAA GAGGCATCGA TTGCTGATCC TGCAAATGCA 2880 

45 

CACTTGAAAA ACCAACGCAT CAAAATTCAT CAAGCACAAA TATCTACACT CCATAGTTTT 2940 

TGCTTGAAAT TAATTCAACA GCATTATGAT GTATTAAATA TTGACCCGAA CTTTAGAACA 3000 

SO AGCAGTGAAG CTGAAAATAT TTTATTATTA GAACAAACGA TAGATGAGGT CATAGAACAA 3060 

CATTACGATA TCCTTGATCC TGCTTTTATT GAATTAACAO AGCAATTGTC TTCAGATAGA 3120 

55 
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AATCCTACAA 


ATTGGTTGGA 


TCAATTGGTG 


ACACCATACG 


AAGAAGAAGC 


ACAACAAGCG 


3240 




CAACTTATTC 


AACTACTAAC 


AGACTTATCT 


AAAGTATTTA TCACAGCTGC 


TTATGATGCT 


3300 


5 


TTAAATAAGG 


CGTATGATTT 


GTTTAGTATG 


ATGGATAGCG 


TCGATAAACA 


TTTAGCTGTT 


3360 




ATAGAAGATG 


AACGACGTTT 


AATGGGGCGT 


GTTTTAGAAG GTGGCTTTAT 


TGATATACCT 


3420 




TATTTAACTG 


GTCACGAATT 


TGGCGCGCGT 


TTGCCTAATG 


TAACAGCGAA 


AATTAAAGAA 


34B0 


10 


GCAAATGAAA 


TGATGGTCGA 


TGCCTTAGAA 


GATGCTAAAC TTCAGTATAA AAAATATAAA 


3540 




TCATTAATTG 


ATAAAGTGAA 


GAGTGArrAC 


TTTTCAAGAG AAGCTGATGA TTTGAAAGCT 


3600 


15 


GATATGCAAC 


AATTGGCX3CC 


ACGAGTAAAG 


TACCTTGCGC 


GTATTGTGAA 


AGATGTTATG 


3660 


TCAGAATTCA 


ATCGAAAAAA 


GCGTAGCAAA 


AATATTTTGG 


ATTTTTCTGA 


TTATGAACAT 


3720 




TTTGCATTAC 


AAATTTTAAC 


TAATGAGGAT 


GGTTCGCCTT 


CAGAAATTGC 


CGAATCATAC 


3730 


20 


CGTCAACACT 


TCCAAGAAAT 


ATTGGTCGAT 


GAGTATCAAG 


ATACGAACCG 


AGTTCAAGAQ 


3840 


AAAATACTAT 


CTTGCATCAA 


AACGGGTGAT 


GAACATAATG 


GTAATTTATT 


TATGGTTGGA 


3900 




GATGTTAAGC 


AATCCATTTA 


TAAATTTAGA 


CAAGCTGATC 


CAAGTTTATT 


TATTGAAAAG 


3960 


25 


TATCAACGCT 


TTACTATAGA 


TGGAGATGGC 


ACTGGACGTC 


GAATTGATTT 


GTCX3CAAAAC 


4020 




TTCCGTTCTC 


GAAAAGAAGT 


ACTGTCAACG 


ACTAACTATA 


TATTCAAACA 


TATGATGGAT 


4080 




GAACAAGTCG 


GTGAAGTAAA 


ATATOATGAA 


GCGGCACAGT 


TGTATTATGG 


TGCACCATAT 


4140 


30 


GATGAATCGG 


ACCATCCaGT 


AAACTTAAAA 


GTCCTTGTTG 


AAGCGGATCA 


AGAACATAGT 


4200 




QATTTAACTG 


GTAGTGAACA 


AGAAGC6CAT 


TTTATAGTAG 


AACAAGTTAA 


AGATATCTTA 


4260 




GAACATCAAA 


AAGTTTATGA 


TATGAAAACA 


GGAAGCTATA 


GAAGTGCGAC 


ATACAAGGAT 


4320 


35 


ATCGTTATTC 


TAGAACGCAG 


CTTTGGACAA 


GCTCGCAATT 


TACAACAAGC 


CTTTAAAAAT 


4380 




GAAGATATTC 


CATTCCATGT 


GAATAGTCGT 


GAAGGTTACT 


TTGAACAAAC 


AGAAGTCCGC 


4440 




TTAGTATTAT 


CATTTTTAAG 


AGCGATAGAT 


AATCCATTAC 


AAGATATTTA TTTAGTTGGG 


4500 


40 


TTAATGCGCT 


CCGTTATATA 


TCAGTTCAAA 


GAAGAC6AAT 


TAGCTCAAAT 


TAGAATATTG 


4560 




AGTCCAAATO 


ATGACTACTT 


CTATCAATCG 


ATTG 






4594 



(2) INFORMATION FOR SEQ ID NO: 201: 

45 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 



55 
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10 



IS 



20 



GGTTTTCnTG GAAAGATAGT GAAAATCTCG TGTTTTTTGG TTTTgAGGTG TTGTTTGTAT 60 

TTTaTAAAAT GGCTTACATA TATGAAGCGT TGATTAAGTA TGGAATTGTT AATTAATTGA 120 

ACCTATTTAG CTTTAAGAAG GCATAACAAG ATGACCTTAT TTTATGCTAT AATATTTCTA 180 

TTATGCGAAG ATTAAGGTGA GTAGTAAATT GGATAAAAAA GTAAGTATTC AAACAAAGCA 240 

AGTGTTGAAA CAGCACAACG AAAAAGAAAA ATTTGAATTT ACTACTGAAG GAACTTGGCA 300 

ACAAAGGCAA TCTAACTTTA TTCGGTATGT AGAACAAATT GAGGATGCAA CAGTTAATGT 360 

TACAATAAAA GTGGATGATG ATAGCGTTAA GTTGATTCGT AAAGGCGACA TTAATATGAA 420 

TTTGCATTTT GTTGAAGGAC AAACGACAAC AACmTTAC GATATATCGG CTGGACGAAT 480 

TCCACTAGAA GTTAAAACAT TACGCATTTT ACATTTCGTA AGTGGAGACG GTGGCAAGCT 540 

AAAGATTCAT TATGAATTAT ATCAAGATAA TGAAAAAAT6 GGTTCTTATC AATATGAAAT 600 

TAACTATAAQ GAGATAGGCG AATGAATATT ATTGATCAAG TGAAACAAAC ATTAGTAGAA 660 

GAAATTGCAO CAAGTATTAA CAAAGCAGGA TTAGCAGATG AGATTCCTGA TATTAAAATT 720 

GAAGTTCCTA AAGATACAAA AAATGGAGAT TATGCTACTA ATATTGCGAT GGTACTGACT 780 

25 AAGATTGCAA AGCGTAATCC TCGTGAAATT GCTCAAGCGA TTGTTGATAA CTTAGATACT 840 

GAAAAAGCAC AT6TAAAACA AATTGACATT GCTGGTCCAG GATTCATTAA TTTTTACTTA 900 

GATAATCAGT ATTTAACAGC AATTATTCCT GAAGCAATTG AAAAAGGTGA TCAATTTGGA 960 

30 CATGTAAATG AATCAAAAGG TCAAAATGTA TTGCTTGAGT ATGTTTCAGC TAACCCTACA 1020 

GGAGATTTAC ATATTGGTCA TGCTAGAAAT GCAGCAGTTG GTGATGCTTT AgcTAAtATT 1080 

TTAACTGCAG CTGGCTATAA TGTAACACGT GAATATTATA TTAATGATGC TGGTAATCAA 1140 

ATTACTAACT TAGCGCGTTC GATTGAAACA CGTTTCTTTG AAGCTTTAGG TGACAATAGT 1200 

TATTCAATGC CAGAAGATGG CTATAATGGA AAAGATATTA TTGAAATAGG TAAAGATTTA 1260 

GCAGAGAAAC ACCCTGAAAT TAAAGATTAT TCTGAAGAAG CACX3TTTGAA AGAATTTAGA 1320 

AAATTAGGCG TAGAATACX5A AATGGCTAAA TTGAAAAATG ATTTAGCAGA GTTCAATACG 1380 

CATTTTGATA ATTGGTTTAG TGAAaCATCT TTATATGAAA AAGGAGAAAT TCTTGAAGTT 1440 

TTAGCAAAAA TGAAAOAATT AGGTTATACG TATGAAGCTG ATGGCGCTAC ATGGTTACGT 1500 

ACAACTGATT TTAAAGACGA CAAAGACAGA GTATTAATTA AAAATGACGG TACATATACG 1560 

TATTTCTTAC CAGATATTGC GTACCACTTC GATAAAGTAA AACGTGGTAA TGACATTTTA 1620 

SO ATCGATTTAT TTGGTGCTGA TCATCATGGT TATATTAATC GTTTGAAAGC ATCTCTTGAA 1680 

ACGTTTGGTG TAGATAGTAA TCGTTTAGAA ATTCAAATCA TGCAAATGQT TCGTTTAATG 1740 



SS 



35 



40 



45 
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ATTATGGACG AaGTTGGCGT TGACGCTGCA CGTTATTTCT TAACTATGCG TAgTCCTGAT 1B60 

AGTCACTTTC ATTTTGATAT GGAATTAGCG AAAGAGCAAT CTCAAGACAA TCCAGTTTAC 1920 

^ TATGCTCAAT ATGCACATGC GCGTATTTGT TCAATTTTAA AACAAGCGAA AGAGCAAGGT 1980 

ATTGAAGTGA CTGCTGCGAA TGATTTTACA ACGATTACTA ATGAAAAAGC GATTGAATTG 2040 

TTGAAAAAAG TAGCTGATTT CGAACCTACA ATTGAAAGTG CTGCTGAGCA TAGATCGGCA 2100 

10 

CATAGAATTA CTAATTATAT TCAAGATTTA GCTTCTCATT TCCATAAATT CTATAATGCT 2160 

GAAAAAGTGT TAACAGATGA TATTGAAAAA ACAAAAGCAC ATGTTGCTAT GATTGAAGCG 2220 

GTCAGAATTA CATTGAAAAA TGCATTGGCA ATGGTCGGTG TAAGCGCACC TGAATCAATG 2280 

IS 

TAAGAACATT TATATACACT CCAACGTAGA GTTTCTCGAA AGATACTTTG TGTTGGAGTG 2340 

TTTTTTTTAG GTATGTGACA TATTGGGGAA TGCTTAGTAT GTGAATAAGG TTAA6AGGAA 2400 

^ CACAGTTGGA TGCTCTGCAC AACTGCATAA GAGAGCCTGA GACATAAATC AATGTTCTAT 2460 

GCTCTACAAA GTTATAATGG CAGTAGTTGA CTGAACGAAA ATTCGCTTGT AACAAGCTTT 2520 

TTTCAATTCT AGTCAACCTT GCCGGCGGG6 CCCCAACAAA GAGAAATTGG ATTCCCAATT 2580 

2S TCTACAGACA ATGCAAGTTG GGGTGGGACX3 AGGAAATAAA TTTTACGATA ATATCATTTC 2640 

TGTCCCACTC CCTCTAAAAT GGAGGGTGTA AATGTTAGGA ACTGATGAAT TATATAAAGT 2700 

TTTATATGAA CATCTCGGAC CACAATTTTG GTGGCCTGCT 6ATAATGACA TTGAAATGAT 2760 

^ GTTAGGTGCA ATTTTAGTTC AAAATACTAG ATGGCGAAAT GCAGAAATTG CATTGAATCA 2820 

GATTAAAGAA CATACGCATT TTAATCCAAA TCATATATTA GAACTACCTA TTGAAACGTT 2880 

ACAATCATTG ATACATTCAA GTGGCTTTTA TAAAAGTAAA TCACTGACGA TTAAAACATT 2940 

^ ATTAACATGG TTAGCACGAC ATCATTTCAA TTATCAAGAG ATTAATGAGC GATATAAAGG 3000 

TGGATTAAGA AAAGAATTAT TATCTTTGAA AGGTATTGGA AGTGAAACAG CAGATGTCTT 3060 

ACTTGTTTAT ATATTCGGAC GTATTGAATT TATTCCAGAT AGCTATACAA GAAAAATATA 3120 

40 

TGATAAATTA GGATATGAAA ACACTAAAAA TTATGATCAA TTAAAAAAAG TAGTCaCATT 3180 

ACCAAATCAT TTTACAAATC AAGATGCTAA TGAATTTCAT GCTCTGTTAG ATGTATTTGG 3240 

TAAACATTAC TTTAGAGACA AAGATATAAA GAATTATGAT TTTTTAGAAC CTTACTTTAA 3300 

45 

AAAGTAAACG CTGTGAAGTT AGATAGATGA GTTTATATGA AATATAAAAA ATAATTTACT 3360 

ATTTTCTTTT AGTATGTGGA CTTATATAAT AAATAGAAGC ATATAAAGAA AAAAACAGTT 3420 

SO GTTTGTTTGT GCAGCAACTG CATAAGAGCC CCTAATCGCT AAAGCTCAAG GGGAGTAAAG 3480 

GAATACAGTT GTTTGTGCAG CAACTGCATA AAAGCCTCTA ATCACTAAAG GTGAAGAGGA 3540 

55 
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AACGCAGTTG GATgCTACCG CACAACTGCA TAAATCCCTC TaATCgcTAA AGCGAAAAGT 3660 

GGGATTAAAA AGGAGATGTG ATAGTGTGAA QAAATCGTTA ATTGCTTTTA TTTTGATTTT 3720 

TATGCTTGTC CTGAGTGGCT GTGGTATGAA AGATAATGAT AAACAAGGTA GCAATGATAA 3780 

TGGCTCGTCT AAATCGCCGT ACCATAGAAT TGTTTCGTTA ATGCCTAGTA ATACTGAAAT 3840 

TTTATATGAA TTAGGATTAG GTAAATACAT AGTTGGTGTT TCAACGGTTG ATGATTATCC 3900 

AAAAGATGTG AAAAAGGGTA AGAAACAATT TGATGCTTTG AATCTAAATA AAGAGGAACT 3 960 

TTTAAAGGCA AAGCCAGATC TAATTCTTGC GCATGAGTCG CAAAAGGCAA CTGCTAATAA 4020 

AGTATTGTCA TCATTAGAGA AACAAGGCAT CAAAGTAGTG TATGTTAAAG ATGCACAATC 4080 

AATTGATGAA ACTTACAACA CATTTAAGCA AATTGGGAAA TTAACGCATC ATGATAAGCA 4140 

GGCTGAACAA CTTGTTGAGG AAACTAAAGA TAATATCGAT AAAGTCATAG ATTCAATTCC 4200 

TGCTCATCAT AAAAAATCAA AAGTATTTAT TGAGGTTTCA TCAAAGCCTG AAATATATAC 4260 

AGCAGGGAAG CATACATTTT TTAATGATAT GTTAGAAAAA TTAGAAGCCC AAAATGTGTA 4320 

TAGTGACATT AATGGTTGGA ACCCTGTAAC GAAGGAAAGT ATTATTAAAA AGAACCCAGA 4380 

25 TATATTAATT TCGACGGAAG CTAAGACAAG ATCAGATTAT ATQGATATCA TCAAAAAAAG 4440 

AGGTGGATTC AATAAAATTA ATGCTGTCAA GAATACACX3T ATTGAAGTTG TAAATGGTGA 4500 

TGAAGTATCA AGACCAGGTC CACGTATTGA TGAAGGATTA AAAGAATTAA GAGATGCAAT 4560 

30 TTATAGAAAA TAAACCATTC TAATTATGCC CCTTATTGCT ACATGTAAAA AATACATGTT 4620 

TGAGATAAGG GGTTTTTaAA ATATATTTAG TGAATGATAG CAACGCGAGT ATGTGATTGC 4680 

TATAATGAAT GTAATTATCG ATGAAcaaAA GAGAATGCTA TGACATTTAA TAAAGTATTA 4740 

TTGAGCTGGa TAGTCmTATT GATTATAACA ACTAGCATAT ATCTATTTTG GCAGTTGGGC 4800 

GATJ&CAATG ATGTATTTAA CCAOTCTATT TTAATCAATG TTAGATTACC GAGATTATTA 4860 

GAAGCATTGT TGACAGGTAT GATATTAACT GTTGCAGGCC TTATATTTCA AACAGTTTTA 4920 

AATAATGCAT TGGCAGATAG CTTTACATTA GGATTGGCAA GCGGCGCTAC ATTTGGTTCA 4980 

GGATTAGCAT TATTTTTAGG TTTAACAACG TTATGGATTC CTGTATTTTC AATAACATTT 5040 

AGTTTGATAA CATTAATAAC TGTATTAGTC ATTACGTCGG TATTGAGCCA AGGCTATCCA 5100 

GTTAGAATCT TAATATTAAG TGGTTTAATG ATTGGTGCGT TATTCAATTC ACTTCTATAT 5160 

TTTTTGATTT TATTAAAACC TCGCAAATTA AATACAATTG CCAATTATCT GTTTGGTGGT 5220 

50 TTTGGTGATG CAGAATACTC AAATGTATCT ATAATAGCAA TCACATTTAT CATTGCATTG 5280 

TTTQGTATAT TTATCATTCT TAATCAACTA AAGTTATTGC AATTAGGAGA ACTAAAAAGT 5340 



SS 



40 
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IS 



20 



25 



ATAA.CX3GCGA 


TAAATGTCGC 


ATATGTTGGC 


ATCATTGGAT TCATTGGTAT GGTGATACCG 


5460 


CAACTCATTA 


GAAAATGGCA 


GTGGAAACAA 


TCATTAGGAA GACAATTGGC TTTAAATATT 


5520 


GTAACTGGAG 


GACAAATAAT 


GGTTATGGCA 


GATTTTATTG GTAGCCATAT ATTGTCACCA 


5580 


GTACAAATAC 


CGGCAAGTAT 


TATCATTGCA 


TTAATTGGTA TACCAGTGTT AtTTTACaTG 


5640 


CkAAwAtCtC 


aGTCgAAAcG 


GTTACaCTAG 


CACACGACaT TTGCTAAAAT AAAAATAACT 


5700 


ATAAACATAA 


AGAGGGCATA 


AGCGATGGAT 


TTGAATCAAA TTAAAGCAGT TGTATTTGAT 


5760 


TTAGAAGGTA 


CGTTGTTGGA 


CAGAGTTAAA 


TCTCGAGAGA AATTTATCGA AGAGCAATAT 


5820 


GAACGATTTC 


ATGACTACTT 


AATTCATGTT 


CAACTGGCAG ATTTTAAAAA AgCATTTATT 


58B0 


GAGCTAGATG 


ACGATGAAGA 


TAATGATAAA 


CCTGATTTAT ATAAAGAAAT CATTAAACGT 


5940 


TTCCATGTAG 


ATAGGTTAAC 


TTGGAAAGAC 


TTATTTAATG ATTTTGAAAT GCATTTTTAT 


6000 


CGTTATGTAT 


TTCCTTATTA 


CGATACTTTG 


TATACACTAG AAAAgCTATC GCAAAAAGGC 


6060 


TTTCAAATTG 


GTGTTATCGC 


AAATGGTAAA 


TCTAAGATTA AACAATTTCG ATTACATTCA 


6120 


CTTGGTTTGA 


TGCATGTTAT 


TAATTATTTA 


TCAACATCAG AAACAGTTGG ITTTCGTAAA 


6180 


CCACATCCTA 


AAATTTTTGA 


AGATATGATT 


GATCAACTAG GGGTATTACC TGAGCAAAT7 


6240 


ATGTATGTTG 


GCX^TGATGC 


GTTAAATGAT 


GTAGCTCCAG CACGAGCTAT GGGCATGGTT 


6300 


AGTGTATGGT 


ATA 






6313 



30 (2) INFORMATION FOR SBQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2174 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 



' (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

40 



45 



SO 



CCGTAAACAC 


ATCAACAAAA GAAGGCTATA 


TTACAAAAGA 


AGACTTGGAC TTATGCTGCA 


60 


CGTCGCTCTA ATTCAGCTGG AATGCAAGTC 


ACCGGACGAC 


TGGCTTACAT TGAACCTTAT 


120 


GGGGCAACAA 


GTCGCACAAA ATAAACGCGC 


GAGAAGCaAG 


AATAGGAAGT GATATCTATG 


180 


AAATGGTTAT 


CACGAATATT AACAGTAATA 


GTGACCATGT 


CtATGGcGTG TGGTGCATTG 


240 


ATATTTAATC 


GTAGACATCA GCTAAAGGCG 


AAAACGCTGA 


ACTTCAATCA TAAAGCATTA 


300 


ACAATTATTA 


TTCCGGCTAG AAACGAAGAA 


AAAAGAATAG 


GTCATTTACT ACATTCGATA 


360 


ATACAACAGC 


AAGTTCCAGT AGATGTCATT 


GTTATGAATG 


ACGGATCGAC AGATGAAACA 


420 
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AAATGGTATG GGAAATCACA TGCTTGTTAT CAAGGTGTGA CX5CATGCATG TACGAATCGC 540 

ATTGCCTTTG TAGATGCTGA TGTAACTTTC TTAAGGAAAG ATGCTGTTGA AACGTT6ATT 600 

AATCAGTATC AATTACAAGG TGAAAAAGGA TTGTTAAGCG TACAGCCTTA TCATATAACA 660 

AAGCGTTTCT ACGAAGGGTT TTCAGCGATA TTTAATTTAA TGACAGTCGT TGGTATGAAT 720 

GTATTTTCTA CCTTAGACGA CGGTCGGACT AACCAGCATG CATTTGGACC GGTGACATTA 780 

ACAAATAAAG AAGATTATTA TGCAACTGGA GGTCATAAAA GTGCAAACCG TCATATTATT 840 

GAAGGATTTG CTTTAGGAAG TGCATATACT TCACAATCAT TGCCCGTAAC AGTTTATGAA 900 

GGGTTTCCAT TTGTTGCATT TCGCATGTAT CAAGAAGGAT TTCAGTCATT ACAAGAAGGA 960 

TGGACAAAGC ATTT6TCAAC TGGGGCAGOT GGCACAAAGC CTAAGATCAT GACAGCAATT 1020 

GTCTTGTGGT TGTTTGGTTC TATAGCGAGT ATTTTAGGGC TATGTCTTAG TTTAAAATAT 1080 

CGCCAAATGT CTGTAAGAAA AATGGTAGCA CTTTACTTGA GCTATACTAC ACAATTTATT 1140 

TATCTGCATC GAAGGGTCGG CCAATTTTCT AATTTATTAA TGGTATGTCA TCCATTGTTA 1200 

TTTATGTTTT TTACTAAAAT TTTCATCCAA TCTTGGAAAC AAACGCATCG TTATGGTGTA 1260 

25 GTTGAATGGA AAGGTCGTCA ATATTCTATA TCTAAAGAAC AATAAATCAA GGTAATGGCA 1320 

TTTCAATATA GGAGGACTAG TATGACAATG ATGGATATGA ATTTTAAATA TTGTCATAAA 1380 

ATCATGAAGA AACATTCAAA AAGCTTTTCT TACGCTTTTG ACTTGTTACC AGAAGATCAA 1440 

AGAAAAGCGG TTTGGGCAAT TTATGCTGTG TGTCX3TAAAA TTGATGACAG TATAGATGTT 1500 

TATGGCGATA TTCAATTTTT AAATCAAATA AAAGAAGATA TACAATCTAT TGAAAAATAC 1560 

CCATATGAAC ATCATCACTT TCAAAGTGAT CGTAGAATCA TGATGGCGCT TCAgCATGTT 1620 

GCACAACATA AAAATATCGC CTTTCAATCT TTTTATAATC TCATTGATAC TGTATATAAA 1680 

GATCAACATT TTACAATGTT TGAAACGGAC GCTGAATTAT TCGGATATTG TTATGGTGTT 1740 

GCTGGTACAg TAGGTGAAGT ATTGACGCCG ATTTTAAGTG ATCATGAAAC ACATCAGACA 1800 

TACGATGTCG CAAGAAGACT TGGTGAATCG TTGCAATTGA TTAATATATT AAGAGATGTC 1860 

GGTGAAGATT TTGACAATGA ACGGATATAT TTTAGTAAGC AACGATTAAA GCAATATGAA 1920 

GTTGATATTG CTGAAGTGTA CCAAAATGGT GTTAATAATC ATTATATTGA CTTATGGGAA 1980 

TATTATGCAG CTATCGCAGA AAAAGATTTT CAAGATGTTA TGGATCAAAT CAAAGTATTT 2040 

AGTATTGAAG CACAACCAAT CATAGAATTA GCAGCACGTA TATATATTGA AATACTGGAC 2100 

SO GAaGTGAGaC AGGCTAACTA TACATTACAT GAAC6TGTTT TTCTGCaTAA GAGGAAAAAG 2160 

GCAAAGTTGT TTCA 2174 



SS 
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ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

GAAnCAGnTA GACAAATTAT GGaAAmCGGT GTGAATCaAG GATTCtTTGG TGTAGCTGGT 60 

TTTGACCTAC TCGTCGATGA GGATGATAAC GTTTATGCGA TTGATTTAAA CTTTAGACAA 120 

AATGGTTCaA CGAGCATGTT ATTACTTGCT AACGAGTTGA ATTCAGGATA TCAAAAGTTT 180 

TATAGTTATC ATTCAAAAGG TGATAACACA CATTTCTTCA ATACGATTTT GAAATATGTC 240 

AAAGAAGGTA GTTTATACCC GTTATCTTAT TATGATGGTG ATTGGTACGO TGAAGATAAA 300 

GTTAAATCAA GGTTTGGCTG TATTTGGCAT GGTGATTCAA AAGAAACAGT ACTGGAGAAT 360 

GAAOGCGCAT TTTTAGCTGA ACTTGAACAC TATTAGAGTT CGGAACATAA GGCGCTACAA 420 

TGTTGTGTTG CCAGTAGTTG ACTGAATATG CGTTTGTAAC AAGCTTTTTT CGATTCTAGT 480 

2s CAACAGTAAT TAAATTTATG ATATGGCAAT ACTTTGTAAT ACTAATATTA AATGGCGACT 540 

TTTATTTCAC TATGTTATAA GAGTTGCCAT TTTGTTGATA AAGGTATACT AAAGGTTATC 600 

GTTTTGAAAT TTTTAGTAAC TAGATATGTT TCGTGTTATA GACC6AATTT GTGTATACGT 660 

30 AAAATTTAAT GCTATTGAAT TTTTAAAATG AAAAACATGA CATTAAATTG AATTCATAAT 720 

ATGTCTAATT GACTAACTTG TTGGAGTCAT TTACTATTTT ATGTATGACA TATTTTAAAA 780 

AGTGAGGGTC AAGCATGTCT TATAAAGCAT ATCCATTCTT TAGAGATATA TTAATAAATG 840 

AATGTATTTA TTTCGCCTCT AAAAATAAAA AACTAGTACG CCTAAATTAT AAAAGTGAAG 900 

CGnAtGTAGG CGTTTGGACA GAAGAAAGTG TGGCCGTATC ATTTTTAACA AGTCGTGATA 960 

TTCCATTTGA TAAAGTTGTA AAAATGGACG TTGATOGTTT TGCTACTTAT GAATTAGATG 1020 

AATTGTTTGA TGAACAAGAC CATATTATTA TGAATCAAAC AATGGAAGAw GAAGGGCATC 1080 

TACTAAACGT TGTAGCTGTT ACACAAGAAG TGATGACGGA ATTAGATAAA ATTAGAATCA 1140 

AAGAATTTGT CCAAGATGTA GCGAAATATG ATGAAGTATA CGGCTTAACT AAAAAAGGTA 1200 

GTAAGCAGTT TATTCTCATT AGTGAAAATG ATAGCGACGA AAAAAAGCCG CATATTATGC 1260 

CTGTATGGAG TATTAAAAAC AGAGCGTTAA AAGTTCGAGA TGAAGATTTT GAAGAGTGTG 1320 

SO ATTTAATTAC GATTGAAGGT TCTGTTTTCG GAGAATGGCT AGATGAACTT AGAGATGATC 1380 

ATAAAGCCGT TGCGATAGAT TTAAAAACTG GCGTQGTTGO TACAATTOTT TCAGCGCAAA 1440 
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ATGGAACAAT ACGTATTCAA AACACTTAGA 
TTATTTAAAA CAACGCGCAT ATAAATGGTC 
^ ATAGCACCTG nA?U^TGCGC CGTTTTCAAC 

ATAATCTTCC ACAACTTTGC GTAATAAAGC 
GATATTTTCA GTTTTAAATT CACGTGCAAC 

10 

AACGACACCA ATAACGGCTG CTAATTTATT 
TAGTACATGA CCAAAATTAG CTGCTGTTAA 
ATAAATATGT CTAACCTTTA AATCGATAGT 

IS 

TAACTGTTTA TAATCAGTGA TTTGACTTAG 
ACCTGTACCO ATACCGCCTA CACGACGTTG 

2^ ACCGGTACCA ACATTTGCAA AAATATAATC 

AATACCTAGT CCTTGAGATG CAGCATCAAA 
GTTTTCAGCA ATCACACCTG CATTACCTCC 

2S GTTTAACCAT TCCACAACTT GATCAATATT 

GTTATCTTGC TCTTGAACGA TTTTAATTAG 
AAGATTCCCA CCTCATTATT AATGTCTATC 

^ AAACAAGTAA TAATAGTAAT TATTAACAAA 

CGTATCGCAA TTAAtGTTTT ACAAACGTGG 
ATTTTGAAGT TGTATAGGAT TGTTAGTTAG 

^ GTGTATTTGT AAGTCTCTGA TTAAAATGAT 

AATn*AAAAA ATATAAATAA GTAATTTATT 
AAATAATAGG TAAAATAATA TGGGGATTCA 

40 

TATATGGCTA AAGAACTTTG TTTTGAAGGT 
CGTTCAGCAA TTAATGATTT TGACTTGAAT 

^. AAAGAAGTTA TTGATGATGC AATTAATGAT 

GATAAAAATG AAGTGGTGGG CTTTTTrGTA 
GATACACCTG AAAATGTCGT TTATATTCGT 

SO TTTGGATATG GCACGAAAAT AATGATGTCA 

GATTTTAATC ATCTATATCT AGTAGTAOAT 

55 



CCATAAAATA AAAGGCCATT TATATAGCGT 1560 

TTTTTCTATT TTTCTAAATA TAATGCACCA 1620 

ATAGTACGGT TTGCAACCGC GTAACACAGT 1680 

GTTATTATGA AATGAAGAAC CGATATAAAC 174 0 

AGTAATGGCC ATTGTCGTAA CAACTTCGCC 1800 

GCTAGGTGTA AAATCAGCAT CTAAATGATG 18S0 

ATCACCGGGA ATGGGTGGTT CGGTATCTTT 1920 

6TTACGATCA CCGTGTTGTG CCATGTCAGT 1980 

TAAATAACCG AGTCCTTGAA TCATGCCTCC 2040 

TGATTGGCCG TCGAAATAAT GTAGTGACGT 2100 

TGCTAAGTCA TGGCCTTGCT CTTTTAACAA 2160 

CTCTACAAAA ATTTGTGCAG GAATGTTGAT 2220 

AGTTAAGCAT AATTTTTCAA TTTGCTGTTG 2280 

TTTAGTTAAT TCAGTTTTAA AAGTACGTTG 2340 

TGTACCGCCA GCX3TCAATGC CAACTTTCAT 2400 

CTTAAATAAT AGTATAGTAA AATGACTAAA 2460 

TTTGATGCCa TTGCATTTCA ACATTGTAAG 2520 

ACGTTAAGTt ATATATATTA TTTTCTAGGA 2580 

TGACGCAATA TTAAAAGTAG TTCGTACGCA 2640 

AAGTAATGAG GAATAGTACA TTAATTTTGA 2700 

TAACTTAGAG CAAATAATGG TATCGTAGTG 2760 

TGCTTCATAT ATAAAAAGAT AGGGGTTAAA 2820 

ATCACTTTAA AAGCATTTGA TGAACAATAT 2880 

GAAAGACAAC AAATATATTC ATCTTTACCT 2940 

GCTGATAGGA TTGCTAACGT AGCAwTAAmC 3000 

TTACATCGTT ACTATCAGCA TGAAGGT^AT 3060 

TCATTATCGA TTAATGAAAA ATATCAAGGT 3120 

TTGCCGCAAT ATGTTCAAGG TGTATTTCCT 3180 

GCGGAAAATG ACAATGCTTG GAACCTATAC 3240 
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CTATATTACT TGGACTTAGA TTCAAAACAT GTTTCATCAT TAAAGCTTGA AGAAGAAAGT 3360 

CX3TTCAGAAG TGACCAATGT ACATATCATT AATTTAATGA TTGATGGCCA AAAGGTTGGC 3420 

5 

TTTATCGCAT TGGAGCAGAT TGGTGAACGC ATGAACATTG CTGCTATTGA AGTGGATAAA 3480 

TCATATCGCT TTAATGGTAT TGGTTCAAGT GCTCTGCGAC AATTGCCAAC TTACTTAAGA 3540 

AAAAACTATG ACAACCTTAA TGTGATTACG ATGATTCTGT TTGGAGAGAA TAATGATTTT 3600 

10 

AAACCATTAT GTTTAAATAG TAATTTCGTT GAAATCGAAC AAACTGATGA TTATGTCGTT 3660 

TTCGAAAAAT ATTTAAATTA CTAACAGTGA TTGCGAAATA TGATATTGTC ATTTATAATT 3720 

TAGTTTTGTT ACTATATATA AATGAATTCA GACGTATAAA TTTAGATTAT ATCCTTOQAA 3780 

AGGAAQTATT GGGCAATGAA AATTCAAGAT TATACAAAAC AAATGGTTGA TGAAAAATCA 3840 

TTTATTGATA TGGCTTATAC ATTATTGAAT GATAAAGGCG AAACAATGAm (rtlTATATGAT 3900 

20 ATyATCGATG AATTTAGAGC GTTAGGTGAT TATGAGTACG AAGAAATTGA AAATCGTGTT 3960 

GTACAATTTT ACACGGATTT AAACACAGAT 0GTCX3TTTTT TAAATGTTGG AGAAAATTTA 4020 

TGGGGATTAC GTGATTGGTA TTCGGTAGAT GATATIGAAG AGAAAATCGC ACCAACTATT 4080 

2S CAAAAATTCG ATATTCTGGA TGCAGATGAT GAAGAAGATC AAAACTTAAA ATTATTGGGC 4140 

GAAGATGAAA TGGATGACGA CGATGATATT CCAGCTCAAA CAGATGATCA AGAAGAACTA 4200 

AATGATCCAG AAGAT6AGCA GGTTGAAGAA GAAATCAATC ATTCGGATAT AGTCATTGAA 4260 

30 

GAAGATGAAG ATGAACTAGA CGAAGACGAA GAAGTGTTTG AAGACGAAGA AGACTTCAAC 4320 

GATTAATTTT TTGTTTGACT TTTAGTTGAA AGATGATAAA ATTTTATTCG GGCTCCTTTA 4380 

AATAGGACAC GTGTATAAAA TTTATACGCT CCCCTTACAG AATTTGTGAG AGGGAGCGTT 4440 

35 

TTTTtATTTA ATTGAGTAAA TCAAGAAATG ATAACGCAAA AATCAAAGTT GTAAATGATA 4500 

TAO^AGTGA CATAGCAGTA TGGAAACXX3T AAGTAAACAG AATTTAATTT TGTCGAtTCG 4560 

ACAAtAAaCA aCTtGAaTGA GCTTGCTTTA ATGTTATGTn nTACGTAATT TTTACAATTG 4620 

40 

ATGAGGAAGC ATTCCCTTTA ATAATTAGGA GGTCAAGACA TGACAAAATT TATmTGTA 4580 

ACAGGTGGCG TAGTTTCATC CATTAGGGGA AGGGT 4715 
45 (2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 
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25 



ATAATAACTG 


AAATTAAAAT 


TGCTAAATmG 


TGTTaAgCTA TCGCmACAAT GAAAATwCCG 


60 


ATTTTGCGTT 


GTTGAAAATA 


TCTTTCCAAA 


CCAAGAATCG ATAATGGCAA TAAATATAAT 


120 


AAATTTCCAT 


AAAATGACCA 


AGTAAAATTA 


AAGTATATAA CGACAGTTGA CATGCCGTAT 


180 


AAAATCGTAG 


CGATCATATT 


TGCTGAGCGT 


TTAAAGTGTA ATATTTTAAA TAAGTAGAAG 


240 


GTCACGACAA 


ATGTTATGAT 


AGCTCGTATC 


ATAGCCATAA TAAGTTGGTT TGTCGGCCAA 


300 


AAATGTATTQ 


TCGTCGGATT 


AAATATACCA 


ACCGTTTCTC CTATTTTAAT GAAkAGAAAA 


360 


TTTAGCCACA 


TTAAAGGTGA 


CAGCGAATAA 


TAATnTGATA GTCCTTTCAT ATAATCGCCA 


420 


CCTAmTCCAA 


ACGATGCATC 


ATrTAAACTA 


GAAnAACTAC GTAGATGTTC ATACAnATAC 


480 


ATTTGAAATG 


GCATCATTTG 


ACGGAATCCA 


TCTCCAGCCC CGCTAAAAAC AGTACCATTC 


540 


ACAATATAAT 


CATAGATATG 


AGTAGAAAAT 


AAAATAAGCG TTAATATTAC ACTAATGAAA 


600 


GTTATAACAA 


AGAATTGTTT 


GACGTTTGAA 


TTTAGCCACT TTTTTAACAC AACATTATCC 


660 


TCAACTTTCA 


AATTTAAAAT 


TAAGTTTAAC 


TGAAACTAAA GTTAATGAGG TTCTTGATAG 


720 


GTAAAGACGA 


AGATGACTGT 


GGAACAGATA 


CCTTATCATA GTTACTTAAA CTTTGGATCA 


780 


TTTTCAGTTT 


ATCATTAAAC 


AAATATATTO 


AATAATAAAa aTGTCATACT GATAAAGATG 


840 


AATGTCACTT 


AATAAGTAAC 


TTAGaTTTAA 


CAAATGATGA TTTTTAATTG TAGAAAACTT 


900 


GAAATAATCA 


CkTATACC 






918 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16397 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

40 



45 



SO 



TCGCCCnATA ATCAATTTAT TTTTCATGTG 


CCACTCCTAT 


ACAAGCTnAC 


AATGCTTCTT 


60 


CAGTTAAGGC AATATCTTTT AATTTTGTTT 


GATATTTTTG 


TTCAAAGTCA 


TATTGTAACT 


120 


GAACAATTTC TGGCAAACCA ATATGCCAAT 


CCGCCAATTT 


TTTTTTAyCT TtGAAGAGCT 


180 


CTTTTGGTGA TGkTTGcGAC ACTATACTAC 


CTTCTTTCAT 


AACGATGACT 


TCATCTGCAT 


240 


AACGCGOGAC TTCATTCATA TCATGTGAAA 


TTAGGATAAT 


TGCCTTATTT 


TCATCTGTTT 


300 


GTAGTOACTT TAGTAATCTC ATTACTTGTC 


GTTTACTTTG 


TGGATCAAGT 


CCTGCTGTAG 


360 


GTTCATCAAC CACQATAATA TCAGGATTCA 


TTGCCAATAT 


CGATACAATC 


GCTATTTTAC 


420 
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AATCCATCAA CAGACGATGG GCATAGTTTT 
TAGGTCCAAA TATCATTTCA CGCTCTACTG 
^ GAAATACCAT TCCAATTCTT TTTCTTACAG 

TAATAGTAAT GTCATCAACT GTAACTGTCC 
TTTGTATCAA CGTTGATTTA CCACTACCCG 

10 

GTTCAAATTC TGTATTAACA TCATGAATAG 
AGGTATAACT TACATTGTCA AACCGTATAG 
TAAGAATGAT GTTTGGTGTC CCAGCATTTG 

IS 

ACCTATTCTT GTTAACTCTT CTGCATTGTC 
GACAGTCCCT TTATTCATAA CGATAACATG 

^ C3GTAATAGAA ATGATTGTAA TATTATGTTC 
ATTTTGACGT GCATCAGGAT CTAACATAGA 
GTTAAGTGCT AATACACTTG CTATAGCCAC 

25 AGGTTCATAA TCTGCACGTT CTAACATATC 
ATGCATTTCG TCATATGGAA CCGCATGATT 
AATTGAACCA ACAAATTGAT TATCCGGATT 

30 CTTTTCAAAA TTATCATCAG TTATAGCTTG 

TTTCTCTATG CCAATCATTA ACTTGGcAAT 

AATAGATGTC CACTGACCTT TAGGTATATT 

35 

ATQ\CmGA TATTGAAATG AAACATTTTT 

TTGTCTCTCT CCTTTACGAT T06TGTATCT 

TATATGACAT TGACTGGGTT CTCTATATAT 

40 

TTAAAGTGTT GTTTAACAAC TAATGATAAG 
AAACGTTAAT AAAATAAATG ATTTACTAAT 
ATACTTTCAC TTCATTCAGT TCTAGTGAAA 

45 

ATTCGCACTC AAAAAAGTAA ATATAAAGAA 
TCCGATATTT TATTCAATAA AAAAGCGCGC 
TTAAATCTTT ATTTAGTTGA TGGGGTACTC 
ATTAtCGTTG CACTCATTTG CTTTATATAA 



TGGCTTCATC TAAATTCATT TTAAAGTTTT 540 

TGTCCTCAAA TAATTGAGAT TCGGGAAATT 600 

GTCTAATATA TTTATCTTTG GTCTTATGTG 660 

CAGTAGTCGG CTTTAACAGC GCATTAATAT 720 

TTTGTCCAAC GATGGCGTAA TATTTACCTT 780 

CTTGATGCTG ATATGGTGTC CCTTTTTGAT 840 

TCATAGTTGA TCCACCAGCC CTTCATAAGT 900 

ATTTATTTTG ATTGGGAATG GCAAATCTAG 960 

GAAAATTTCA GTCGCTGTGC CTTCTTTATA 1020 

ATCTGCTTCC ATCGCCTCAG ATAAATCATG 1080 

TGATTTAACT TTTCTCACTA AATCCAATAA 1140 

AGTCGCCTCA TCTAATATAA TGACAGAGGG 1200 

ACGCTGCTTC TGTCCCCCCO ATAATGCATT 1260 

AACTTGTTTA AGTGCTTCGC TGACTCTTCT 1320 

TTCGAGTCCA AATGCCACAT CGTATTTTAC 1380 

CTGAAATACA ATTCCTATGT CTTTTCTTAA 1440 

ATTATTATAA AAAATTTCTC CAGATTTAAC ISOO 

TGTAGATTTT CCAGAACCGT TATGACCAAC 1560 

AAAAGAAACA TCTTTCAATG TGAAGGATGC 1620 

AAATACAATA ACTGAATTCT TATCCTCCAC 1680 

ATCATATTTT ACAATATTTA TAAATCGCTG 1740 

TACTAGTATT TTCTGACTCA TTTCTAGTCT 1800 

GACTTTTATT CCTCTCTAAC AATTATGTAT 1860 

ATAGGGGTGG TCGCGTTTGA TTCAACGATA 1920 

TTGATCAAAC TAGCTTCATC ATATTTTTAG 1980 

ATCGGACTTA AAAACATTTC TGTTCATAAG 2040 

ACCCCATCAT AAGTTTGTTG AGTTCACGCT 2100 

TGAGCTAGAC AATATTTGTA TGTGGCAAAC 2160 

AAGTAGTTAG TGTATTTATA TAAATTCTTA 2220 
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ACGAGTGTAA CCACCTTGAC GTTCTGTGTA AcGCTCTGCQ ATTTCACCAA ATAATTTTTG 2340 

AAGTGCAGTT TGTGTAGTTT CATCTTCGTT TAAGATTTCA ACATTACGTA AAGTTTTAGC 2400 

TGCATTACGA CGAGAAGCTA AATCTCCTTT TTTACCTAAA GTGATTAATT TCTCAACAAC 2460 

ACTGOGAACT TCTTTTGCAC GAGCTTCTGT AGTTTCAATA CGTTCACTAA TAATAAGTGA 2520 

TGTAGCTAAG TCACGTAACA TAGCTTTACG TTGATCAGAA GTACGACCTA ATTTTCTGTA 2580 

ACCCATGAGT TAACCTCCTT TATCAATCTT CTTTTCTTAA TCCTAATCCT AAATCTTCTA 2640 

ATTTGTATTT AACTTCTTCT AAAGATTTAC GACCTAAATT ACGCACTTTC ATCATGTCAG 2700 

CTTCAGATTT GTCAGCTAAC TCTTGAACAG AATTGATTCC TGCGCGTTTT AAGCAGTTAT 2760 

ATGAACGTAC AGATAAGTCT AATTCTTCAA TAGACATTTC TAATACTTTT TCTTTTTGAT 2820 

CTTCTTCTTT TTCAATCATG ATTTCAGCGT TTTGCGCTTC ATCAGTAAGA CCAACGAAGA 2880 

TATTCAAGTG TTCAGTCATT ATTTTTGCTG CTAATGAAAC TGATTCTTGT GGTGTGATTG 2940 

AACCATTAGT CCAAACATCC AATGTTAATT TATCAAAATC ACTGCTTTGA CCTACACGTG 3000 

TATTTTCAAC AGTATAGTTC ACACGTTCAA CAGGTGAATA CAATGAATCA ACAGGGATTA 3060 

2s CACCAATTGG TAAATCACTA GTATTATTTT GTTCTGCTAA TGCX3TAACCT CTACCCTTGT 3120 

TAGCAACTAG ACGAATTTTT AAGTGACCAC CTTTAGATAC TGTTGCAATT TTAAGCTCTG 3180 

GGTTTAAAAT TTCAACATCA CTATCATGTG TAATGTCGCT TGCTGTTACT TCXKrCTTCAT 3240 

30 CACGTACATC AATTTCTAAA GTTTTATCTT CTTCAGAGTA AATTTTCAAT GCTAATTGTT 3300 

TAATGTTCAT AATAATTGTA GAAACATCTT CAACTACATT GTCTACTGCT GAGAATTCAT 3360 

GTAAAACTCC CTCAATTTCA ATATACTTAA CGGCTGCACC TGGTAATGAA GATAGTAGGA 3420 

TACGACGTAA GGAGTTTCCT AGTGTAGTAC CGTAGCCACG TTCTAGTGGT TCAACAACGA 3480 

ACTEACCGAA TTTAGCATCT TCACTAATTT CAATTGTCTC AATTCTAGGT TTTTCGATTT 3540 

CTATCAITTA AATATCCTCC TTATATACGT CGACTTAATT TAAAATGTTT GCTCAGTGAC 3600 

CTGTAACAAT ACCATCATAA ATTATACACG ACGACX5TTTT GGTGGACGAC AACCGTTATG 3660 

AGGTACTGGA GTAACGTCTC TGATCGCAGT TACTTCTAAA CCTOCAaATT GTAATGCACG 3720 

AATAGCTGAT TCACGACCTG GACCAGGTCC TTTAACTGTT ACTTCAACTO TTTTTAAACC 3780 

ATGCTCCATA GCTGATTTAG ATGCAGTTTC AGAAGCCATT TGTGCTGCAA ATGGTGTTGA 3840 

TTTTTTAGAT CCTTTGAATC CTAATGCACC AGCTGATGAC CATGATAAAG CATTACCGAA 3900 

CTCATCAGTG ATAGTTACAA TAGTGTTGTT GAATGTTGAA CGGATGTGTG CTACACCATT 3960 

TTCAATATTC TTTTTCACTC TACGTTTACG AGATACTTGT TTACGTGCCA TTTAAAATTT 4020 
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CCAGGTCTAA TACCTGGAAC ATAGCTACCT TGTTTCTTAA GGTTATCAGC CATTTTTTCC 5940 

GGATTAACTT GTACAAATGC ATAGAAGTAT GTGAATAGTA TAATTAGTAC AATATATACA 6000 

^ ACCATACCAA CATTACTTGA AGGATTTGCA GCATTCGCAA TGTTTTGTGC CCATTCTTTA 6060 

TCTGGATAGA ACAACGTTAA TGTTCTAGGC AGTAAGAAGA ACGCCATTGC AAAGATTACA 6120 

GGAATAACAC CGGCTGAGTT CACTTTTAAA GGTAGATAAG TTGCCTGTGA ACCTAATCTT 6180 

70 

TGAGCAGTTT GTTTCTTAGC ATATTGAATC GGAATTTTAC GAACGGCTTC AAGTACATAA 6240 

ATAGCACCTA CAGTTAATAG TATCAGTGAC ACTAAAAGTC CTAATACTTT CAACCATGCT 6300 

AATGATGTAT CTTCTTGCCC AACX3AACGCA tTTGTcCAAA TTGAATTAGA CTGGCTGGCA 6360 

75 

ACGTTGATAA AATACCCGCA AATATGATAA TAGAAATACC ATTACCAACA CCGAACTGAG 6420 

TGATTTGATC ACCAAGCCAT ATTAAGAAAG CAGTTCCTGC TGTnCAAAAC TAGTGCTATT 6480 

20 AATAAATAAC TCATAATTGA CTGATTGATA ATCAGCGCAC CTTTGAGATA ATTATTAAAT 6540 

TGGAATGCCA TACCTATAGA TTGGATAAAT GCTAAAGAAA TTGCTAAATA ACGAGTAACG 6600 

TTATTTAACT TTCTTCTACC TACTTCACCT TGTTTTGCCC ATTCTGAGAA TTTAGGGACA 6660 
ATATCCATTT GTAATAATTG CATTACGATT GATGCAGTGA TGTAGGGTAC AATACCCATT ' 6720 

GCAAAAATAG AAAATCGTTT cAAGGCTCCG CCACCAAAAG TATTTAATAA CTCAGTGGCA 6780 

CCTTGAGAAC CTTGGGGATT ATCAAAAGCT GCAGGATTTA CTCCTGGAGC TGGTATATAA 6840 

30 

GTCCCTATTT TAAAAATTAC TAACATTGCT AGTGTGAAGA AAATCTTGTT ACGAACCTCT 6900 

TTTGTTCTAA AGAAGTTCAC AAGGGTTTGA ATCATTAGAT CACCTCGTGT GcTCCACCTT 6960 

TAGCATCAAT AGCTTCTGCT GCTGAAGCTG AGAATTTATG AGCTTTCACT GTCAATTTCT 7020 

SS 

TATCAAGTGA ACCATTACCT AGTATTTTGA TACCAGATTT TTCATTCTTA ACAACACCAG 7080 

ATTCTACTAA TAAAGCTGGA GTTACTTCAG TACCATCTTC AAATTTATTA AGTTGGTCTA 7X40 

40 AGTTAACAAT AGCATATTCT TTACGATTTA TGTTAGTAAA ACCACGTTTT GGTAAACGAC 7200 

GGAATAATGG TAATTGACCA CCTTCAAATC CTGGTCTTAC ACCACCGCCT GAACGAGCTT 7260 

TTTGACCTTT GTGTCOGCGA CCACTTGTTT TACCGTTACC TGTCGCAACA CCACGTCCAA 7320 

^ CACGATTGCG TTCTTTACGT GAACCTTCTG CCGGTTTTAA CTCATGTAAT TTCATTTOGG 7380 

CACCTCCTTG ATTATTTTTC TTCTACTGTT ACTAAGTGCT TAACTTTGTT GATTTGCCCA 7440 

CGAATAGCAG GGTTATCTTC AACAACTACT GAACTGTTAG TCTTTTTAAG ACCTAAAGCT 7500 

SO 

TCAACA6TTT TACGTTGTGT TTCAGGACGA CCAATAACAC TACGAGTGAG GGTAATTTGT 7560 

AATTTAGCCA TAACTAGTTT TCCCTCCTTA ATTGTATAAT TCTTCTACTG TTTTGCCACG 7620 

SB 
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CATXSTTGATT 


GGTGTGTTTG 


ATCCTAATGA 


TTTACTTAAG ATATCAGTGA 


TACCTGCTAA 


7740 




TTCAAGTACG 


GCACGAACAG 


GACCACCAGC 


QATAACTCCT GTACCAGGTG 


CAGCCGGTTT 


7800 


5 


CATAAATACG 


CTTCCTGAAC 


CGTAACGGCC 


AGTAATTGTG TGTGGAGTTG 


TACCTTCAAC 
TTGCTTCTGG 


7860 
7920 




ACGTGGAACA 


ACTACTAAAT 


CTTTTTTAGC 


TGCTTCAACA GCTTTTTTGA 




TACCTCTTGA 


GCTTTACCAG 


TACCGAAACC 


TACACGACCA TTTTTGTCTC 


CAACTACAAC 


7980 


10 


TAATGCAGTG 


AAACGGAAAC 


GACGACCACC 


TTTTACAACT TTTGCTACAC 


GGTTGATTGT 


8040 




AACAACGCGT 


TCTTCAAATT 


CTTTCGTCTC 


TTCTTCtCTA CGAGCCATGT 


ATTTGTCCCT 


8100 


15 


CCTTTAAATT 


AAAATTCTAA 


TCCGCTTTCT 


CTTGCTGCTT CAGCTAATGC 


TTTAAcACGT 


8160 


CCGTGATATA 


AATATCCTCC 


ACGGTCAAAT 


ACXSATTTCTT TAATGCCTTT 


QTCAGCAGCT 


8220 




TTTTTAGCAA 


TTGCTTCACC 


GACTTTAGTT 


GCTAATTCAA CTTTAGTTGC 


TGTAGTAGCA 


8280 


20 


ATGTCGCTGT 


CTTTTGAAGA 


AGCTTGAGCT 


AATGTTACGC CTTTATTATC 


ATCAATAATT 


8340 




T6AGCGTAGA 


TATGCTTGTT 


TGAACGATAT 


ACGTTTAAAC GTGGCTTTTC 


AGCTGTACCT 


8400 




GATAAGTTAG 


TACGAACACG 


AGCATGTCTT 


TTTAAACGCA CTTTATTTTT 


ATCAATTTTA 


8460 


25 


CTGATCATTT 


CAATACTCCT 


TTCTTTAGAG 


TTTATCTATT ATTTACCAGT 


TTTACCTTCT 


8520 




TTACGOCQAA 


CGTATTCACC 


TTGGTAACGA 


ATACCTTTAC CTTTGTAAGG 


CTCTGGAGGT 


8580 




CTTACTGAAC 


GGATGTTAGA 


TGCTAATGCT 


CXAACTTGTT CTTTTGAAAT 


ACCTTCAACT 


8640 


30 


TTAACGACTG 


TGTTTTTCTC 


AACTGAGAAA 


GTAATGTTTT CTTCAGCTTT 


AATTTCTACT 


8700 




GGGTGAGAAT 


AACCAACX5TT 


AAGGATTAAG 


TCTTTACCTT GCATTTGAGC 


ACGGTAACCT 


8760 


35 


ACACCAACAA 


GTTCAAGTAC 


TTTTACGTAT 


CCTTGAGAAA CACCTTGTAC 


CATATTGTTT 


8820 


AATAAAGCAC 


GAGTTGTACC 


ATGGTTTGTT 


CTATCTTCTT TAGAATCAGA 


TGGTCTTACA 


8880 




ACTTCAATTG 


TGTTTTCTTC 


TTGTTTGAAT 


GTCATTCTTT CATTTAAAGT 


TCTTGATAAT 


8940 


40 


TCACCTTTAG 


GACCTTTAAC 


AGTTACATGA 


TTTCCATCAA AAGTTACTGT 


TACGTCACTA 


9000 




GGGATGTCAA 


TAATTTTCTT 


ACCAACACGA 


CTCATGTTAT GGCACCTCCT 


TATTTTTTAT 


9060 




TACCAAACGT 


ATGCGATAAT 


TTCTCCACCA 


ACATTACGTT TTCTTGCTTC 


TTTGTCAGTG 


9120 


45 


ATTACACCTT 


CAGAAGTTGA 


TACTAATGCA 


ATACCTAAAC CATTTAATAC 


TTTAGGCATT 


9180 




TCGCTAGCTT 


TTGCATAAAC 


ACGTAAACCT 


GGTTTTGAAA TACGTTTTAA 


TCCTGTGATA 


9240 




ACACGCTCAT 


CGTTTTGACC 


ATATTTTAAG 


AATAAACGAA GTACACCTTG 


TTTATCATCT 


9300 


SO 


TCTACGTATT 


CAACATTTTT 


AATGAAACCT 


TCACTCTTTA AGATTTCAGC 


AATTTCTTTT 


9360 




TTAATATTTG 


ATGCAGGTAA 


TTCTAACTTC 


TOOTGACGCA CCATOTTTGC 


GTTTCTTACA 


9420 
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TCTTTTTTAT TACCAGCTAG CTTTACGAAC 
ACGGAAACAA ATACGGCATA ATTTAAATTT 
^ TTCACAACGA GTGTATTCAC GAACTGCATA 

TGAAGTTTTA GCCACTTAAT TAGCCTCCTT 
AGTTAGCTAA CAATTCACGA GCTTCTTCAT 

10 

CCATTCCTCT AACTTTACTT ACTTTATCAT 
TAACACCTAA AGTGTAGTTA CCGCGTCOGT 
CACGTACACG TGGTaATGAT ACTGAAATTA 
CGCGAAGTGT TACTTTCX3CA CCGATTGGCA 
aTTTTTTAGC TTTAGTTACT AATGGtTTTT 
20 CATTGTCTAA TACTTTAGAA TTTTGTACTG 
TATCTATTTT TGGTACTTCC ATTACTGAAC 
TAACTTCAGT GTTaAACTTT TCtTTTaAAC 
TtATTAATTA TTAGAkTTAA TTTCTTCGCC 
ATCAACAAAT TTGTAACCTA CACGAGTTGG 
ATTAGAAACA TGGATTGCTG CCTCTGTTTC 

30 

TGGTTTTTGG TGTTTTTTCA TAATGTTAAC 
TAGAGTAGCA ATTACTTTAC CTTCTTTACC 
GTCACCTTTT TTGATATGCA TGTGGGCACC 

35 

AGTACTTCTG GTGCTAATGA TACGATTTTC 
ACAQ^TCCXSA AGATACGAGT ACCACGTGGG 

40 TTTTCATCAA ATTTGATGTA TGAACCGTCA 
ATTACAGCTT TGACAACGTC ACCTTTTTTA 
GTACATACGA TAACATOGCC GATGTTTGCT 

^ ATTGTAAGAA CTTCACGAGC ACCAGAGTTG 

ATCATTAGTT AAACCTCCCT TATCTCTAAA 

AATCTCTACT AAACQAAAAC GTTTTGTTGC 

SO 

AATGTCTCCT AATTTAGCTG AATTGTTTTC 

TACTCGTTTA CCGTATAATT TGTGTGTTTT 

55 



GCCAGGGATT TGGCCTTTGT AAGCTAATTC 9540 

ACGATATACA GAATGTGGAC GGCCACAACG 9600 

TTTTTGTTTT TTTTGTTGCT TAGCAACCAT 9660 

TAAATAATTA TTTACGGAAT GGCATACCGA 9720 

CAGTGTTAGC AGTCGTTACG ATAACAATAT 9780 

AGTCGATTTC TGGGAAAATT AATTGTTCTT 9840 

CAAATGCTTT TTTAGAAACA CCTTGGAAGT 9900 

ATTTGTCTAA GAATTCATAC ATTCTTTCAC 9960 

TACCTTCACG TAAACGGAAA GTCGCGATTG 10020 

GACCAGTGAT CAATTCTAAT TCTTCAACAG 10080 

CGTCACCTAC ACCCATGTTC ACAACGATTT 10140 

TATAATTGAA TTTTTTCATT AAGTTTTCAG 10200 

GGTTCaAAGT GGGATCCTCC TTTCaACTTG 10260 

AGATTrrTTA GCGATACGAA CTTTTTTACC 10320 

TTCGTTTGTT TTAGGGTCCA ATAATTGTAC 10380 

TAAGATTCCA CCTTCAGGAT TTAATTGAGT 10440 

ACCTTCCACA ACGACACGGT CTTTTTTAGG 10500 

TTTGTCTTTA CCTGCGATAA CTTTAACGTT 10560 

TCCTTATTTG TATTGGTTGT TATTAATTAA 10620 

ATGAAGTTAC CTTCACGTAA TTCACGAGCA 10680 

CCTTTGTCAT CAOGGATGAT AACACATGCA 10740 

TTACGACGAA CACCTQACTT AGTACGTAC6 10800 

ACAACGCCAC CTGGTGTTGC ATTTTTAACA 10860 

GTTTTACGAC CAGATCCACC TAATACTTTG 10920 

TCraCTACTT TCAAGCGTGT TTCTTGTTGG 10980 

CTTGTATTAA ATAATTACTG ACTCTTCAAC 11040 

TGATAAAGGA CGAGTTTCTT GAATTTTAAC 11100 

ATCATGAGTT TTGTATTTTT TAGAGTATTT 11160 

GTAAGTTTCA ACAAGTACTG TAATAGTCTT 11220 
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30 



3S 
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4$ 



SO 



TTTTGTAACC TCCTCTTACT TAATTATTGA TTAGCCTTAC TTTGTTCAAT TTCTCTTTCA XI 340 

CGAGCAACAG TTTTTAGACG TGCAATCGTT TTTCTTACTG TACX5AATACG TGCAGTTTCT 11400 

TCTAATTGAC CTGTAGCTAA CTGAAAGOGT AGGTTAAAAA GCTCTTCTTT TGAAGATTTG 11460 

ATTTGTTCTT CGATTTCTGA AGTGGTTAAG TCTCTAATTT CCTTAGCTTT CATTTGTTTC 11520 

ACCACCCAAT TCCTCACGTT TTACAAACTT AGTmTACT GGAAGTTTGT GACTTGCTAA 11580 

ACGTAGTGcT TCACGCGCAA CTTCTTCAGA AACGCCAGCA ACTTCGAATA AAATTCTACC 11640 

TGGTTTAACA ACTGCGATCC AGCCTTCAAC CGCACCTTTA CCAGCACCCA TACGTACTTC 11700 

TAAAGGTTTT TTAGTATATG GTGTATGTGG GAAGATTTTA ATCCAAACTT TCCCGCCACG 11760 

TTTCATGTAA CGTGTCATTG CTATACGAGC AGATTOGATT TGACGAGATG TGATCCAAGA .11820 

CGTTGTTGTA GCTTGTAAAC CAAACTCACC AAATGTTACG TAtTACCGCC TTTAGAACGA 11880 

CCAGTTGTTT TAGGACGATG TTGACGACGA TATTTTACAC GTTTTGGTAG TAACATTATT 11940 

ATTTTCCTCC TCCACTAGTG TTCTTAGTAG GAAGAACTTC TCCACGATAA ATCCATACTT 12000 

TAACGCCTAA TTTACCGTAA GTAGTGTCAG CITCAGCGTG tGCATAATCG ATGTCAGCAC 12060 

GTAACGTAT6 AAGTGGAACA GTTCCTTCTG AATATTGTTC AGCACGAGCG ATGTCAGCTC 12120 

CGCCTAAACG ACCAGATACT TGaGTTTTGA TACCTTTAGC ACCAAGTTTC ATAGCTCTAG 12180 

TGATTGCTTG TTTTTGTACA CGACGGAATG AAGCACGGTT TTCTAATTGA CGTGCGATGT 12240 

TTTCAGCTAC TAAACGAGCG TCAAGATCAA CTTTTTTGAT TTCAATTACG TTGATGTGTA 12300 

CTTTTTTATC AGTTAACGCA TTTAATTTGT TGCGTAATTT TTCGATTTCT GAACOGCCTT 12360 

TACCAATTAC CATACCAGGT TTACCAGTAT GAATTGCAAT GTTGATACGG TTTGCAGCAC 12420 

GTTCAATCTC TACGTGAGAA ACTGATGCTT CTTTTAATTC ATTATCAATA AATTTACGGA 12480 

TTTTTAAATC TTCGTGTAAA AGTGAAGCGA AGTCTTTTTC AGCATACCAT TTAGCTTCCC 12540 

AATCACGGAT AATACCAACA CGAAGTCCGA TTGGATTAAT TTTTTGACCC ACAGTATTCC 12600 

CTCCTTAAAA GTTAATTAAG CTTCTTTAGC TTCTTCTTTA CCGTCACTTA CGACGATTGT 12660 

AATGTGGCTT GTACGTrTGT TAATCGCACT TGCACGACCT TGCGCACGTG GACGGAAACG 12720 

TTTTAATGTT GGTCCTTCGT TAGCATATGC TTCTTTAACT ACTAATTCAT CTGTGTTCAT 12780 

GTCATAGTTA TGTTCAGCAT TAGCTAAAGC GGACATTAAT ACTTTTTCAA TTACTGGTGA 12840 

TGAAGCTTTG TTTGTTAATT TTAAAATTGC AATAGCTTCA GCAGCATTTT TACCTCTGAT 12900 

TAAGTCAAGA ACTAGTCTTA CTTTACGAGG TGCGATTCTT ATTGTTCTAG CAACOGCTTT 12960 

TGCTTCCATT AGGATGTCCT CCTCTACTTA ATAGATATTA TCTTCTTGTT TTCTTGTCGT 13020 
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TATCTTCAGT 
CGATGAAATT 
CGCTTCCTTC 
TAATACTACG 
ACACCCAAAT 
ATGATTTTTT 
GAGATGGTCT 
TCATTACAGA 
CAACGTTAAC 
ATAAGATCAT 
CAAGTACTTG 
OCTCGATGTT 
CTTTGATGTC 
CAATOATATA 
TTGGATCATA 
CGATAACACG 
GGTTGTTACG 
TCGTGATTTC 
ACTTTTTAAT 
GTCGATTGAT 
GTAACGGCCC 
TACGTTGAAG 
GTCGAAAGTG 
CTTAAGAATA 
TCAGTAATTA 
GTCACTTGAA 
GTAACTACTA 
GTTTTTGGAG 
AAAGATAATG 



TACATATACA 
AGGGAAAATT 
TTGAGCTTCA 
AGCCATTTGG 
AAGTTGATTA 
ACCACGACGA 
ACCGATAGGA 
ACCACGAACT 
TAATTCGTGT 
ACGAACTTCA 
AGCACTTGCA 
GTGTACTACT 
AGCTTCAGCA 
TCGTTTTTCA 
TTGAATAGAA 
GTATTGACX3T 
TCCCGCTTTT 
TGCGAAATCT 
AGCCATTGTC 
CCTTCTTTAA 
ATACGTTTTT 
ATTTCTTCAA 
TATTTGTCTT 
TCTCTTGCTT 
CTAAGCTGTC 
CGCCAGGGAT 
ATACTTTTTT 
CTTCGAAGTT 
CTGAGCGTAA 



GGTACGTGTT 
GTAGAACGAC 
ACTTTTTTCA 
GCGCCTCCCT 
TATTATTTTT 
GTTTTCTTAC 
GCACGACCTT 
GTTGGACGGA 
TGTAGGTTAC 
CCAGATCTTA 
CCAGCTGAAC 
GTACCAACTG 
CCACTTTCAA 
CCGTCTGCAT 
TCAACTTTTG 
TTGTGTCCAC 
TTCGGTAGCG 
AACGAaGTCA 
GCTTACCTCC 
GAGTTACAAT 
TCTTAGGTTT 
CTGCCATTTT 
CAGCCATTGC 
CCATTATCCXj 
AGCATTAGTG 
GTTGCGTGCT 
AGGTTGTTCT 
GAATGCGTCA 
AGCTAAACGA 



TACGTCCGTC 
GTGACCATGT 
TTAAATGCTC 
TCTTATTATG 
TCTTACGTCC 
CAAGCGTAGG 
CACCACCACC 
TACCTTTCCA 
CAACTTGACC 
ATCTQATTAA 
GAGCGATTTG 
GAATGTTTTG 
CGATTTGACC 
ATACAACTAA 
CATTGATACC 
CACCATGGTG 
GTTTTAATAA 
TATTACGACG 
TTAATGGTAA 
CGCTTTTCTT 
GTAATTCATG 
TACTTGTGTT 
TTCAGAAGAT 
AGCACCTCCT 
ATATCTAAAA 
GATAATTCAA 
AATGTAGATA 
ACTACAGTTA 
CGCATTTTCT 



GTATACTGCA 
TTTGATTACT 
ATCGACGAAA 
TGCGTGCAGC 
ACGAACGATA 
TTTACCCCAT 
GTGTGGGTGA 
TCTTGAACGT 
GATTGTAGCA 
TACX5TATTTA 
TCCACCTTTA 
TAATGGTAAT 
TACTTCTAAT 
AGCGATGTTT 
ATCTTTGTTA 
TCTTACAGTC 
TGACTTTTCA 
ACCATTTGTT 
TTGTTTTATT 
CTTTTGTTTG 
ATATTAACAC 
TTGTTAACAC 
TTCTCAGTGA 
CAACTTTTTT 
CATTTAAACC 
CATTTACATC 
ATACATTTTT 
AGCCATTCTC 
TAGGCATTTT 



AAAOTATGTC 
TGTTTCTTTT 
GGTCCTTTTT 
7TTAAGCCGC 
AGTTTGTCTG 
GGTGACATTG 
TCGTTAGGGT 
COGGCTTTAC 
CGGCAAGTAG 
CCTTCTTTAC 
CCAGGTTTAA 
GCGTTACCAA 
CCTTTAGGAG 
GCTGAGCGGT 
CGTTTGAAAT 
AATTTACCTT 
GGTGTAGTTT 
ATTGGCTTAT 
AGTTAAATAA 
TATAGCCTTG 
TTGCAACTTT 
GAGTATCAAC 
TTACGGGGCG 
AGCAGCAGCT 
TTGAGCAGTT 
TTCGTTTTCA 
GAATTCTTTA 
TTGAGCTTTG 
GTATGCATAA 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14X00 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
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CCTTGACGAG 


CACGACCTGT 


TCCTTTTTGC 


TTCCATGGTT 


TACGTCCGCC ACCGCTTACT 


14940 




GCTGAACGAT 


TCTTAACAGC 


ATGCGTACCT 


TGACGTAATG 


AAGCACGTTG TAAATTAATA 


15000 


5 


GCTTCGAATA 


AAACGCTATT 


ATTTGGCTCA 


ATACCGAATA 


CTGCATCGCT TAATTCGATT 


15060 




GAACCTGATT 


TAGTTCCGTC 


TAATTTTAAA 


ACATCATAAT 


TAGCCATTAT GCATTTCCTC 


15120 


10 


CTTTCACTTC 


TTATTATTTA 


TTACCTTTTT 


TAATTGAAGT 


TCTGATTTCT ACTAAACCTT 


15180 


TTTTAGGTCC 


AGGTACGTTA 


CCTTTTACTA 


AGATAACTTT 


GTTTTCTGTG TCAACTTGAA 


15240 




CTACTTCTAA 


GTTTTGAACA 


GTTACAGTGT 


TTCCACCCAT 


ACGTCCTGGC ATTTTTTGGC 


15300 


1$ 


CTTTAAATAC 


TCTAGAA6CA 


TCTGAAGCCA 


TACCTACAGA ACCTGGTGCT CTGTGGAAAT 


15360 




GAGAACCGTG 


TGACATAGGT 


CCACGAGATT 


GTCCGTGGCG 


TTTAATTGCA CCTTGGAAAC 


15420 




CTTTACCTTT 


TGATAGGCCT 


GTTACGTCAA 


TAACX3TC6CC AGCTACAAAA GTATCTACTG 


15480 


20 


AGACTTCTT6 


AaCCTAcTcG 


TAAGCATCCA 


CGTCTACATT 


GCGGAATTCA CGAATGAAGC 


15540 




GCTTAGGTGC 


TGCGTCAGCT 


TTTTTAGC6T 


GACCTTCAGC 


TGGTTTATTA GCATATTTAT 


15600 




TAGATTTTGC 


ATCTTTTTTG 


TATGCTTTTT 


TGTCTTCAAA 


TCCAACTTGG ATTGCGTTGT 


15660 


25 


ATCCATCAAC 


TTCTACA6TT 


TTCTTTTGTA 


ATACAACATT 


TTCTTTAGCT TCTACTACTG 


15720 




TTACAGGGAT 


TAATTCACCG 


TTTTCTCCGA 


ATACTTGTGT 


CATCCCAATT TTTCTTCCTA 


15780 


30 


AGATTCCTTT 


GGTCATCGAA 


AGTCCACCTC 


CTAAAATTGT 


CTATTATAAT TTGATTTCGA 


15840 


TGTCTACACC 


AGATGGTAAG 


TTTAAQCCCA 


TTAAAGCGTC 


AACTGTTTTT GGTGTTGGGT 


15900 




TTACAATATC 


GATTAAACGT 


TTGTGTGTAC 


GTTGTTCGAA 


TTGTTCACGT GAATCTTTAT 


15960 


35 


ACTTATGCAC 


GGCACGGATG 


ATTGTGTAAA 


CTGATTTCTC 


AGTTGGTAAC GGAATTGGTC 


16020 




CAGAAACATC 


TGCACCAGAA 


CGTTTCGCTG 


TTTCTACAAT 


CTTCTCTGCT GATTGATCAA 


16080 




TTACSCGGTG 


ATCATAAGCT 


TTTAATCTGA 


TTCTGATTTT 


TTGTTTTGCC ATAATTTTCC 


16140 


40 






X WAX Jtunv* 


TTCTCCACGA AAACTATCTT ACACAGCGCC 


16200 




ATGGCAAAGC 


GGCCGGGTGT 


GTCAGTAACC 


TTTOGCTTCA TCGCTTTTCT TAAAGTCCAA 


16260 




CGTTAGTTAT 


ATTACACGAA 


AAACATCGAT 


AAATCAAGGC 


TTTTCACATA ATTTTTCTAT 


16320 


4S 


CTGTCTAACA 


CATACITTTA 


TATTTnACTT 


TATATACTTA GTCAGTTCAA CTATTTTCGA 


16380 




GATATTTTnA 


ATTTCCn 








X6397 



(2) INFORMATION FOR SEQ ID NO: 206: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

ss 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

TnAGTTGTTT CTGCCACGAA AGATTCAATG GCTTTTCTTG CTTTACGCTT TTCTTTCAAT 60 

GGCAAATCAC CAATCATTTT TTTAAGGTGA TGTGGGTTTA CAACACCACT ATACTGGTAG 120 

TCATTTGAAn TTGTTTTTAG GGCTTGTTCA TCGATAGATC TCTCTCCAGC AAATCCTTTG 180 

AACTCCGCTT CTTTTTTAAT ACTTTCGAAA TTAACATATT CTTGATCGAT ATCATCATCC 240 

TTATTTAAAG AAGGTACAAC ATTGTCGATG AATTCTCTAA TTAGATCTCG TTTTAACCTC 300 

AATGtCGGAT CATCTGCATG ATCTAAAATG CGTCTAATTT GTTCTTGGTT ACGACGTTGT 360 

TCCGCTTTGT CTTCAAGATC AATTTGTCTC AATATATTCA TAATATAATT CACATTAATC 420 

GTATCATTAC GCATCATTTC TATTTCGAAA TCAATATCAT TTAAAATGGA TACTTTATTT 4 BO 

20 TTCTCAGCCG TCGCTCTTTT TACTTGATCG TACACAGCTA AATATTTACT TTTATAGTCT 540 

TCATTCTCTT GTTCATCCAT TCCAATTTCA TCAATTOTAA ACTCAAACTC GTCAAATGCT 600 

TTTAAACGTA ATATTATTTT AGCTAATAAA CGATAAGCtT CAACAAAGCG CTTTAGCTCT 660 

TCTTCATCtT GaATGtCATC AACCATGTGT GGTGTCGGCA CAATCATTTT AAGCTCACGA 720 

TAAGCGTCCA TAAATTCTTT TTTATAGTCT TCATAACTGC GCATTAAAAT TGTATCCGTA 780 

TCATTTGTTT GTGAGAATAC TCTCAGTGCA TCGTCTGTCT CTTTTTTCAA GTCACGATAG 840 

TTTACAATTT TACCAAATGG CTTTGATTCT TTTTCAACCC TATTTGTACG TGAATACGCT 900 

TGAATTAAAT CATGATACAT TAAATTCTTA TCAACATATA AAGTGTTCAG TACTTTACTA 960 

TCAAAACCAG TTAAGAACAT ATTAACAACG ATTAAGATAT CAATTTTACT ATCTTTAACG 1020 

CCCTTTTTAA CGTTTTTtQA AATATGATTA AAATACTCAT TAGTTGtGGC TGnTGaAAAA 10 BO 

TTCCTCTCGA ACTTTTTATT ATAATCACTA ATCATTATCT CTAATTTTTC ACGTGAATGA 1140 

40 TATGGCACTT CACCATCACG ATCATCTTCA TTAGGTTTAA ACGTAAATAT ACCAGCTATC 1200 

GTTAACGOTT GTTCCAACTT TTTGTTAAGT CQCTTAAATG TCTCATAATA TTTAATAAGC 1260 

GCGTGAATAC TTTGGACTGT AAATATACTT GAATATTGAC GATTAC6TGT ATATTTATCA 1320 

TGATTATTGA TGATATGTCG TGriACTAAT TCCACACGTT TATCCGCTAA CCATACTTCT 1380 

TCCGTATCAA TTGCTTCAAC CATOCtOTTA TCTTCTGCTT TTAAAQCTTT ATTTTTAAAA 1440 

GTATTAATAT AGTCAACTGA GAAACCAAGT ACATTACCAT CATGAATGGC ATCTCTAATT 1500 

AAATACGTAT GTAAGCATCT ACCGAAAATA TCTGCAGTTG TTCTACCATC TTGACTACTA 1560 

TTTTCTGOAA AACGTGGCGT ACCAGTGAAT CCAAAGTATT GGGCATTTTT GAAATGTTGT 1620 
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GTCGCAAGAT GCTTCCTGTA ATTATCAAGT GCCATTTTCG ATTGGGTTAT ACAATCTAGA 3540 

ATCGCATGAT AATTTAATGC TACAAATCGA TAGTACAATA TATCTACCGT GAATAACTGT 3600 

GCAAATAGTG ACGTTGTAGC CGCCATACGC ATTTCATTTT CATCAGTTCT GCCATAAATC 3660 

AATGCATAGT CTGCAATTTG AGCCACTGGA TTATTAGCTG TACTAGATAT AGTTATGATG 3 720 

GGAATACTGT AATGTGTGGC CACCTGTGCA ATTGACTGCA ATTCACTATG ACTACCTTGA 3780 

TTCGTCACAA AAATCATGCA ATCTCTATCA TCATGCGTCO CAAATGTTGA CACAAGTAAA 3840 

TGCGTTTCAT GTAATAACCT GACATTTAAG CCAATACGAG ATAACTTTTG AAAAAGATCA 3900 

CCAATAGTCA AACTCGATGC GCCAAATCCA AATAAAAATA TTGTCCTGGc ATTTTTCAAC 3960 

ACATCACAAA TTGCATCAAT TTGCGCATCC ATAATATTAG TAGCTACAAA TCGCATCGTA 4020 

TTCGTTGCTC TAGCAATCAT TTTATTTTTC AAAGTTTCTA CAGATTCATT TTCAATCAAT 4080 

20 TCTAAATGTG GATTGGTTGC AATATCTTCG GGTAAGTATC GAGATATCGC AATCTTTAGC 4140 

TCTTGAAAAC CTTGATGTOT CATTTTCCX3A CTAAATCTTU^ CAATTGATGC TGTACTAACA 4200 

TTCGTAACAT CTGCCAAATC ATTCACAGTC ATATCAATGA TTTTATGTGG ATTCTTTAAA 4260 

ATGTAATCAG CGATTATCTT TTCTGTCTTC GTAAAATCAC TCAACTGCTT ATCAATGCGA 4320 

TATAAAATAT TTGTCATCAT TAATCACCCA ACAAATCTGT CTGTCGCATC GCCTTTGTCG 4380 

TTCCAAATAA ATATGTACAA ACGAATCCAC CAGCATACGC AGCAAGTAAT CCTGCAATAT 4440 

AACCTAAATA CATATTATCT GAGATTAATG GTAATAGTGA CACACCACTT GGGCCTATTG 4500 

CTTTGGCACC AATATGTCCA ATTCCACCTA TTACAGCGCC ACCAATACCA CCACCAATAC 4560 

AAGCAGTTAA GAAAGGTCGA CCTAATGGCA AAGTCACACC ATAGATTAAT GGTTCTCCGA 4620 

TACCTAGGAA ACCAACTGGC AATGCACCTT TTAAAGTATT ACGTAATGTT GTGTTGCGTT 4680 

TACATCTTAC CCAAAGTGCT AATGCGGCAC CTACTTGTCC AGCACCAGCC ATCGCTGCAA 4740 

40 TTGGCAATAA GTAAGTAGCA CCTGATTGGT TAATCATTTC TATATGAATT GGCGTAAAAA 4800 

TATGATGAAG CCCTAACATA ACTAACGGTA QGAAOCTTGC ACCAATGATA AATCCACTAA 4860 

ATACGCXJVCC AATACTAATA ATTCCGTTAA CTACTGAAAC TAAACTGTCT GAAACAAAAC 4920 

CTGCTAATGG CATAAAGATA AAGATAGTTA ATAGTCCTAC AATCAACAAT GCAATAGTGG 4980 

GCGTTACAAT AATATCAATC GCATTTGGCA CAATTTTATG TAATCTCTTT TCGACAATAC 5040 

TTAAAATCCA AACX3GCAAAA ATAACGCCAA TAATCCCACC TTGTCCAGGT TGCAATGGTT 5100 

CTCCAGTGAA GACATTCATT AAAATATTTT TACCAGCAAT ACCCGTTAAT AACGTTGTAC 5160 

CACCAATCAC GCCACCAAGT CCTGGTGTCG CACCAAATTC TTTAGCCGCA TTAATACCAG 5220 
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GCGTAATCCA AGCACCTGAA ATATAGCCTC 
TACCACCAAT TAATCCAGCT CCAATAAATG 
5 TTGATTTCAA TACTTTATTC AACTTACCAT 

TCGCCTTTGC TTTATCAGCT GCATATGATT 
GGTGTGGTAT TGGGTCACCT AGTTTAACAC 

10 

TATTGatGTA CCAGGTCCAA CCACAACTTG 
ACCATCAATA TGCCTTAGTT CTTGGTCATC 
CACACGTGTC ATACAGTTCA TGACACTATC 

75 

AATTCGTTCT GCAAGTTGTT GTTCTTTGGT 
CTGATTGCTC GTTTAAaATG TCACCATTGT 

20 AAATGCCACA CATACCCATA ACTGTCGCAA 
ACGCCATTGC TTCATCATAT GTGATAGCAC 
CGATCAGTTT TTGATTGGTT GCTTTAACAT 

25 CACCAACCAT TGTGATGGTT GAAATCATAT 
TTAAACGTGT TGAACCAGTT AATACTTCTG 
CAATTTCACT TATAACTGCA TGTTCATTGC 

30 

TGTTAGCAAA TGTTAAACCG CCTATAACAT 
TAACGACATC TTTTGATGTT AAATCTATAT 
GATCTTCCGC ACCTTCTACA GCCATCGTCA 

3S 

TAATTTCATG AGGGTCAGTA TTGAATGTAG 
ACCTTCCACT TGTACCTGCA CCGATATAAA 

^ TrGTriTTTT AATTACTTTT GTCAATTGTG 
GCTGATCTTC TTTATTCATC GTAATTAAAG 
TCGTCGCTTC ATTACGCGCT TCGGTCGTAC 

^ TTTTTGAAAA TCAAATGTAT CATTCXX3CTC 
AATAATTTGT GCAaCAACAT TCACATTGTC 
CATTTCGCCT TGATAACGTC CGTTATTCAA 

SO 

TTGCGTCX3TA AACTGTGGTT GAATCGAATG 
AATGACATTT TCCGGATTAT CCX3GG0GTAC 
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CCACCATTAA GTTACTCAGT ACTGCTGCAA 5340 

CAGGAATCAA CGGTATAAAG ATATTGGCAA 5400 

TCTTTTGTTT TGCTTTATGC GCTTCCTTAT 5460 

TATAGTCCAT TTTTTCACTA TCATTGTGAT 5520 

CACTTAATTC CGCCATATGA TTAGCCACTT 5580 

AATGCGTTCA TCGTGTATAA CACCCATGAC 5640 

TACTTTATTC TCATCTAATA CTTTAATACG 5700 

TATATTATCC ATACCACCTA CTGCAGCAAT 5760 

CATTTAAATC CCTCCTAAGG TTGTCTATCT 5820 

TTAATAACCG TCTTGTTGCT TCTTCCTTAG 5880 

CTTTCACATC ATGCTCAOAT ACCTGATATA 5940 

ATATTTCTTG AATAATACGC ACTGAACGGT 6000 

CAATCATGAG GTTATCGTAA ACTTTTCCGA 6060 

TTAAAATTAA CTTTTGTGCt GTACCAGACT 6120 

GACCAACTTT AACTTCTACT GGATAcTGCG 6180 

ATGAAATAGA TACTGTTGTA GCACCGATTG 6240 

ATGGCGTTTT GCCACTCGCG GCAATTCCTA 6300 

TTTTCAAATC TTCTTCCGCT AATTTTTTGT 6360 

TAGCATGTTG TCCACCAGCA ATAATACCTA 6420 

GTACACACTC CGCTGCATCT AAGACACCCA 6480 

TCAATCGTCC ACCCTTTTTA TACTGTGCAA 6540 

GTATTGCCTT TCGAACTGCT AACGGGACTT 6600 

CCTCTTCCAC AGTCATTTCA TCAAGATGCA 6660 

TATTTTCCAT CACTTCTTAC ACTCCCTAGT 6720 

GATACAACTT AACA6TGGTA AOTCTTCTTT 6780 

ATGTGCACTA AGCGTTTGTC TCACAATTTG 6840 

ATTATCAACG GTTACTGAAC CAATGCGTCO 6900 

TGGACATATT TGTCTTGACG TTTCCGAACG 6960 

TTTATGACAC ATATCGAAAA GGTAAGTCAC 7020 
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AAGTTGTTTT GCCTGCCTCA TTTCAATCAA 
AGTTTCTTGT AATAATTTAG CTGCAACGAC 
^ TGGCAAGCCT TTATGCAAAG GACCTCGCAA 

TGCCTTTGGA TTAAATTGAT AAATGAGTTC 
TCCCGTATCT GGTCTTGGAT AATAGTTATG 

10 

TTGTTGATGT AAGCTTGTTA ACAATTCCCG 
TAAACCCTGT GCCATTATCG CTTCGATTGC 

^5 CATAAATTGT GCATCATATT GTCGAAGATG 
ATTAGCATCT ATGAGGTAAG TCACTTGTTC 
ATAATGATAT TTTGTCTCGT CA1CTVCVVC 

20 GTAACCTTGT TTAATCATTC GCTTAATATA 
TGAAAAGCCT GTCAAAGTAG CCCTCCTTAA 
TTGTGTAATC GCTTACATTT TACTATAAGA 
GCCTGTTCAT TGTAATAATC TTACATATTT 
GATTATCCTT TCTTTTAAGT GCTGATAATA 
TTATACATTT TATTCAGGAT TATCTTATAT 

30 

AGAGATTTGA AATTGAATTT AATAATTGTA 
AGGAGTCGCA AACAAATAAG ATTCAATAAG 
TCGTGTCTTT TTTTATTGAA TCTTAAATAA 
GGAAGAAACT TTTCGGTCAA TTGCAACGTA 
TTTT^CCAGC AGCTGGTCTG TTATTAGCTA 
40 AACACTACTT GCCGTTTATA CAAAATGGTG 
GTGCTGGTGG TATCATXTIT GATAACTTGC 
GATTAGCTGG CGGTGATGGC GTAgcAGCTA 
ACAAAACAAT GGGCGACTTT TTACAAGTTA 
GTTACGCTAG CATTTTAGGT ATCCCAACAT 
TCGGGGCCCT GGCAGCTTGG TGTTATAACA 

50 

TAGGTTTCTT CGCTGGTAAG CGTTTCGTAC 
TAGCATTCCC AATGGCATTA ATTTGGCCAA 

SS 



TGAGTCTCCA ACTAACACTT CAGATACACC 7140 

AGGATGACTA TGTCTCGTTG CTTCAATTGT 720O 

ATCACrCCCT ACAATAAAAC CATATATTTG 7260 

ATTTTTCTTA TTGACCAAGT CAACAGATAA 7320 

ACAAAATGAA AGTAATGTAA AATCATTCAA 7380 

GGAAATAATA CTTGCATTCA AACAGCACTT 7440 

CTCAATTGAT GTACTATGAT CGATACX5AAT 7500 

GTCATAAAAA GATGGTGTTA AAATAGATGG 7560 

ATGTTTTAAT AAATTGAGTA GTTTTGTGAA 7620 

TGGTATTTGT ACAGATGTAA AAATCATTTG 7680 

CGCTTCATCT AAAGGTTGTC CTAAATACAC 7740 

CAATATAATT ATTAGGAAAA TATAGTTGAT 7800 

GAAAACACAT TACAATATTA ATCAGTTAAA 7850 

CTGTCACAAG TTAATTATTA CACCATCAAA 7920 

GCTGCTACTG CTGGATTATT ACAATAACTT 7980 

TATGTTTTAA TAATAATCTG TGAACAATTA 8040 

TTGAAAACGC ATACTTCACC ATGCTAAAAT 8100 

ATGTGATGGT TACCAACACA GTCTATTTGC 8160 

TAAATACAAC TTTGGAGGTT GGACAAGTGA 8220 

TTGGTAAAGC GCTAATGTTA CCTGTTGCGA 8280 

TCGGTACAGC TATGCAAGGT GAATCATTAC 8340 

GCGTACAAAC TGTCGCTAAA TTAATGACAG 8400 

CTATGATTTT CGCATTAGGT GTCGCAATCG 8460 

TCX3CAGCATT CGTCGGTTAC ATAATCATGA 8520 

CACCTAAGAA TATTGGTGAT CCAGCGAGTG 8580 

TACAAACAGG TGTGTTCGGC GGTATTATAA 8640 

AGTTCTATAA CATTAACTTA CCATCTTATT 8700 

CTATTATGAT GGCTACAACA TCATTTATTT 8760 

CX5ATTCAATC AGGATTAAAT GCATTCAGTA 8820 
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TATTAATTCC ATTCGGTCTA CATCACATTT 
CATGGAAAAA TGCAGCTGGT GAAATTATTC 
TTCGTGAAGG CGCACATTTG ACAGCTGGTA 
TGTTCGGTTT ACCTGCAGCA GCTTTAGCAA 
AAGTAGTAGC AGGTTTAATG GGTTCTGCTG 
AACCATTAGA ATTCTCATTC TTATTTGTAG 
TTGATGGTTT ATCATTCTTA ACATTGTACT 
CAGGTGGTTT CATCGACTAC TTCTTACTCG 
TAGTCATTCC TGTAGGTCTT GTATACGCAG 
TTGTAAAATT AAAATACAAA ACACCAGGTC 
CTTCAGCAAC TGAATTACCA TATGCAGTAT 
AACATTTAGA CGCTTGTATC ACACGTCTAC 
ATGTTCCTGG TTTGAAAGAT TTAGGCGCAT 
AAGCAATTTT TGGTCCTAAA TCTGACCAAA 
GTCAAGTAGT AGAAAATCCT ACTACTATGG 
CAGAAGATAA ATCTGCAACA AGCGAATTGA 
AAGTAACACC ATTATCAGAA GTGCCTGATC 
GTATCGCTAT CAAACCTTCA CAAGGTGAAG 
TGATTTTCCC AACAAAACAT GCAATTGGTC 
TCCACATCGO TTTAGACACT GTTAAATTAA 
AAGGTCAAGA AGTTAAACAA GGTGATTTAT 
ATCATGCAAA GAGTGATATT ACGCCTATTA 
ATTTTAAACA AGGTGAACAT GGCAACATTT 
AATGCTTACT ATAAACAGGT GCGTATACCT 
CTATTGTATT TTGCAGCATC ATTGATAGTT 
TCATCAATTA AAGCCCCCCT TCATACTCAT 
TTAGCGTTTT CACAACAAAG TCAACTTCCT 
TTAAGTGTAA ATATAGAGAA AAGGTGGCTT 
TAATTGACGC TATTCATCGA GCAGGCGTCG 



TCCACGCACC GTTCTGGTTC GAGTTTGGTT 8940 

ACGGTGACCA ACGTATCTTT ATCGAACAAA 9000 

AATTCATGCA AGGTGAATTC CCTGTTATGA 9060 

TTTATCACAC AGCTAAACCT GAAAATAAGA 9120 

CTTTAACATC ATTCTTAACT GGTATTACAG 9180 

CACCATTATT ATTCTTTATT CACGCaGTAC 9240 

TATTAGATCT TCATCTAGGT TATACATTCT 9300 

GTATACTACC TAATAA6ACA CAATGGTGGT 9360 

TTATTTACTA CTTCGTATTC CGATTCTTAA 9420 

GTGAA6ATAA ACAATCACAA GCGGCTACTG 9480 

TAGAAGCTAT GGGTGGCAAA GCAAACATTA 9540 

GTGTTGAAGT TAACGACAAA TCTAAAGTTQ 9600 

CTGGTGTATT AGAAGTCGGC AATAATATGC 9660 

TCAAACATGA AATGCAACAG ATTATGAATG 9720 

AAGACGATAA AGACGAAACT OTTGTTGTTG 9780 

GCCATATCGT GCATGCACCA TTAACTGGTG 9840 

AAGTGTTCAG CGAAAAAATG ATGGGTGACG 9900 

TTCGTGCACC ATTCAACGGT AAAGTACAAA 9960 

TTGTATCAGA TAGTGGTTTA GAACTATTAA 10020 

ACGGAGAAGG CTTTACTTTA CATGTTGAGG 10080 

TAATCAACTT TGATTTAGAC TACATCCGCA 10140 

TCGTGACACA AGGAAACATT ACAAACCTTG 10200 

CATTTGGCGA TCAATTATTT GAAGCTAAAT 10260 

TCATAAGGTG ACGCGCCTGT TTTTTCTTTG 10320 

CGCTCTCCCC TTAAATTTTG AATTTTAAGA 10380 

TTCCTAAAAA ATATTAATTG TTCACTATTG 10440 

TGACCTTACA CTATATTCGA GGCTATCATT 10500 

TTTTTATGAA ACAACGCATT GGAGCTTACT 10560 

ATAAAATTTT TGGTGTTCCT GGTGATTTTA 10620 
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ATGAATTAAA CGCAAGTTAC GCAGCXX3ACG GTTATGCCCG TCTTAATGGA CTCXSCTGCAT 10740 

TAGTTACTAC ATTTGGTGTT GOCGAATTAA GTGCCGTCAA CGGTATCGCA GGTTCATATG 10800 

CTGAACGCAT ACCTGTCATT GCGATTACAG GTGCGCCGAC ACGTGCTGTT GAACAAGGCG 10860 

GTAAATATGT ACATCACTCA CTTGGTGAAG GTACATTTGA CGACTATCGA AAAATGTTTG 10920 

CACATATAAC CGTTGCACAA GGTTATATCA CACCTGAAAA TGCAACAACC GAAATACCAC 10 980 

GTTTAATTAA TACAGCAATC GCCGAAAGAC GCCCAGTTCA TTTACATTTA CCAATCGATG 11040 

TCGCAATCTC TGAAATTGAG ATACCGACAC CATTTGAAGT GACGGCAACT AAATATACGG 11100 

ATGCATCAAC ATATATAGAG TTATTAGCAA CTAAACTGCA TCAAGCGAAG CAGCCTATCA 11160 

TCATTACTGG ACATGAAATT AACAGTTTTC ACCTCCATCA AGAATTAGAA GATTTTGTAA 11220 

ATCAAACACA GATACCAGTA GCACAACTTT CATTAGGAAA AGGTGCTTTT AATGAGGAAA 11280 

ATCCATATTA TATGGGTATT TACGATGGGA AAATTGCCGA AGATAAAATA CX3AGATTATG 11340 

TGGACAACAG CGATTTAATT TTAAATATTG GAGCCAAATT AACAGATTCA GCAACAGCAG 11400 

G TTTTTCATA CCAATTCAAT ATCGATGATG TCGTTATGTT AAATCATCAC AATATCAAAA 11460 

TTGACGATGT TACAAATGAT GAAATATCTC TACCATCATT GTTAAAACAG TTATCCAATA 11520 

TTTCATATAC GAATAACGCA ACGTTCCCTG OGTATCATCG TCCAACATCA CCCGATTATA 11580 

CTGTTGGCAC AGAACCATTA ACACAACAAA CTTATTTTAA AATGATGCAA AATTTCTTAA 11640 

AACCAAATGA TGTCATCATT GCTGATCAAG GTACATCATT CTTTGGTGCT TATGATTTAG 11700 

CATTATACAA AAACAATACT TTTATAGGGC AACCGTTATG GGGTTCTATC GGCTATACAT 11760 

TACCTGCAAC ATTAGGTTCA CAATTAGCAG ACAAAGATCG TCGTAACTTA TTATTAATTG 11820 

GTGATGGCTC ATTGCAACTA ACTGTTCAAG CTATTTCAAC TATGATTAGA CAGCATATTA 11 880 

AACCGGTATT ATTTGTGATT AATAATGACG GCTATACGGT AGAACGACTT ATTCACGGCA 11940 

TGTATGAACC TTATAATGAA ATTCACATGT GGGATTATAA AGCTTTACCA GCTGTATTTG 12000 

GTGGTAAAAA TGTTGAAATT CATGACGTTG AATCATCAAA AGATTTACAA GACACGTTTA 12060 

ATGCAATTAA TGGTCATCCC GATGTGATGC ATTTTGTCGA AGTCAAAATG GCTGTCGAAG 12120 

aCGCACCGAA GAAACTCATC GATAtCGCTA AAGCTTTTTC ACAACAAAAT AAATAATTTC 12180 

ATCGTATACA GGGTATAAGT TTAAGCGAAT ACTTTATTAA ACOAATAGGA CTCTGATATA 12240 

AGATGATTAA TTTTAATAAA ACCGCTTTAG TGTTAATCGA CCTGCAAGAA GGTATTCTTA 12300 

AAATGGATTA TGCCCCATAT ACAGCTGAAA ATGTCGTTCA AAACGCTAAT AAATTAATAG 12360 

ATGTTTTTAG AAAAAACAAT GGCTTTATCG CTTTTGTTCG CGTGAATTTC TATGATGGTA 12420 
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GCACATCATT CTC7VCAAACT TTGATTAATT TCAGCCAAAA ATATTTACCA TTATCAC3ACA 14340 

TTGGTATGGG CTGGGTTGTT CTCAGTTTGA TTGGTTTCAT TATCGGCTTC ATTATTTATA 14400 

^ AAATTAAGCA TCGTAAAATT CCACAAGCAT AATACTATGC CACAGTCATA TGTTAAACAT 14460 

ATGCTTGTGG CATTTTTTAT TCATACTACA TTAAACTGCA ATCGTATACA TACATATCAA 14 520 

TGATTATCCA CAAAAAATAT TAGTACTTTC ATTTTACAAA TCACATTAAT ACAAACACAA 14580 

10 

CCTTATCTTT ATATTATTAA ATTTATATTT GACACTTATA TTGAACAACT GTAATATATT 14640 

AATATTAATT CTTTAAAATG TATAAATATA AAGGAGGGAG ACCGATGaAT TCAATCATTG 14700 

aATTAACTGA TTATTATAGC TCTAATAATT ATGCACCACT TAAGCTTGTC ATTTCTAAAG 14760 

IS 

GTAAAGGT6T CAAAGTTTGG GATAGTGATG GCAAACAATA TATAGATTGC ATTTCGGGTT 14820 

TTTCAGTTGC AAACCAAGGC CATT6TCATC CAACAAITGT TAAAGCGATG ACAGAACAAG 14880 

20 CTTCAAAGTT GTCTATCATT TCACGTGTCC TTTATAGTQA CAATCTCGGG AAATQGGAAG 14940 

AAAAAATTTG TCATCTTGCT AAGAAAGACA AAGTACTCCC CCTTAACTCT GGTACTGAAG 15000 

CTGTTGAAGC AGCCATTAAA ATTGCTAQAA AATGGGGCTC TGAAGTTAAA GGCATTACTG 15060 

2S ACGGACAAGT TGAAATCATC GCTATGAATA ACAATTTTCA OGOTCGTACA CTTGGCTCAT 15120 

TATCACTATC TAACCAOGAC GCATATAAAG CAGGATTTCA CCCCCTACTT CAAGGCACTA 15180 

CAACAGTAGA TTTTGGAGAC ATTGAACAAT TAACACAAGC TATTTCACCG AATACAGCAG 15240 

CAATTATTTT GGAACCAATT CAAGGTGAAG GTGGCX5TTAA TATACCACCG AAAGGATATA 15300 

TTCAAGCTGT GCGTCAACTA TGTGATAAAC ATCAAATATT ATTGATTGCA GATGAAATTC 15360 

AAGTTGGTCT TGGTAGAACT GGGAAATGGT TTGCTATGGa ATGGGAGCAA GTCGTTCCAG 15420 

35 

ACATTTATAT TTTAGGTAAG GCATTGGGTG GCGGCTTATA CCCTGTATCT GCTGTACTTG 15480 

CAAMAATGA TGTCATGCGT GTTCTAACAC CAGGTACACA TGGTTCAACA TTTGGTGGTA 15540 

ACCCTTTAGC CATTGCAATA TCGACGGCAG CGCTTGATGT ACTTAAAGAT GAACAACTGG 15600 

40 

TTGAACGATC AGAACGCTTA GGTTCATTTT TATTAAAAGC GTTGCTACAA CTTAAACATC 15660 

CTAGTATTAA AGAAATTAGA GGTCGTGGTT TATTTATAGG CATAGAGCTT AACACAGATG 15720 

45 CTGCACCTTT TGTGGATCAA CTGATTCAAC GTGGAATCTT ATGCAAAGAC ACGCATCGTA 15780 

CTATCATTCG ATTGTCTCCA CCTCTAGTCA TTGATAAAGA GGAAATCCAT CAAATTGTTG 15840 

CAGCTTTTCA AGACGTTTTT AAAAATTAAC AATTAATCAT TTATATATGA CATAGGAGGG 15900 

ATTCATGATG ATTAAAGTAG GTATCGTTGG CGGTAgcGGT TATGGC3GCAA TTGAATTAAT 15960 

TCXUVTTGTTA CAAACACATC CTCATGTAAC GATTGCACAC ATCTACTCAC ATTCAAAAGT 16020 
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ACTTACAGTG GaTAATAATG ACTGTGATGT AATTTTCTTT GCGACACCAG CACCCGTAAG 16140 

TAAAACATGT ATCCCTCCCT TAGTAGAAAA AGGTATTCAT GTTATCGATT TATCTGGCGC 16200 

5 

ATTTAGAATT AAGAATCGTG AAATATATGA AGCATATTAC AAAGAAACTG CTGCAGCACA 16260 

AGATGATTTG AATCATGCTA TTTACAGCAT TTCAGAATGG CAATCGTTTG ATAACAATGG 16320 

AACGAAGCTC ATTTCTAATC CTGGCTGTTT CCCTACAGCA ACATTATTAG CATTACATCC 16380 

10 

ACTTATTAGC GAAAAAATAG TAGATTTGTC ATCTATTATT ATTGATGCTA AGACCGGCGT 16440 

GTCAGGTGCT GGTCGTTCAT TATCACAACG gTTcATTTTT CAGAAATGAA TGAAAATCTA 16500 

AGCGCTTATG CAATCGGAAA CCATAAACAC AAACCGGAAA TCGAGCAATA TTTATCTATC 16560 

ATTGCGGGTC AAGATGTATC AGTCATATTT ACACCACATC TCX3TACCAAT GACACGAGGT 16620 

ATTTTATCAA CAATATATCT CAAATTATCA TCTGAATATA CGACTGAATC ATTACATAAA 16680 

20 TTAATGACCT CTTATTATGC TAATCAGCCA TTTGTCAGAA TTAGAGATAT TGGGACTTTT 16740 

CCAACCACAA AAGAAGTACT CGGTAGTAAC TACTGCGATA TCGGCATCTA TGTAGATGAA 16800 

ACAACGCAAA CAGCAATTTT AGTATCAGTG ATTGATAACC TTGTCAAAGG CGCAAGTGGG 16860 

CAAGCCATTC AAAATTTAAA TATATTATAT GATTTTGAAG TGACGACTGG CCTAAATCAA 16920 

TCACCAGTTT ATCCATAAGG GGTGTTA6AA TGAAACATCA AGAAACGACA TCACAACAAT 16980 

ATAACTTTTC AATTATTAAA CATGGCGATA TCAGTACACC TCAAGGCTTC ACGGCTGGTG 17040 

30 

GTATGCACAT CGGTTTACGC GCTAACAAAA AAGACTTTGG GTGGATTTAC TCATCGTCTT 17100 

TGGCAAGTGC AGCTGCCGTA TATACTTTAA ATCAGTTTAA AGCTGCACCA CTTATTGTCA 17160 

CTGAAGACAC TTTACAAAAG TCTAAAGGAA AATTACAAGC ACTTGTTGTT AATTCAGCTA 17220 

35 

ATGCAAATTC TTGTACCGGT CAACAAGGCA TAGATGATGC ACGACAAACA CAAACATGGG 17280 

TTGCTCAACA ACTTCAAATA CCATCTGAGC ATGTTGCTGT TGCTTCAACT GGGGTCATTG 17340 

GTGAATATTT GCCTATGGAT AAAATTAAGA CTGGGACCGA ACATATTAAG GATGCTAATT 17400 

TTGCAACGCC AGGTGCGTTT AACGAGGCAA TTTTAACAAC TGATACCTGT ACAAAACATA 17460 

TCGCTGTATC ACTAAAAATC GATGGTAAAA CCGTTACAAT TGGTGGTAGC ACCAAAGGTT 17520 

45 CAGGTATGAT TCACCCAAAT ATGGCTACCA TGCTTGCTTT TATAACAACC GATGCATCGA 17580 

TTGAATCGAA TACACTTCAT CAATTATTAA AATCTTCGAC TGACCATACA TTTAATATGA 17640 

TTACTGTTGA TGGCGATACA AGTACAAATG ACATGGTATT AGTCATGGCA AATCACCAAG 17700 

^ TTGAACACCA AATACTTAGT CAAGACCATC CACAATGGGA AACATTTGTT GATGCATTCA 17760 

ATTTTGTCTG TACATTTTTA GCTAAAGCTA TAGCCAGAGA TGGCGAAGGC GCAACAAAGT 17820 
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CTATCGTAAG TTCAAATCTA GTAAAATCAG CTATTTTTGG CGAAGATGCC AATTTTGGTC 17940 

GAATCATTAC AGCTATTGGC TACAGCGGAT GTGAAATTGA TCCTAACTGC ACATATGTTC 18000 

AACTGAACCA AATACCTGTC GTTGATAAAG GTATGGCTGT ACTATTTGAT GAGCAAGCTA 18060 

TGTCGAATAC ATTAACTCAT GAAAATGTCA CAATTGACGT TCAGCTTGGT TTAGGTAACG 18120 

CTGCAGCGAC TGCATACGGT TGTGATTTAT CCTATGATTA TGTGCGTATC AACGCATCAT 18180 

ATCGAACATA AGGTGGTGTT GGTTAGATGA AATTTATTGT CATTAAAATT GGTGGCAGTA 18240 

CACTTAGTGA CATGCATCCA TCAATTATTA ACAACATTAA GCATTTACGA TCAAACAACA 18300 

TCTACCCCAT TATCGTTCAT GGCGGTGGCC CATTTATTAA TGAAGCATTA TCAAACCAGC 18360 

AAATCGAGCC ACACTTTGTT AATGGCCTAA GAGTGACTGA TAAAGCAACC ATGACCATTA 18420 

CTAAACACAC GCTCATTGCA GACGTTAACA CTGCATTAGT AGCTCAATTT AACCAGCACC 18480 

AATGTTCTGC AATAGGCTTA TGTGGTTTGG ATGCACAGCT GTTTGAAATT ACATCTTTTG 18540 

ATCAACAATA TGGATATGTC GGTGTTCCGA CCGCTTTAAA TAAGGATGCT TTACAGTATT 18600 

TATGTACTAA ATTTGTACCT ATCATCAATT CGATTQGTTT CAATAACCAT GATGGAGAAT 18660 

TTTACAATAT TAATGCTGAC ACGCTTGCCT ATTTTATTGC ATCATCATTA AAAGCGCCTA 18720 

TTTATGTATT AAGTAATATT GCAGGTGTAC TCATCAATGA TGTTGTTATA CCTCAATTGC 18780 

CATTAGTOGA TATTCATCAA TATATTGAAC ATGGTGATAT TTATGGAGGT ATGATTCCCA X8840 

AAGTGCTAGA TGCCAAAAAT GCGATTGAAA ATGGCTGTCC TAAAGTTATC ATTGCATCAG 18900 

GAAACAAGCC AAATATCATT GAATCTATTT ACAATAATGA TTTTGTTGGC ACAACAATCC 18960 

TTAATTCATA ACTATGAAAT TAAGGCCTAA CAAGTTTTGA CACGCGAGAT GATTCCAGTT 19020 

CGATTATCCA TTGCGCTAAA ACATTTATTT ACCXSTTCATC TCGTTAACAA TTTTGAATAC 19080 

AGTfiCGATAC AATATGAGAT GTAAAAAACT AATAACCTTT TACAAATTTG TTTATCAAAA 19140 

TATTTTAAGT TTTGCAAAGC TTTTTATTGT GATTATTTTC ACAAAATACT ATAATGAGGA 19200 

TAGTAAATAG AGAGGAGTCC TTAAGTTGAC GAAACGACAA ATGGGTATAT TCATTTATGC 19260 

TGGAATTATC GGTGGCTTGT TATCTGGAAT TGTAAAATTA GGTTGGGAGG TCATGTTTCC 19320 

ACCTCGCACA CCAGAACGTA ATGCAACGAA CCCACCTCAA GAGTTATTGC AACAATTAGG 19380 

ATTTAGTAGT GAGTTTACGC ATCAAACATA TACATTTTCA AATATGGAAT TGCCTTGGGT 19440 

AAGCTTTATT GTCCACTTTA GTTTrTCTAT CGTCATTGCA ATTATTTACT GCATATTAGT 19500 

TAAAAAATAC GCTTACTTAG CAATGGGACA AGGTGCTGTT TTTGGTATTG CTATTTGGGT 19560 

ATTATTCCAC CTTATCATTA TGCCAATCAT GCATACTGTA CCTGCTGTGT GGGATCAACC 19620 
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AGTGCGACAA CATTTTGTCT ATCGCTATAA ATTAAATTAA TACACTGACT AACATTAACG 19740 

TGAGTTTtAA ATCATCGTTT GAGTATGATG ATTGATGCTC ACGTTATTTT ATTAACTGAC 19800 

^ ATGATATGAT TCCAGCCAAC TTACGTGAGC ATTAAAGTCT CAAATGCX3TC GTAACAAACT 19860 

ATTATTTTCG GTAATTTCAA TATTGCTCAG TATATTTTTA CCTTATCACT TACTTTAATC 19920 

TCGTCATGAT TTTGAATGAT GCCATCGTGT ATTCACCTTT CATTTTTCCA ATAAAAAAAC 19980 

10 

ATCTAACAGT AAACATTTAG GCAGTATAGT TTAAAATCAC TGCGCAATGA TACTGTCAGA 20040 

CGTCATATTA ACTACTCAAT AACTGAAATA CAGACACTTT TTTATAACCC CAGGGTGCCT 20100 

GTCCTAAQAA ACATACCTGT ACCATAAACT GATCAAAAAT AAATTGTTTG AACTTCACTT 20160 

IS 

CACGTGATTG ATAAAAGTGT GATTGTGTCA TATCATAAAT GTCCAATCCT TTGATTAAAC 20220 

CTTCACCAAT CAATTTTGTA AAACTTTCTT TTTGTGTCCA TATTTGATAA AAATCATTTA 20280 

20 AACTACATAT TTGATGTGCT TCX5TTT0TAG AGAAACACGT CACTAACGTA CGCCAGTCTA 20340 

AACGTTGTGA TATCTTTTCG ATATCAATAC CAACTGGTTC TTTATCXavCA ACACACACGA 20400 

TATAAGGATA ACTATATGAT AAGCTCACAT AGATGGGCTG TCCATCACGA TTGTGTTGAA 20460 

2S CAATATCTGC CTTACCTCGT GGCGAAATGT GATAATGCCA TTCATGTGGT AATAAACCTO 20520 

TGTCATGTTQ AATTCCATAT TGCACTAAAA TATCTCCCAA TCTGTGCATG AGTTTATCTT 20580 

GATTGTATCT ATAGTTGACT GTACX5CGGTT TTTTATATGA CCAACGACTT TGTGATATTA 20640 

^ ATTCTTCAAT ACTTTTCAAG TTACTCTGTA ATTGCATTAC AAATACTGTC ATAACTTTCC 20700 

CTACTTACTT ATTGAATATT GTTTTGATAT ATTGTGCCCA ATGATACAGC CAATTGTTAG 20760 

TTATCGTTGG CCATTTTTCA CTGATGTGAT TCATTATTTT TAATGTTAAT GTTGTATCTA 20820 

35 

TCATTGCTAG TTGTTGTTCA CGGTCAACAC TAGTTAATCC AATCGTTTCG TACATGTCTT 20880 

GTTTCTGTAA AATTTCATTA AATGATTCAT CGCTGACX3AG TTCAATTTCT TTGCGCTTAA 20940 

CGCATTCTAA CAAAGATTTC ACCGGCATTT TATTAGGTGA TAGCACATGG TAAATGATTT 21000 

40 

GTGGTGTGTT GACCTGTGCT AATGCGACAA TTTGTCTTGC AGTCGTATCC ACAAAAGAAA 21060 

AATCTACAGG CATTTCAGCC ATGCTAACCC CGATACAATC CAGTTGTAAC AAATCATTCA 21120 

45 TTACCATTGA AAAACGGTTA GTCTTTATAT TTCTCATATG CCATCTTCCA TTGTAAGGAT 21180 

TCGTCAAATT ACCAACACGT ACAATCCGAC CATCTAAGCC ATTATTTACA GCTTCTAATA 21240 

CTTTTAATTC ACTATAAAAT TTGCTCCGTG TATATGGTGA TGTTAGTAGT TGCCCTTTAT 21300 

^ AGACATCCGC TTsTGAAAAT GTCACATCTT CTGTGTCTAT ATCAAAATAA GTTCCCACAC 21360 

TTATCGTAGA CACATATATT AACCTTGCAT GATGTTGTTG TGCCAAACGT ATGACATCAA 21420 
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CACCTGCATG AATAATCGTA TCCATGTTTT CTGGTAAAAC AACATCATCC ATACACTCGA 21540 

AATCACCAAC AATGACTTCA ATGTTTGATA ACATTATTTC AACCGTCTCT TCTGAAAAAT 21600 

^ AATCATTTAA ATTCGTCATC AACTTATACC ATGCTATTTC CTCATTATCA GCACGTATGA 2 16 SO 

AACAATAAAT GCGATGACTG TATCCTTGTA GTACTTCAAT CAGATAAGCA CCTAAAAAAC 21720 

CTGTCGCGCC AGTCAATAGT GTATTTCCTA GAGGTCGATG ACTTAGACTA TCCTCTAAAA 21780 

10 

TACCCAAGTT ATAACGAGAC ATAACAATCT TTTGTAATTC CGAAAGATTA TCCGGTAATG 21840 

CAACTAATGA TTGTTGATTT TGGTACATAT AATTAACAAT CTGTCGCACG GTTTTATATT 21900 

GGTATAATGT CTGCATTGAA ATATGATGGC CAAATCX5TTT TAAATGCGAG ACAACTAACA 21960 

TCGCCTCTAA TGAGTTACCA CCAAGTTCAA AGAAATCATC GTCAACACCX5 ACATCATTTT 22020 

GTTTCAATAC CTCTCCAAAT ACATCAACAA ATGTCTGCTC AATTTCATTA GAGGGTTCGC 22080 

20 TATACACTTT ATTAGACTGT TGTATAGGTG ATGGATTTGG CAAACGCGTA GTATCCACCT 22140 

TGTCATTCGT OGTTAATGGC ATACAATCGA TATGC6TTAT AGTCTTAGGA ATCATATACT 22200 

TAGGCAGCTG ATCATTTAAA TATTGCTTCA AATCCTGTTC CACTTGTTGC TCTCCGACAT 22260 

AATAAGCATT CAATATATCA TGCGTATCAA AGTGACTTAC TGTTACAACA CAATCAGATA 22320 

TACCACGAAT AGCTAATATT GCATTTTCAA XTTCATCAAG TTCAATACGG TACCCGTTAA 22380 

CTTTCACTTG TTTATCTATT CTTCCTAAAA ATTCAATTTG ACCATCAGAT GTATAACGTG 22440 

30 " 

CTAAATCACC ACTATGATAC AACTTTCCTT TACCAAATGG ATTATTTTGC CATTTATCAG 22500 

CCATTAATTC TGGACGATTA ATATATCCTA TCGCTAAACT ATCACCTGCA ATACACAACT 22560 

CGCCTGGCAT ACCAATACCG CATAACAAAC CATCTGACAT AATATACACT TGQATGTTAG 22620 

35 

ATAAGGGTTT GCCAATTGGA ATCGTCTCAG GTATCAAATC ACCACAATGA TGTGACCAAT 22680 

ACX3ATGTGAT GACTGTTGAC TCAGATGGTC CATAGGCATT GAAATACGTG CCACAATGCT 22740 

TCTCAATATA TTTAACAAAG GATGCCGTAC TAGTTGCCCC GCCTGTAATC AACTTTTCT^ 22800 

40 

TATAAAAGTC TTCCATAACA CTACACATCT GTAACGGAAT CGACGCAACC GTCACACGAT 22860 

GCTTATTAAT GAGTTGTTGT AACTGTTCTG GATTAACACG TTCCTCTCTA TCTGGAATCA 22920 

4S CAAGCGTATG ACCATTTAAC AAACAACAAT AAATCTCCAT AACTGATGCA TCAAAAACAA 22980 

TATTTGCATG TTGCAAAAAT ACTTCATTGT CGCCTAATTG CAATTCAGTT GACCATGCAT 23040 

GCACTAAATT CAACAAATTT CGTTGTCGTA TGGCAACCCC TTTAGGCATC CCGGTCGTAC 23100 

^ CAGATGTGTA AATAGCATAC ATCTCATTAT CTAACATCGC TGTGTTTTCA AGTTGATTGC 23160 

CATGTAAATC ATCATATTGT TCATTTTCCT TTGATTCAAC AAAGCCTTTA GCATTTTCCA 23220 
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TAGCATCCTC CAAAATTGCA CCTTGTCGTT 
AAGATGCACC TACTTTAACT GTCGCCAACA 
5 TAAACAAGGC AACCCGTTGA CCATTGCCCA 

CATCCACATA GTTGCGTAAT GTTTCATACG 
CATGATTACC TTGTCGTGAG ACAACTTCAT 

10 

GGACATTAAG CATTCGATCG TTAACATGCG 
CGTTTGGTAT ATCACAAATT TGTAGTGTAT 
TCATACATTG ATTACCCATG TGACGAACTG 

IS 

CGATATTGAT TGTATAGTCA TCGCGATCTT 
CCGTCACTGA TTTGGGTTGA ATGTGTGTTA 
TTTCATTGTT TTGTAGTACT AACATGACAT 

20 

ATTGATCTAA GTCATTTACT AAACATTCGA 
AACTCATTTC CTTAACCTCT TGTAAAAACT 

25 CTCTATATAC CAACGTATTA GCAAACATGC 

GCGCACTCAT CACACTACCG ACAACAACAT 
TCATGACCAC ACTCATAAAG AACATAAAAT 

30 GAAGTAGCTG TCTCATTTGT TGATTCATTG 

TAATATTTGG TCTAACATAO TCTGTCGGTA 

GAGATAACCA ATATTGTCTA TGTTTCGTCA 

35 - 

CTTTATATTG CAATTTAAGT GGTAACAATA 

TCATTAATTG TATATTACTC ATACCGTCAT 

ATGCATGTAA GGGACTTCTA ATGTATCTCA 

40 

AAGGTGCTAC AAATTGGCGC ATGATTTCTT 
CAAAGTCAAC TGCAACATCT GCCACAATAC 
ATTGTGTTCG TAAAATCTCA TGTCGCGCTA 

4S 

CTACATTAAG TTCTGATGAT AACCGCCATA 
GGTTTGATTT CCATAATAAA TACATACGCT 
so TAACTATAGT TTCTGGAATC ACTTCATAGT 

CTAGTTCAAA TACAGTTGGC TTTTGTAATA 
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TATTCGGAAA ATCAATATCG ATAGGTATAT 23340 

TCGCCGCAAT CATTTCAAAA CTACGTTCTG 23400 

CACCATTTGA TAGGAGCATG TGCGCAATGG 23460 

TCATTGTCAA ATCATTCATG ACTAGCGCAA 23520 

TAAAGTAACT TATGATAGAT TTATTTCCCQ 235B0 

TATTGACCCA ATTTAGAAGT TCCTCCGTGC 23640 

CTTGATGCTT CAAAATATAA TCAATCATAA 23700 

TTTCTGAGTG ATATAAATCG GTATTATACT 23760 

CTTCAATGAT GAAAGATAAA TCAAATTTCG 23820 

ATTTACTATG CCCAAAATGA GCATGATTCG 23880 

CAAATAATGG ATTCCX3TGAG GCATCATGTG 23940 

ATGGGTATTC TTGATGCTCG TATGCCTCCA 24000 

GTGTCCACAT TTTATCAGGT GACGGTTGCC 24060 

CTAGCATTTG CTCCXSTGCCT TTATGCATAC 24120 

CATCTTTTCG AGCATATCTA CTTAACAACG 24180 

CAGTAATTTG ATGCTTTTCT ACATACTTTT 24240 

TAAATGACAT CATTGCTCCA TTTGTCX5TTT 24300 

AGCTTAAAAT AGGTACTTCA TCTTTGAATT 24360 

TATCACGATG CGACATCCAC TCACTATAGT 24420 

ATTTATGTTG ATAAAGTGCG TTAAGATCAT 24480 

TAATGATATG ATGCGTATCT ATAAAGAGGT 24540 

CTCTAATTTG ACTTGGCTTT TCCAAATTAA 24600 

GTTCATCCGT AAAATGCGTG TTAACTTCTT 24660 

GTTGTCGAAC CTCATCATCT ACAACAATAT 24720 

TCAAACGCTG CACTGCTTGT CGCAATTGAG 24780 

AAAAAGGTAC GTTATACACC GTATCTTTAT 24840 

TTTGTGCAGA GCTCAGCACA TAATCATCTT 24900 

TTTGTTCTTG AACCTTAGCA ATCGCTTGTG 24960 

AATCACCAAT TTGTAATCGT TTCCCAGTAG 25020 
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AATTATCATG AATACCTACT TGATTCACAT GTAAAATATC TGCAAAAATT TGGCATAGCA 25140 

AGTGTTCGGT ATCTGTACTC GGTGCTACAT AGGCATCCGT ATCGACATAG TCCATGATAG 25200 

^ GCAATGCCTT CTTATCTAAT TTCCCATTAA TAGTAATAGG AATTTGCTCA ATATGCATGA 25260 

AATTAACTGG TATCATGTAC TCCGGTAAGG TCATACGTAA TTGTGATTTA ATCTTATTAT 2 5320 

GTGATAATGT ATGCATCGCT TCATAATAAG CAACGATATA CTGATCTTGA TCATGATTTT 2538 0 

10 

GAACAATAAC AACTGCTTTA TTAATACCTT GTATACGCTC GAGCGCATGC TCAACCTCTG 25440 

ACAACTCAAT CCTAAACCCT CGAATCTTAA CTTGTTTGTC CTTTCGATAT AAATAATCTA 25500 

TGTTGCCATC GGGTAACAAA CGAACGATAT CACCACTTCT ATACATCAGC TGaTTTATAT 25560 

IS 

TTGAATCTTT GATAAATTTA TCTGCTGTCA ATTCTGGCTO ATTTAAATAA CCTGCAGCTA 25620 

ACCCAAAGCC ACTTGTACAT AATTCTCCAG QAATACCAAC GCCACACCGA CGCTCGCCTT 25680 

20 GCATGATATA AACATGAGTA CCCAGAATCG GTTTACCAAT AGGAATACGA TTTGGAACTT 2S740 

TGTTAOGTAT ATTATACGTC GTTGTAAATG TTGTATTTTC AGTTGGTCCA TAACCATTAA 25800 

TAATTTGAGG ATGCTTCGGT TTTTGATTAA GCAAATCCAC CCACTTAGCA TTCAATACTT 25860 

2S CTCCACCAAT TAATAAATAC TTTAACGGTA CCAATACTTC TATTCGTTCA CTAGCAATCT 25920 

GATTAAATAA TGAGGAGGTT AACCACATAG TATTAACGTC ATTTTCATTG ATTAATTGTT 25980 

CTACCGCTAT TGGATTTAAT AATTGTTCTT TTTTAGCAAC AATCAGCTTT CCACCATTGA 26040 

GCAATGCACC ATATATTTCA AATGTTGCAG CATCAAAGGC TATAGTTCCT GATAACAAAA 26100 

TCGTCGTCTC TTCATTTAAT GGTACATAAT GATTTTGATG GACCAAGCGA ACAATACCTC 26160 

GGTGCGGAAT TAGTGTCCCT TTAGGGTTAC CAGTTGTCCC CGACGTGTAA ATAACATAAG 26220 

35 

CATGATCTTC TAACGTGTTA CATTTAGAAA GATTATCAAT ATTTTTCCAC GCTATCTTAT 26280 

TCAAiTCAAT GTGATTAATA TTTTGTTTAC CATTTTCATA TAAAGCTTGG TACGTTATTA 26340 

CAACTTTAGG CGTTACATCT TTTAAAATGT ACTCCTGACG ATCACTTGGA TAGTTCGGAT 26400 

40 

CAATTGGCAC GTAAGCCCCA CCAGCTTTCA ACACACCTAT CATCGCTATT ATCATCTCAA 26460 

TACTTTTTTC AGCTATGACA GCGACACGAT CATTAGGTTC AACACCATAC TGGTTTCTCA 26520 

45 AACGGTGTGC TAAATCATTC GCGCGTGCAT TCAATGTTTG ATATGTTATA AACAcTCCGT 26580 

CAAATTGCAC AGCGACATGA TTCGGCGTTG CTTCAACTTG TTGCTCAAAT AAGGTAACAA 26640 

CTGTTTGCGC ATCATCTATC TCAGGCAAAC TTAAATTGAT ATCGTCATAT AATTGAATAT 26700 

^ CAOGTTCTGT CATCAAATTA AGTTCATCTA CAGTTGTTCG TTTATTTCCA TTTTCTTCAG 26760 

TAATTTCCAA ATAAATATTT CGAACTAAGT CACTCAGCGT CTCGATTGAG AGCAAATCAT 26820 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



CTAAAGATGT 
ACATCATATG 
GTAATTGCGA 
CATCTTTTGC 
AATGT6ATGG 
CTAAATACAC 
AC6TTTGATA 
TGTCAGAGTT 
GATTGTCTTT 
TATTGCCACG 
AAATGGaCAC 
CTTTGAATAA 
CTAAATCATA 
CATGAAGTAT 
CATGTTGTgC 
GAGTTTAACC 
AAAATATTCC 
ATAATACTCA 
ACTCGTTATG 
CCTAAAATAC 
ATCTTCGATA 
CAAACTCAAT 
CGCATAAAAC 
TAATTACGAC 
TAAATTCAAT 
GCACCAATTG 
CCCAACACAA 
GCACTATATG 
GGCGAATAAA 



ATGTGCATCA 
ATGATAACAA 
CATATTTTGC 
ATCGATTGTT 
TACATGTATA 
ACTAACAGCC 
TAAAGCTTGT 
ATTCTCTAAC 
ATCATCATTT 
ATATGCATTG 
ATGAAAATCT 
AGGTGAATCA 
GTTTGCGGAA 
CATTGTTAAA 
ACAATTGTCG 
CAATACGCAT 
TGTTGAAATC 
TTACTTACTG 
TAATGaTTAA 
ATCTTGTATC 
AATAGCACAC 
ATTGTTGTTA 
AATTGTTATC 
CAAATTAATA 
TTGTATTCAC 
TTAAAACGAT 
TCCCTAACCT 
TTCCTCTTTT 
TAATTTCACC 



TGTATTTGAT 
TGAAAAATAG 
AACACACATT 
AACGTTAACG 
CCTAATGTGA 
AACGATGCCA 
TCAAATGGAT 
CGAAAATAOT 
CTATTTATAT 
CATAAATCAT 
aATAGTATAT 
TTAAAATGGT 
GAAGATGGAA 
TTGTCATCGG 
ATAATGCATA 
TACGGTGTGT 
TCAAATTACC 
AAATTGCATG 
TATaAAACAC 
ATCGTTACAA 
TACAAATCGT 
CAAAATATCC 
CTTAAATTTT 
GGTTTTCCAT 
TTGTTTTTGT 
AAACATATAC 
AGCTAGTGTT 
AGCTTTAGGA 
TACAGTCGCA 



GTACATCCTC 
TCTCTAGTGA 
TATTAAAATC 
GCACAATATT 
CATCATGTTG 
TATCTATTGA 
GCTTAATTGG 
TTGAGTCTAA 
GTGCATGCTG 
CAAGAAnAAT 
ATGCAGCATC 
AAGTACTTAA 
CTTGTTTTAT 
TAGTAACATC 
CATCATTTTA 
CGTTGATTCT 
CATAATCATA 
ATGATATGAT 
CATTCGCAAC 
TTGGTATATT 
TAATCACTTT 
ATGAAGCAAG 
AACAAATTCT 
ATAAAAAGAT 
CCCCCAAATA 
AGTCCCATTT 
TCTGAAAAAT 
ATAATTTTAA 
ATTATCATAA 



AATAACATCA 
AAGCGAAGAC 
TGTTGTAAAA 
TCCGTGTAAA 
TTGACTCATT 
TGTCATATCA 
AAAATAACTA 
TGCTATATGC 
TCGAGTATTG 
ATCAATTTGA 
AGCGAACTGoi 
TTCTTGCTTA 
CTCAATAAAA 
AAAAAAATGT 
GTAGCTTCAA 
GTATTATTTT 
AAAAGTCCTT 
AACCGACGAA 
ATATGAGCGA 
TTTCAATGTA 
CTGTTGTTCA 
TTTATATTAA 
TAATAAATTT 
GCATAAAATA 
CACCAGCAAC 
GTAATGACGT 
GAATACCTAA 
AGCGTTGTTC 
AAACAACTAA 



TTACAACAAG 
TTCGCGCACT 
CGTTGACATA 
TCATTTGGTA 
ATATGATTAG 
TCAATCAAAT 
TCAATATGGA 
GATGCATCTT 
TTAATAACAG 
CTATCATCga 
AACAATTTAA 
AAAAAAGCTT 
GGCAGAAATT 
CTTATAGATG 
CATTTTTAGC 
TGTATATACG 
CTTTCATATC 
ATGTTAATTA 
TATATTCTAC 
AATTACATAC 
CATCTCATTG 
ACAAACAACT 
ATCTCTATTT 
AATATTTAAA 
AAGCATGCTA 
TAAGAAAACA 
TGCATTAACT 
TGAAACTATA 
TAAGCCAAAC 



26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 
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GCTTTTTTAA AATCTATTTT CAATACAACT TTCGAGATTG AATACGTGAG TAaAATGACG 28740 

ACGACCGTAT TAATCATTAG CAAGATTGCT AACATCTTAG CACCTGTAAT ATCATATGAA 28800 

5 CCTATACTTA TTGTrrCAAA CT6ATCCTTT AGTCTAATAG CAATATATGA GGAGATTGAA 28860 

AATTCACCCA TCATGATGAT ACTGAACCCC GAAATCAATA ACATATAATT ACGGTCTTTC 28920 

AAAACTAATT TATAACTGCG AAATATATTC ATTATTTGTA ATTTTTGATA ACGACTTGCA 28980 

TGCCTCTTGT CATCACTTTG CTTTACTTGA TTTCGGTCTT GAGGTAACCA AATATATAAA 2 9040 

ATAAAGAGTA CAATTAAAAA TATACAAGCT GCTATTAAGA AAAGTAGTAA CATACTGTAG 29100 

CCATACATCA AGCCACCTAA CAATGCCCCA ATAGCTACCG ATAAGTTTGT CATCCAATAG 29160 

IS 

CTAATCTTGT AAATATAATG TTCCACGTCT TCGGTAATTG CATCCATAAT TAATGTGTCC 29220 

ATAACTGGAA ATTGTAATCC CCAAACGATT GTAAATATGG CATATGCAAC ACAAAAACCA 29280 

ATAATTTGCC ACAATTGATG TGACCCAAAT ACGCCCATGA ACACAAGCAT TATCACCATC 29340 

20 

GTCGCTTGAT AAATAAGTAC TAGCAACTTT tTCGQAAATA TCTCAATAAG GTAACCAGAT 29400 

ATAATGQACA ATGGAAATTT nAGAACCACT AAACCAACAA GATATATACC GACAATTGAT 29460 

2s TGACTTAACA TATCTGTTAA ATATAGT6CT ATAAACGGTA TAAATGCTGT CGTAATAATT 29520 

AGCTGTAAAA nATTGCTAAT CAATCQTACT TTCAA 29555 
(2) INFORMATION FOR SEQ ID NO: 207: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 
AAAAAAAAAA AAAAAnGGTG AATCTTTAAT TAAACACTAA TATTGTAAAA GATGTTAAGT 60 

40 

AAACGCTTAA TGACACTTAT TTTTTGAAAA TAATAGTAAT ATCATTTTGT TAAATGAAAG 120 
AATAAAGCTA TAATmATTAT AGAATAACTA TTTAAAGGAG ATTATAAACA TGCCAATTAT 180 
TACAGATGTT TACGCTCGCG AAGTCTTAGA CTCTCGTGGT AACCCAACTG TTGAAGTAGA 240 

45 

AGTATTAACT GAAAGTGGCG CATTTGGTCG TGCATTAGTA CCATCAGGTG CTTCAACTGG 300 
TGAACACGAA GCTGTTGAAT TACGTGATGG AGACAAATCA CGTTATTTAG GTAAAGGTGT 360 
TACTAAAGCA GTTGAAAACG TTAATGAAAT CATCGCACCA GAAATTATTG AAGGTGAATT 420 
TTCAGTATTA GATCAAGTAT CTATTGATAA AATGATGATC GCATTAGACG GTACTCCAAA 480 
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AGCTGACTTA TTAGGTCAAC CACTTTACAA ATATTTAGGT GGATTTAATG GTAAGCAGTT 600 

ACCAGTACCA ATGATGAACA TCGTTAATGG TGGTTCTCAC TCAGATGCTC CAATTOCATT 660 

5 CCAAGAATTC ATGATTTTAC CTGTAGGTGC TACAACGTTC AAAGAATCAT TACGTTGGGG 720 

TACTGAAATT TTCCACAACT TAAAATCAAT TTTAAGCAAA CGTGGTrXAG AAACTGCAGT 780 

AGGTGACGAA GGTGGTTTCG CTCCTAAATT TGAAGGTACT GAAGATGCTG TTGAAACAAT 840 

TATCCAAGCA ATCGAAGCAG CTGGTTACAA ACCAGGTGAA 6AAGTATTCT TAGGATTTGA 900 

CTGTGCATCA TCAGAATTCT ATGAAAATGG TGTATATGAC TACAGTAAGT TCGAAGGCGA 960 

ACACGGTGCA AAACGTACAG CTGCAGAACA AGTTGACTAC TTAGAACAAT TAGTAGACAA 1020 

75 

ATATCCTATC ATTACAATTG AAGACGGTAT GGACGAAAAC GACTGGGATG GTTGGAAACA 1080 

ACTTACAGAA CGTATCGGTG ACCGTGTACA ATTAGTAGGT GACGATTTAT TCGTAACAAA 1140 

CACTGAAATT TTAGCAAAAG GTATTGAAAA CGGAATTGGT AACTCAATCT TAATTAAAOT 1200 

20 

TAACCAAATC GGTACATTAA CTGAAACATT TGATGCAATC GAAATGGCTC AAAAAGCTGG 1260 

TTACACAGCA GTAGTTTCTC ACCGTTCAGg aAACAGAAGA TACAACAATT GCTGATATTG 1320 

25 CTGTTGCTAC AAACGCTGGT cAAATTAAAA CTGGTTCATT ATCACGTACT GACCgTATTG 1380 

CTAAATACAA TCAATTATTA CGTATCGAgA TGAATTATTT GAAACTGCTA AATATCACGG 1440 

TATCAAATCA TTCTATAACT TAGATAAATA ATTTTCTnTA TAATCAAATG CTGACATAAT 1500 

30 TTTAGTTGAG GATTATTATG ACGGTATAAA TAAATAAA6 1539 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
CAATTTCTAT CTATCAATGA TGTGCATACT TCCAnTTAAA TTAAtCGAAA TGaATCAAGG 60 
TATATCATTC CTGCCTCTTT ATATAACaAC AAATAGTGAT TACAATATTT CGGTTATTAA 120 

45 

CACGAAAATT TTACAAGCAC CTATTTCATT TACATATATA TACAGCAAAA AAGAAAGCCC 180 
AGAAATATTG GTGTTTATTA AATCATTTAA AAAGTATATT GCCAATGAAC AATTATAATA 240 
AATTTCAAAT CTAAAAAACC AAGAATGCGA 1TAATCATCA CATTCTTGGT TCAATTTTAT 300 
TCATGAATTT TTTCAACATT AAACGTTAAG TTATTGTCTG AATTTAAATT AACTTTAATC 360 
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CGTTGTACAA AACGTTTTAA TGGTCTTGCA CCGTATTGAG GTTCATAAGC TTCTTGACCT 480 
AGCCAAGCTT TAGCATCATC AGAAACTTCA ATTGAGATTC GTTGTTCTAA TAATCTTATA 540 
5 TTTAATTGCG TTAAGATTTT ATCTACAATC ATACTCATGT CATCAATAGA TAATGGTTTA 600 

AATAATACGA TATCATCCAT ACGATTCAAA ATTTCTGGTT TGAAATATGC ATTTAAACTT 660 
GTCATAACAG CTTTTTCTGT TGATTCTGTA ATTTCACCAG TCTCTTTTAC GTTTTCTAAT 720 
AAAACTTGAG ATCCAATATT ACTTGTCATA ATAATAATAG TATTTTTAAA ATCAACGCTA 780 
CGTCCTTTAG AATCAGTTAA ACGGCtTCAT CTAAAATTTG CAATAATACA TTAAAGACGT 840 
CAGTAT 

IS 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



35 



40 



45 



SO 



nTGGGAACAG 


TAAGCCAGTA 


TTTTTAGAAA 


GTTGCCATAC ATGAGCGTCG ATTTTTCCAA 


60 


TATGGCTATG 


ACACTA6AAC 


AATGGGAATT 


TGGAGGAAAA GTAAATGATT AAACCTAAAA 


120 


TAGCATTAAC 


CATTGCAGGT 


ACTOATcCaA 


CAGGTGGTGC CGGCGTAATG GCTGATTTAA 


180 


AATCATTTCA 


TTCATGTGGT 


GTATATGGTA 


TGGGCGTCGT TACAAGTtnTT GTTGCTCAAA 


240 


ATACATTGGG 


CGTACAACAT 


ATTCATAATT 


TAAATCATCA ATGGGTAGAT GAACAACTTG 


300 


ATAGTGTCTT 


CAATGATACC 


TTACCTCATO 


CTATTAAAAC GGGGATGATT GCTACAGCAG 


360 


ATACTATGGA 


AACGATTCGT 


CATTATTTAA 


TGCAACATGA ATCTATTCCA TATGTAATtG 


420 


ATCCT6TTAT 


GTTGGCGAAA 


rCggTGATTC 


ATaATGGwTA ATGACaCAAg CaAAACTTGC 


480 


AGCATaCGTT 


ATTGCCATTA 


GCTGACGTAG 


TAACACCGAA TTTACCAGAA GCTGAAGAAA 


540 


TAACGGGACT 


AACCATTGAT 


AGTGAAGAAA 


AAATTATGCA GGCTGGCCGC ATCTTTATTA 


600 


ATGAGATTGG 


TAGTAAAGGT 


GTCATCATTA 


AAGGOGGTCA TTCAAATGAT ACTGATATA6 


660 


CAAAAGATTA 


TTTATTTACT 


AACGAAGGTG 


TTCAAACATT TGAAAATGAA CGATTTAAAA 


720 


CAAnACATAC 


GCATGGAACA 


GGGTGTACAT 


TTTCAGCAGT TATAACGGCA GAACTTGCAA 


780 


AAGGTAGACC 


ATTATTTGAG 


GCTGTACACA 


AGGCTAAAAA GTTTATTTCA ATGAGTATAC 


840 


AATATACGCC 


TGAAATCGGC 


CGTGGTAGAG 


GTCCAGTGAA TCATTTTGCA TATTTAAAGA 


900 
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TGTTATACAA ACGATGTAGT TAAAAATTTT ACAGCGAATG GTTTATTAAG TATTGGTGCT 1020 

AGCCCTGCAA TGAGTGAAGC TCCCGAAGAA GCTGAAGAAT TTTACAAAGT TGCACAAGCG 1080 

CTATTAATCA ATATCGGTAC TTTAACAGCA GAAAATGAAC AAGATATTAT TGCGATTGCT 1140 

CAAACGGCAA ATGAGGCAGG CTTACCTATT GTATTTGACC CTGTAGCTGT TGGTGCTTCT 1200 

ACATATCGAA AGCAATTTTG TAAATTATTA TTGAAATCAG CGAAAGTATC AGTAATTAAA 1260 

GGCAATGCAT CTGAAATATT AGCGTTGATT GATGATACAG CAACTATGAA AGGTACAGAT 1320 

AGTGAT6CTA ATCTTGATGC GGTTGCAATA GCGAAAAAGG tTACGCAACA TATAAAACTG 1380 

CAATAGTAAT CACAGGTAAA GAGGACGTTA TTGtTCmAGA TAATAAA6CC TTOGTATTAG 1440 

CTAATGGATC TCCATTATTA GCACGAGTAA CTGGAGCTGG TTGTTTATTA GGAGGCGTTA 1500 

TTGCTGGATT TTTATTTAGA GAAACAGAAC CAGACATAGA AGCGTTAATT GAAGCGGTAA 1560 

GCgkATTTAA TATTGCTGCT GAGGTAGCTG CTGAAAATGA AAATTGTGGT GGTCCTGGTA 1620 

CGTTTTCACC ATTGTTGCTT GATACX3TTAT ATCATTTAAA TGAAACAACC TATC 1674 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

3S ATGAGTTGCC GATGAATTTA GCACCACCAA CGATTGCnTT TGATACTGTG TCCCAACCAG 60 

CTTGTTTAGC ATATTTAATA CCTTCACGTA AAGGATCGTT ATCATATGCA GCAATACCAA 120 

ATACCJTTATG GTATTTCGTG TTTGAGTTAG TTACAACTTT GTnTTGCACT ACATCTGCAC 180 

CTTTCGCTAA TTGAGAAGTA CCGTTACCTG TTTCTAATAG GGCATGTGAG ATAAGATAAA 240 

CTTCATTAAT GCCATACATT TGAGCAGCTT TGTTAAATGC AGCACCTTGG TTTTCTAATA 300 

CACCTTTACC TTTTAAGAAT TGATTAATTT TATCAATAGA AATATTTTGT GGTTGGTCTA 360 

AGCGTAAGAA TTGATATTTT AATGCTGGAT CTTGAGCTAA ACGCTTCGTA TCCAITGCAT 420 

GCTTAACATC ATTAAATTTA GCATCTGTCC ACTTACCTGG TACACGTTGT ACTTGTGGTT 480 

50 TATATTGTAA ACCAGCTTGT ATTTGAGCAA CTTGGTTTAA TGTCATACCT GTTTGATTAT 540 

ACTTAATTAA TTCTTTAGCT AAATCAGTTG ATTTAATCCA TGCtAATTTA CCGTTAGATA 600 

ATTTACCATA GTACCAAGTT TGTCCATTAA TGACTTGTTC TTTAACAACT GCGAATGGTT 660 

55 
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10 



20 



2S 



AACCATTACC ATTTTTAATT ACATAAGTGT AGTTATAATC TTTGGcAGCT GATGTAGTTG 780 

GTTTCACAGC AGTTGGTGCA GTTAAATCTT TTGCATTTAC CCAACCAGTG CGGTTATTAA 840 

TAGTACCGTA TAAATAAACA TCTTTGCCTA CAGATACTTG TTTCGTTGCA TTAAATGTAC 900 

CTTGAGCAAT GTTATTGCCT GTTAAAATGA CTTGGTTTTT AGTACCCCAA GGAACCATTG 960 

ATAAGCCGTT ATTTGATTTA TTAACAGTAT ATTTTTGAGT CGTTTTAACT TCTTTGCCTA 1020 

AQTTTTGAAC ATTTAAGTCT TTTACATTGA ACCAACCTAA TGGGATGTTA TGGCTTGTAT 1080 

TGTTTAATAA TACATACGTT TCATTACCAT GAGCACGCTC TTTTGTTACA TAGAACGTAC 1140 

GGTCTGCATA TTTCGCACCG TTTTTCGCTG TTTTTTCATA AACAGAAGCA CGAATACCAG 1200 

TGTTGTTTGG TTTAACTTGA GCAATCTTGC TAACTGTTTG AGTCGTTTGT GGTTTAGTAA 1260 

CAGTATAAGC TTTTACAGCT GTTTTTGGTT GTGCTACTGC TTTTTTAGGT GCAGCAGGTA 1320 

CAGCTAAATA TGCTTTACTT ACCCAACCAG ATTTACCATT TACAGTTCCA AATAAATAGA 1380 

TAGATTTATC AATTTGTTGT TGCTTAGTCG CTTTAAAAGT TTGGTTACCT GTACCAGAAA 144 0 

CTGCACCAGC TTCTTGTTTA TAAGTGCCCC AAGGTACTGA ATATAATTTA GTGCCTGGgT 1500 

TTACTGTATA TGTTTGCATT ACATTTACAG 6TGATTTTGC ATtGtTATAA ATACGTCACC 1560 

TTGTTTAACC CAACCAATTA AAGTTGGACT ATTGTAATCT TTAACTAAGT AGAATTTGTT 1620 

^ TCCACCTAAA CTTGCTTCTT TTGTTACAGC AAATGTTTIT TGAACTTCTT TCGTTGGCTT 1680 

ACCAGTTTTG TCATAAACTG TAGTGAATAA GCCATTGTTT TTAGCATTAA TTTGAGCAAC 1740 

ACCGTTTAAT GATGAAACTG TTAATTTATT ATTTGTTGTA GGTGTTGATG GCTTAGGTGT 1800 

TGGTGTAGGC GTAGGTTTAG CAGTATCAAC TAAATATGCT TTACTTACCC AACCAGATTT 1860 

ACCATTCACA GAGCCATATA AATAAATTGA TTTATCAATT TGTTGTTGCT TTGAAGCCTT 1920 

AAATCTTTGG TTTCCAGAGC CAGACACACT ACCAGCAACT TGTTTAGATG TACCCCAAGG 1980 

TACTGTATAA AGTTTCGTAC CAGGTTTGAT TGAATATGAT TGATTTACAT TTACAGGTGA 204 0 

TTTAGCTGTG TTGTAAACCA CATCGCCTTC TTTAACCCAA CCAAATTTAT TACCAGAATT 2100 

GTAATCTTGA ACAAGATAGA ATTTTTGATT ACCTAATGTA GCTGTTTTAG ATACAGCAAA 2160 

TGTTTTTTGA ACTTCATTAG TTGCTTTACC AGTTTTGTCG TATACAOTAG TATATAAACC 2220 

ACTATTTGTT GG 2232 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
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(xi) SEQUENCE DESCRIPTION: i 
GATTTAAATA AAATTAATGG ATATCGTGAT 

5 

GGAATGCGTG TATCTGAATT GATACATTTA 
TTTGTACGCG TATTTGGTAA AGGCGATAAA 
ATTGAGTACT TAACTACTTA TATTGAAACG 
ACTGAAGTCT TATTTTTAAA TATGCATGGT 
ATGATTAAAC AAAATGGT6T AAAGGCAAAC 
IS CGCCACTCTT TTGCGACACA TTTATTGGAA 

ATGTtAGGtC ACTCTGaCtnT ATCTACTACC 
ATTAGAAAAA TGTATAACCA ATTTCATCCT 

20 

CACAATACAC ATGACTAAAA ATGTCTGTGC 
TTACGTCTAT GTTTTCTTAA TTGAATCGCT 
TCACGGCGCA TGTGATGGTC TACAATAAAA 

2S 

TAGTCTGGAT AGTTGGCCTT GATGATGCGT 
GATGAAAAAG GCTTTGCTGC ATTCAATTTT 

30 TCATTTAAAT TAAGGTATAA CGCATCTCTA 
TTATTAAAAG ACTTATAGGC 6CCTTTTAAA 
GCAAACAAGA TTAAACTAAC AACTGCATCT 

^ GCATCTTCTA AAATGTCATG ACATAGGAAA 
AAATTTATCA GTGCCTGTTG CATTTTGTTA 
TGCTATAATA TTTTAGAGAA TTATGCACAT 

40 

GAAGTTAAAT TAGATGCTTT CAATGGACCA 
TTTGAAATAG ATATTTATGA TATTCCTATG 
45 GTTCATGCAA TGAAACAGCT TGAAATTAAT 
GAACTCTTAA TGATTAAAAG TAAGATGCTA 
GATGATGACC CACGGGAAGA TTTAGTtGGG 
TATACTGCtA TTTTAAATGA CATGAAAGAA 
GACAGATTTA TCtCATTXGG AAAcAGATGA 

55 



5EQ ID NO: 211 : 

CGTACGATGT TAGAACTTCT GTACGCAACG 60 

QAGTTAGAAA ACGTGAACTT AATAATGGGA 120 

GAAAGAATTG TACCATTAGG CGACGCAGTC 180 

ATTAGACCGC AA C TTTTAAA AAAGACTGTT 240 

AAACCTTTAT CACGACAAGC AATATGGAAA 300 

ATTAAAAAGA CGTTAACGCC ACATACGTTA 360 

AATGGCGCAG ATTTAAGAGC AGTGCAAGAG 420 

CtaaCTCTATA CmCATGTTTC GrAATCTCAA 480 

AGAGCATAAA GTGAACAATA ACTCAAAAGT 540 

TATTGTGGCT TTTTTAAATT GGTTGATTAA 600 

TCTTCTTTTG CTGCAATCAC TTCTGAACGA 660 

GGATCTGTTG CTGTTTCCTG ATTATAATCA 720 

TCAAAGACTG GAGTTATTGG TAATATAACA 780 

GCAATCTGTT GCTCAATTAA CAACTGATAA 840 

TCTTTAGCAT TTTGTATTAT TTCTTTAGAT 900 

TTATTGCGGC GATAATGGTA ACAAGCAGTT 960 

TGCTTACTGT AGTTATTTTC AGCTTTCCAT 1020 

TAATGTTGCT TAGTATGAAA TTGATAATAG 1080 

TCACCCCAAT TTAAAAGTAA GTTATTTTCA 1140 

ATGACGCAAT ACGAGGTAGA TATTATGTAT 1200 

TTAGATTTAT TGCTGCATCT TATCCAAAAA 1260 

CAAGCATTAA CAGAGCAGTA TATGCAGTAC 1320 

ATTGCAAGTG AATACCTAGT ATTAGCGTCA 1380 

TTACCACAAT CAACATCAGA TATGGATGTT 1440 

CGTTTAATAG rATATCaAAA TTATArAGAA 1500 

GAAAGAGATT TTTATTTTAC CAAAAAGACC 1560 

ATCyTGGGAT CCaAATCATA CGATTGATTT 1620 
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ATCTGTTGAA ATCCGAAAAG AGACATTTAC CATTCAACAA GCTACAGAAC AAGTGACATC 1740 
GAGATTGAAA GATAAAGATC ATTTTAACTT CTTTAGTCTG TTTACGTTTT CTGAGCCAAT 1800 

5 

TGAACAAGTA GTCACTCACT TTTTAGCTAT TTTAGAGATG TCAAAAGCAG GAATAATTAA I860 
TATTGAGCAA CAACGTAATT TTGAAGATAT TAACATTATT AGAGGAGTGA ACTACCATTT 1920 
TGGATAATCA TGGTATATTA GAGTCGCTTT TATTTACAGC TGGCGATGAA GGTTTAGATG 1980 

10 

AAAAACAACT ATTAGAAATA TTAGATATGT CGAAAGACCA ACTCGTTGAA TTAATTGAAA 2040 
ATTATTCATC ACATGGATTA ATGATACAAC GATTTG6AAT GA 2082 
IS (2) INFORMATION FOR SEQ ID NO: 212: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4219 base pairs 

(B) TYPE: nucleic acid 
20 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 212: 

25 





TCTATTCTCG 


TTCTTCCAAG 


ACCCTGaATT 


AGAAGTTAAG 


AAAATCGAAG AAGATGAGAA 


60 




AGAATCTATT 


AAAAAAGCTC 


AAAAAGGTAT 


TTATAAAGAC 


CCTAGAGACA TCAATGATGA 


120 


30 


CGAACAAGAT 


GATGATACAA 


AAGATACTGT 


TGATAAAAAG 


GAATGATTGT AATTGCCTAA 


180 




CAAAAACACT 


CAAGAATATT 


GGGAAGAACG 


CGGACGCAAA 


GCAATCGAGA ATGAGTTGAA 


240 




GCGTGATAAA 


ACTAAAGCTG 


AAGAAATAGA 


ACGTATATTG 


AATATGATGA TTAAGCGCAT 


300 


35 


TGAAAAAGAG 


ATCaATGCGT 


TTATTGTCAA 


GTACGGAGAT 


TTTGCAGGCG TTACATTACA 


360 




AGAA6CACAA 


AAGATTATTG 


ATGAGTTCGA 


TGTAAAAGCG 


TTTCAAGAAG AAGCAAAAAG 


420 


40 


ATTGCStCGAA 


AACAAGGAGT 


TTAGCGATAG 


AGCAAATGAA 


GAATTAAAGA AGTATAACAC 


480 


GAAAATGTAT 


GTATCTAGAG 


AACAGATGTT 


AAAGATTCAA 


ATAGAATTCT TAATTGCTTA 


540 




TGCAACAGCT 


CAAACAGAAT 


TATCGATGAG 


GGAATATTTC 


GAATCAACAG CTTATCGTGT 


600 


45 


GTTCAGTGAT 


CAAGCGGGTA 


TTTTAGGTGA 


AGGTGTACAA 


GTAGCTAAAG AAGTTATAGA 


660 




TACAATCGTT 


GATACACAAT 


TTCATGGTGT 


CGTTTGGTCA 


GAGCGATTAT GGACTAATAC 


720 




CGAAGCAATG 


AAACAAGAAG 


TAGAAGAAAT 


AATTGCTAAT 


GTAGTTATTA GAGGTCGACA 


780 


SO 


TCCTAATGAA 


TATGTTAAAG 


ATATGCGCAA 


CACTTAAATA 


AATTCGAAGG CACAGCACGA 


840 




CAAAAGACCG 


CAGCAATTAA 


ATCATTGCTT 


TATACGGAAT 


CGGCACGTGT TCACGCACAA 


900 




TCAAGCATTG 


ACAGCATGAA 


AGAAATTTCA 


CCGGAAGgAT 


ATTATATGTA TATTGCAAAA 


960 
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TGATTCGTTA GACGGTCTAC CTGTGGTTAA CCTTAAATCA AGCAACTTAA AACGTGGTGA 2880 

ATTAATCACT GGTGACTTCG ACAAATT6AT TTATGGTATC CCTCAATTAA TCGAATACAA 2940 

^ AATCGATGAA ACTGCACAAT TATCTACAGT TAAAAACGAA GATGGCACAC CTGTAAACTT 3000 

GTTTGAACAA GACATGGTGG CATTACGTGC AACTATGCAT GTAGCATTGC ATATTGCTGA 3060 

TGATAAAGCG TTTGCTAAGT TAGTTCCTGC TGACAAAAGA ACAGATTCAG TTCCAGGAGA 3120 

10 

AGTTTAATAA ATAATTAGGA GTGGTAACAT GCCCGAAATC ATTGGAATTG TTAAAGTAGA 3180 

TTTTACAGAT TTAGAAGATA ACAGACATGT CTATATGAAA GGGCATGTCT ACCCTCGTAA 3240 

AGGTTATAAT CCTACAGATG AACGTATCAA AGCTTTAGCT AGTGTTGAAA ATAAACGCAA 3300 

CAAACAAATG ATTTACATTG TAAATGACAA ATTAACCAAA AA^GAACTTG TCGAAATAGC 3360 

AAGTGTT6CT GGCTTACAAG TTGATGAAAA ACAAACAAAA GCTGAAATTA TCAATGCTTT 3420 

20 TGAGTCACTA GAGTAGGTGG TTATATGACT ACGCTAGCTG ATGTAAAAAA ACX3TATTGGT 3480 

CTTAAAGATG AAAAGCAAGA TGAACAATTA GAAGAAATCA TAAAAAGTTG TGAAAGCCAG 3540 

TTGTTATCAA TGTTACCTAT TGAAGTTGAA CAAATACCGG AAAGgTTTAG TTACATGATT 3600 

AAAGAAGTTG CAGTTAAACG CTACAACAGG ATTGGTGCTG AAGtATGACA TCAGAAGCGG 3660 

TTGACGGACG TAGCAATGCG TATGAATTGA ACGATTtCAA GGAGTATGAA GCTATTATTG 3720 

ATAATTACTT TAATGCTAGA ACGAGAACTA AAAAAGGAAG GGCTGTGTTC TTTTGAGATA 3780 

30 

TGAAGATAGA GTTATTTTTC AATTAGAACA AGTAGCAACT TACAATCCTA AAACTAGCAA 3 840 

AAAAGAAAAC ACACTAATCA CTTATGATGC GATACCATGC AATATTAACC CCATTTCTAG 3900 

35 AGCAAGAAAG CAACTTGAAT TTGGTGATGT AAAAAACGAT GTAAGTGTTC TGAGGATAAA 3960 

AGAATCAATA TCTTACCCTG TTAGCCACX3T GTTGGTTAAT GGCATTCGCT ACAAGATAGT 4020 

TGAl&CAAGG ATATACAGAC ACGAAAGGTC ATATTATATC GAAGAGGTCA ATTGATGAAT 4080 

ATAGATGGAT TAGACGCACT GTTAAACCAA TTTCACGATA TGAAAACCAA CATTGATGAT 4140 

GATGTAGATG ATATTTTACA GGAAAACGCC AAAGAATATG TAGTACGAGC TAAATTGAAA 4200 

GCTAGAGAAG TAATGAATA 4219 

45 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 
^ (B) TYPE: nucleic acid 

IC) STE^ANDEDNESS : double 
(D) TOPOLOGY: linear 

55 
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GCTTACAAGT ATATTCATAA TTACATATTC 
yCtTTaACTA CAACGGGGTC TAAGACTGCG 
^ TyCTTTATkA AAAAGTTATT TAGTAGAAAT 

ATGCTGATAT TACTTTGTTT TACCTTTTAT 
GACCTTGATG CCTTACCGTC ATTAGATCGA 

10 

TCATTAAATG ATAGTGGGTC TGAGCGAAGT 
AAATATACAC TAGGTTTTGG TGTCGGATTA 

^5 AATGGTATTT TACTTGTTGC CCATAATACA 
TTATTCGGTG CATTATTTAT CATATTTATG 
AACATTTCTG GGAAAAATGT AACAGCAATT 

^ TTAACAGTAT CATTTAATAA CTCAAGATAT 
ATTGTTCAAT ATGAAAAGAT G6AAAGGGAT 

TATTATTTAT CAAGGGCTAT ACCAATTGAT 

25 

CATTATTTCA CGTGCATTTG GTCCCAGTGG 

CGTGCAATAC TTTTTGATGA TTGCAAGTGT 

CGCGAAGTCC GTTAACX3ACA AACGGCAATT 

30 

TAAATTATTT TTAGCGTTAA CAGTTTTTGC 
TGATGATTAC TATCTTATTT TCCTACTACA 

35 TATTTCATGG TTTTATGCTG GAACTGAAAA 
TGCX3TCTGGT ATTGTATTAA GTGTAGTTGT 
ATTCTATGTA TTTACTATTG CTATTGTGAC 

''^ TTTAAAACGA TACATTAGCT TTGTTTCGGT 
TTCGTCATTt AGCATACTTA TTACCAAATG 
GCGTTGTTCT TGGTTTAGTA GGTACATACC 

45 

ATATTTTAAC GGTCGCAATC ATAATGATTA 

TTACCAAAAT GTCTATCCAG CAATCACATA 
SO ATATTCAATT GATATTAaCA ATACCTATGG 
TTTATTTATG GTTCtTTGGT GAGGAATTCG 
CGATACTTGT ATTAATCATT CCTTTAAATA 

SS 
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AAGGTCCTTG CATGTGGTAT TTTGCTATGG 60 

TTTATCATAT TAATCGTCTT AGCCATTtAT 120 

GCGGTAAGTG TTGTGAGTAT GTCAGTGATT 180 

AATATCAACT ACTATTTATT CCAATTAAGC 240 

ATGGCGTCTA TTTTTGAAGA GGGCTTTGCA 300 

GTTGTATGGA TAAATGCCAT TTCAGTAATT 360 

GTGGATTATG TACATATTGG CTCGCAAATT 420 

TATTTGCAGA TCTTTGCXXyi ATGOGGCATT 480 

CTTTATTTAC TGTTTGAATT ATTTAGATTT 540 

GTTGTAATGT TGACGATGCT GATTTACTTT 600 

GTOGCTTTTA TTTTAGGAAT TATCGTCTTT 660 

CGTAATGAAG AGTGATTCAC TAAAAGAAAA 720 

TAGAACGATG ACACCACTGA TTACAATACC 780 

TGTGGGTATT GTTTCATTTT CTTTCAATAT 840 

TGGCGTTCAG TTATATTTTA ATAGAGTTAT 900 

GTCACAGCAG TTTTGGGATA TCTTTGTCAG 960 

GATGTATATG GTCGTAATTA CTATATTTAT 1020 

AGGAATCTAT ATTATAGGTG CAGCACTCX3A 1080 

GTTTAAAATT CCTAGCCTCA GTAATATTGT 1140 

TATTTTTGTC AAAGATCAAT CAGATTTATC 1200 

GGTATTAAAC CAATTACCTT TGTTTATCTA 1260 

TAATTGGATA CACOTCTGGC AATTGTTTCG 1320 

GACAGCTCAA CTTATATACT AGTATTTCTT 1380 

AACAAGTTGG TATCTTTTCt AACGCATTTA 1440 

ATACATTTGA TCTTGTAATG ATTCCGCGTA 1500 

GTTTAACTAA AACGTTAGCT AATAATATGA 1560 

TCTTTgGTTT AATTGCaATT ATGCCATCAT 1620 

CATCAACTGT CCCATTGATG ACCATTTTAG 1680 

tGTTGaTAAg CaGGCAATAT TTAtTAAtAG 1740 
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TATGTAyTAT TTTGATATAT TTTTATGGAA TTTACGGTGC TGCTATTGCG CGTTTAATTA 1860 

CAGAGTTTTT CTTGCTCATT TGGCGATTTA TTGATATTAC TAAAATCAAT GTGAAGTTGA 1920 

ATATTGTAAG TACGATTCAA TGTGTCATTG CTGCTGTTAT GATGTTTATT GTGCTTGGTG 19B0 

TGGTCAATCA TTATTTGCC 1999 



(2) INFORMATION FOR SEQ ID NO: 214: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

20 TCATTATTAA GACTATTATA TATAATGAAT TTTAACTGGT TTATTAAACG AGAACGTCGG 60 

GAATTAAGTA ACTACAATAA AAATAAGATA TGACAATAAG GAGACTACAC GCGTGATCAT 120 

TGCCATAATT ATATTGATAT TTATTTCGTT TTTCTTTTCA GGAAGCGAGA CGGCATTAAC 180 

GGCTGCCAAT AAAACAAAAT TTAAAACTGA AGCTGACAAA GGTGATAAAA AAGCAAAAGG 240 

CATTGTAAAG TTACTTGAAA AACCAAGTGA GTTTATTACA ACGATTCTAA TTGGGAATAA 300 

TGTCGCGAAT ATTTTATTAC CAACACTTGT TACAATTATG GCTTTACGTT GGGGGATTAG 360 

30 

CGTTGGTATT GCATCAGCTG TTTTAACAGT TGTTATCATT TTGATCTCCG AAGTGATTCC 420 

CAAGTCTGTC GCTGCAACAT TTCCAGATAA AATAACAAGG CTTGTATATC CAATTATTAA 480 

^ TATTTGTGTC ATTGTGTTCC GTCCTATCAC ATTACTTTTA AATAAGTTGA CGGACAGTAT 540 

TAATCGAAGT TTATCTAAGG GCCAACCTCA AGAACATCAA TTTTCAAAAG AAGAATTTAA 600 

AACAATGTTA GCAATTGCTG GACATGAAGG TGCTTTAAAT GAAATTGAGA CGAGTA6GTT 660 

^ GGAAGGTGTC ATTAATTTTG AAAATTTAAA AGTAAAAGAT GTAGATACAA CACCTAGAAT 720 

TAATGTGACG GCATTTGCTT CAAATGCGaC ATACGAAGAA GTTTATGAAA CGGTTATGAA 780 

TAAGCCATAC ACTAGATATC CAGTGTACGA GGGAGATATT GATAACATTA TTGGGGTGTT 840 

45 

TCATTCTAAA TATCTGTTGG CTTGGAGTAA TAAAAAAGAA AATCAAATTA CAAACTATTC 900 

AGCTAAGCCA TTATTTGTGA ATGAACACAA TAAAGCTGAA TGGGTATTAC GTAAGATGAC 960 

^ TATTTCTAGA AAACATTTAG CAATTGTGTT GGACGAATTT GGTGGTACTG AAGCGATAGT 1020 

GTCACATGAA GACTTAATTG AAGAATTATT AGGTATGGAA ATTGAAGATG AGATGGATAA 1080 

AAAGGAAAAA GAAAAACTTT CTCAACAGCA AATTCAATTT CAACAACGGA AAAATCGCAA 1140 

55 
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GTATTGAATA TCCAATTATA CAAGCAGGTA 
CATCAGTAAG TAACAGTGGT GGGTTAGGCA 
^ AATTGGAAGA TGAAATAGAT TATGTACGCC 
TCTTTGTACC AAGTCAACAA TCATATACCA 
TAAAACCTTA TCGACGCGCA TTACATTTAG 

10 

AACAATTTAA GTGTCATATT GATACGATAA 
CTTTTGGAAT TCCAAGCGAA CAGATTATAA 
TAGGTACAGC AACAAGTGTT GATGAAGCTA 
TC6TTGCTCA AGGTAGTGAA GCAGGTGGAC 
AATTACCTAT GGTTGGAACA ATATCTTTAG 
20 CGGTCATTGC CX3CTGGTGGA ATTATGGATG 
GTGCAGAAGG GGTACAAATG GGCACCGCAT 
AACTACTGCG AGATGCAAT7 ATAAATAGTA 

25 

TTAGXGGAAA GCTTGCACGC GGTATCAACA 
AAGGCGATAT CCCAGATTAT CCAATACAAA 
CAGCAAACAT CGGCGACAAA GAGTTAATAC 

30 

CAACAACGCA TCCCGCCAAC ACCATCATGT 
TGCAATATAA ATAATCGACC GCAATCCACA 
35 GTGCTTGCCA TTTTTGTGGA TTGCGTTTCT 
AAGCTGAAGA TCGCXTGAAAG ATTTTAATCA 
ATGATTATGA TAAAAGTTAT ATGGTATGAT 
ATTAACAGGT TGGGGTGATC ACTATTCATT 
ACTTAAAACA TATGCTGGAC ATTTTCCGGT 
ACAACCGGAA AGAAATATAT TGAAATGGAT 

45 

GGTCAAAATT CATCAAGCAC TcACATTGCA 
GCAAGAACTA TTTGATCAAT TTAAGAATAT 
so AGCAATGGTA TTGGTTCAAT TTCCGCCATG 
TATTTTGTAT GTAAGACAGC AATTACAAGC 
ATCATGGTTT AGTGATGCAT TTAAAGAACA 

55 
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TGGCAGGAAG TACX3ACACCG AAATTAGTTG 1260 

CAATAGGCGC AGGTTACTTT AATACGCAGC 1320 

AATTAACX3TC AAATTCTTTT GGCGTAAATG 1380 

GTAGTCAAAT TGAAAATATG AATGCATGGT 1440 

AAGAGCCGGT TGTAAAAATT ACCGAAGAAC 1500 

TTAAAAAGCA AGTGCCTGTA TGTTGTTTTA 1560 

GCAGGTTGAA AGCAGCGAAT GTCAAACTTA 1620 

TTGCX3AATGA AAAAGCGGGT ATGGATGC7A 1680 

ATCGTGOTTC ATTTTTAAAA CCTAAAAATC 1740 

TGCCACAAAT TGTAGATGTC GTTTCAATTC 1800 

GTAGAGGAGT TTTGGCAAGT ATTGTCTTAG 1860 

TTTTAACATC ACAAGACAGT AATGCATCAG 1920 

AAGAAACAGA TACAGTCATT ACAAAAGCGT 1980 

ATAGGTTTAT CGAAGAAATG TCCCAATACG 2040 

ATGAGCTAAC AAGTAGCATA AGAAAAGCCG 2100 

ATATGTGGAG TGGACAAAGC CCGCGACTAG 2160 

CCAATATAAT CAATCAAATT AATCAAATCA 2220 

AAAGCACAAG CACCCCCAAA CATTATTTTA 2280 

ATTTTACCAA TTTAATCAAA CGAAAACATC 2340 

AGCAAAAACA TCAAACTAAA GTTCGCTGAA 2400 

GACATTGGTG ATATATATGA TAAACATCGG 2460 

ATATGAAGAT TTAGAACGCC AAACCGATAA 2520 

TGTCGAATTA GATGCGACAT ACTATGCGAT 2580 

AAAAGAAACG CCTGATACAT TTGAATTTGT 2640 

TGCAGACTAC AAAACATTTG CAGATACAAG 2700 

GTTAGAGCCC TTACATACAC AGAAAAAATT 2760 

GTTTGACTGC AATGCACAAA ATATCAAATA 2820 

ATTTCCAATG TGTGTAGAAT TTAGGCATCA 2880 

AACATTGGCA TTTTTAACAG AACATCAAAT 2940 
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AATCACAAAT GAAATTGCGT TTGTACGTTA TCATGGACGT 


AATCATTACG 


GTTGGACTAA 


3060 




GAAAGATATG TCAGATCAAG AATGGCGCGA TGTACGCTAT 


TTATATGATT 


ATAATGAGCA 


3120 


5 


AGAATTAATA GACTTGGCAC AAAAGGCACA AATATTAGCA 


CAAAAAGCTA 


AGAAAGTTTA 


3180 




CGTCATATTT AACAATAATT 


CTGGTGGTCA TGCAGCAAAT 


AATGCCAAAA 


CATATCAGCG 


3240 


10 


ATTATTGAAT ATAGAATATG 


AAGGGTTAGC ACCACAACAA 


TTAAAATTAT 


TTTAAGAGGC 


3300 


GACGACTATG TTATTAACAA TTACATTATT AGTTTTAATC 


GGAGGTTTGT 


CAGCGATTAT 


3360 




AGGGTCTATC GTAGGCATTG 


GAGGCX3GTAT TATTATCGTT 


CCAACAATGG 


TTTACCTCGG 


3420 


IS 


TGTTQAACAT GGATTACTAC ATAATATTAC AACACAAGTA 


GCGATAGOGA 


CGTCTTCAOT 


3480 




CATTCTAATT GTGACAGGAC 


TTTCTTCATC ACTTGGATAT 


TTAAAAACAA 


AACAAGTTGA 


3540 




TATTAAAAAT GGTTCCATCT 


TTTTATTTGQ ACTATTACCA 


GGTTCATTGC 


TTGGGTCCTT 


3600 


20 


CATTAGTAGA TATTTAACAT 


TTGAGTCATT TAATTTATAT 


TTTGGTATCT 


TTTTAATTTT 


3660 




CGTAGCCATT TTATTAATGG 


TAAGAAATAA GATTAAACCG 


TTTAAAATTT 


TCGATAAACC 


3720 


25 


CAAGTATGAA AAGACTTATG 


TAGACGCTAA AGGTAAAACA 


TATCATTATA 


gTGTTCCACC 


3780 


ATTGTTTGCT TTTATTACAA 


CGTTTTTAAT TGGTATATTG 


ACAGGTTTAT 


TTGGTATTGG 


3840 




AGGTGGCGCA CTAATGACGC 


CACTAATGCT TATTGTATTT 


AGATTTCCAC 


CTCATGTAGC 


3900 


30 


TGTTGGAACA AGTATGATGA 


TGATTTTCTT TTCAAGTGTC 


ATGAGTTCTA 


TAGGGCACAT 


3960 




TGCTCAAGGT CACGTAGCTT 


GGGGTTATGC AATCATnTTA 


ATTATTTCTA 


GTTATTTTGG 


4020 




TGCGAAAATC GGTGTCAAAG 


TGAATCAATC AATTAAGTCA 


GATACGGTAG 


TAACATTATT 


4080 


35 


GAGAACAGTA ATGTTGTTAA TGGGTATATA TTTAATTATT 


CGTGCGTTGA 


TTTAATACAA 


4140 




CTTTAAAAGG AGGACGTCAA 


TTTGAGGCTT ACAATTTATC 


ATACGAACGA 


TATTCATAGT 


4200 




CATTOACATG AATACGAACG 


CATTAAAGCA TATATGGCAG 


AACATCGGCC 


ACGACTTAAT 


4260 


40 


CATCCTTCTT TATATGTTGA TCTAGGTGAT CATGTAGATT 


TATCCGCACC 


TATAACTGAA 


4320 




GCAACTTTAG GTAAAAAGAA 


TGTGGCATTA CTAAATGAAG 


CAAAATGTGA 


TGTTGGAACA 


4380 


45 


ATCGGTAATA ATGAAGGGAT 


GACCATTTCA TACGAAGCTT 


TAAATCACCT 


TTACGACGAA 


4440 


GCAAAATTTA TAGTGACATG 


TAGCAATGTT ATAGATGAAT 


CAGGTCATTT 


ACCAAATAAT 


4500 




ATCGTTTCTT CTTATATTAA 


GGACATAGAC GGTGTGAAAA 


TACTATTCGT 


TGCAGCGACA 


4560 


SO 


GCACCTTTTA CCCCATTTTA 


TCGTGCACTA AATTGGATTG 


TTACCGATCC 


ACTTGAATCT 


4620 




ATAAAAGAAG AAATTGAACT 


TCAACGAGGT AAATTTGATG 


TATTAATCGT 


GCTAAGTCAT 


4680 


SS 


TGTGGCATTT TCTTCGATGA AACATTATGC CAAGAATTGC 


CTGAAATTGA 


TGTCATTTTT 


4740 
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GCAgCTGGAA AGTATGGTAA TTATCTTGGA GAGGTTAATT TAACTTTTGA GGCACATAAA 4660 

GTAGTACATA AAACTGCAAA GATTATTCCT TTAGAAACAT TACCTGAAGT TGAAACTTCA 4920 

^ TTTGAAGAAG AAGGAAAAAC GTTAATGTCC AATTCAGTAA TTCAACATCC AGTAGTGCTT 4980 

AAGCGTAGTA TGAATCACAT AACTGAAGCT GCATACTTAT TAGCTCAAAG TGTTTGTGAG 5040 

TATACACATG CACAATGTGC CATCATCAAT GCTGGCTTAC TCGTTAAAGA TATTGTAAAA 5100 

TO 

GATGAAGTGA CAGAATATGA CATTCATCAA ATGTTACCGC ATCCGATTAA TATGGTAAGG 5160 

GTTAGACTTT TTGGTGTGAA ATTAAAAGAG ATTATAGCTA AAAGTAATAA ACAAGAATAT 5220 

ATGTATGAAC ATGCACAAGG TTTOGGTTTC AGAGGGAATA TATTTGGAGG ATATATTCTT 5280 

TATAATTTAG GGTACATTCA TTCTACAGGG CGTTACTATC TGAATGGAGA AGAAATOGAA 5340 

GACX3ACAAAG AATATGTACT AGGTACGATA GATATGTATA CGITCGGTCXS TTATTTCCCA 5400 

20 ACATTGAAAG AATTACCAAA AGAGTATTTA ATGCCAGAGT TTTTAAGAGA TATATTTAAA 5460 

GAAAAATTAT TGGAATATTA AAAAGTAAGA TTATTGGATT TTCATTTGTC ATGAATTTCG 5520 

ATATAATGTT TAAAGATACA CTTAACAGGA GGGTATGTGT TGITATGGCG ACAAAAAACG 5580 

AGGAAATATT ACGTAAACCG GATTGGTTGA AAATAAAATT AAATACCAAC GAAAACTATA 5640 

CAGGACTTAA GAAGATGATG AGGGAAAAAA ATCTTAATAC TGTATGTGAA GAAGCTAAAT 5700 

GTCCTAATAT ACATGAATGT TGGGGTGCAC GTCGTACAGC GACATTTATG ATTTTAGGTG 5760 

30 

CCGTATGTAC AAGAGCTTGT CGTTTTTGTG CGGTTAAGAC AGGTTTACCT AATGAACTTG 5820 

ATTTAAATGA GCCTGAACGT GTAGCTGAAT CAGTTGAATT AATGAATTTG AAACACGTTG 5880 

35 TTATCACTGC TGTTGCGCGT GATGATTTAA GAGATGCTGG TTCAAATGTT TATGCTGAGA 5940 

CAGTACGTAA AGTTAGAGAA AGAAATCCAT TTACAACGAT TGAAATTTTA CCATCAGATA 6000 

TGGGCGGGGA CTATGATGCG TTAGAAACAT TAATGGCGTC AAGACCTGAC ATTTTAAACC 6060 

^ ATAATATTGA AACTGTTCGT CGCTTAACAC CGAGAGTTCG TGCXXTCTGCG ACTTACGACA 6120 

GAACATTAGA GTTTTTAOGT CGTTCAAAAG AATTACAACC GGATATCCCA ACTAAATCAA 6180 

GTATTATGGT TGGATTAGGT GAAACTATAG AAGAAATTTA TGAAACXSATG GATGATTTAC 6240 

45 

GTGCGAATGA TGTAGATATT TTAACGATTG GTCAATATTT ACAACCTTCA CGTAAACATT 6300 

TAAAGGTTCA AAAATATTAC ACGCCTTTA6 AGTTTGGTAA ATTAAGAAAA GTGGCAATGG 6360 

^ ATAAAGGGTT TAAACATTGC CAAGCTGGAC CTTTAGTACG TAGTTCTTAT CATGCGGATG 6420 

AGCAAGTAAA TGAAGCTGCT AAAGAAAAGC AACGCCAAGG TGAGGCACAG TTAAATAGTT 6480 

AATATTTAAC CATTAATAAG GCATAAAGGC TTAGTTTGTA CAAAACGAAC GTGTCATAGA 6540 

55 



961 



10 



IS 



20 



25 



30 



35 



40 



45 



EP 0 786 519 A2 

AGGTGAAGAA TTTGATAAAA GTAGATCAAC ATTACTTTGA ATTAATAGAA AATTATCGCG 6660 

AATGTTTTAA TOAAGAACAA TTTATTGCTA GGTATTCAGA TATTTTAGAT AAATATGATT 6720 

ACATAGTTGG TGACTATGGT TACGATCAAT TACGATTAAA AGGTTTTTAC AAAGATTCTA S780 

ATAAAAAAGC AGAGATGAGT AAACGTTTTT CAAATATTCA AGATTACATA TTTGAATATT 6840 

GTAACTTTGG TTGTCCTTAC TTTGTATTAA GACATTTGTC TAAACAAGAG GTTAAAAAGT 6900 

TAATCGAAGA AGTTCATCCG TCTGATGTGA TAGATGACGA CAATAAACTT CAAGATGTGA 6960 

AGATTAAGCC AACCATTCAA GATACTGAAC ATTAATAAAA CCCTTAGCTA GATTGAAAAT 7020 

GGGAATCATG CAATTCAAGC ATGGACCTGT AATCTAGTTA GGGGTTTTTA TCTTTAATGA 7080 

atgacttcat ttaaatactc agtaatttca tcgccttctt cagcatttac acctaaaata 7140 

tgagcgatat agccttcttc ttttaaatca tca6taccga taataccgaa tttatttgtt 7200 

tgcatattaa gtacgagtgt cttaccataa tgtctatttg tatggactaa catcaaatca 7260 

tatcgactat gctcgccaac aaaaccaaca aactgaactt gactctcttc gttgtcatca 7320 

tataaataca tatcaatcat tttgtagcga ctccttttaa aagtagtaaa gttagtataa 7380 

cgacaaatga agtatactgc aaaattatga taatatataa gtqagaggtg acaaggaatg 7440 

tattttgtag acaaagataa actaactcag aaattagcct atttacaagc attaactgat 7500 

gattatcatg agagcaagca caatcattat gcatttgaac gcattgctca aatgttgata 7560 

gaatcatcgg tagatatagg gaatatgatt atcgatgcat ttattttaag ggatcctggt 7620 

aattataaag atgtgattga tatattagaa ctagaaaatg ttattactaa agaaacacag 7680 

caggcgatta ataaaactgt cggtattcgt aaacaattta catatqatta cacagcctta 7740 

gatgttgaga ttatcatgcc aatgtttga 7769 
(2) "Information for seq id noj 215 : 
(i) sequence characteristics: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

ACCGCCACCC ATTAATGATT GCTTAAAATC AATAGTCGTA CCATTTAATA CGGGTGCATC 60 

TTTTTTGTCT ACTAATACTT TTAATCCAAA GTATTCTAAG ACTTCATCAT TTTCACCAGG 120 

CGCTTCTTCT GCACCCATAC CGTATGTTAA ACCAGTGCAC CCGCCACCAT TCACTTTAAT 180 
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TGCTTCTGTT AATATAACTG TTGGCATGAT AACTCCTCCT TAAAAAATCC AAGTTTCTTT 300 

TATATGTGCA TATATATTTT GTAATAATTC TTCCGGCGAA TCACCTTCAA CAATATCACC 360 

5 

ATTTACTAAA GCATACAACC CGGCTGAACA TATACCACAA TGTGTCAGGC AACCATACTC 420 
TAACACATCG ACATCTGGGT CATTTTCCAG TTGATTAAAA ACATAATCTC CACCTTTTGC 480 
CATGTTAGAG AGACAAAATT CTACX3ATCGG ATTCATACTT CACCTTCTTA TTTCATTTGT 540 

10 

TACAATATTA TAGCATTTTA AAACTGGTAT TTTAACATGA TGTGCTCAAT TAGCAACAAC 600 
TGATGTTTCT TATCCCAGTT ATGTAATAGT GCCTTAGTTA GTAC 644 
IS (2) INFORMATION FOR SEQ ID NO: 216: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

25 



30 



40 



45 



SO 



GAATGATGAA 


AGGAATAGAA 


AAGAAAAGAT 


AAATAATGTA 


ATAGATTTAT CCGAGAAAAT 


60 


TGAAAGAACA 


AAAGATATGC 


CAATCAAGAA 


TACTATAACT 


ACTCAATTAG GAAATAAACT 


120 


TATTGGCACA 


AAAAAAGCTC 


GTTTTGATGA 


TAAGAAAGTA 


GTGTCGTTTG GAGCATTTGA 


180 


AGATGAATAA 


AATAAATGAT 


AGAGATTTAA 


CAGAATTGAG 


TAOCTATAGG GTITATCAAG 


240 


ACATCAATAA 


AGATAATGAC 


TTTACAGTTA 


ACGAAAAACG 


ATTTAAGCAG GCAGATGTAT 


300 


TTGAAGATTT 


ATATAGAGAG 


AAACTAAAAG 


ACACAAATAA 


ATTAAGAGAG TATAATTATT 


360 


TACAAAATGA 


AACTTTTAAA 


AGCGCATAAA 


TAGGTGATGA 


GATATGCTTA AAAAAGCAAA 


420 


ATTTfiTCTTA 


ATGGCAACGA 


TACTACTATC 


AGGATGTTCA 


ACTACCAATA ACGAATCCAA 


480 


CAAiCGAAACA 


AAATCTGTAC 


CAGAAGAAAT 


GGATGCTTCA 


AAATATGTAG GACAAGGATT 


540 


CCAACCACCT 


GCAGAAAAAG 


AT6CGATTGA 


ATTTGCAAAG 


AAGCATAAAG ATAAAATTGC 


600 


TAAGCGAGGC 


GAACAATTTT 


TTATGGATAA 


CTTCGGTCTA 


AAAGTTAAAG CTACAAATGT 


660 


TATAGGTAGT 


GGCGATGGTG 


TAGAAGTATT 


CGTGCATTGT 


GATGACCACG AyATCGTATT 


720 


TAATGCGAGT 


ATTCCATTTG 


ATAAATCAAT 


wATTGAsAGT 


GATAGCTCAT TAAGAAGTrA 


780 


GGAyAAAGGy 


GATGATATGA 


GTACTTTAGT 


TGGTGCAGTA 


CTCAGTGGGT TTGAATATCG 


640 


AGCACAAAAA 


GAAAAATATG 


ATAAATTATA 


TAAATTTTTC 


AAAGATAATG AAGAGAAATA 


900 


TCAATATACA 


GGATTTACAA 


AAGAAGCAAT 


TAATAAGACG 


CAAAATAGTG GTTATGAAAA 


960 
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to 



IS 



20 



2S 



ACCATTGTTA AACAAAAGTG ACAGTGAATT TTCAAAAGAA TTGTCAAATG TTAAGAAGCA 
ATTAAAAGAT AAGTCTAAAG TTTCGGTAAC TACTACTCTA TTTAGTAAAA AAAAGAACTA 
TACTAAAAAA AGTAACAGTG AAAATGTAAT AAAAATGGCA GAAGAAATAA AAAAAGATAA 
AGAGATACCA AACX3GTATAG AGCTTAGTAT AAAATTTTCG GACAATAAAA TAAATACGGT 
TAAACCAAAT TTTAACGGTG aAAGCACTTC AGAATATGGT GTGTTTGATC AAGAATAAAA 
TTAATGATOa AAATTTAACG GAGAATAGTG TATATTGAGT AGATCmAGAA TAAAAAGATA 
ATTCTACTAT TGTTGTGAAG GCAAATAAGT AGAAGATTTT AAGTGTAATT TCTGGTGATT 
TAAATAATAA TATAnATGGn AGTACTGATA TAAnACTTTT TAACCTACTA GATTCTTATA 
ATTTGCTTTC CATTTTATGA CGATTTTTAC TCCAATTGAG TGATAGAATC CAAAAAAGCC 
ATCTCCAAAA ATTAATCC 

(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1578 



30 
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40 



45 



SO 



ss 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
TGTTTTCCTT GGGTTAAAAC ATGCTTGCTA TGCGTTTGTA AATATGACTT GCTGTTTTnA 
CCroxiATACC CGTCACACCA TGGAAGTAAA AATGTTTCTT GCTCTTGGCT TACAATTTTA 
GCTTTAATCG CTTCATATGC TTTATATTGG TCTTCTGTTA ATTGCTGTTT TGATTCTTGT 
TCGAAAACAC GATCTTTAAA TGGGTCTCTT TCAACAACCG CGTCATATTT TTCAACATAA 
CCrmTTGA TAAGTCCATC TAAACTGGAT TTTGAAAAGC CCATATCCTC AATATCAGTT 
AAAAATATTG TTTTATGTTG TTCTTCAGAC AAGTAAGCAT ACAAATCGTA TTGTTTAATA 
ACTTTCTCCA ACTTAGCTAA TACTTCATCA GGATGATACC CTTCAATGAC ACGAACAGCA 
CGCTTGGTTT TTTTAGTTAT ATTTTGTGTG AGAATCGTTT TTTCTTCAAC GATATCATCT 
TTTAACAACT TCATAAGCAA TTGAATATCA TTATTTTTTT GCGCATCTTT ATAATAATAG 
TAACCATGCT TATCAAATTT TTGTAATAAA GCTGAAGGTA GCTCTATGTC ATCTTTCATC 
TTAAATGCTT TTTTATACTT CGCTTTAATA GCACTCGGAA GCATCACTTC TAGCATAGAA 
ATACGTTTAA TGACATGAGT TGAACCCATC CACTCACTTA AAGCTATTAA TTCTGATGTT 
AATTCTGGTT GTATATCTTT CACTTCTATG ATTTTTTTTA ACTTOGAAAC GTCAAGTTGT 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
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ACAATTACAC GCACACCAGG TTGGATGACA GATTCGAGTT GTTCGGGAAT TATATAATCA 900 

AATTTATAGT CAACGCTCTT CGACGCGACA TCGACTATGA CTTTCGCTAT CATTATTGCC 960 

5 

ACCTAGTTTC TAGTTCATCT AAAATTTGTG CAGCTAATAC TACTTTTTTT CCTTTCTTGA 1020 

TATTTACTTT TTCATTATTT TTAAAATGCA TTGTCAATTC ATTATCATCA GAACTAAATC 1080 

CGATAGACAT ATCCCCAACA TTATTTGAAA TAATCACATC TGCATTTTTC TTGCGTAATT 1140 

TTTGTTGTGC ATAATTTTCA ATATCTTCAG TCTCTGCTGC AAAGCCTATT AAATACTGTG 1200 

ATGTTTTAT6 TTCACCTAAA TATTTAAGAA TGTCTTTAGT ACGTTTAAAA GATACTGACA 1260 

^5 AATCACCATC CTGCTTTTTC ATCTTATGTT CTAATACATC AACCQCyrGTA TAGTCAGATA 1320 

CX»3CTGCTGC TTTTACAACA ATATCTTGTT CGTCAAATCG GCTTGTCACT TGTTCAAACA 1380 

TTTCTTCAGC ACTTTGAACA TGAATAACTT CAATATCTTT TGGATCCTCT AGTCTTGTAG 1440 

20 

GACCAGCAAC TAACGTCACG ATAGCTCCTC GATTTCGCAA TGCTTCAGCT ATTGCATAGC 1500 

CCATTTTTCC AGAAGAACGA TTGGATACAA ATCTGACTGG ATCGATAACT TCAATAGTTG 1560 

GTCCTGCTGT AACCAATGCG CGTTTATCTT GAAATGAACT ATTAGCTAAA CGATTACTAT 1620 

2S 

TTTGAAAATG AGCATCAATT ACAGAAACGA TTTGAAGCGG TTCTTCCATA CGTCCTTTAG 1680 

CAACATAACC ACATGCTAGA AATCCGCTTC CTGGTTCGAT AAAATGATAC CCATCTTCTT 1740 

30 TTAAAATATT AATATTTTGC TGCGTAOGTT TATTTTCATA CATATGCACA TTCATAGCAG 1800 

GCGCAATAAA TTTCGGTGTC TCTGTT6CTA GCAACGTTGA TGTCACCAAA TCATCAGCAA 1860 

TACCTACACT CAATTTTGCA ATTGTATTTG CCGTTGCAGG TGCAACAATG ATTGCATCTG 1920 

^ CCCAATCACC TAATGCAATA TGCTGTATTT CTGAAGGATT TTCTTCTATA AAAGTATCTG 1980 

TATAAACAGC ATTTCGACTT ATTGCTTGAA ATGCTAATGG TGTCACAAAT TTTrGTOCGT 2040 

GATTeCTTAA CATAACGCGA ACTTCATACC CAGATTGTGT TAACTTACTT GTCAAATCAA 2100 

40 

TTGCTTTATA TGCCGCAATG CCACCTGTAA CGGCTAATAA TATTTTCTTC ATATTCAATC 2160 

TCCCTTAAAT ATCACTATGA CATTTAOGCT TTACATCATC ATATGCGCAC AAATGCTCAT 2220 

TACTTTTTTA TAGATACAAA TTTAGTATTA TTATAACATC AATCATTGGA TAAACTAAAA 2280 

45 

AAACACACCT ACATAGGTGC GTTTGATTTG GATATGCCTT GACGTATTTG ATGTACGTCT 2340 

AGCTTCACAT ATTTTTAATG GTCGAAACTA TTCTTTACCA TAATAATCAC TTGAAATAAC 2400 

50 AGGGCGAATT TTACOGTCAG CAATTTCTTC TAACGCTCTA CCAACTGGTT TAAATGAATG 2460 

ATATTCACTT AATAATTCAG TTTCAGGTTG TTCATCAATT TCACGCOCTC TTTTCGCTGC 2520 

AGTTGTTGCA ATTAAATACT TTGATTTAAT TTGTGaCGTT aATTGGTTtA AAgGTGGATT 2580 
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TTTArGTGcT CAGCTTCTAC AATACATTGA ATTCtATTCy TCGcAAGtTC TACTTCAtCA 2700 

TTAACTACAA cGTAAyCGTA TAAATTCATC ATTTCrACTT CTkTACGCGC yTCGTTAATA 2760 

5 

CGACTTTGTA TTTTCTCATC AGATTCTGTT CCTCTACCTA CTAATCGCTC TCTCAAGTGT 2820 

TCTAAACTTG GAGGTGCTAA GAAAATAAAT AGCGCATCTG GAAATTTCTT TCTAACTTGC 2880 

TTTGCACCTT CTACTTCAAT TTCTAAAAAT ACATCATGAC CTtCGTCCAT TGTATCTTTA 2940 

10 

ACATATTGAA CTGGTGTACC ATAATAGTTG CCTACATATT CAGCATATTC TATAAATTGG 3000 

TCATCTTTGA TTAAAGCTTC AAACGCATCC CTAGTTTTAA AAAAGTAATC TACOCCATCA 3060 

IS ACTTCACCTT CAC6CATTTG ACGTGTTGTC ATTGAAATAG AATACTTATA TGATGTACTT 3120 

GGATCTTCAA ATATnCGTnT TCTAACAGTA CCTTTACCTA CTCCAQATGG TCCTGATAAA 3180 

ACGATTAACA ATCCTTTTTC ATTATCCATG CCTTACTGACC TCTCTAAGCT AATCTTCTAT 3240 

2^ TATTTAAATA TGATATCACA TTGTTCTTTA TATTGTATAG CATATTTGAA ATTGCATGCC 3300 

ATAATTTCTA TTAAGTCTAA CAATATCGTT ATATTGCACG ATTAATTTTA ATTAAATAAA 3360 

TTGAATTGCA AACTTTTAGA TAATGTAAAA TGTATGGCAT AATGTATGGT TCAATAACTA 3420 

25 

TACTGAAAAG TTACAATCAT GTTAAAATGA AACGAATGAT ATGAAGAAGG TGGAAGATAA 34 80 

ATTATGGCTT ATGATGGCTT ATTTACAAAG AAAATGGTTG AGTCTCTACA ATTTTTAACA 3540 

ACAGGACGTG TTCACAAAAT CAATCAACCT GATAATGACA CGATACTAAT GGTTGTACGT 3600 

CAAAATAGAC AAAACCATCA ATTGTTATTG TCAATCCATC CAAACTTTTC AAGATTACAA 3660 

TTGACTACTA AAAAATATGA TAATCCATTT AATCCACCCA TGTTTGCGCG TGTTTTTAGA 3720 

^ AAACACTTAG AAGGTGGTAT TATCGAATCG ATTAAGCAAA TTGGTAATGA TCGTCGCATT 3780 

GAAATCGATA TAAAGAGTAA AGATGAAATT GGCGATACTA TTTACCGCAC TGTCATCCTT 3840 

GAGATTATGG GTAAACATAG TAACTTAATT TTAGTAGATG AAAATCGCAA AATAATTGAA 3900 

40 

GGATTTAAAC ACTTAACACC AAATACX5AAT CACTATCGTA CAGTAATGCC AGGATTTAAT 3960 

TATGAAGCAC CACCTACTCA GCACAAAATA AATCCGTATG ATATTACAGG TGCAGAGGTG 4020 

TTGAAATATA TCGATTTTAA CGCAGGTAAT ATTGCTAAAC AATTATTGAA TCAGTTTGAA 4 080 

4S 

GGATTTAGCC CTTTAATTAC GAATGAAATC GTTAGTCGTC GTCAATTTAT GACTTCATCA 4140 

ACATTACCAG AAGCATTTGA CGAAGTAATG GCAGAAACCA AGTTACCACC TACTCCTATT 4 200 

50 TTTCATAAAA ATCATGAAAC AGGTAAAGAG GATTTCTATT TTATAAAGTT AAATCAATTT 4260 

AATGATGATA CAGTTACATA CGATTCATTA AATGATTTGC TTGATCGTTT TTATGATGCG 4320 

CGTGGCGAAC GTGAACGCGT TAAACAACGT GCGAATGATT TAGTTCGATT TGTTCAACAG 4380 
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ATAAAGATAC TGAACAGTTA TATGGTGAAT TGATCACTGC TAATATATAT CGAATTAAGC 4500 
AAGGCGATAA AGAAGTGACG GCATTGAATT ATTATACGAA TGAAGAAGTT GTCATTCCTT 4560 

5 

TAAATCCTAC AAAATCCCCA TCAGCAAATG CTCAATATTA TTATAAACAA TATAAyCGTA 4620 
TGAAAACGAG AGAmCGTGAA TTACAACATC AAATTCAATT GACGAAAGAC AATATAGATT 4680 
ATTTTTCAAC AATCGAACAA CAATTACATC ATATTTCTGT CCATGACATT GATGAAATTA 4740 
GAGATGAATT AGCAGAACAA GGCTTTATGA AACAGCGTAA AAATCAAACT AAGAAAAAGA 4 BOO 

AAGCX3CAGAT TCAATTACAA CATTATGTAT CAACTGATGG CGACGATATA TATGTTGGTA 4860 
AGAATAACAA GCAAAATGAT TATTTAACAA ATAAAAAAGC TAAAAAAACT CACACATGGT 4920 
tACACACAAA AGATATTCCT GGTTCACATG TCGTTATATT TAATGATGCA CCAAGTGATA 4980 
CGACAATCAA GGAAGCGGCT ATGTTAGCAG GATACTTTTC AAAAGCTGGT AATTCTGGAC 5040 

20 

AAATACCTGT TGATTATACA TTAATTAAAA ATGTGCATAA ACCATCaGGT GCAAAGCCTO 5100 
GGTTTGTAAC ATATGACAAT CAAAAAACTT TGTATGC 5137 
(2) INFORMATION FOR SEQ ID NO: 218: 

2S 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: double 
30 CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

35 GTTTTATCGC AGCAGTAAAG CTATCAATCG GCGGTTCAAT TGATGATGCA TTAGCAGAAA 60 

TCAnACAATC ATTTTAGTTA AAATTTACTA ATAATGAaAA ATGTAAACCT TTTTCAAATG 120 

AAACTTTATa AaAAATATGA TAGTATATAT GTAAATGTTT AATAAAATCT GGAGAAATAG 180 

40 

GAGGACATTG CCATGCAACA CCTTATAAAA AAACATGTAT TGAATGGCGA GTTTGATTTA 240 

GTACGACAAT TGATGTCCGA AACAGATTTT ATGGAATTTG AAGAAGCATA TATTTCAAGT 3 00 

GCGCATGAAG TAGAAAGTAT GATGTTTTAT ACATGTATTT TAGATATGAT TAAGTACGAA 360 

45 

GAATCATCTG AAATGCATGA CTTAGCATTT TTATTGCTTG TGTATCCACT AAGTGAATAT 420 

GAAGGTGCTT TGGATTCTGC TTATTATCAT GCAGACGCTT CCATAAAACT TACTGACGGC 480 

^ AAAGAAGTTA AAAGTTTGTT ACAAATGTTA TTATTGCATG CGATACCAAC ACCTGTTATT 540 

TCAGATAAGA AGGCTTTTGA TATCGCCAAG CAAATTTTAA AATTAGATCC TAATAATAAT 600 

GTTGCTCGTA ACGTCTTAAA AGACACTGCC AAACGTATGC gACAaCGTTG TTGTTGATAT 660 

55 
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AGTTTTAACA TTTGGTTGGG TTGGGCATAT GTTCCAGCCT TTTTTAATAC TTAAAAACTA 780 

ACGAAgTATA CTTGTGTGCA CAAATGGTTT TTATACAACA TTTTATAAAT TTATACATTT 840 

5 

TAATAAAGAA CATACGATAG ATGGTTTAAA CCTTGTTAAC TGAGAAATTT TGATATGTAT 900 

TCTTCGAAAT TTAACTAAAT ATACGAAATT CAAGAAGCAC AATAATTAAT CATTTTTCCT 960 

ATACAAAAGT TCGTATGACT GCATTATAAA AGCATAAATT TATAATTTTT TTAAATGTCA 1020 

TTQAAGGTGA TAATGTGAAT GGATTGAGCA AriTTGAAAA AGTGAAAAAT AACCTATGCG 1080 

ACTTGCAATT AATTTTCAGT ACGTTATAAT GCACACTGT6 CAAAATTAAG GAGGTCTATT 1140 

IS ATTCACATGA TGATGAaTAA AGAAGCAACA AAAATTGGAT TTGCCTACGT CGGCATTGTA 1200 

GTGGGCGCAG gATTTTCAAC TGGACAAGAA GTTATGCAAT TTTTCACTAA ATATGGCTTG 1260 

TGGGCTTATT TAGGTGTTAT TATATCTGGT TTTATTTTAG CTTTTATTGG GCGCCAAGTA 1320 

20 

GCAAAAATTG GTACTGCCTT TGAAGCGACA AATCATGAAT CAACATTACA ATACGTATTC 1380 

GGTGAAAAGT TTAGTAAAGT CTTTGaTTAT ATTTTAATCT TCTTCTTATT TGGTATAGCT 1440 

GTAACCATGC tAGCTGGTGC AGGCGCAACA TTTGAAGAAA GTTATAACAT ACCTACATGG 1500 

2S 

CTAGGTGCTT TaATTATGaC ATTAGCGATT TATATTACGT TGCkATTAGA CTTTAATAAA 1560 

ATAGTACGTG CACTAGGTAT CGTTACACCA TTTTTAATTG TTTTAGTTGT ATTAATCX3CT 1620 

30 GGCGTTTATT tATTTAAAGO TCATGtTTCA TTAGCAGAAG TTAACCAAGT AGTGCCtGAA 1680 

GCAAGTATTT GGAAGGGAAT CTGGTTTGGT ACAATATATG GTGGATTAGC TTTTTCTGTA 1740 

GGTTTTAGTA CCATCGTAGC AATCnGTGGG GATACTGAAA AGCGTACAGT GTCAGGTGCA 1800 

35 GGCGCGATGT ATGGTGGTAT TATCTATACT GTATTACTAG CATTGATCAA CTTTGcATTG 1860 

CAAGTGaATA TCCAACTATT AAAAATGCCT CAATTCCTAC ATTGACGTTA GCAAATAATA 1920 

TCCAffCCTTT AATAGCAACA GTGJcTATCTG TTATTATGCT GGCGGkTATG TATAATACTA 1980 

40 

TTCTAGGACT.AATGTATTCA TTTGCAGCAC GTTTTACAGA ACCATACAGT AAAAATTATC 2040 

ATATCTTTAT TATTATAATG ATGGTAGCAG GTTATTTATT AAGTTnOGTA GGATTTGCTG 2100 

AATTAATTAA TAAGTTATAT ACnATTTATG GGATATGTAG GCTTATTnTA TTGTAGTAGC 2160 

45 

TGTAATTATn AAATATTTCC AAACGTAAAA ATGGCGGATA AAAAACATAT TGCTTTAATA 2220 

TCATATGGAG GGGATATCCG AAACTTTACA ATTTGAATCA CTTTGGT 2267 
SO (2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6336 base pairs 

(B) TYPE: nucleic acid 
„ (C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: i 

S 

GTCAATGTAA CCTAATAGTT TATGTCTATC 
AATTTCCATA ATTTCAGCTG ATGAAGTTTG 
ATCAGGATTT TGTCTTATTG CACGTCTAAT 

10 

CGTTACTGTA CAAGTATTAA TAACAAATAC 
ATAGTTTGCT TCTTTAAATA ATTGCCAGAT 
IS ACCTAATGTG TGaACGCAAC TGTTGACATA 
CTTATTGCAC TTAACGCATA CAATGGCGCA 
CCAACAACTG TACTAGTATT ACTAAATAAT 
GGaCCAAAAA TCATCAACAC TTTATCCTGA 
TTGCTTAACT CACCATCTTT TGCTTCCTCT 
TCAATAGTAT CACAAATTAA TTTTAAATTC 

2S 

CGATAGCTTT GTTCAGCAGC TTCTTTAATT 
ACTTTTGCCT CGTTTAATTT AACAATTGAA 
GCACCCAATT CAGTAGCTTT TTGTAGCAAC 

30 

CTGCAAATCG TAACATCAAC TGGCAATTCT 
ACTTCAATTT TATCACTTGT TATGTCAGCA 

35 AAAGTTAAAA TAATTTTACT ACCAACATCA 
TCTTCTTTTT TTGTAATAAA AAAACGCTGA 
ACGTTGCACA TTATTCACTC ACTTTCTGGC 

""^ GTTCTGAAAT AATTTTAAAA CCTACACGCT 
CTTTTATAAT ACCAGAAGTA ATAAAATAAC 
CAATCATTTC ATCAATAATA TGCGCTAAAA 

45 

TTTCGTCTTT CAATAAGTTA CCTGGAACAG 
TGAAGTTTTC TTTAGCTACA CTCACTGCCA 
so TTACACCGAT TAGATGACTT GCAATACTTA 
CTGAATGCTG TGGCAATACA TATGTTTCTA 
GATCACCTGT TCCAAAAGCC ATACCTGGGT 

55 



iEQ ID NO: 219: 

TTGTGTACCA ACTACTACAT CGACACCAGG 60 

CGCATAACAA CCTGTTACAC AGATTACAGC 120 

TATTTGACGA CTTTTTTTAT CACCCGTATT 180 

ATCAGCATTC GCTTCAAAGT CAACGCGCTC 240 

TGCTTCAGTT TCATAATGGT TTACTTTACA 300 

AATATTCACC CCATTAATTC TTTTTCATAA 360 

GTTTCTGCCC GTAAAATTCT CGGCCCAAGA 420 

GAAATTTCAT TTTCTGACAA ACCACCCTCA 480 

GCATTGAATT GTTQTAAAGT TTGCTTGAAA 540 

TCATATGCAA TAAGAATATA GTCATAATTA 600 

GACTCGAATT GAATAGATGG AATCACTAAA 660 

ATTTTTTGCC AACGCTCTAT CTTTTTGGCA 720 

CGTTCCATGC TCACAGCTAT AAATGATGAA 780 

CACTCATATT TGTCAGCTTT GATTAGTCCA 840 

GTATTAATAT TTTGTTTTTC TTTTAAATCA 900 

ATTTCACATA AATAAACTGT TTGATCATTA 960 

TATCTCATTA CATTTGTTAT ATGATGAATA 1020 

CTTACATCAG CGTTTTGGnT CTATGAAATA 1080 

CAACAAGACA AACCCAACCG TTGTCATGTT 1140 

CCATATGTGA CTGTATACCT TCATACTTCT 1200 

CGCCTTCATT TAGAOTATTA TAAGCATCTT 1260 

TATTTGCTAT TACAATATCA AATTTTTCTG 1320 

CTTCAATTAA CGTTTCACAA TGATTTCTTC 1380 

TTTCATCAAT ATCCAACGCT TTAATACGTT 1440 

ATATACCTGA GCCAGTACCA ACATCAATTA 1500 

TTGCCTTCAA ACACATACTT GTAGTCGGAT 1560 

CGAGCTCAAT GCAAAGCTCT TCATCCGCTT 1620 
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GGAAATAGTT TTTCCATTCA TTTTCCCAAT CCGTCTCTGC AATAATTTGC TCACTGAATT 1740 

GAACGTTATG TTGATCAAGT TCATCTAAAT TTAATAACTC ATCTTTAATT TGCTGTCGCA 1800 

5 

ACTTATCATC ATAAGTCATT TCATTAAAAT AGGCTTTCAA TCTTACTCCC TTATCTGGAT I860 

AATCCTCTTT TTTCAAAGCG TAAATTTCAC CGTATTTATC TTCTGGTTGG TTAATTAAAT 1920 

CATCTGAATC TTCTATCACG ACACCATTTG ATCCATGATT TTCAAGTATA TTGGTAGCCA 1980 

10 

ATTCTACTGC TTCATGATTA ATAATAATTG AAAGCTCTGT CCAGTTCATA CTTTATTCTC 2040 

CCTTAAAGAA TCTTTTTGCT CTATCTTTAA AATTCGAAQG TTGTTCATTA ATTTCTTCAC 2100 

75 CATTTAATTG GGCAAATTCT TTCATTAGTT CTTTTTGTCT ATCTGTTAAT TTAGTAGGCG 2160 

TTACTACTTT AATATCAACA TATAAATCTC CGTATCCATA GCCATGAACA TTTTTTATAC 2220 

CCTTTTCTTT TAAGCGGAAT T6CTTACCTG TTTGTGTACC AGCAGGGATT GTTAACATAA 2280 

CTTCATTATT TAATGTTGGT ATTTTTATTT CATOOCCTAA AGCTGCTTGT GGGAAGCTAA 2340 

CATTTAATTT GTAATAAATA TCATCACCAT CACGTTTAAA TGTTTCAGAT GGTTTAACTC 2400 

TAAATACTAC GTATAAATCA CCAGCAGGTC CTCCATTCAC GCCTGGAGAG CCTTCACCAG 2460 

2S 

CTAATCTAAT TTGTTGTTCA TTGTCX3ACAC CTTCAGGTAC TTTCACTTCT AATTTAACTG 2520 

TTTTATTTTC AGTACCTTTT CCGTGACATG TTGGACAAGC TTCTTCAAAT TCTTGACCAC 2580 

TTCCATTACA TTTAGGACAA ACTTGTTCAG TACGAACTCT ACCTAAAATT GTGTTTTGTT 2640 

30 

CTACAGCTAC ATGACCAGCG CCATTACAGT AACTACAAGT CTTTTTACTT GTTCCAGGCT 2700 

TTGCACCATC ACCATGACAT GTTTCGCATG TTACATCTTT ACX3GATTGAA ATTTCTTTTG 2760 

35 TTGTACCAAA TACCGCTTCT TCAAATGTTA ATGTCATTGT ATACTGAAGA TCATCACCTT 2820 

TTTGCGGTGC ATTTGGATCT CTTrGTCTGC CGCCACCGAA GAAAGAGCTA AAGATATCTT 2880 

CAAA5CCGCC GCCACCGAAG CCACTAAAAC CXXTCAAAGTC AGAGCCATTG AATCCTTGTC 2940 

40 JL 

CACCAAAACC TTGTGGACCA TCATGTCCAA ATTGATCATA GcTTGCGCGT TTATTATCAT 3000 

CACTTAAAAC TTCATAGGCT TCAGAAATTT CTTTAAACTT TTCATCTGCA CCTTCTTCTT 3060 

TGTTAATATC TGGATGATAT TTTTTCGAAA GCTTTCGATA CX3CTTTTTTG ATTTCATCTT 3120 

45 

TTGAAGCATC CTTACTAATG CCTAAAACTT CATAATAATC TCTTTTGGCC ACAGCTATCT 3180 

CTCCTTTTCT TAATTAACTC ATATAGTTTA ACGTAATATG TCATACTATC CAAATAAAAA 3240 

so GCCAAAGCCA ATGTTCTATT GACTTTGACT TTTCAGATCA TGACAACATT CTAATTGTAT 3300 

TGTTTAATTA TTTTTTGTCG TCGTCTTTTA CTTCTTTAAA TTCAGCATCT TCTACA6TAC 3360 

TATCATTGTT TTGACCAGCA TTAGCACCTT GTGCTTGTTG TTGCTGTTGA GCCGCTTGCT 3420 

55 
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TATCTTCTAT ATCTTGACCT TCTAAAGCAG 
ATTTTTTATC TTCTTCACCG ATATTTTCGC 

5 

ATACTAGACT GTCAGCTTCG TTTCTTAAGT 
CAGCGTTAAC TTCAGCATCT TTTACCATAC 
TTGATTGAAT TGTAATTCTT TGTTCTTTAT 
CAATACCGTT TTTATCGATA TCAAACGTTA 
GTGGAATATC AGTCAATTGG AATCTACCAA 
CACCTTGTAA TACGTGTACA TCTACTGATG 
GAGATTTAGA TGTAGGAATC GTAGTGTTAC 
AAATTTCAAT ACCTAAAGAT AGTGGTGTTA 

20 

CTGTQATAAC GCCACCTTGG ATTGCAGCTC 
CTTTGTTAGG CTCTTTACCG ATTTCmTT 
TTGATCCACC AACTAAGATA ACTTCATCGA 

25 

TTGCTTGGCG TGTAGGTTCC ATTGTTCTTC 
TAGAACGAGT TAAGTTTACT TCTAAGTGTA 

30 GTAATGAGAT TTGAGTTTGT GATACACCTG 
CTTTCAAACG TTGTAATGCC ATTTTATCTT 
ATTCTGCAAC TAGGTAGTCA ATAATTACTT 

^ CACCGGCTGT TGATAGTACT TCGAATACAC 

ATGTACCGCC ACCTAAGTCA AAAACAAGAA 

CATATGCTAA TGCTGCAGCT GTTGGTTCAT 

40 \ 

TTTTACCAGC ATCTTTAGTT GCTTGACGTT 

TTACAGCTTT GTCAACTTTC TCACCTAAaA 

^ AAATCATAGC TGAGATTTCT TGTGGTGTGT 

CAGTACCCAT ATGACGTTTA ATAGATTGAA 

TTGCTACTTC aCCAACTTGA GTTTCTCCAT 

50 GTGAACcTTC AGGGTTTTGA ATTACTTTTG 

AATTTGTTGT ACCTAAGTCT ATACCAATAA 

TCATTAAATT AATTTAATTT TAAACAATGT 

as 



TTTTAAGAGC GTCTTTTTTC TCTTCAGCAG 3540 

CTAAATCAGT TAAAGTTTTT TCAACTTGGA 3600 

CTACTTCTTC ACGACGTTTT TTATCTGCTT 3660 

GGTCGATTTC TTCGTCTGAT AATGAAGAAC 3720 

TTGTACCTAA GTCTTTTGCA GTTACATTTA 3780 

CTTCAATTTG AGGTTTACCA CGTTCAGCTG 3840 

GTGTTTTATT ATCCGCAGCC ATTGGACGTT 3900 

GTTGATTATC TACTGCTGTT GAATAGATTT 3960 

GTTCAATTAA OGTATTCATA CGTCCACCTA 4020 

CGTCTAATAA TACTACGTCT TTAACX3TCAC 4080 

CCATTGCCAC TACTTCGTCC GGGTTTACTC 4140 

TGACAGCTTC TTGTACTGCT GGAATACGAG 4200 

TATCTGAGTT TGTTAAGCCA GCGTCTTTCA 4260 

TAATTAATGA ATCTGATAAT TCTTCAAATT 4320 

ATGGACCGTT TTCACCAGCT GAGATAAATG 4380 

ATAAGTCTTT TTTAGCTTTT TCAGCAGCAT 4440 

GAGATAAGTC TACGCCATTT TCTTTTTTGA 4500 

GGTCAAAATC ATCACCGCCA AGTTTGTTGT 4560 

CGTCACCTAA TTCTAGGATA GATACGTCAA 4620 

CTTTTTCATC TTTATCAGTT TTGTCTAAAC 4680 

TAATGATACG CTCAACTTCT AAACCAGCAA 4740 

CAGCATCGTT AAAGTATGCA GGTACTGTAA 4800 

TAgTTTCAGC TGTATTTTTT AAGTTTTGTA 4860 

ATGATTTACC TTCAATATCT ACTTTATAAT 4920 

CAGTGTTTGG GTTTGTAATA GCTTGACGTT 4980 

TTTTGAAAGC TACAACAGAT GGTGTTGTAC 5040 

GCTCATCGCC TTCTAATAcT GTnACACATG 5100 

TTTTACTCAT AATAAAATTC CTCCATTTAA 5160 

CTTTTOGCCA AATTTAAGTT ATTGGTTTAC 5220 
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AGTGATTTCG CCAGATTCAA AATCAGGGTT ATCATCTTGA ACTACAGCTT GGTGAATATT 5340 

TGGATCAAAT GCTTCACCTT CAGTTTTAAT AACTTCAAGA CCATTATCTT TTAGTGCGTT 5400 

5 

AATCAAACTT TCATGCACCA TTTGTACACC TTTTTGAAGA GATTTAAAAG TCTCATCATC 5460 

ACCTTCAATT TGAAGTGCAC GTTCTATATT GTCTATTGCT GGTAAAATAT CTGTTAACAC 5520 

ACGTTGTGCT TGATATGTTT TGTTTATTTC ATTTTCTTTT TGAATTCTAC GCTTATAATT 5580 

TTCAAACTCA GCXSTAGAGCC TTAAATATTT CTCTTCGTTT TCATCTGCTA ATTGTT6AAG 5640 

TTCATTAATT TTTTGATCTT TTGGATCTAT TTCTTCAATA ACATTCTCGT CAGACGTTTC 5700 

IS TTCTATTGCT TCATCTTGTA AATGACCTTT ACTTTCTTCA GCTTGTTCAA CTGAATCATC 5760 

AATATTTTGT TTGACGTTTG TTTCTTCAAC TGTTGATTCA GTGTTTTTTT CAACTGATTC 5820 

GTCTTTATTT GTCATTTTCT GTCCTCCAAT ACTTTCTAAT CCATCATTAC CAAATTCTAT 5880 

20 

TTAATAATTG AATGACATTT TGATAATGCA TAGCTGTAGG TCCAATCACA GCGATTTGAC 5940 

CTTTT A ACGT TTCATCAAAA TGATATTGAC TTGTTACAAT TGAAATATCA CTTAAGCTGT 6000 

CATCAATTTC ATTACCAATT TTTACATTAA TATTTGGTGA AGATATATCT TGTAATAATT 6060 

25 

CTGCAATTCT ATTTGATTCT ATATATTGTA GAATGGGCTG AATTGAAGAT ACATTACTTT 6120 

CATTCAATGC ATCAATAAGT TTAACCTTTC CACCCATATA AATGCTATTA CTTTGATTAG 6180 

30 AAATATGATT ATTCATCGTA TTTAACAATT TATTGATAAA AATTTCTTCC TGCTCTGATT 6240 

GAaCAAAAGA GACAATATCA TCTTGTAAAT TCTGATTAAA CTCAGTTAGT TTGTTTGTAA 6300 

CAAAATTTGA TATTGTATTT AGTTTGTCAT TATTAA 6336 

^ (2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13059 base pairs 

(B) TYPE: nucleic acid 
40 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

TTCATGATTA TTATCTGTTG TAGACACTGC TGGATCTTCC GATGTATCTT TCGATGCATC 60 

TTTCGATTTG TGTATTTGCT GATTCAAATG GTCTAGGTCT TCTAACGCCT TATTTACCAT 120 

so TGCTTCATCA TTTTTATCAT CTTTTTCTCC ATGTTTTGTT GTAGCCGTTT GTGACATATC 180 

ATTTTTCATT GCATTAAGAT CGTCCTCGCC ACTTTGTTGA CCCCTATCAA CATTTGAAGA 240 

AACCTCATTT AAATCTTTAA GCAATTGATC TAATTTACTG TCTATATCAC TTTGACCGTT 300 
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IS 



20 



25 



TTCATCTATT TGCGATGCTG TTTTCGCTTC ATTTAGTTGT GCTTTATAAT GTTCTTTAGA 420 

TGAAGCCGAT AACTGTTTTA ATTGCTCAAT TTGACGAATT GCCTTGTCAA CTTTGTCTAA 480 

5 

TAAATCTTGC TTAGATAATA TCTCTTTTGT AATTTCAGTA TCCTTTTCAG ATGCAGCTTG 540 

GGCATCGTAC GGCAAGATAT TCX5TTAAAAT GATACTTGTC GCCATCATTG TCGAACACGA 600 

10 TAACTTTACA TATAATTGAA ACX3GTTTCCC TCGATATTTA GCCATCAACA TACTCCTTCC 660 

TCACTTACTT CCTTCAAAGA ATTACATACT ATTATATACC TGTTTACAAG AAATTTACAC 720 

TTATCTATCT AGTTATTGTT GTTAGTAATT ATCTACTTAT TACTTAGCTT ATATTTAAGT 780 

AAACAAAACA AGCATGACGT AATATCATAT TGTCCATGTC GCTAACATCA TATTACGTCA 840 

AATCTTTTAT ATTAAATOAT GTTTTATTTT AGACTGCTTT TTCCTTTTAG CTTTCGAGCG 900 

CCTGTTTAAA AACTTGCTCG AATTGTTCAC GCGAGATTTC GTGTGCATGT GCTTTTTGTG 960 

CTAATAAAGC ATCTCGAAAC TGTTGTTGAT CTTTCAAACT TTCTAACATT TGTATTAATT 1020 

GGTCTTTACT TTCCATTGTT ATCTCATCAT TATGCTCAAA TAAGTGCTCT GATAATGTTA 1080 

CTTTAGCATG GTGTGCGGTT TGACGATAAC CTAAAATCAA CAACTCATAG TCAAACGCTT 1140 

GTTCCACCGC ATTTAAAATT TCATTACCCT CATTGATATC AAGATAAATA TCACATAACT 1200 

GGTATAGTTC ATTTACCCTG TCAATATTAA TAGATGGGTA TAAATGCACA TTAGCATATT 1260 

SO GATCAAGTTG CATTAGCTTA TCAGACATCT CTGTAATAGC AGOGATGTGn AACTTAAAAT 1320 

CTGGTAAAGt TyCAACCAAT ACCTTGATGT TACXSAatTGa TCOgAGTTAG TTAATATTAC 1380 

AATTTCTTTA GTATATCTAT TACGACTACG ATAGTTATAT AGATATCCGC CTTGTAAAAT 1440 

ACGAGATTGA ACCTTTGCGT CTGCTATATT GAGCATCGTT TCATATTCGT TTTTATCTGG 1500 

AATAATAATA TTACAATGTC GTTTCATATC ACCTTTACAC ATCAATTGCA TATTTCCCGG 1560 

GACATTACCA TTACAGTGTT CTTGCCATAC CAAAACATCA CTACCTTTTG ATGGCAAATT 1620 

ATATAACACT GAAAATGGTA GGGCTAGTGA GTTAATAACG AAATGATGTT CCGTAATTTC 1680 

AAGTTGCTTG ATAAAAAATA ATGCGAATGC GAGCTTTGAA GGGAAAAAGT AAGACTTCCC 174 0 

4S TTGCCAATCC AATATGACAT CAGATGTTAC AAAATTTTCA TAAATCACTT CTTTACCTTC 1300 

TGCTGTCATA TATTTCTTCA AGATCGCTTT ACGATTTAAA TOSTAAACAG TTTGTGCAAA 1860 

TTTAATACCA TTCTTAGAAT AATAATCGAC AAATCGGACA CGTTGTTGGT CATCAAACCA 1920 

^ TTCGACACGA CTAACAATTC TAGGGCGCTC TCCACTTTGA yAAAATATTT TACCTCGyAG 1980 

ACGTCCCATA TCATTaATTG TAGCCGAATT GTTOTTACCT TTAATTTCCC AAAAAGCTGG 2040 

TACAGTAACC TGATTAAAAA ATCGTGGTTT CATATTTTCT GTATTATGAT TATCTGCAAA 2100 

55 



35 



40 
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TAAATCTTCT TCCAACTTAC TGGCTTTAAA 
AAAGTAATCA AATAATTTAA TCATGTAGCA 

5 

TAATATCTTG AGTCATAAAT TGCTGTGCCA 
GACCATTGTT AAAATACATT ACAATGGcAT 
TATCTTCGTC GGTATCAAAA GGTACCAAGT 
GGTTACCATA ATTCACATTT AATCCAATCA 
GTGTTAACCC AAAACCTTCG CTAGTTGATG 
IS TTTCATCAAG TTTAACATGC CCTAGTAAAC 
TAATTTTACXr CAGTCGC6TC TTCTCGCTAC 
GCACTTGTCG TTTAGCCACG ATAACC6CTT 

20 

CTAAACGAGA TGCACTAATC ATCGCATATG 
CATCAATGCT TCCCACAGGA ATAGTATAAA 
ATTGTCGACA AACCATATGA TTTTGAATAT 

25 

TAGCTTTTGA AAATTGATAT TCATAATAAT 
TATTATTACT ATAATGATCA GCATGAATCA 

30 GCAAAACAGC CTGACCAATA TCAGAAGCGC 
AATTCAATCG TTGTAAAAAG TATGCAATAA 
CAAACACATA TATAGAGCTG TCTCCATCAA 

^ GATTATAAAA TTGTCGCATA TATAATTTCG 
AAAATATGCG CGTATAACTA TAAAAATCTT 
CTGCftCGATC CACAATATCT TTTTGTTCAT 

40 

TCCCATTAAA ATATAGGCGG ACTATCTTAC 
CAAGCTCACG TTCAATGTCA TCTAACGTGT 
AATACTGATA CAACCAAATA ACTTCTTGAT 
GTATGTTCTC TGACTGTATA AAATCTAAAA 
TCAATAATTT AGCACGGTAA GCTTGTGCAT 
50 CAAAGTTTAT ATTATATATT GTCATGOGCT 
TGGCATACCC TCTTTATCAA ATGTAATCAT 
TTTGATATTT CGTGTCATAA CTTCAAATQA 

55 



AGACTCATAT AACTTTCGTG AATGATCGTT 2220 

CCTCTTGaAC TAATGTTTCC CATTTTAAAA 2280 

CTTCATAAGA GATGTCATGT GGTGTCTGGG 2340 

GAGCTAGTTT TGCGATAACA TCATCCACAC 2400 

AGCCATTTTC CCCATCTCGA ATAAAGGTT6 2460 

TACCTAGTCC TGAGCCTACC GCTTCCATTA 2520 

CAGAAAGAAA TAACTCATAA TCATTATAAA 2580 

GAATATAATC TTGTGCGCGG TGTGTATCAA 2640 

CTTCTCCATA AATATCAAAT GTTAATTCTG 2700 

TGACAAGCCA ATCAATATGT TTCTCATTOG 2760 

GCTTTCTTGA TAATGTTGGA TATGATAATG 2820 

CACGTGGGCG ATAACCTTGA TATTGCTCAA 2880 

CTGTTGCTGT AATAAAGAAA TCAATGTATT 294 0 

TGTTCCATAG TATATGCTGC TCACTCATCA 3000 

CAACACCAAC TTTACTATCA CCTTTATGCT 3060 

GGTCTAATAT GACAATATCG TCTCGGGTTA 3120 

ATTCCGTTTT GTTATACAAC ACCGCATCTT 3180 

TATATTCGTT ATAAGCGATG GAACCATCTT 3240 

CTTTATTATC AGCTGGTGCA TAATACTCAG 3300 

TACX3TACTAA CATACTATTA ATTACAAATT 3360 

TTTGCAGATA ACATGTTACA AATGATGATT 3420 

CATTTCTTTC TCTAAAACTA ATGTCATGAC 3480 

ACGTTGTTGG TGCTAAAGAA ATATCACTAA 3540 

CTTTAAACCC AATGTTTTGC GTTAATGTCT 3600 

ACACAAATTT AGTGTCTTGA TTTGTACGTC 3660 

ATTCAACACC GCTACTCGCC CAGCCTATAC 3720 

ACCCCTTTTC ATTTATGGAA AATGTATAAC 3780 

GCTTT6ACAA ATATTTTTCA CCATTCTTTT 3840 

ATCTAAGGCA ACTCTATGGT ATTCAAAAAT 3900 
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20 



2S 



GACTTGTTCT AACCAACATG AATCAATTGC TTTCAAAAAG ACTTTTTGAA CGAAAATATT 4020 

ATAATAATAT GCACTTTGCA TGTTTTTACG ATTCAAAGCT AATTGCTTTT CAAATTGCTC 4080 

5 

TAATAAAAAT GTCACTACTG CTTGCTTATC TTTAAAATTA ACACAAGCCA CATCTTTATT 4140 

AAATTGGAAA CTTAAATTTT GATAAATATA CTCGACAACA CGCGATTTTG TTAGCACCTT 4200 

10 TTCCTCATTT ACAAACATTT CAAATACATC TTTAGCTAAC GCTTTAAAAT CTTGATTCTC 4260 

AGCATCATCT ATTTCTAAAA CTCGATTGOG TTCCTCGTAT ACAAGATCTC GCTGTATACT 4320 

AATGCTTTTT TCAAATTCAT TAGCCATTTC ACGAGCTTTA ACCCCTTGTT CTTCOGAGAt 4380 

aCGcTGCGCT TTAACTACAA TTTGCTTAAC TTTGCGATTA AACAAATTAC TTTGCX5ATAA 4440 

TCGTTGTGCA TCTAATGAAT ATAATTGATT ATTTTCCGCT AAATTACTAT CGCTCCATCG 4500 

CTTAACTAAA TAATCATCTA GTGAAATATA TATACAAGAT GATCCCGGAT CCCCTTGTCT 4560 

ACCAGAACGA CCACGTAATT GCCTGTCTAC ACGGCTATTT TCCATATGTT CATGAATAAT 4620 

AACAGCTAAT CCACCTAATG CTTCGACACC TTCACCAAGT TTAATATCTG TGCCTCGACC 4680 

TGCCATACTA GTCGCAACAG TCATGGAACC AATTTGCCCT GCTTCAGCTA TCATCTGCGC 4740 

TTCTTTTGCA ACATTTTGCG CAATGAGTAA ATTATTAGGA ATATCCATTT GGAATAATAC 4800 

TTTCGAAAAG TATTCAGCCG CTTCAGCAGT TCTCGTTATG AGTAAAACCG GTCGCCCCGT 4860 

30 TTCATGAAGT TCAACTATAT CATGAATCAT CGCGATGTTT TTCTCATCAA CTGAACGAAA 4 920 

CACTTTATCT GGTTCATCGA TACGTTGAAT CGCTTTATCA GTTGGTACTT GTACGACTAT 4980 

TTTTGAATAC AAATCAAAGA ACTCTGATTC GCCTAATTTT CCTGTAGCTG TCATACCTGA 5040 

AAATGATTCA AAAAGTTTAA ATAAATTCTG GAAGGTAATT GTTGCCATAA CACTTTTATC 5100 

TGTTGAAACC TCCATACCTT CTTTCGCTTC AATAGCTTGG TGAAGTCCAG CTTGCAACTT 5160 

AGTTCCCGGT AACATACGAC CTGTAATACG GTCAATTAAA ACAATATCAC CATTATATAC 5220 

AAAGTAATCG ACATTAGATT CAAACAAATA TTGTGCX5CGC AGTGCTAAAT TAATATTACG 5280 

CACTAGGACC ATCGCTTGTT CGCTATATAA ATCTTCAACA TTAAAGTATG ATTGTGCCGC 5340 

45 TTCAATACCT TGATTTAACA GCCATATTTC TTTTTTGGTC TTCTTCATTT TAAAATGCAC 5400 

GTCTTCAATC AATGTATCTA CAAACTCTTT CACAATATGA AATAGATTTG ATTGTAATCT 5460 

TGGTGCACCC GAAATAACTA ATGGTGTTTG AGCAGCATCT AAAATGATTG AATCCACTTC 5520 

^ ATCAATAATA CCGTAATTTA ATTGTGGTAA AAATTTCCCT TCCGCACTAT CAGCCAAATT 5580 

ATCAATTAAA TAATCAAAAC CGAGACGTCC ATTAGTTGTA TATATAATAT CATGTTCATA 5640 

TATATTACGT TTTTCCCCTT TTTGATACTC ATAATCCACA ATATCAACAA AACCTAATGA 5700 
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TAATCATTCG TTGTAATTAA ATATGTTCCT TTTCCCGAAA GAGCATTTAA ATATAAAGGC 5820 

ATCGTTGCCG TTAATGTTTT ACCTTCGCCT GTTTGCATCT CCQCAATGTT ACCTTCATGC 5880 

AATACAATCG CTCCGATTAA CTGAACTTCT TTAGGATACA TACCTAATAC TCTCCAGCTC 5940 

GCTTCACGTG CCACTGCATA AGCTTCAGGT AACAATGTAT CTAGTGTATC AACTCCTGAT 6000 

GCTAAACGTT CTTTAAATTC TATTGTCTTT TGTTTTAACG CATCATCAGA ATATGATTTA 6060 

ACTTCATCGC TCCATGTATT GaTGsGTTcA CTATTTTTCT AATCGACTTT AGTCTTAATT 6120 

CGTTTATCGT AACATCTAGT TTATGTTTCA TTTACTTCCC CACCATTCAG TTTCGATACA 6180 

TCTAAGTAAT CTAAAAATCG TACTGGATTC ATTAAACGTG ACATATAATT TAGATGTTTG 6240 

TCTTGCTCTT CTTTAAAATA AACCTCGACA TTTGTATCTT TTAGTTCATQ ATTTCCTGGG 6300 

ACATGTTCTG TAAGCCATCC TTTTAAATCA TCATCTTCAT GGCTTGTACG ATACACTTTG 6360 

20 CAACCCAAAT GCTGA6CGAC ATAAGTTGCA AAAACATTTG ACTTTGACCC ATAACTAATC 6420 

AAATTAATAG CCTTTAGGGT ATCTTQACTT TGCAAATCAT TCTTTAGTTG CTTAATATTT 6480 

CCCTCGATAT TGTCGTCCAT CCAACGTTCA ACGAGCCAAA CATGACCAAA CAGTTTCAAA 6540 

AAATjCATTCG AAATAGTTGG ATAGGTGTCA GATGGTTCTG CAATAATGAC ATTGATCATA 6600 

TCATTTCCAT ATTGGTCATC GCCTATCTTC GTCACCCGCA TGCTTTTATA CTCTAAATCA 6660 

TATTGATGCG TCATCTCTGT GATTGTTAAA CATCTAAATA TAAGACTCX3T CGATGCTGCA 6720 

TTCATCATTT TTATTTTATA AGCATAGGCk TCATCAGGAT ATTGAATCX3T AATACTATTT 6780 

GACTTTACAA TCTCAGTACT TAGTTTTGTG CCATTTTTAT TATAAAAAAT GATGATAAAA 6840 

TACACTGAAC CAGCAGGCGT TGCATCAAAA TCAAAATGCA ATTTATAATG CTGTCCTCTA 6900 

CGCAAAATTG GkAAACTTGG CGcACTTTTA TATTTTGAAA ATTGCTTTAA CATCAACCAC 6 960 

TCATSAATCG GTAATCCAGA GGGCATCAAA GGATTTATAA AAGTCACTTC ACCATTTGAA 7020 

40 AATGATACTT TAGAGCCATA CATAAATGTA GTTTGTGAAA TATAATTCCA AGTAACTTTA 7080 

AATGTTTTGT TTTTCAGCAT GTTGAACTCT CCCAAACTTG TCTTCCAAAA TAATGTTGTA 7140 

AAAATTAACA AACCAACTTG CAATGGTAGG TGAATCATCA TTATGTCGCC CAGGAATACT 7200 

GCGATTCATC ACTCTTGCTT GGTGTGCTOT CAATACAGGT AATAGCTCTT GAAATGCATG 7260 

TGGATCATAA TCATCATGTT GCATATATGC TATGGCAAAA ACAGTTTGTG ACAATGATTy 7320 

CTTTTGAAAT GTTTGCCAAA ATTTTTGATT TAATGCCTGT ATCGACX3CTT GAGATGTATC 7380 

ACCTTCATTA GACACCAGGA CGTCTAATGC TGTACCOAAC TCTTCTGGTC TAAGTAATCG 7440 

CATATGTTCA GCAATCGTTC CAATATTAAC AAGTGGTTTA CCAACAATAA TTGCCTGAGG 7500 
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TAATTCATGT GATTTAAAAT TCAGCTTTTC 
TTGTTCATAT TCAGATGAAC CGATATAAAA 
^ AAGTAAAAAC GGTGCATTCA TACGTTTCAT 

TCGATAACCA CTAAAATATA CGTTTAGTGG 
AATAAATTCC TGTCGTTGAC TATCTACGAA 

10 

GTCTAATCTA GACCATCGTT TGTGTATAGG 
CTTAACAGTT ACACTTATAT AAGCATCAAA 
TAACATATCA TCASTCAATA CGATTTGTTC 
CAATCTAAAT GTATATTGCA ACTCGACCGC 
AATGACTTTA TCTTTATCGT AAACGAGATT 
20 TTTCCCAAAT TCTCCACTCA ATGTGAGCTC 

AAAATTCGGA TGCACAAGTG CTAACTTAGQ 

GCTAACTGCC TCTAATTTAT TATTACGTTC 

2S 

AACGTATTTT TGGACATTTT CGTCTTGTTC 

ATAC6TATTG TATGGTTCGC TTATCATTTG 

TGCTTGAATA TAGATAAAAT CAAAGCGCCC 

30 

TACATAACCA CTATCAAATT CAAACAATCC 
TTTGTGTTGA AAAATAGGTT CTAAATCGTC 
^ TGGCATCATT TTCACCTTCT ATTAACTCAT 

ATGCATCTAT TAATTTTAAA GAATAGGCGT 
AATAATAATT TAACGCATCA TCTAATTCAT 
AATCAGAGAC GTAATCTGTT TGTTGACCAT 
TAATTTGTAA ATACAAGTCA GGTTCTTTTG 
ATGCTTCTAC AACATCATGT TCAGCATGTA 

45 

CTTCAGGTGT CATTAATGCT GAAACATTAA 
CATTTACCGA CGTAATACAT TCACGAAGCC 
ATAAAATTAA ACGGTAATAA TCTTCCTGTG 

so 

GTTGCAAATC AGCGTCACTC ATACCATCTA 
GACTACTTAT ATTAGGTGAT TGTCTCGTTT 

55 



TAATGTCTCG TCAATAACAT TGATAATACC 7620 

ACTACCACCT TCAACACGAG GATCGCCGAT 7680 

CATATAATAT CCTTCGAAAC CTTCCGCTGT 7740 

CGGTTTCATA TCACCAGGGT GGAAATAATA 7800 

ACGACTACCA CCAAGTAAAA ATTGACCCAT 7860 

TCCTAAATGT ACCGTCCCGT TCCCACGCGC 7920 

TGGTTTCGCA GGTATCTCTA AAGGACTGTC 7980 

AATTAATGCA CCATCAGCGC CAGTCTGAAT 8040 

ACCATCAATA TCAAATTCTG GCCATATTTG 8100 

ATTTTGCCAA GATGCGATAG GTTTAAATTC 8160 

TGAATTACCT TGGTAAACGA CATCTCCTTT 8220 

AGAAACCTTA TCTCCATACT GTCCTGAGAA 8280 

TTCAATATTC CGGTAATGTA ATGGTTGAAC 8340 

ATATTCAACT GACCAAAATG ATTCATCAAC 8400 

TAATAAATTC GTTAATGTCT CCGAGTATGG 8460 

TTCTGCTTCA ACAATCGCTT CAATAGCCTC 8520 

AATATCGAAG TAATCCCAAC TCACACCTTT 8580 

TCCTCCAATT TGCAAAACTC TAAATTTACG 8640 

CGAGCTGATT AATAATATTC TTAGAAGCAT 8700 

ACGCATAATT CCAATTTTTC AAATAAAATA 8760 

CAACTGTATT TATAATACGG CCATTQTCAT 8820 

TAATTTGTGG AATCCCAGCG CTAATTGCAC 8880 

ACATATCTAT CACAAGTCGC AACGTCCGCA 8940 

TCGTCTTAAC AGCAATGATG TCATCTTGAT 9000 

CATCCGCATT CTGTTTAGCT TGGTATTCCT 9060 

ACATCGGTAT GTCATTTTGA TGGCGCGATA 9120 

CGATATAATC CACAAGTCGT TQCATCATTT 9180 

TCCATACACC TATAAATGTT TCCATCAATT 9240 

CAAATGGTGT GATTCGAATC ATTGTATTCT 9300 
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TTAAATGGGC ATTCTTTACG ATAGATTGAT 
TTTTAAAAAA TGAATAACTT AATGATTTCG 
^ GCCTAGCATC TGAAGCCACA ATCACATGAT 

CTTGAAATTT TTCTTCAATT AGTTGAGCCA 
GTTGATATCT TTTTGAAACA GTGACTCTGC 

10 

CTCCATTAAT CGTTAAATAT TCTTGGTAAG 
TCGCTGATAA ATAACCTCTG TCATCAAAAA 
ATTCTTCAAA CCAAATTGAA TACCCTTCTT 
CACTCGTCAC ACATTTTAAT AAATACGGTX5 
TTAAGTGATG ATAATTAATC GCTTGTGGCG 
20 ACACA6ACGA ATACTTTGTC TCATATAAGT 
GTGCATGATT GAGAACAATC AGTTGATAAT 
AACTAATCAT ATCGTCAAAT TCCGTCTTAT 
CTTGCCACCA TCGTTGGTCA TCGTACCAAG 
TACCAATACT GGTTTAAAAA TGGCTTATAT 
TCT6CAATAT TAATACTGAT GTAAACTAAT 

30 

GTAGATAAAT GCGGTACAAA CAATGTGAAA 
AATGCCAATC CAAACCAACA TACX3CGTCGT 
TTACCAGGTT TaACTCCTGA AAAATAATTG 

35 

TTAGTATTGA TTAAAAATCT CGATAAAAAA 
ACTGAAATAC CTACTGGACT ATCAAATGTC 

40 TTAAACATAG ATAAAATAAA ATGAATGCCA 
ATCATTAAAG TAATACTGCC TGCAGGGTTA 
GTTGCGGAAA CGTTCATTAA ATCKATATAT 

^ AATAAGATAA ACAATGTGAT TATCACAAGG 
GCATCTATAT ATTCCATTTT TTGATGCATC 
GGCATTGGTC CTGCGATGCC GTAGCGACTA 

50 

GTTCCAGTAA CCAAAATCAA TATTGTTAAG 
GAAACATATT CATGAATCAC AAAATAACTT 

55 



ATTCCTCATC TGACACAGTT TCATTTCTAT 9420 

CTGGAATATG ATTGGCTATT TGTCGATTGT 9480 

CATCTTCATG TATTTGTTGT GCAATCATTG 9540 

TATTGTTATA TTCTGTTTGT TGATAGTGAT 9600 

CATTTTTCAA ATCTTCATGA AGTACACAAT 9660 

AAGCCTCTCC CTGATCATCA AAATAACGTA 9720 

TATAACGCCG TTGTAACTGA TCTCTTTCAA 9780 

GACTAAAATA AATATTTGTA TAGGTCTGTT 9840 

TGTACACAAA CTCAACATCA TCCGGCCATT 9900 

CATGGTGACT GAATCCTTGA ATTTCATCAA 9960 

CATATCGATG TAAAAATGTT CTTAAATTTG 10020 

CTAAGTCATT TTCAAGGTGC ATTCCCATTA 10080 

TTTGTAGTTG ATAATACGGC ACAGTCGTGT 10140 

CTGGAATAAA GTATTTCATA ATTACXTTCCT 10200 

TTATCAAAAT ATAAATATGT ACGAATTGTT 10260 

ACAATCAGTT GTACTGAGaA ATAAATTTCA 10320 

TAAAGCGGTA TACCAATAAT GACTGTAACT 10380 

GCTTGATAAT TTAAATAACG TTCTGTATCC 10440 

CCACTCTTTA AGAAATCTTT GGATTTTTGT 10500 

TAACCCAATA ACATTTGAAT CACTAAATAT 10560 

AGCATTGGCA TGTCATCTGA TATGCTTTTA 10620 

CTTTTTAAGA AAACAAAAGC TGAAATACTC 10680 

ACTTTCCAAG ATAAATAAGA TTTCATATTT 10740 

GGTATTCTCA CTTCTACTAA TTCAATAAAT 10800 

ATGATTAACA ACGCAATCAC AATATGACTT 10860 

ATTGATTTAA TAATACTAAC CATTACAATC 10920 

TTTTTGTCAG CTAACCAAAC TAATAACATC 10980 

TAAATATTGT CTTGATGAAC ACGTTCTTTC 11040 

TGAATAACAC TTAAAATTAA TGTTAAGATG 11100 
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GAAATCAGCA 
TTTAAAGTAT 
TTTACTTGCA 
AAGATAAAAC 
AGTTTTAACA 
CTAATTATTA 
TAGCCAGCTA 
TTCTTTCTCT 
TTTTGTTTTA 
TTATGTTTAO 
GTTGCTTCAC 
TCTGACGGAT 
GTTGAGTCAG 
GTCGAATCGG 
ATTGATGTTG 
ATTGATGTTG 
ATGCTTGTTG 
TCTGATGTAC 
TCTACTGTAC 
GTTGATAATG 
ATTGM'GTTG 
TGTGACATTG 
GTTGAACCTG 
ATTGAGTCAS 
TCACTTGTAG 
TTACTTGTTG 
ATACTTTGCG 
GTX^GAACCAC 
TCACTCATTG 



TCAAGATAAT 
TAACGTCTCC 
TATCGTTaTA 
ATGTGTATAG 
TGTTGCACCT 
AAATCCTATA 
TAAATTCAAT 
TCATTAAACC 
TTGAGTCACC 
TCACTTCAGA 
TTGATAAGCG 
GCATTGAGTT 
AAATGCTTTG 
ATGTGCTCAA 
AGTCGGATTT 
AGTCAGATAC 
AAGTTGAACC 
TCAATGATTC 
TTTGTGAACC 
ATGTCGACAC 
ATATCGATGT 
AATCACTTAA 
ATATTGAGTC 
ATGTTGAAAG 
ACATTGAGTC 
AGCTTGACTG 
AGCTACTCAA 
TTAAAGATAT 
ATATTGAATC 



CATTGATGTT 
ACCCATATTA 
GGAAACGATG 
CATACGTTTA 
CTTTTATATC 
TCGATTTTTC 
TTATGCTTGA 
TAAACCAACT 
TGTATCTGGC 
TGTTGCACTT 
AGATGTGCTC 
AGATTCAGAT 
TGAACCAGAC 
TGACX3TTGAT 
GTCTTGTGAC 
GCTCGTTGAA 
ACTTGTTGAG 
TGAGTCACTG 
ACTGATACTT 
CGATGTGCTT 
ACTTAAGGAA 
TGATGTAGAT 
ACTTAAACTT 
TGATGTACTC 
GCTTTCTGAT 
CGAATCGCTC 
TGATTTTGAA 
TGATCCACTT 
ACTTAGcGAG 



AACCACGGAC 
GAAATAGCTA 
GAAATGTTTG 
TATATAATTT 
AAAAACATTA 
TAGTGATTGG 
GAATCATCTT 
AATAATGTCA 
AATCTTTTTT 
AATGTA6ACT 
GTQCTGTGAG 
GTACTTGTTG 
ATAGATGTAC 
GTGCTTGTTG 
ATTGAAACAC 
CCTGAACCAG 
TCCGATGTAC 
ATAGAAGTTG 
ATTGAAGTAG 
TGTGATGACG 
CCAGATGCAC 
GTGCTTGTTG 
GTCGATGTTG 
GTTGAATTTG 
GCACTGATGC 
ACACTTGTTG 
TCACTTAATG 
AATGAGTCGG 
GTAGACTyGc 



CTAAtCCTAA 
TTTTAAAAAA 
TGCCTAATAT 
TATATTCGTA 
AAAAGACTAA 
TGCCTCAGTC 
GATCATTTTC 
TAACGCCACC 
CACTTTGTGC 
GAGATTCACT 
TATGATGCAT 
AGCCGGACAT 
TCAGTGATTC 
ACACTGATTC 
TCGATGAATT 
ACGTACTTAA 
TTGTCGATGT 
AATCACTTGT 
AATCACTGAT 
ATGTACTAGC 
TTGTACTTGT 
AGCTCGAGTC 
AAACTGAtwC 
ATCCACTGAT 
TCATAGAGTC 
ACGTTGATTC 
AATCCGAAGT 
AGTCACTTGT 
tTACGCTTTC 



AGTGAAAATG 
TGACTCATGT 
ATAAATAaAC 
TTGTTGTAAA 
GGGTTCATCA 
TTTTTAATTT 
ATCTTTCTTT 
TAGTAATCCA 
TGGTGTGCCA 
CGTGCTCGTT 
ACTCATTGAG 
ACTTGTTGAT 
GGATGTGCTT 
TGAGTCACTA 
AGATTCACTC 
TGATTCAGAT 
CGAGTCTGAA 
AGATTCTGAT 
ACTGTCTGAT 
ACTCATTGAC 
TGACTGGCTT 
ACTTACACTT 
GCTTCCGCTC 
GCTAGACGAA 
AAATTGACTA 
TGATCCACTC 
GCTAAGACTT 
ACTAGTAGAA 
TGAACCACTT 



11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12460 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
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TTTGAATCAC TTAATGAATC AGATTCACTC ACGCTTTCTG AACTTCTTAG TGACGTCGAT 
ACACTTAATG ATGACGAATC GCTTGTGCTT ACTGAATCG 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

AGGGATGGCC TTACCTAAAA AACCGGGnAA ACCCTCCAAA ACCCATTAAA AGGnTGGnTA 60 

CCCTTTAAAA TGGTAGCATT TAACCGCCAC CCGCCAAGGT GGGTGGTTTA TTCTTCCGTT 120 

ATTTAAATTA GTACACCATG CAGATTCTGT AGTTGAGGGA TATTTTAACG AAAGCTTATT 180 

AGCAACTGAT AAAAAAATAC GTCCTAAGGC ATATATTGCT TCATGGAAGG ACATCGAGCC 240 

GGCTAAGAAA ATAGAATTTA AAATTAAAAA AGGTATTAAA TGGCATGATG GTAATGAATT 300 

GAAAATTGAT GATTGGATTT ATTCAATTGA AGTCTTAGCT AACAAGGACT ACGAAGGTGC 360 

TTATTATCCA AGTGTAGAAA ATATCCAAGG TGCGAAAGAT TATCATGAAG GAAAAACTGA 420 

TCATATTAGC GGATTGAAGA AAATAGATGA CTACACTATG CAGGTTACAT TTGATAAAAA 480 

ACAAGAAAAT TACTTAACAG GATTTATTAC TGGACCTTTA TTAAGTAAAA AATATTTATC 540 

AGATGTACCA ATTAAAGATT TAGCGAAATC AGATAAAATC CGAAAATATC CTATTGGTAT 600 

TGGACCGTAT AAAGTTAAGA AAATCGTTCC AGGTGAGGCT GTTCAACTCG TTAAATTTGA 660 

TGATTATTGG CAAGGTAAGC CTGCACTAGA CAAAATCAAT TTAAAAGTTA TTGATCAAGC 720 

GCAAATTATT AAGGCAATGG AAAAAGGCGA TATTGATGTT GCGAATGATG CTACCGGTGC 780 

AATGGCAAAA GATGCTAAGT CATCTAATGC TGGTCTCAAG GTATTATCTG CGCCAAGCTT 840 

AGACTACGGT TTAATAGGtT CGTATCTCAT GATTACGATA AAAAAGCTAA TAAAACTGGT 900 

AAAGTGAGAC CAAAATATGA AGACAAAGAA TTACGTAAA6 CAATGCTTTA TGCAATTGAT 960 

AGAGAAaAAT GGATCAAAGC GTTTTTCAAT GGTTAOGCTA GTGAAATCaA TAGTTTTGTA 1020 

CCATCTATGC ATTGGATAGC AGCCAATCCT AAGGACCTAA ATGATTACAA ATATGATCCT 1080 

GAAAAAGCTA AAAAAATCTT AGATAAGTTA GGTTATAAAG ATAGAGATGG TGACGGATTT 1140 

AGAGAAGATC CTAAAGGTAA TAAATTTGAG ATTAACTTTA AACATAATTC AGGTTCTAAT 1200 

CCTACTTTTG AACCAAGAAC TGCTGCGATA AAAGATTTCT GGGAAAAAGT TGGCTTGAAA 1260 
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AATACGATTC CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC 1380 

GACAGACCTT TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA 1440 

GAAACAGAGC GAGACACTGT ATTTACAACA GAATATTCGG TTOGTACTGC CATGGAAGCT 1500 

GTTTATCAAT TACTAAATAT AGATCXyTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT 1560 

CTTCGCGTCT TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT 1620 

ACTAAAGATT CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 1680 

ACGTACATTG AGTCATTATT AAAAGAACAC AAATTGTTAT AACGAAAACC ATTAATAGAT 1740 

TTTTATTTGG TGATTTCAAA TCATGAGACT GGGACAGAAA TGATGTTTTC ATAAAAATTA 1800 

TTTCQTTGTT CCACTCTCAT GATTTTTTTG ATGAAACATA ATTACATGAT TGATTGCATC 1860 

ATTTTGTTAA ACAAGTGATT GCAAACCTGC CATTTCACAC TGAAAATTTA CATAATAAGT 1920 

20 GACGATATTT TACAAGTCAT ATACAAATAA CATATATTGT TAAATAATTT TACCTAATCT 1980 

TAACATTAAA TTTACAATTA TAAGCGATAA TCTAAATATA AAGCTTATTT GAGGTGAAAT 2040 

AATGGAAATG TCGGTTACAG AAGTCATTTT CTCCTTTTTA GGTGGTTTAG GTATTTTCCT 2100 

TTACGGCTTA AAAATCATGG GAGACX3GGCT TCAAGCATCA GCAGGAGACA GGCTACGAGA 2160 

TATTTTAAAC AAATTTACAT CAAATCCAGT ATTAGGTGTT ATTGCAGGTA TCGTTGTAAC 2220 

TATTITAATA CAAAGTAGTT CAGGTACGAC AGTTATCACA ATCGGACTGG TAACAGCTGG 2280 

ATTTATGACA TTGAAACAAG CCATTGGAGT GATAATGGGT GCTAATATCG GAACAACGGT 2340 

AACTGCATTT ATTATCGGTA TAGATTTAGG CGAATATGCA ATGCCAATTT TAGCATTAGG 2400 

TGCATTCTTA ATCTTTTTCT TTAAACGCTC TAAAATCAAT AACATTGGCC GCATACTATT 2460 

CGGTTTCX5GT TCACTATTCT TCGGTCTAGA ATTTATGGGT GATGCCGTTA AACCTTTAGC 2520 

ATCMTAGAT GGATTTAAGC AATTAATGCT TGATATGTCT ACAAATCCAA TACTCGCTGT 2580 

40 CATTGTCGGC GCAGGGTTAA CAGCACTAGT TCAAAGTTCA AGTGCGAOGA TTGGTATTTT 2640 

ACAAQAATTT TATCAACAAG ATTTAATTAG CTTAAACGCA GCAATCCCTG TGTTACTAGG 2700 

CGATAACATT GGTACCACX3A TTACAGCTAT CTTAGCTAGT TTAGCCOGCT CAATCGCTGC 2760 

AAAACGTGCG GCGCTTGTAC ACGTCATCTT TAACTTAATC GGGGTAATTA TCTTCACAAT 2820 

TTTCTTGCCA GTTGTGATTC ATTTGATTAG TTTGTTACAA GATTTATGGC ACTTAAAACC 2880 

AGCGATGACG ATTGCAGTAT CACATGGTAT CTTCAACATA ACAAATACTT TGATTCAATT 2940 

ACCATTTGTA GCAGGTTTAG CATGGATTGT TACAAAGCTT GTCCCAGGTA AAGATATTGC 3000 

TGATGACTAT AAACCTCAGC ACTTAAACAA AGATCTTGTT TATCACGCAC CTGGTGTTGC 3060 

55 



25 



30 



35 



45 



50 



981 



EP 0 786 519 A2 



AGACATTCGC GAAATTACAA AAGAOGATAA 
TCAAGCTGTT GAAACAATCA ATGATAGCAT 
^ AGCCATTACG AAGGCAGACG TTGAGCGTTT 

TTTAAAAGTA GCAGAGCTAA CAGAAGAGTA 
AGATATTCGC ATTACAGAAG ATGCACAACG 

10 

TGAGTCATTT GATAAAGCCA TCGACATGTT 
AATTGTAGAA CGTAGTAGAG AATCATTTAA 
TAAACGCCTT AATCGTGGTG AATGTACAAC 
TGGTGTTCTT GAACGTATCG GTTATCATTC 
TAACGATGAT GTACCTACAG ATGAAGAAAT 
20 TTATTTATAT TCATATTTTT TTAAAATTAO 

ACATTCATGG GTTGGCTTTT TTGTTTAGCA 
CTGATATAAT AATGCTTCAT TAGTATGCGG 
GACCCCGAAA CGTTTAATAT ACACTTTTAC 
ACCTTCACAT TATATATAGT TCTTTCCATA 
AAATAACCAT CCACCTAACT TATCAAAAAT 

30 

ATTTATATTA GTGTTAATCC AATCATAGAT 
TCATTTAATT TGCTTTACTT TCATTTATAT 
ATTATTTCGC ATCTTTGACA CGTTTATCAT 
ATGCCATGAT TCCCATTAAT ACGTTAACCC 
TCGTOCAGC AACTGCACCA GCATATACAG 
"^0 CAA6TGATGA AGCCATTTTA TAAATACCTG 

TCGAAATAGC TGTATCTGTA GAAGGTGTTG 
AATATCCTAC GACACAACTG ATAACATAAA 
GTGCAATACC AATGACCACA ATGAATGTAC 
TTTGTAATAA TTTTTCACCA ACTCGAATCA 
ATAAGTATCC TGCCTGCAAT GCTGTATAAC 

SO 

CAATTAATGT ACCTGCAAAA CCGTTTAATA 
GTTTATTTTC AAATAATTTA AAATCAATAA 

55 



AAAATTGATC AAAAAGCTTG AACAAAAGCA 3180 

TCGAAATTAT TTAGTTAGAA TTTCTACAAA 3240 

AGCAGTTATG TTTGATGTCA ATCGCTCTAT 3300 

TGTCGCTCAA TTAAAACGCC AACATGATGA 3360 

CGGTATGGAT AAATTATTCA ACCATGTTGC 3420 

AGATGTTTAT GACAAAACGA AAAAAGATGA 3480 

TATTGAACAT AAACTACGCA AAGGTCATAT 3540 

AAAAGGCGGA TTACTATATA TCGATATGAT 3600 

ACGAAATGTT TCTGAAGCAC TTGTTGGCCT 3660 

TGCAACAACT GAAATTTAAT TTTTACTGTC 3720 

AGATTCAGAT GCATGTAAAA AGCCAATCCA 3780 

AAATTTATTA TCTTAAATCG GCTATAAACA 3840 

TAAGCATGAC GGACACTGTT CTCGGAGTCT 3900 

ACGTCGCCTT CATTGAAGCG AATTGCCATA 3960 

TAAATGTCCA AATTTTTAGA ACAACGCAAT 4020 

TTAAGTGGAT GGTTTTTCAT TTTCATTTAT 4080 

TTATCTATAT GCACTGCTCT ATACATTTCC 414 0 

CATTATCAAA ACACTTGGCG TGTCATCGTT 4200 

CATTAGGAAT CGCGAATAAA ATTGCGATAA 4260 

AAAGTGCAAT CATCGCACCT GTATGAATGC 4320 

CACCACTAAT TGCGACACCG AATGCGCCAC 4380 

AAGCAACGCC AACTTTATCT AACGGTGCAT 4440 

CATAAATACC TAAGCCTAGT CCGAAACATA 4500 

ATATGCCTGG TAAGAATACT AATGAAATAA 4560 

CTAATAACAT TGGTCGCTTA GAACCCATTT 4620 

TCAATAACAC CATGATTAAA TAAGTAATTG 4680 

CTAAACCTTG TTGCACGAAT GTATTCGCTA 4740 

AGAAGTTCGA AATCGTTGCA CCTGTATATG 4800 

GTGGATTATC TACTTTTTTC TCAACATTTA 4860 
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AACCAAGTGC TGCACCTTTA GTAATGACAA 
CAATTAGCCC TGCAACGTCA AATTTATGTG 
^ TCCCTTTGAT GAGTAACATT GAAAGTACGG 

TCCAACCCAT AGTTGTCGCA ACTGCACCAC 
AAGAACCGAT AGACCAATAA CTTAAGGCAC 

10 

TCATAATGGC CAATGTAGAA GGCATAATAC 
CTAAAATTAA TAATGCCGGT AAATTCGTAA 
ATAATAAACC GATATTCGTC ATTTTCACGC 
CAACAACAAA CATGCCTGAA AATAGTGCAG 
TGTCTGTACC AAAACTTTGT TGTAAATTCG 
20 ACCAAAATGT AATAACACCT AATACAATAC 
TTTCATTCAT GTTAGTTATC TCCTTTAAGG 
6CAGAAATAA GTAATGATTT TTCTGAAATG 
TCACCATTTT CCACCGCTGC ACCPGTATAA 
TATGCAAAGT CTTCTGAAGG TGGTTGTGGT 
GCTTCTTTCA ACGTCTTAGC CACGTACTCA 

30 

TAATCATCGT TATATTCTAA GGTGCAAGTt 
AAACGTTTAA TTTCTTTTTC AATTGTTGCT 

^ CCTTCAATTT CAACAACATC TTTAATGACA 
ATTGTGACAA CACCGGTTTC AAATGGACTT 
GTGACGAAGT AGCTACCTGC AACAATGGCA 

40 CCACCTTTAC CTTGAACTTT CAATTTGAAG 
CTGTAATACA CTTTACCTGT TTTCATTGTG 
ACACCGTCTA ATACACCATT TTCAATCATT 

45 

6CTGGTTGAT GTATCACAAC GACTTTTCCT 

TCTGCTAATA CAAGCATGTA TGCTGTATGT 

TTATTTTGTG ATGCAAAAGA TAATCCTGTA 

50 

GCACGGATTG CTAATGTTTT ACCAGGTTTC 
GGTCCGACAT TCGTTTCTAC TTCCACATCT 

55 



CGTTTAAACT TAGCAACATA ACTACTAGAA 4980 

TATTGGTAAT TTCTGATTTC GTTTCAGGCO 5040 

CAACGATAAT TGAGAAGATG AAAATCCATC 5100 

CGAAGAGTGA ACAGATACCA CTGCCACCCC $160 

GCTGACGTTC AGCACCCTGA TAATAAGTTT 5220 

ACGCTGCTGA TACACCTTGT ATAACACGAC 5280 

TAATAATTAA TGCTGAACCA ATAATACTTA 5340 

GCCCAATTTT ATCTGCCAGA CCACCTGCTC 5400 

TTAGACTGAC CGCAATACTA ATTGTCCCCA 5460 

GTACAACATT TACAAGTGAT TGTGCAAACA 5520 

CTAAGATTAA CTTGTTGCCC CC6CGATACG 5580 

TAATCTAAAA CAACTGTCCC TACTGCTTCT 5640 

TTAAATTTAG GATGATGATG TGGGTAAATT 5700 

ATAAAGGCAC TTGGGCGTTC TTTAGCATAA 5760 

TCACACATTT CAACACCAAA ATCAAGGTTT 5820 

GTAAACTCTG GATCATTATA TAATGCTGGA 5880 

ACACCATACA TATCCTCTAA TCCTTTTGAT 5940 

TTTGTAGCAT CTGTTAATCC ACGTACATCA 6000 

TTGAATTGAC CTTTACCGTC AAATGAACCG 6060 

AGTCX3TCTAG ATACAACTGT TTGTAACGCT 6120 

TCATTGGCCA TATGTGGTGA TGAACCATGA 6180 

AATGCGCGTC CTGTTTGAAC ATAACCAGGT 6240 

CTCATGACGT GTACACCTAA TACATGATCA 6300 

GTTTTAGCAC CACCTGGTGG TACTTCTTCA 6360 

GTAAAACTAT CTTTCATTTC AGCAAGCGTC 6420 

GCATOGTGAC CACATGCGTG CATAACACCT 6480 

TCTTCAGTAA TGGGTAATGC GTCAAAGTCT 6540 

CCTGAATCAA TCGTTACTTT AATTCCACGT 6600 

TTACCTTTGT AAAATTCAGC GATGTATTTC 6660 
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10 
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ATCATTTTGC CTTCTTTAGA TTTTAAAGTT TCAATTAATT GTTGATTCAT ATCCTTCATC 6780 

TCCTTAGTTA CATCATAAAT GATTAATCAT TATTTATATT GCCAACAACA GAOATGTTAA 6840 

CCATTAATTT TTTGCAATTT TAGCTTTCAA TATAAAAAAT CACAAATTAT GTATATCAAA 6900 

ATTTGTGATT TGTGATCATT TTATGAACTT GGGTAACGTT TTACTTCAAT TAAGTGAATC 6960 

CCATTCGTAA TCATTTTAAT GTTTAATGCC AGTGTGTCCG TGATATCTAT ATCATATACT 7020 

TCTAATTTCG GAAAACTCAT TCGATTAACG TAATCTATAG AGTCCTTGTC CATGCCATGT 7080 

ATCGTATGAT GTTTGCGCCA AAGATTAAAT AACGCACCAT TTTCTTTATC TAAGGTAAAA 7X40 

TGTTTAATCT TATACATACC TTCTTCCAGG GCATTAATGT TCAAATGAAT CATTTCCGTC 7200 

GCACGCATAT TCATTTGATT GTCCAACX3CT AAGTACGGAT TAAAATGCTT TGCATCATAT 7260 

AACAATATTT GAAAATTTGA ATCAGTCCCC GTGACAATAC ATGTATCATC AGAATACAAA 7320 

20 ATATTGCTTG TTAATTTATT AAATAGCAAT GCCGTGAAAT AGACCGGACG TTTTCCATTA 7380 

TATTGATGAA ATAGTTCAAT AGAATTCATA TAATCCCGTT CATTTTTACA ATGACTGACG 7440 

TGCAAATCAT AATTCAACCA ATACCCGATA CCCTCTACTT TAGAACTTAA TTTTAATAAT 7500 

TGCTCAATGA TGATACCACC TCTAAAATAT TCGCCGTTTG TAATAAATGT ATCACCCGTC 7560 

AATGTATTCC AATTGAGTAA AATGAGTGGA CGCTTTAGGC GATGACGATG CATTAAGTCG 7620 

ATAAGGTAAT TCGTTTTATT AATAATCATT TGACTCGCGG TTTTAAATTC ATCATCATTC 7680 

ATTTTATTAA AATCAACAGC GTCATTTGAA TTGGCATCAA ATACAAAATG GTCGATGTGT 774 0 

GGCTCAAGTC GTTTCAATAA TGGTAGATGT CTTTCCGTAG CTTGATCTAA GTGAATGTAC 7800 

AAGCCACCAT TAGGGAATAA TGCTTTAAAA TAATCAATCA TTTCAATCAA AGACGTGTGC 7860 

AATGTCGTCA CATACAAGTT GAACTTCAAA TCTTTTCTAT GACTGACATG CAGGGCAACG 7920 

TGATGGATAA AAATTTTAAA TGCATCGATA TAATCACGTG AGTCATACT6 ATCCAAATGC 7980 

^ ATGGTCAAAC TAAAGTTATG ATCTAATAAA AAGTCTAAAC ACAAATCAAT ATCATAAAAT 8040 

ATATTCGAAA TTTCTGCATC ATACGTGAAT GGCGCATTGA GCTTTTTCAT GATATATGGA 8100 

ATCACATCAT ATGCTAATAC TTCATTGACT TGAAAATCAT OATGACATQT AAGCAACTGT 8160 

GATTGATACT GTGTATTGAG CAAATTCCTC AAATAGCCCA CTTGAATAAT ATGATTAAAT 8220 

TGATTTAGTT GGTGATTGGT TGGTTGAAAG GCAATCTCTT TATAGTTCAT CTTTTCAATA 8280 

TCTTCAATAA AATGATTCAT TTCTTCAATG TAGTCATTTA AAAGTAATAT CAATTCACGG 8340 

TCGTGATAAT CATGTTGTGC CGATTGCTGG TTTTCAGTGA TTGCTGGACG ATCACCTCGA 8400 

TATTGTTTAG GTGTTTGATG CGTAAAGTGT TTAAATGTTC TOGCAAAGCT CGCTGCACTT 6460 
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TTCGCATGCT CAATTCGCGT CGTATTTAAG 
GTAAACTTTT TAGACAGATG GCTCTCTGAC 
* CTTAAATCTT CATGAAAATG TAACTCGATA 
TTTAAGATAC TTTGGTTCGA ATGATATGTA 
AACTGAATCA CAAGTTGTTG CTCAGTCAAT 

10 

GAAACCAGTC TTGCCATTAT ATTTCTCAGT 
GTTAAGTGAA AATATAGACA ATGCACATCA 
,5 AATTGGATAT AACATATGAT GCCATCTTGT 
TTAATGATGA AAATGTCGCC ACTGTTGCAT 
GCCTCnCCTT TAATAACAAA ACCAATCATT 
CTCTCAGTTT CTACTCGAAT TAAATAATCA 
TGAATACGTT TATTTTACAT TATTTTACAG 

TGTGTGGATG ATTATTTATC CTCACTCGGT 

25 

TTTTCACCTT TTTTCTCCAC AAAAGTAAAT 

TTTAAGTCTC CCgAACCTTT CaACaATAAC 

CTTTTTCGTA ATTGTTTTAC ATTATAGTCA 

30 

CTCGGTACCT CTGGATTATA TGATATATCG 
CTGCCATATT GCGCGAAGAA CTTAAAATTC 

^ ATACCTTTAG TTGGAATGAT TTTATTGTCT 
CTCTTAGGTC TACCATCTTC ATCATGAAGT 
GTCTTAGTGT TTCTATTCAT ATATAGAACC 

^ GGTTGAACAA CCATTTCAGA ACCAATAATC 
TCATCACGAT AACCTTCTTT ATCGTATAAA 
AATGTTTTTT CAAAGCTTTT CTTAACTTCC 

45 

AAACTAATGA CTAATATCAA AAAACTAATA 
ATTTCACAAT CCTATTCTTC TTATTATCTT 
CATGATTATT TATCCTCACT TGGTTTAAAA 

50 

TCTTTTTTCT CTACAAACGT AAATTCAATG 
TTCCCTGTAC CyTTCAACAA CArCTTCGGy 

55 



AAATGATGGA ATCCTACACC TAGCGATTCT 8580 

CACCCAACGT ATTCGCTTAA TTCTGAAAGG 8640 

TAGTCGCATA CTTGATTCAC TTTATCATCA 8700 

CGCX5GGACAT AATGAATCAT ATGCATAAGC 8760 

TTAGACAACT CATTATGTCG GATATGTGTT 8820 

TGATGTATAT TCTTTGTTGT GGTCGCATCT 8880 

TCAAACTTGT CTGCTAAATA TTTCATTTGG 8940 

T6AAGTTGAA ATCGATACAA GTG6CGGTGG 9000 

TGCGTCATAT TATTTTCATC ATAAATGTGT 9060 

AAACTATTGA GCCTTTTGAA ATCTGACATA 9120 

CGTTGCATAC TATCCCTCAA TTCAGTAATA 9180 

CAACATATTT GAATTTCATA TTGAATCGTG 9240 

TCAAGATGTA GACTATCAGT AAAAAAAGTA 9300 

TCAATGTCTT TATATCCAAC TGrTGaACCT 9360 

TTTGGTGCTT TATTCGTTGG TATTTTATAT 9420 

TCATTAGTTA ATTGATATTT TGCTGAATAA 9480 

CCGTCTTTGT ACTTCGACAA ATCTTTAAAG 9540 

TCGATTTCTT TTTTTATATT TTCGTCTTTG 9600 

ACCATTTTAA CGGGATATTC TTTATCTTTA 9660 

GTTTCACTCA CTATATACTT CCCGGTTGTA 9720 

ATACCTTTTG ATTTCATACG TTCCCCTTTA 9780 

CATGTACCTT TATCATTTTT ATCAAATTCG 9840 

TCCTCTAGAT TTTTAATCGG ATACATACTC 9900 

GCTTCTTTAC CTATGCCACA ACCAGCAGTG 9960 

TACAATACCA ATTTGTTTAA TOGTTTCATA 10020 

TCCTGGATTG ATTTCATATT TTGATCGAGT 10080 

ATTAACCCAT CACTAAAGTA AATGTTCTCT 10140 

TCTTTATATC CAACTGATGA ACCTTTTAAA 10200 

GCTTTATTTG TTGGTATGTC ATATCTTTTA 10260 
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ACCTCTGGAT TATATGATAT ATCTCCATCT TTATAATTCA TTAAATCTTT AAAATTGCTA 10380 

TATTGCGCAA AAAACTTAAA GTTTTCGATT TCTTTTTTTA TGTtTTCTTC TTTAACTTCC 10440 

5 

TCAGTAGAAA TGAATTTATT ATTAATCATT TTAACTGGAT ATTTTTTTTG ATTATCCTGA 10500 

GCTACTTCGT ATTTCTCCGT CTTTAtTTCA TTAGTATAGT AAAAtCCTTT TGCACTTCTT 10560 

GTATTTCTAT CTATCTTCAA AAGCATGCCT TTTATTTTTA GAGCTTCTCC TTTATTTTGA 10620 

10 

ATTGCCATTT GAGAATTTAC AATCCATGTT CCCTTATCAT TTTTATCAAA TTGATCATCA 10680 

CGATATCCTT CTTTATCGTA TAAATCCTCT AGATTTTTAA TCGGATACAT ACTCAATGTT 10740 

15 TTTTCAAAAC TTTTCTTT 10758 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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nTaTCaACTT 


TGGaATTTAA 


AgTCAATAAC 


TTTTTTAAAA 


ACTTTTTGTG TTCACAACCC 


60 


GCTTCTTTTT 


CAACGCGTTT 


ATTGCTTAAC 


ACAAGAACTT 


ATTTTACCAG CATTCCAAAA 


120 


CAAATCAACA 


TAAAAACGTA 


CAAAATAAAA 


GTAATTTTGT 


ACGTTTAGCA TATATTATAC 


180 


CTATTTATTT 


GTAGCAGCTA 


TAACTTTTTG 


TGCAATCGAG 


CTATAAATTT TACCTAGACG 


240 


ATCATCTGAT 


TGATATATTG 


ACGGTGCAAA 


ATCTTTTGGA 


TTCCAAGATG GTTGCTCTAA 


300 


AGGTAATTCC 


CCAAGTAATT 


6AGTATTAAG 


TTCATCAGCT 


AACTTAGTAC CQCCACCTTT 


360 


GCa$AAGACA 


TATTCTTTAT 


TACCCGTCTC 


TTTACTTTCA 


AAATAACTCA TGTTTTCAAT 


420 


TACGCCAAGA 


ATAGAATGAT 


CCGTATGTTT 


TGCCATCGCA 


CCTGCGCGAg CTGCAACAAA 


480 


TGCTGCTGTA 


GGATGAGGTG 


TCGTTACAAT 


AATTTCCTTA 


CTTGAAGGTA ACATCGTATG 


540 


AACATCTAAA 


GCTACATCTC 


CTGTTCCAGG 


TGGAAGATCG 


AGTATTAAAT ATTCAATGTC 


600 


TCCCCATTTA 


ACTTCTGTAA 


AGAAATTCGT 


CAACATTTTA 


CCTAACATTG GCCCTCTCCA 


660 


TATAACTGGC 


GCATTTTCTT 


CCACAAAAAA 


GGCCATIGAT 


ATAACTTTAA CGCCATGACG 


720 


TTCAACTGGA 


ATTACTTCCT 


TCCCTTTAAT 


TCCAGGCTTT 


TCATCAATAC CCATCATATC 


780 


TGGTACACTA 


AATCCATATA 


TATCGGCATC 


TACTAATCCG 


ACTTTTTTCC CTTCACGAGC 


840 


TAAGGCAACG 


GCTAAATTTA 


CTGCAACAGT 


AGATTTACCG 


ACACCACCTT TACCGGAGGC 


900 
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ATnTCTTCT TTTGGTTTAA ATTGATTTAC TTTTTCTTCC GGCAATGTTT CAAATCGTAT 1020 

ACOGACCGTT TTCGCACCGT TTTCTTTTAA TGCATTAACA ACAGCCATCT GTAAATCTAA 1080 

^ aTTGCGtCCA CCACCTAATT GTGCCATTG 1109 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3997 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TCTTTATTTA AAAAAATGAT TGTCTAGTTT GTATCTCTCT GAAGATTTGG CAATAAATAA 60 

20 AAGCCGATAA CCGTATAATG ATTATCGACT TAAAGTTTAT GTGGCATTTT TTACTTTTGT 120 

AATTTCAGGT GAGTTAGATG ATTATTATCA GATAGATTAT TGCITATAAT CATATGATGT 180 

TTGAATGATA TCTTTGATTT CACTGATTAG TGCTTCTTTA GGATTAGCAG TTGTACATTG 240 

ATCTTCAAAT GCGAGCTCTG CCATTCTATC AATTGACTCA TTTAATTCTT CTTCAGACAC 300 

ACCTTGTGAT TTCAAATTCA TTTCAATTCC GACTGATTGA CCTAATTCGT AGACAGCTTT 360 

AgCTAATGAT TCTACGAGTG CTTCTGTCGT ATTACCTTTT AATCCTAAGA ATTTGGCAAT 420 

ATCTGCATAA TCTGTATCTG CTCTGAAGAA CTCATATTTA GGGAATAATG CATGTTTTTG 480 

CGGGTCTTTG GCATTATAAC GGATAATATG CGGTAGTAAT ATCGCATTCG CTCTACCATG 540 

CGGAATACCA TATTCGCCAC CAATTTTATG CGCAATTGAG TGTGCAATGC CTAAGAATGC 600 

ATTTGCAAAT GCCATACCAG CCAAAGTTGA TGCGTTATGC ATTTTCTCTC TTGAAACTTT 660 

ATCACCCTTT TCAACAGATG ATTTTAAATA TTCGAACGTC AATTTAATCG CTTGTAGACT 720 

40 CAAACCTCTT GTGTAGTCTG AAGCCATTAC AGATACATAT GATTCCATTG CATGCGTTAG 780 

TACATCCATT CCTGTATCTG CTGTAACGCT TTTTGGCACA CTCATCACAA ATTGAGGGTC 840 

AATAATTGCA ACGTCAGGTG TTAAAGCAAA ATCAGCCAAC GGATATTTTA CATTTGTTTC 900 

ACTATCTGTG ATAACTGCAA ATGGTGTTAC TTCTGAACCT GTACCTGATG TCGTAGGGAT 960 

ACAAATQAAC GTCGCATTTT CAGGCATGCC TATTTTATAA GTACGTTTAC CGATGTCTAG 1020 

GAACTTTTGT TTAGCACCGA AGAATGATGT CTCAGGGTGT TCAAAGAACA TCCACATTGC 1080 

TTTTGCAGCA TCCATCGCTG AACCACCACC AAGTGCAATG ATTGTATCCG GTTGGAAATC 1140 

AACCATCATT TCCAGACCTT TATATACTGT ATTAGTTGAT GGGTTCGGTT CGACTTCGCT 1200 
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ATAACCGAAT TCTACCATAC CAGGGTCACA 
TGTTGTTAGA CTCATGATTG CATTTTCTTC 
^ TTGAGTATTA TTACGTCGTT TAGCAATCGT 
ATTATGTGAA ATTGAGTTTC TACCGTAGaA 
TCGTTATACA TATCACCAAT ACCTCCAACC 

10 

GCTTTCATTC TTAGTCCAAA ATCTnTTGT 
GTGTGTCCTA ATCCACCAAA ATGTAGTGTG 

^5 GATTGGGCTT TTACTAAGGC TAATACTGGA 
GAACCTACAC CGCTAATTTC GGCTATGATA 
GCTAATTCAG CTATTTCAAC TGCAGATTTA 

20 TGTTCATTcA TAATTGCATT TTCTAAGCGT 
TGATGTGCTT TAAATTCATT AGTAACATCT 
TGTTCAGAAG CACAAATCAT ACCATTATCA 
GCACCSTTTAA TGTGTGCTGT TTTTTCAATG 
AATGCCGGTT TGCCAGTTGA ATATGCAGAC 
ACTAATGCAA TACCTTTGTG ATTCATTAAT 

30 

ATCCACTGAA TAATATCTTT AGGTGCACCT 
GCTGCACGCT TCGACGATTC TTGTGCACTT 
GTCTTAATTG CAATCATCGC TTTAAAAATA 
ACACCACAAA TAACACCAAT TGGTTCCGCT 
ccaaSaatcc CTACTGTCTT ATTGTCTTTT 
^ AAATTTTTAA TCGCTTTATC TTCGTATATA 
GCTAGCACCA TATGTTGATC AACAGCTGCT 

TCTTCTTGTG actttttaga taatgcttct 
tcaatcataa ttgccacttc ttgttctttc 

AACATATACA ACCACTCCTT TATACTTTGT 
CTCTCAGGAT ATAAAGAAAA TTCTATACAA 

SO 

AATATCAAAT TTTAAAATAA ATGTTTTCAT 
TGATTTTATA ATCATAATTT TTTCAATGAC 



GACAATCATC ACTTTTTCAA TCTTGTCCAT 1320 

AAAATAAATT TGAGCAGGCA CCTTGAAAAT 1380 

TTTAATGTTT AATAAATCTG TCGCACTAAC 1440 

CCACAACCTA ATGTTAAAGA CGGAATCAAT 1500 

GCTGATGGTG TATTTACAAG TACACGACAA 1560 

AATGTTTCAT CTTCTGTATG GATAACGGCT 1620 

TCTTCACAAA TTTGAAATGC TTGTTTTGTA 1680 

GATAATTTTT CACGAGATAA CGGATAGTCT 1740 

AGTTTTGTAT TTTCGGGGAC AGGTATACCT 1800 

CCGACAATAT CAGGCTTAAT ACCTGTTTTT 1860 

TGTAATTCAT CTTTTTTAAC AAAGTATGCT 1920 

TTATAAATTT CTTTATCAAT GACTACAACT 1980 

AATGTTTTTG AACCAATGAT ATCATTTACT 2040 

TAAGACGGCA CGTTACCTGG TCCCACACCT 2100 

TTAACCATGC CCGAACCACC TGTTGCTAGA 2160 

TGTTTTGTTG CTTCGATAGA AGGCACTTCA 2220 

GCCTTCATTG CCGCTTCTAA TACAACTTCT 2280 

GGATGGAATG CAAAAATGAT TGGATTTCCT 2340 

GTTGTCGACG TAGGATTTGT TGTTGGCGTA 2400 

ACATACGTTA ATCCTTTTTC TTTATCTTCA 2460 

ATTGAATTCC ATATATATTC AGAAGCGTAT 2520 

CCTCTTCCAG TTTCTTCATG TGCTAATTTT 2580 

AAGCTCaTTT GATGAACAAT ATGATCAATT 2640 

AATGCTTTTT TCCCTTTGTC AGCTAGA6CA 2700 

GATCCACGAT TTTCTTTTTC AGGTATAGTT 2760 

GAATTATTTC ACAAACATTA TAGTACATGT 2820 

AAAAGTTTAA TTTCGAATAT TATTTGAACA 2880 

GAAATCATTG TTATTTCGGT GTTTTTAGAA 2940 

ATAATTTATT CATAATTATA TATTTAATTC 3000 
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20 



25 



TCCTTGTCGA 


TACCTATCAA 


CAGATGTTAC 


AAATAAAAAC CaCCCGTGTG AACGGGTX3GT 


3120 


TTGTTCTGCG 


gCTATAAGCC 


TTCCTTACTG 


GCCaGCCCTA AAAGGGCACT GACAAGTCAG 


3180 


CCAACTGCAC 


TACTATTCCA 


GCAATCCTAA 


AGGTTTACTC TTl'mC'lTi' CTl''l"rrrTAT 


3240 


TTTTCTCTCC 


AGTGAAAGGA 


TCTAAATATT 


CTTCCATTGA AATTTCATCT GCAACGATAT 


3300 


CCTCTTGTAA 


TTGATTACGA 


ATATAATTTT 


CAATCACTTT TTTATTTCTA CCTACTGTAT 


3360 


CCACATAAAA 


TCCTTTACAC 


CAAAACTTTC 


TATTTCCATA TCTATACTTT AAGTTAGCAT 


3420 


GTCTATCAAA 


TATCATTAAA 


TTACTTTTCC 


TTTTAAATAG CCAACAAATG ATGATACCCC 


3480 


AAGTTTGGGT 


GGTATACTTA 


CTAACATATG 


GATATGATCT TTACATACCT CTGCTTCAAT 


3540 


TATCTCTACA 


CCTTTTCTTT 


CATATAATTG 


ACGTAATATA ATCCCTATAT CrmTlTAT 


3600 


TTTTCCATAT 


ATCGCTTGTC 


TTCTGTATTT 


AGGTGCAAAG ACAATATGTT ACTTACAATT 


3660 


CCATTTCGTA 


TGTGCTAAAC 


TGTTTGTGTC 


AGATGACATT AAATAGCATC TCCTCGTGTT 


3720 


GATTATTTTG 


GTTGGCTGAC 


CAATATTTAC 


TCTAACATGT AGAGATGCAT TTTTTTGACA 


3780 


ATGGTAGAAC 


CTTTTCTGGG 


GAGTGGGACA 


GAAATGATAT TTTCX5CAAAA TTTATTTCX3C 


3840 


CGTCCCACCC 


CAACTTGCAT 


TGTCTGTAGA 


AATTGGGAAT CCAATTTCTC TTTGTTGGGG 


3900 


CCCCGCCCCA 


ACTCGCATTG 


CCTGTAGAAT 


TTCTTTTCGA AATTCTCTGT GTTGGGGCCC 


3960 


CTGACTaGAA 


TTGAAAAAAG 


CTTaTTaCAA 


GCGCATT 


3997 



30 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 91 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



45 



SO 



GnGCGAGACA AACACACtTA TTGGTGCCAT TATtaCcTAGA 


ATGaATTCaT 


ATGCAGTAGA 


60 


TGAaaCAATC AAAGGATTGG CAAAACAATG CCAAAAATAT 


GAATCaCAAT 


TAATTTTAAA 


120 


TTACACAGGT TTAAATATCG AAGCAGAAAT ACAAGCGCTT 


GAAACATTAG 


CACGCaGTAA 


180 


AGTAGATGGT ATTGTTTTAA TGGCTACAGA CATAACAGAG 


AGACATATTG 


AAGTCATTAA 


240 


TAAAATGAAT GTACCAATCG TTATTGTTGG TCAACAACAT 


GAACAACTTC 


ATAGTATTGT 


300 


GCATGATGAT TATAAAGCAG 6TCAAATTAT AGGCGAATOG 


ATTGGTCAAC 


AGGGATATCA 


360 


ACAAGTTGAA GTGTrTAGTG TAAGTGAAAA AGATATTGCA 


GTTGGTATAC 


ATAGAAAACG 


420 
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GAACAACTTT GATTCTGTCG ACTCCAATGA TITTATCAAA TTCATTTAAT ACX3CCATCAA 540 

AGTCCTCTTT AACATTATAT CCAGCATCAT GCGTATGACA TGTATCAAAA CATACTGATA 600 

AACGTTCGTT ATTATGAACT CCATCAATAA TACGTGCTAA CTCTTCAAAT GAGCGACCAA 660 

TCTCTGTACC TTTACCTGCC ATCGTTTCAA GCGCAATACG TACATTATTG TCATTCGTTA 720 

AAACTTCATT TAATCCTTCA ATAATCTTAT TAATTCCGGC ATCAACACCA GCTCCAACAT 780 

GCGCACCTGG ATGTAATACa ATATCTTTAG CCCCTATAGC TTGCGTTCTk TCeiATTTCTT 840 

GTTGCAAGAA ATCTACACCA AGATTAAACG T TTCTGGTTT GGTTGTAttG CAATaTTaAT 900 

GATGTATGGT GGCATGAACA ACAATATTAG 930 
(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



25 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGACGCACCA ATTTATAACG CAATTGACAA AACAATTAGA TATACCTGTO AAATTTGTAC 60 

CTGGAAACCA TGATTTATGG GAAGTTGAAA GTATGACTAC GCAAGACATT TGGAATAATT 120 

ATAAGAGTAT GTCACAGTGC TTGGTAGGAA AACCATTTAT AGTAAATGAA GAATGGGCAA 180 

TCATAGGACA TACTGGCTGG TATGATTATA GCTTTGCAGC ACAACGATTT TCATTAGATG 240 

AGTTACAAAA AGGAAAACAT TATGGTGCGA CTTGGCAAGA TAAAGAACGA ATATCTTGGG 300 

GCATATCAGA TCAAAATTTA TCTAAAATAG CGGCTGAACA A6TGAAGAAA GATATATTAG 360 

AAGTAGGAAA TAGACGAGTG ATTTTAGTCA CACATGTTGT GACGCACCCT GATTTCATTG 420 

40 TTCCTATGCC GCATCGTATA TTCG AT TTTT ATAATGCATT TATTGGGACA AGTGATTTCA 480 

ATCCTTTGTA TGCGATGTTC GATATACCAT ATAGTATTAT GGGCCATGTT CATTTTCGTA 540 

AAAGTGTGAT AGATGATGGC AGATGTTATC TCTGTCCGTG TCTAGGCTAT CCAAGACAAT 600 

GGCGTTCAGA AGATATTTAC CAGGAAATAA ATGAGACGAT ACAAATAATA GAAATTTAAA 660 

ATGCGCAAAC CTGACCCAGT TTGCGCATTT TATGTTTTAC ACACGCGAGT AATGTGTTTA 720 

CTTACGTGTG TTTATTTTGT TGCTGATTTT CAATTGTATA TGAATGTGGT TGCACATAAA 780 

TGCACTTTCT TCCTGGTQAA TTAAAGCTGT ATTCCATTTT CTCTTTACGA ATTTTAATAA 840 

TTTGTTTGCG ATTTGGAATG ATGGCAGGTA AAACTAOGCC ACGACGAATA TGACTCCAAA 900 
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TTGAAACTTG TTTCGCTGGC TTGTTATCAA AGCGGAAAAC ACGTAGTAAT GGTTTAGAAC X020 
CAAGATTAGT ATGGTATATT AACACAGGTT GACCTTGATC GATAATACCT TTAAGATCTT 1080 
^ CTAACGATTT ACCAGTGCCG TCTACGATAT TAGGATTGTA TTTTTGTAAA AATGGTACAT 1140 

ATGCTTCTGG AAATATCGTT TGATGATAAT TGCCAAGCTT AATGAATAAG TGATGTCCAA 1200 
CATAACCTTT ATGTGGATTG TTCGGATGTG TCGGCCAATG TCTCATAATT TCTGTAGCAG 1260 

70 

GGATATGTTG GTTGTTGTAT TGCAACATCA TGGCTGCGGA AACACCTTCA CACCCCATGA 1320 
CCATAGGGAT AGGAAATAGC TGACTGATAG GTTTAACTGG TAATATTTTT CGGTTCATAA 1380 
TATAGTCCTC GCATTGATTC AATAAATATT TAATATAATT ATATAGCGTC AATGCAAAAT 1440 

IS 

GTCCTAAACA TATGTTTTAC ATGAGTGAAT AAAATTAATG GAGTGATAAA ATGGAATATC 1500 
AATTACAACA ACTTGCGTCG TTAACGTTAG TAGGTATTAA AGAAACGTAT GAAAATGGAC 1560 
20 GACAOGCrCA GCAACATATA GCAGGGTTTT GGCAAAGATG TTATCAAGAG GGAGTAATTG 1620 
CG6ATTTACA GTTAAAAAAT AATGGTGATT TAGCCX3GGAT ACTTGGCTTA TGTATACCTG 1680 
AATTAGACXXJ TAAGATGTCA TATATGATTG CAGTTACCGG AGATAATAGT GCTGATATTG 1740 
AAAAATATGA TGTCATAACA TTAGCAAGTT CAAAGTATAT GGTATTTGAA GCACAGGGCG 1800 
CAGTACCTAA AGCAGTTCAA CAAAAAATGG AAGAGGTTCA TCACTACATA CATCAATATC 1860 
AAGCAGATAC GGTAAAATCA GCACCATTTT TTGAGTTGTA TCAGGATGGT GATACTACAA 1920 

30 

GTGGAAAATT AATATTACCA GAAATTTGGG ATnCCTGTTA AAGOGGTGAT TGAAATAnGA 1980 
AlxTG 1984 

(2) INFORMATION FOR SEQ ID NO: 227: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6373 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 
40 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
^ GATTCCACGT GTGTTAAAAG AAGTTACAcC TTCAATGATG GTATTTACTA ATTTCTTTAG 60 
AGATCAAATG GATCGCTTCG GTGAAATTGA TATTATGGTT AATAACATTG CAGAGACAAT 120 
TAGTAATAAA GGCATCAAAT TATTGCTAAA TGCTGATGAT CCATTTGTGA GTCGTTTGAA 180 

SO 

AATCGCAAGT GATACGATTG TGTACTATGG TATGAAAGCA CATGCCCATG AATTTGAACA 240 
AAGTACGATG AATGAAAGTA GATATTGTCC AAACTGTGGT C6CTTATTGC AATACGATTA 300 
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IS 



20 



2S 



30 



35 



40 



4S 



SO 



AAAATATGAA 


ATATCAAGTT 


TTGATGTGGC 


ACCGTTTTTA 


TATTTAAATA 


TCAATGATGA 


420 


AAAATATGAT 


ATGAAAATTG 


CAGGTGACTT 


TAACGCTTAT 


AACGCGTTAC 


AGCATATACT 


480 


GTTTTAAGAG 


AGCTAGGGTT 


AAATGAACAA 


ACAATTAAAA 


ATGGCTTTGA 


AACGTATACA 


54 0 


TCAGACAATG 


GTCGTATGCA 


GTACTTTAAA 


AAAGAAC6AA 


AAGAAGCGAT 


GATCAATTTA 


600 


GCTAAAAATC 


CTGCAGGAAT 


GAATGCAAGT 


TTATCAGTTG 


GTGAACAATT 


AGAAGGCGAA 


660 


AAAGTGTATG 


TTATTTCGCT 


AAATGATAAC 


GCTGCAGATG 


GTCGAGATAC 


TTCATGGATT 


720 


TATGATGCAG 


ATTTTGAAAA 


ATTATCTAAG 


CAACAAATTG 


AAGCTATCAT 


CGTGACAGGT 


780 


ACACX3AGCAG 


AAGAACTTCA 


ATTGCGATTG 


AAGTTAGCAG 


AGGTTGAA6T 


ACCAATTATA 


840 


GTTGAGCGT6 


ATATTTATAA 


AGCAACGGCA 


AAGACTATGG 


ATTATAAAGG 


TTTCACAGTT 


900 


GCAATACCAA 


ACTATACATC 


ATTAGCX3CCT 


ATGCTTGAAC 


AATTAAACCG 


TTCGTTTGAA 


960 


GGAGGTCAAT 


CATAATATGC 


ATGAATTGAC 


TATTTATCAT 


TTTATGTCAG 


ATAAATTGAA 


1020 


TTTATACAGT 


GATATAG6AA 


ATATTATTGC 


TTTAAGACAA 


CGTGCTAAAA 


AACX3AAATAT 


1080 


TAAA6TTAAT 


GTCGTAGAAA 


TCAATGAAAC 


AGAAGGTATT 


ACCTTTGATG 


AATGTGATAT 


1140 


TTTCTTTATC 


GGTGGTGGAA 


GTGATAGAGA 


ACAAGCATTA 


GCAACAAAAG 


AATTAAGTAA 


1200 


AATTAAGACA 


CCACTTAAAG 


AAGCGATTGA 


AGATGGTATG 


CCGGGATTAA 


CGATTTGTGG 


1260 


AGGCTATCAA 


TTTTTAGGGA 


AAAAATATAT 


CACGCCTGAT 


GGTACAGAAT 


TAGAAGGGTT 


1320 


AGGTATTTTA 


GATTTTTATA 


CTGAATCAAA 


GACAAACCGA 


TTAACAGGAG 


ATATTGTTAT 


1380 


CGAAAGTGAT 


ACTTTTGGAA 


CTATTGTAGG 


TTTTGAAAAT 


CACGGTGGTA 


GAACATATCA 


1440 


TGATTTCGGT 


ACACTTGGTC 


ATGTTACTTT 


TGGTTATGGT 


AATAATGATG 


AAGATAAAAA 


1500 


AGAAGGCATT 


CATTATAAAA 


ATTTATTAGG 


TACTTATTTA 


CATGGACCAA 


TTTTACCTAA 


1560 


AAATTACGAA 


ATCACTGATT 


ATCTGTTAGA 


AAAAGCTTGT 


GAACGTAAGG 


GTATTCCGTT 


1620 


TGAGCCTAAA 


GAAATAGATA 


ATGAAGCGGA 


AATACAAGCG 


AAACAAGTAT 


TAATAGACAG 


1680 


AGCAAATAGA 


CAGAAGAAAT 


CTCGTTAACT 


CTGAACATCG 


CATCAATGGA 


TTTAATATTG 


1740 


ATAAACGATG 


AAGTTTAGTA 


ATTAATCATA 


TATGTATAAA 


CACACACATT 


ATTTTGGATG 


1800 


GAAACAACCA 


AATTGATGTG 


TGTTTTtTTG 


TTCTAGTGAA 


TAATTATTAT 


ACAATGAGTA 


1860 


TCTATCCTAG 


AATTATCAAT 


AGTAATGGTG 


ATTATGCAAC 


ATGAAAAAAT 


GAATGATGAA 


1920 


AGGAATTTGA 


CGATGAAGCC 


TACTAAAGTG 


ATATTAAAAG 


ATGCATCTTA 


TTTACATAGC 


1980 


AAAACATCGA 


TAACATTTAT 


TTTAAAAGAT 


GTAGTTATCG 


AAGAAGATAA 


TAAAATTTAT 


2040 


TATTTCGACA 


CTAGTGCAcT 


TTCGAAGATC 


AAGAAGTTAA 


ATTTGAATTT 


GCACTCTTTG 


2100 
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TTATAGAACC TGATTTACAT TTTACAATTA TTGATTTTAA TCAAGAACTG CTTTGTATTT 2220 

ATATTGATTT TGATTCTGGT TTAAGGCATT CAAACATGGC AACAGAATCT GGTATTTCAT 2280 

TAAGGATAAA TGTTGCTAAA TCAGATTTTA CTAAATTTAT TAATGAATTA GCCTCTTTAC 2340 

ATTAATGATT TAAATCTGAT ATGTAATTAC AATCAAAAAA GACAGCCACA TCCCTCCGTA 2400 

GTTTAGGCGT GTGGCTATAT TTGAGTCTGA ATATTTATGC TTGTAATTTT AAAAAGGGAC 2460 

ATGCTATATA CGATAAAAAG AGGCGGGGAC ATAAATCAAT GTTCTATGCT CTACXSAAGTT 2520 

ATATTGGCAG TAGTTGACTG AACGAAAATG CGCTTGTAAC AAGCTTTTTT CAATTCTAGT 2580 

CAGGGGCCCC AACAAAGAGA AATTGGATTC CCAATTTCTA CAGACAATGC AAGTTGGGGT 2640 

GTGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA GGCAAAGCQA GTTGGGGTGG 2700 

GACGACGAAA TAAATTTTAT GAAAATATCA TTTCTGTCCC ACTCCXIATGG TGCCAATTAG 2760 

20 CATAAGGTAC TTAAATTAAG CATATCTGCT GTCTAGCAGT CGATAAATCA TTAGAACTTC 2820 

6TATAGTATA TGACTTTTAA TTTGATTTTC ACCACTAATT TCAAGTGCTT TTATAGTCGA 2880 

ACGTAAAGTT TCTACAGAAT CATCTTCTCT CTTAAAAGAA CCATCATAAA ATATATCTTT 2940 

GATGCTACTA CTAATTTTTA GCAATGCCAT TTTTTCGTCA CCTGAAAAGT TAACACGAGT 3000 

ATTTTTAGGC AAGTAAATGA TATTTGATAA ATGAGTGATA AACAAACQAT TCGTATATGC 3060 

ACGTTTAGTT AATTGATTGA GTAATTTCCA ATCACATTCT TTTTTCTTAT GATAGCTTAA 3120 

TTCATCACGT TGATAACTTA TTAACGTTTC AACTTGATTA TTTAAATTGA AAATATTTTT 3180 

ATATGCTTTT TCGCTTTTAT CAGATTGCAG TCTTGATAAG ATAAGTTCTT GGCAGCGATT 3240 

GTAAAATAAT TTATACATCA AGGCATCTGT CTTACTTAAT TTTTCTTCGA CCTGACCATA 33 00 

ATACTTAGGT GGAAACACCA TGAAGTTAAT TAAACCTGAT GTCACGAGTC CAATAATTGC 3360 

TGTcAtGTT CGAGACAAAA AGTTGAATAT GTAGGCATCA TGAATACCTG GAATCATAGC 3420 

40 TAATGATGTT AGTACAGCGA CATTCGTACC AACTTGCAAT TTGAGTTTTG TACAGAATAA 34 80 

AATCX5TGAAC GTTGCACTCA ATGCATATGT AAAAGGTGAT TGATCGCCGA ATAAATATGT 3540 

AAATAATACT GCAAAGCCTG CACCAATTAC CGTAGCAGGT AATCTACGAT AACCTTTAAT 3600 

AAGTGATGCC TTGGCAGTTG GTTCAATTGT GACTACAGCT GTTAAAATGG CATAGATGGG 3660 

TGTTAAATCT AGTGCCATAC AAAAGACAGC TGTTAAAAAA ATGGCAATAC CAGTTTTAAT 3720 

TGTTCTGGCA CCAATTAAAT GTTTATACCA TTGATCGTTC ATTTTTTAAC CTCTAATCAT 3780 

(XrTAAAATCT TAGCGAGCGC TTTATAATAA TAGTATCGTA CATTGGAAAA GTTCATGTAT 3840 

GTAAAATATT TGAAATAATC ATACATAAGC ATTACTTTGA TTTTCATATA CATTAATCAA 3900 
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CAAGCATTTT TCAATTATAG TCOGGGGCCC 
ACAGGCAATG CAGGTTGGCG GGGCCCCAAC 
^ AATGTGCAGG TTGGCGGGGC CCCAACATAG 
TGCAAGTTGG GGTACAACGA TAAAGAAATA 
CTTTCCAAAA TACTAAAGTA ACATCTTTAG 

70 

ATAATTATAT AAAAAAGGAA CGGGATAAAA 
CGTTAAAAGA AATTGGATAC ATATGCGCTA 
AACCAGGTAT CACTACGAAA GAGCTTGATA 
GTGCTATTTC TGCGCCAATT CATGATGAAA 
ATGAAGAGGT GGCACATGGG ATTCCAAGTA 
20 ATATTGATGT ATCGGCTTTG AAGAATGGCT 
TTGGAGAATC AGATGATCCA ATGAAACAAA 
AGAATGCAAT TGCAAAAGTA AAACCGGGTA 
ATAATACAGC TAGACAAAAT GATTTGAAAG 
GTTTATCATT ACATGAAGCA CCAGCACATG 
CATTATTAAC TGAAGGTATG GTATTAGCTA 

30 

TTGTTACAGA AGGTAAAAAT GAATGGGCTT 
AAATTGAGCA TACGGTTATC GTGACTAAGG 
AAGAATAGTT CAACATATAC TAAGACTAAA 
TCATATTGGT TTCGGAACTG TTTTATAATA 
AAATATGTTG TAACAAAGTA GTTTTTAAGC 

40 TAGCGCATTT GGTATTTTAA AACTTATTAT 
GTTATATTGA ACAGTTTTTG AGAGCAACAG 
TGCTAGATGA AAAAATGAAA AATTTAGATG 

^ AACAACTTAG CAAGTTAATT GACAGTCTAA 
TTGCTGAAGC ATTTCAAATT CAATGTGCAA 
TTAAATCAGA GTTGAATAAA GTTGAAGCAT 

50 

AAACTTCAAC TGAAAAAATA GCAACAGAAA 
CTGTGGCATA GAAAGGCX3GC GAAACATGAC 

55 



CAACATAGAG AATTTCAAAA 


AAGAAATTCT 


4020 


ACAGAAGCTG ACGAAAAGTC AGCTTACgAT 


4080 


AGAAATTGGA TCTACAATTT 


CTACAGGCAA 


4140 


TTTTTTCTTT ATCACACTAT 


GTCTCACTCA 


4200 


TATATCAAAG AATTTTTGCT 


ATAATAAGTT 


4260 


TGATTGTAAA AACAGAAGAA 


GAATTACAAG 


4320 


AAGTGCGCAA TACAATGCAA GCTGCAACCA 


4380 


ATATTGCGAA AGAGTTATTT 


GAAGAATACG 


4440 


ATTTTCCTGG TCAAACGTGT ATTAGTGTCA 


4500 


AGCGTGTCAT TCGTGAAGGA 


GATTTAGTAA 


4560 


ATTATGCAGA TACAGGGATT 


TCATTTGTCG 


4620 


AAGTATGTGA CGTAGCAACG ATGGCATTTG 


4680 


CTAAGTTAAG TAACATTGGT 


AAAGCX3GTGC 


4740 


TCATTAAAAA CTTAACAGGT 


CATGGTGTTG 


4800 


TACTTAATTA CTTTGATCCA AAAGACAAAA 


4860 


TTGAACCGTT TATCTCATCA 


AATGCATCAT 


4920 


TTGAAACGAG CGATAAAAGT 


TTTGTTGCTC 


4980 


ATGGTCCGAT TTTAACGACA 


AAGATTGAAG 


5040 


GTATGAACAT CATTTAGTTC 


CGGAGCCTAT 


5100 


ATTAAGAACA CAATCAATGC 


GTCATTTCAA 


5160 


AAACATATCA TCGACATCAA 


CGAAGATACA 


5220 


AAAAGGTGAT AGTTATGAAC 


TATGTTGAAC 


5280 


TAAGAAATAA TATCAAGGAC 


TACCTTTTAA 


5340 


ATTATATGCG TTATTTAATT 


ACTAAAAAAG 


5400 


TGCTAACATT AGAAAATAAA TATATTGATA 


5460 


GAGAAATCAA TAATCAAGAA 


ATTGAAAATA 


5520 


ATTATGCAGA AATTGAAACT 


CAAATTCAAC 


5580 


AAACATC6TA TCTAATAAAT 


TATATGAACG 


5640 


ACACAAATAT ATATCAACGC 


AAATGTTGAT 


5700 
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ACTATGCATG GTCTTTTTAA TCAACTTAAA CTCGGCATTA TTTCAATCGA AAACGCAGAG 2640 

CATACGCTTT TTACACCTTA TATGTTGGAA ACGCTCTCTT CCCTAGGCGT GAAAGACAGC 2700 

5 

ATTGTCGATT TAATTCATAA AGGGACTGAA TTAGAAGACT TTGCGGCATT TAATTTATCA 2760 

ATTGAAGACA CAGTTACAGT CTGTTTACAA AGAACTGAAG AACTATTAAA ACAATACAAA 2820 

AATGTGGAAT TCAATGACAA AATATTAATC AATTGGCGTA TTATACAAGA GAAATAGACA 2880 

70 

TATAAAAGTC GAATGTAACt ACGTGAGTAT TGATTTTATT CTTTGTAAAT TACAAGCATT 2940 

TCATATTATA AA6TTTGAAA AGAGGTATAT TGAAATGGAG AAAAATGAAT ATATAGCTAA 3000 

^5 ATATAATGAA TATAGTCAAT TATTAGACGC TACATACTCG CAAGCTGTAG CATmCCTTTT 3060 

AAGtAAATaT GGCGCTGTAA CCGATGATTA TTATAAAGaA AAATCATACA CGCGATTTTT 3120 

AAAtGGAGnA ATCAAAAGTA TTTCAAAAGG AAAATACACT AGAGCTAGCG AaGGATTATA 3180 

20 TTGCCATCAT ATAAGCGAGG ACAAATTCCA AAATCTATCT GATCTAAGAT TCATtTCCAA 3240 

ATTTAAGTAC TCATACGACG TTCAAAAGAA AGAAAACTTA GTGTACTGTG ATCTAATCGA 3300 

GCATTTAATT TTACATGCAA TTATTACAAA AGAATCCCAT GGCCAATTTG GTGTAGCTGG 3360 

25 

ATTATGTCAA ATGATCAAAC CAACAGTCAT TGATTGGTAC ATTGGCGAAT ATAATCCAAA 3420 

ACCAGCATGG ATGCAAGCCA CCAAAGCACG TGCCTATTTG CCTGGAATAT TAGTAGAGAA 3480 

ATTACTCATT AAAATTGACG ATATGTTAAA AGGAATAGAA ATATAAGATT TCCTTGAGTC 354 0 

30 

TAGATAAATG ATTAATGTAG ATTTATTTTT TGCTGTTGAG ATTTTGTTAT AGATGTTTAA 3600 

ACCTGTAATT AAATATATTT TATAAAATAG ACCACGCATA CCTATCTATA AACGGrCAAT 3660 

35 GTTTATAAAT GAGTTTGCAT GGtCTTGAAT TGTATTAAAT TTCTTTTGGT TTTAATAAAT 372 0 

CGACTAGATT TTCACAATAT TTATCAAATA TGTATTCCTA AATTATACAG CCTTAATCCA 3780 

GCAC^ACTT TCGAAACTTC CAACTTAGTT GATATAAGGT TCAATAGTTT GTTTCGTTCT 3840 

40 TTTTCAGATA AACCAGAACT TAAATTGATA TTATTGACTT CATAAAAATT ATAGACTAAT 3900 

GCCTCTATTT GCTTTTTAGG CATAAGTAAG TCGACTGAAA ACTGATTTAC GTCGCTTTCA 3960 

TAAATCATTT CATGTAAATT CTTTAGACTA TTATCX5TTAC TATCTCTCAT TAAGTCTGTA 4020 

45 

TTTTTAAATA AATAACGGCC CAATTCACGA GCTATTGAAA ATCTTGTATT ATTAATCGAG 4080 

TGATTATTAT TAATATAGAT TGTTCTTCCA CTTAAATAAC CCGAAGTATT ACCCTCCATT 4140 

TTAATATATC TAACATTTAA ATTAAGTTGA AATAATAGCT TGTCTATGTC AATAGCAAAG 4200 

SO 

TGTTCAGAAG TAATAAAAAG TTGATCCATT TTGTCCTTTA TAAATGCCTG AAATAATCGA 4260 

ACTATTTTTG ATTCTAAAAT ATCTTCATAA TGAACTTTCT CAATAACTTT CAATTGATTC 4320 
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AACATTATTA AAATAAAACC CCTCTACTAC TATATCTAAC GAAOGGACAT GATTTCAAAA 4440 
TAAAATACCT TTTTTATAAA TnTATTATAA TATCCCCCAC TATACnAC 4488 
^ (2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 





TATGGCGCCA 


TATTAGTTGT 


AACTGGTTTA 


AGAGGTCcAa 


GAAaTATCaA 


ATAAAGTTGT 


60 




tCCTGGGCTT 


GGTACTGTTA 


TCTCaATATT 


GmwTGCaTTT 


GGTGGTCTAG 


CTTTTAATAT 


120 


20 


TGGTAATATT 


GCTGGTGCCG 


GTTTAGGTTT 


AAATGCAATT 


TTTGGATTAG 


ATGTAAAATG 


130 




GGGCGCAGCT 


ATTACTGCAA 


TCTTTGCAAT 


ATTAATCTTT 


GTAAGTAAAA 


GTGGCCAAAA 


240 




AATTATGGAC 


GTTGTTTCAA 


TGATTCTTGG 


TATTGTGATG ATTTTAGTTG 


TGGCATATGT 


300 


2S 


GATGTTTGTT 


TCTAATCCAC 


CTTATGGTGA 


TGCTTTTGTG 


CATACATTTG 


CGCCAGAACA 


360 




TCCAATGAAA 


TTAGTCTTGC 


CCATCATTAC 


GTTAGTTGGT 


GGAAcTGTar 


GTGGTTATAT 


420 


30 


TACCTTTGCA 


GGTGCACATC 


GTATATTAGA 


CTCTGGCATT AAAGGTAAGC AATATTTACC 


480 


ATTTGTAAAT 


CAATCAGCAA 


TTGCTGGTAT 


TTTAACTACA 


GGTATTATGA 


GAACGTTACT 


540 




ATTCCTAGCG 


GTATTAGGAG 


TTGTTGTAAC 


AGGTGTGACA 


CTAAGTTCTG 


AAAATCCACC 


600 


35 


AGCGTCAGTT 


TTTGAACACG 


CAATTGGACC 


AATTGGAAAG 


AATATTTTTG 


GTATTGTGTT 


660 




ATTTGCTGCA 


GCTATGTCAT 


CAGTAATTGG 


CTCAGCATAC 


ACAAGCGCAA 


CATTTTTAAA 


720 




AACftCTTCAT 


AAATCACTTA 


ACGAAAGAAG 


TAATTTAATT 


GTGATTGTGT 


TTATCGTTAT 


780 


40 


TTCAACAATG 
GATAAA 


ATTTTCTTAT 


TTATTGGAAA 


ACCAATCAGC 


CTTTTAATTA 


TAGCAGGCGC 


840 
846 



{2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

lA) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

55 
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TCTTTTTAAA AGGTACTAAT ATTTCTTTAG TGAAAATTGA ATCACGGTCG TTTATTGGTG 120 

CCTTGAGTAT ATTATTATAG ACGGAATCTG ATCTAATAAT ATTGATTTTA TACATGATAA 180 

ACCTCCTTAT GTTGTCAGC7V TAAAGGATAA CGTAACGTGA TTTTCAAGCA GTAATTGTAA 240 

CTAATTGAmA AAAATTAAGA AAAGTATGTG AGTGTTCCTA AwTAATATGa TTAAAATGAT 300 

GGCGAATAAG TGTCTaAAAG CATCTTAAAG GGACATTGTA TAGGGTAAAT CACTTCATAA 360 

ATAAGGGaAA ATCCTTATGT TCACTTTTTC ACAATCATnA TAAAATATAT ATGTAGTCAA 420 

TACTTTGTCT ATATTGAATG TTTTCATATA AATGAAAGCA TTTTTAAATA ACATTGACCT 480 

CTAATATATA GGCAGAGTAT TGATATCTAT TAAAAAATAA ATGATTTTGA TGAAGGTGAA 540 

ACGTATGTAC AAAACAAAAG GTGGCTTTCA ACTTACATTA CAAACATTAA GTTTAGTGGT 600 

TGGGTTTATG GCTTGGAGTA TAATTGCGCC ATTAATGCCC TTTATTAAAC AAGATGTCAA 660 

20 TGTTACTGAA GGTCAAATAT CAATCATTTT AGCGATACCA GTTATTTTGG QATOGGTGCT 720 

CCGTGTGCCA TTTGGTTATT TAACAAACAT TGTTGGCX5CT AAATGGGTAT TCTTTACTAG 780 

TTTTATCGTA TTGTTATTCC CGATATTTTT CTTAAGCCAA GCACAAACAC CGGGTATGTT 840 

AATGGCTTCA GGATTTTTCC TTGGTGTAGG TGGTGCAATT TTCTCAGTTG GTGTTACATC 900 

AGTTCCTAAA TATTTCCCTA AAGAAAAAGT AGGTCTAGCA AATGGTATTT ATGGTATGGG 960 

AAATATCGGT ACAGCAGTTT CTTCATTTTT AGCACCACCG ATAGCGGGTA TTATTGGTTG 1020 

GCAAACAACA GTTAGAAGTT ACTTAATTAT TATCGCTTTA TTTGCATTAA TTATGTTCAT 1080 

TTTTGQTGAC ACACAAGAAC GTAAAATTAA AGTACCATTA ATGGCtCAAA TGAAAmCATT 1140 

ATCTAAAAAC TACAAATTAT ATTACTTAAG TTATTGGTAT TTTATTACTT TTGGTGCTTT 1200 

TGTAGCATTT GGTATTTTCT TACCTAACTA CTTAGTAAAT CATTTTGGAA TTGACAAAGT 1260 

AGAxiCTGGT ATTCGATCAG GTGTATTCAT TGCGCTGGCA ACATTCTTAA GACCAATAGG 1320 

40 TGGCATTTTA GGTGATAAAT TTAATGCAGT TAAAGTATTG ATGATTGATT TTGTTGTTAT 1380 

GATTATCX3GT GCCATTATTT TAGGTATTTC AGACCATATC GCATTATTCA CTGTAGGCTG 1440 

TTTAACAATA AGTATTTGTG CAGGTATTGG TAACGGCTTA ATCTTCAAAT TAGTACCATC 1500 

ATACTTCTTA AATGAAGCGG GATCCGCAAA TGGTATCGTA TCAATGATGG 6TGGTTTAGG 1560 

AGGATTCTTC CCACCACTAG TAATCACGTA CGTAGCTAAT TTAACAGGAT CAAGTCATTT 1620 

AGCATTTATT TTCTTAGCGG TATTnGGAnG TATTGCATTA TTTACCATGC 6TCATTTATA 1680 

CCAAAAAGAA TATGGCTCAT TGAAaAACGG TTGATATGTA ATACATGCCA TTcATTTAGT 1740 

TAAATACAAA GCCTTaATAT GATGCGCAAT ATTCGTAGCA TGACATTAAG GCTTTAGTAG 1800 

55 



2S 



30 



3S 



4S 



50 



1000 



EP 0 786 519 A2 



CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 1920 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 1980 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 2040 

TTAttATTGA AATCgATACT AATAtCTccC TO 2072 



(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,c (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



30 



35 



50 



CGTCTTCTCT 


TGGTTATTCT 


CTGTGTTTTQ 


TCTTTGTTCA 


AATTCGATTT 


TTTGTTGTTC 


60 


GAATGAATGT 


AATACTTTAT 


rrrrriTix^G 


CACATAATCC 


AAGTnATTTT 


TAGGAATTAA 


120 


TATACGGTCT 


TTAAATGCTT 


CTTGTATTTT 


GCTCACAATC 


AATTGGTATA GTTGCTCTTC 


180 


TTTTGATAAA 


CGCACTTCTA 


GTTTTGTTGG 


ATGAACATTT 


ACGTCTACTA 


AGATTGGATC 


240 


CATTTCAATA 


TTAATATAAC 


AAATCGGGAA 


CCTACCTATT 


GTTAAGAGTG 


TATGATAGCC 


300 


TTCTAAAATC 


GCTTTATTTA 


GCATAAAGTT 


TTTAATGTAT 


CGTCCATTAA 


TAAAAATAGA 


360 


AATATAGTGC 


TTATTACTTC 


TAGAATGTTC 


AGGCTTTGCA 


ACAAAACCTT 


CAATGTGATA 


420 


ATCACTTGTA 


TCTCCAGATA 


TATGTACTAA 


ATCTCGTGCA 


ACTTTCATCC 


CATAAATCTC 


480 


TGCCATCACT 


TCATTAGTTC 


GTCCTGAACC 


ATTTGTACTT 


AACATTGTTT 


TGCCATCTGA 


540 


AATGAGTGCT 


ATTCGAATGT 


CCGGATGGCT 


CATTGCCATT 


CTGTTGACAA 


TATCTGTTAT 


600 


TTTACCTAGT 


TCAGTGTATA 


AACTTTTAAT 


ATATTTTAAA 


CGTGCTGGTG 


TATTATAAAA 


660 


TAATGATTCT 


ACAAGTATAT 


CTGTTCCTTT 


TTTCGCTTTT 


GCAGGCTTAT 


GATTTAATAT 


720 


TTCACCATTT 


TCTACATATA 


TTTCATTTCC 


ATTAGCATTA 


TCCGTGCAAG 


TCTTCAATGT 


780 


TACTTTAGCA 


ACTGATGAAA 


TACTGGCTAG 


TGCTTCACCA 


CGGAATCCTA 


ATGTCCTAAT 


840 


ATGAAATAAA 


TCTTCATCTT 


GATCTAATTT 


ACTAGTCGCA 


TGTCTATGAA ATACTAATCC 


900 


TAAGTCTTCC 


GCTTCAATTC 


CGCTTCCATT 


ATCGACTACG 


CGAATAGATT 


GGACGCCAGA 


960 


tTCCTCTACT 


TCAATGCTTA 


TTTCTGTAGC 


GCCTGCATCT 


ATAGCATTTT 


CCAACAGTTC 


1020 


TTTCACAACA 


GAACTCGGTC 


TTTCAACTAC 


TTCACCTGCT 


GCGATTTTAT 


TTGCTAATGA 


1080 


GGTTTGGAGT 


TCTTTAATTT 


TCCCCATTTT 


GCAACACCTC 


TATTTTAATT 


GATTTTGTAA 


1140 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT 
GTCTTTTTGA GATAAAGTAT CaGTTGTTTC 
^ AATTTCCAGG TTCATTTTCG ACCATTrTTA 

ACGCTTCAAA CTCGCTTAGA ATCACTTGTG 
CTAATTTCGC AACTTGAATA CCATAAtATC 

70 

GAATATAAGT TCACCTTTAT ATTCATTAGC 
TGCTTGATCT AATGTTGTCA ATTCATGATA 
TGTTTCAGCT ACATACTCTA TCATTGCCTG 
ACCACGTCCA ATTTCATCGA AAATAATCAA 
TTGTGCTTCT AGCATTTCTA CCATAAACGT 
20 ACCTATTCTA GTGAATATTT GATCAAATAT 
AGCTCCCATT TGGGCCATTA TACTAATTAT 
AGACATATTC GGACCTGTAA TTAAATATAT 
AGGCACATAG TCATTATAAT CCATTACTCT 
TTCTAATGTT TTATTTTCAC TAAATGAAGG 
TGCAAAGCTC TGTAAACAAT CTAGCTCTGA 

30 

AGTATATTTT TTAACTTCTT CACGTAGCTG 
TTTGTCTTCC GCACCTAAAA TGATATCTTC 

^ TTCAGCATTC GATAACGTTT GCTTCCTCAT 
CAAGTTGGCA CGTGTTATTT CTATAAAATA 
TGATTTTATT CCTGTACGTT GTCTTTCTTT 

40 GTTTTTTGAA GCTTCAAGA7 ATTCATCTAA 
GCCATCTTTA ACTGAAATTG GTGGTTCTTC 

ATCATCaAGG GGTTCTAGTT GATTAACTTG 

4S 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT 

ATTAACATTT CCGTAACTAA CACGCCCAAC 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA 

SO 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG 
TGCTCCCATT GGTGTTTTCG TTTCGTCCAT 

55 



TTCAAACAAA TcAAATGATG CyTGTTCAAA 1260 

TTCaACACTT aAGTTTAAAT TTTCTTGATT 1320 

AATTTGATAT CGATGATtTT TCACCAGCAG 1380 

CTCTGCTAAT AACTTTTTCA GGTAAATCAG 1440 

GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

AGCGACGTGA ACATTTTTTA GACTTGGTAA 1560 

ATGTGTTGAA AATAACGTTT TAGCATGTGA 1620 

CGCTAAAGCT AAACCX3TCAT ACGTT6AAGT 1680 

ACTATCCTCT 6TTGCATAAG TTAATGCCTT 1740 

ACTCTTACCT GAAACCAAAT CATCTGCCXK: 1800 

AGGTAACACT GCCTCTTTAC AAGGGACATA 1860 

GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AAATGTTTCA TTATCTAATC GACAATTATT 1980 

TTCCACTACT GGGTGCCTAG ATTCCACTAA 2040 

CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AACAAATAAT TGATATTCTA ATTCGATGGC 2220 

TTTTTCTTTA AGTTCATCAG TTATAAAACG 2280 

ATAACCAAAT TCACTTGGTT CAAAATTTTG 2340 

ACCAAACACT TTATTAAAGC TTATTTTCAA 24 00 

GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 2460 

TTGCGTATTA AAACCAACTT TGAATAGTCC 2520 

TACTAAACTC TGTTCTAATA TATCAAGTAA 2580 

TACAAGAGTA TTCTGATTCA TAGAATTTAG 2640 

GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

AAGACGTTCA ATATCATACA CTTGATTAAG 2760 

ATGAGCACTA AATTCATCAA CGATATCTAA 2820 

TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2880 

TAGCCAAAOT AGCGTTCCTT TTTTTGATTT 2940 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

CACACTATAG ATTtCTGATG ATAACX3TTTC CCTGACTGT 3159 



(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



30 



3S 



SO 



TAAGCGAGAA 


GCAATTGGTG 


TTATQTATGC 


TAGTGATAAA 


CCAACAGGTG AAAGTACAAG 


60 


GTCATTTGCT 


GTTTATTTCT 


CTCCTGAAAT 


TAAGAAATTT 


ATTGCAGATA 


ATTTAGATAA 


120 


ATAAATCATC 


CATCCATACA 


TTGATAAATG 


ATTTTyAGAA 


ATTAACAACA 


AAATCAACAA 


180 


TTTTAAACAT 


CTCTGTGATT 


CTATTTATTC 


GAAATGATTT 


AAAAAATAAA 


ACTTCAAAAA 


240 


CCTAACCTTA 


TATTTATACG 


AATACTTAGA 


GGAGCACAAA 


AATGAATAAA 


AATATAATCA 


300 


TCAAAAGTAT 


TGCAGCATTG 


ACGATTTTAA 


CATCAGTGAC 


TGGCGTCGGC 


ACAACA6TGG 


360 


TTGAGGGTAT 


TCAACAAACG 


GCTAAAGCTG 


AACATAATGT 


GAAACTAATC 


AAAAATACTA 


420 


ATGTAGCACC 


ATACAATGGT 


GTCGTTTCGA 


TAGGATCTGG 


AACAGGTTTC ATTGTCGGTA 


480 


AAAATACAAT 


TGTTACCAAC 


AAGCATGTCG 


TTGCAGGTAT 


GGAAATTGGT 


GCACATATTA 


540 


TAGCGCATCC 


CAATGGTGAA 


TATAATAATG 


GCGGATTTTA 


TAAAGTTAAA 


AAAATTGTCC 


600 


GTTATTCAGG 


TCAAGAAGAT 


ATTGCCATTC 


TACATGTGGA 


AGATAAAGCT 


GTTCATCCAA 


660 


AAAACAGGAA 


TTTTAAAGAT 


TACACAGGCA 


TTTTAAAAAT 


AGCATCAGAA 


GCTAAAGAAA 


720 


ATGAACGCAT 


TTCAATTGTT 


GGCTATCCAG 


AACCATATAT 


AAATAAATTT 


CAAATGTATG 


780 


AGTCAACAGG 


AAAAGTGCTG 


TCAGTTAAAG 


GCAACATGAT 


TATTACTGAT 


GCTTTCGTAG 


840 


AACCAGGCAA 


CTCAGGTTCA 


GCTGTATTTA 


ACAGTAAATA 


CGAaGTtGTA 


GGT6TTCACT 


900 


TTGGTGGAAA 


CGGCCCTGGA 


AATAAAAGTA 


CAAAAGGATA 


TGGTGTTTAT 


TTCTCTCCTG 


960 


AAATTAAGAA 


ATTCATTGCA 


GATAACACAG 


ATAAATAAAT 


CCTTACATAG 


ATAAATGATT 


1020 


TTAAAAATTA 


ACAACAAACT 


CAACaATTCA 


AATCATCTCT 


GTGATTCCAT 


TTATTCGAAA 


lOSO 


TGATTAAAAA 


AAATAAAACT 


TCAAAAAGCT 


AACATTATAA 


TTATACAAAT 


ACTTAGAGGA 


1140 


GCAGAAAAAT 


GaATAAAAAT 


ATAATCATCA 


AAAGTATTGC 


AGCATTGGAC 


GATTTTTAAC 


1200 
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(2) INFORMATION FOR SEQ ID NO: 233: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCCCTAATCT 
TTCGGCAATT GTTTCTAOCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATATT 
TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAkTCA GCTAAAATTT TACTAGCTTT 
GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 
GTTTTCAATA GCATCATAAT TATCAACAAC TyTCAACGTA CCATTATCTA T6CGTCTTTG 
AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 
GTAAATGTCT AAATCTCTTT GTGOATCTTC AOTATAAATC GCACGACTCG TGTATACTTG 
ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG ATTTATTAAG 
GTGTGAAGTG CTCATTACAT TGAAAGTTTC ATCACGTACT TCAGCACCAT TCAAACAAAT 
ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 
TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG 
TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATTAGTTT 
TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 
TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 
TCTc^CAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 
AAgTTTTTAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 
ACAATTGCAC GATGATTAAA ATTAGATTGT TGCAATAAAT TCATTAAACT GCcACTACAA 
CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 
CCTGCGCACA TGACACATGG TTCTAAGGTT ACATATAATG TGCAACCTTC TAAACGCCAA 
CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 
TGTAGTGTTT CTCTTaAATT ATGTGCTCTA GCGATAACTT CATCATCTTT AGTGATGATA 
GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATGTC 
ATAAAATATA TATCATTTGT CATTTATGTC CAGATACCTC ACTTATGGTA CAATACTCAA 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC 
AAGAAAAAGA AATCATCACA GAAAATCCAT 
^ AATGGAGTTT TCAAACTGAA ATGTTCTTTT 

TAACACAACT AAATCAAGGT GTAGTTAGTG 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC 

10 

CTGAAGATAT GATTATGCCG AATATGATTA 
AATCTAGAAT TGCTAAACGT AACCGTAGTT 
TTAAAAAAAG ATTATCGTGA GTATTATGAG 
TTAATCGATA CnACTTCTAT TGATTTTCTT 
CATATTATAT TACCTATGAT AGGAGATATT 
20 TGCCATTATA ACCATTGCAG GTACAGTTGG 
TGCAGATAAA TTAAACTTTA AAACGTCTTT 
TAAATTTTAC AGCGATTTTG AACGATGGAG 
ACGTTTTAAA GAACAAAAGC GTATGTTTGA 
AATTTATGAA GATGTTGATA TTTTTGCAAA 
AGAAGATTTC AAAACATATT CAGACTTATT 

30 

TAAACCTGAT GTAATGATTT ATTTAgAATG 
TGAACGTGGT CXSCGAAATGG AAATTAATAC 
ACGCTATGAC GATTGGATTA ATAGCTTTAA 

35 

TGAATATGAT ATCCATAAGG ACCCCGAATC 
AATTATTCAA ACATATCGAC AAGTAGATAC 

40 GTTTATATTC AATGTATATT CCATAGATAT 
CATTAATTCA CATATACACT ATGTCTTTCT 
GCACTATTTA AAATAATATC AGCTTCATGT 

^ ACACCTATTG CCATACCTAA ATTTGCATTA 
CCTACTATAG CTACTTTCTG AGGATCTACA 
TTAGGATTTG GCTTCTCATA GGCATCCGCT 

50 

GTAGCATTGG TATGTGCTAA AAATTGTTCT 
CCAAGTTGAT AGCCTTTTGC TTTCAAATCG 

55 



ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

AGAAATTCAG TAAAATTTAT GATATTTTAA 1740 

TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

TTGAGCATCA AATAGAaGtG AtAcTGTaAg 1860 

TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

AAAAATGAAC AAGATTACX3A AGATATATTA 1980 

ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

TGTTGGAAAA TCAACACTAA SGCAAGCACT 2100 

TGAAAATX5TC GAACATAATC CATATTTAGA 2160 

TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 

AATGCATGAA GAAGAAGGCA CAATGAGTAA 2340 

TAATGCCATG GTCATGACAC CTTATTTTCC 2400 

TAACTATGAT GAGGTCATTG ATCGTATTAT 2460 

AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

TTTAAATCCT ATGATAAACA AAATTGCTCX5 2640 

ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

TATCGATTAT TTTATCAATT CTATCGAATA 2760 

TTTTAATTTA AAGCTTCTAA AATATCTGCC 2820 

AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

CTTGCTGTCT TCATATCATT AGCAGT6TCT 2940 

TTATATTGCT CAAATAAAGG CGATAATACT 3000 

TCGGTAGAAA TGATCAAATC GAACAACGAG 3060 

ACACCTTTTT TAGTATCACT CX5TAACAATA 3120 

ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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10 



IS 



GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3300 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 3360 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 3420 

CAACTTCTAT CAAATTCAAT TAACX5TACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 80 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3540 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3600 

ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC 3660 

ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATTTT TATCTTCTAA AGTACCATTT 3780 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 3840 

ATAACGATTT CACGATCTTC GTATCTCAAT GTACCCACAG GTGAAATAAC TGCTGCAGTA 3900 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3960 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAGTTTTTAG CTAATTCGAT AATAGATTTA 4020 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 4140 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 4200 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 4260 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 4320 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC TTCTGTTGCA 4380 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 4440 

ATATGAACTA ATTGTTTTAO CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4500 

40 AATCGTGCTA ACGAGTTATT AAGACGCTTA AAATTTTCTT CAGGACXK3AA AAGTGCAACT 4560 

TCCCCATCTC ITTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4620 

TGTGCAGCAG GTGAAATTTC AATAQGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4680 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 4740 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4800 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4860 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4920 

AGACTATATC TTTTAGTATA AACX3TATTAA TTTCACAGAG ACAAGTAATC TGTGTTTTAC 4980 

55 



2S 



30 



45 



SO 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTAGTTTA TTAAAATAAA GAGTAATTTA 5100 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 5160 

5 TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 5220 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 5280 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 5340 

70 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT 5400 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 5460 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 5520 

IS 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 5640 

TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 5700 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 5760 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5820 

25 ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 5880 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 5940 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

^ ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA CTAATTCTGT 6060 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 6180 

35 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 6240 

GAAJ^CTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCTAATTACT 6300 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 6360 

40 

TTTTTTATAA CCATGGATAT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 6420 

TTGTACTTnT TCCGCTTCTG GATT 6444 
45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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GCCCATGAGA CAATTTTACT TGCTTTTCCC 
TAAACCGAGA AATATTATTC AAACGCTATT 
^ GATAGTGATT CATTTCATCA CTGGAGATAA 

ACTGCAAACT TTATTGCTTG GTTGTATATT 

AAAAGATATT TCTAAAAAGA TTGTCGTATC 

10 

GGTTCTAATG ACTTTGTTCT TTATAGTTGG 

ATTTTACATT ATATCATTTG CAACTTTATT 

TTTATTTGCT AAATTTTTAA GTATGAAACC 

75 

CTTATACTTA TGGCATTATC CTATCATTGT 
AATACCGGTA TACGTTTATA TTATAGAAAT 

20 GTATCGCTTT ATTGAAACAC CTATACGTAA 

TAAAAAGAAG GGGCAATTTG CTAGAACAGT 
GGTTGTGCTC AGTGGACAGT TTGATGCACT 

25 GAAGAAGACG GAATTTAAAA CAACGAAGAA 
TAAGCAGACA GCX3AATAGCA AAGAGGATAT 
CTCGGTCATG GTGGATATTG GTAATGTCTT 

30 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC 
CTATGCTAAA AAAGGTCAAA AAGTTGTAGT 
AGATCAATTA AATGAACTAT TGGATAGTTT 

3S 

TAGAGTACCT AGAQATTATG AAGGTAGAAT 

GCGCTcTAAT GTACATCTAG TCGATTGGTA 
TGCATATGAC GGTATTCACT TAGAATATGC 

40 

AAAAACGATG GAAACACATG CTACAAATAA 
TATTACATTA CTTCTGATAT TTATTATCAA 
45 TTTTTTATTT TCAAATCCCA TAGTAACGGT 

GTATATTACA TTCAATGAA6 CTTTATTAGG 
AAGACACCTA ATCTCTGTTA TAGTTTOTTT 
TGCATACTAA AAAAATAAAA AATATAAAAT 
GTATA6AGTG TTTTGATTAT TGGGA6GATA 

55 



ATTGGTTATC ACGTTCTTAT TACATAGATT 60 

TATTGTATCG TTGATTTCTT TAGGACTTAT 120 

TTCACGTGTG TATTTTGGGA CAGATACACG 180 

A6CATTTATT TGGCCTCCGT TTGCTTTGAA 240 

ATTAGATATT ATAGGGATAT CTGGTTTTGC 300 

AGACCAAGAT CAATGGATCT ATAATGGAGG 360 

CATTATTGCA ATTGCGGTAC ATCCTTCTAG 420 

TTTACTAATT ATAGGTAAAC GATCATATAG 480 

TTTTGTGAAC AGTTATTACG TACAAGGACA 540 

TTTGTTAACA GCGTTAATGG CTGAAATTTC 600 

AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 

GTTAGTTATC CTATTATTGG TTCCGTCTAT 720 

TGGCAAACAA CATGAAGCXXS AGAA6AAAGA 780 

AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 840 

TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

TACTAAGAAA ATACCAAATG CACAAATTGA 960 

TACACCAATT GTGAAATCGC AATATAAAGA 1020 

AGAGCTTGGT ACAAATGGGG CATTTACGAA 1080 

TGGAAAAGCA GACATATATT TAGTTTCTAT 1140 

AAATAAATTA ATTTATGAGG CAGCTGAAAA 1200 

TAAAGCTTCT GCAGGTCATC CGGAATACTT 1260 

AGGTAGTAAA GCGCTGACTG ATTTGATTGT 1320 

GAAATAATTT GATGCACTAA ACTTTTGAAA 1380 

AAATGATGTA TTTCATTAAA AGTTTAGTGC 1440 

GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 1500 

AACAGATTAC ATTATGATAA CAAAGCCCGC 1560 

TGTCX5CAAAA CTATAAAAGT TATAATTQTT 1620 

TTAAAATAAT TGAGTCGCTA ATGACTATAT 1680 

TTTAATTATG AAAAAAATCO TTACAGCTAC 1740 
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ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CGTATACAAT I860 

TGATGCACAA GGTAATTATC ATTACACTTG GACAGGAAAT TGGAATCCAA GTCAATTAAC 1920 

5 GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 1980 

CAATAACTAC TATAATCATT CATATCAATA CAATAACTAT ACAAACAATA GTCAAACAGC 2040 

AACAAATAAC TATTATACTG GTGGTTCAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

TCATGTGACT ACAACTGCAG CX5CCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 2160 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTC ATCGTGTTGG 2220 

TCGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTGCATCATC 2280 

IS 

TGGCTATACA GTGAACAATA CACCAAAAGT TGGTGCTATC ATGCAAACAA CACAAGGCTA 2340 

TTACGGTCAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 2400 

AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 2460 

20 

AGGTTCATAT AATTTCATTC ATTAATCAAA TGTAAATCAA ATGACGTCAA TATTCTCTAA 2520 

CATGAGAGTA TTGGCGTTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 258 0 

25 TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 2640 

TTCAATTTAG ATGAAAACCA TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2700 

TTGGTTAATT GTTCAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2760 

30 TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 2820 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 2880 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 2940 

3S 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCX3 ATACTTCCAO TCATCATATT 3000 

CAATAAAGGC CCTCGTTTAG TAAAGTATAT AAATATTTTC TATGCAATGA TAGCTAATGT 3060 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 3120 

40 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG 3180 

GGATTTCTGC AGCAATTCGA ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 3240 

4S ATAATCATAT AGGAGGCAAA GTGAATCXyTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 3300 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 3360 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 3420 

^ ATGGAACGAC TATCX3ATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCGATAT 3480 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATACAAGA CGAATCGATC 3540 
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TTCATGGGCC 


ATTAAATGCT 


CTTATTAATT 


ATGATTATGT ACATACTATG CAACAGGCCA 


3660 




TAGACAAGCG 


TATCTCGAAT 


CCATACTTGC 


GACAAATGTT 


AGGCTATTTT ATCAAATATG 


3720 


5 


TAGGTTCTTC 


ATCATACGAT 


GCGCCAgCTG 


TATTATCTAT 


GTTATTCCAT ATGCAACAAG 


3780 




AGCAAGGCCT 


TTGGTATGTA 


GAAGGTGGAA 


TCCATCATTT 


AGCCAATGCC TTGGAAAAGc 


3840 




tAGCGCGTGA 


AGAAGGTGTC 


ACAATTCATA 


CAGGTGCACG 


TGTGGACAAT ATTAAAACAT 


3900 


10 


ATCAAAGACG 


TGTGACGGGT 


GTCAGATTAG 


ATACAGGTGA 


GTTTGTAAAG GCAGATTATA 


3960 




TTATTTCAAA 


TATGGAAGTC 


ATACCTACTT 


ATAAATATTT 


AATTCACCTT GATACTCAAC 


4020 


IS 


GATTAAACAA 


ATTAGAGAGG 


GAATTTGAGC 


CGGCAAGCTC 


AGGATATGTG ATGCATTTAG 


4080 


GTGTTGCTTG 


CCAATACCCG 


CAATTAGCAC 


ATCATAATTT 


crrrriTACG gaaaatgctt 


4140 




ATCTCAATTA 


TCAACAAGTT 


TTTCATGAAA 


AGGTATTGCC 


AGATGATCCG ACCATTTATC 


4200 


20 


TAGTAAATAC 


GAATAAAACT 


GATCACACAC 


AAGCGCCAGT 


AGGTTATGAA AATATCAAAG 


4260 




TCTTACCACA 


TATTCCATAT 


ATTCAAGATC 


AGCCTTTTAC 


CACTGAAGAT TATGCGAAGT 


4320 




TTAGGGATAA 


AATTTTGGAT 


AAATTAGAAA 


AAATGGGACT 


TACTGATTTA AGAAAACACA 


4380 


2S 


TTATTTATGA 


AGATGTTTGG 


ACACCGGAGg 


ATATTGAAAA 


AAATTATCGT TCTAATCGTG 


4440 




GTGCAATATA 


TGGTGTTGTA 


GCAGATAAAA 


AGAAAAACAA AGGATTTAAA TTTCCTAAAG 


4500 




AAAGTCAGTA 


TTTTGAAAAC 


TTGTACTTTG 


TAGGTGGATC 


AGTAAATCCT GGTGGTGGCA 


4560 


30 


TGCCAATGGT 


TACATTAAGT 


GGGCAACAAG 


TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 


4620 




GAATAGGAAG 


TGATATCTAT 


GAAATGGTTA 


TCACGAATAT 


TAACAGTAAT AGTGACCATG 


4680 




TCTATGGCGT 


GTGGTGCaTT 


GATATTTaAT 


CgTAGACATC A 


4721 



35 

(2) INFORMATION FOR SEQ ID NO: 235: 



7 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 240 
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AAAGCGAATT 


AGATAGAGTA 


GATAAAAAGC 


TCTCTAATGA 


AAACTTTGTA AGTAAAGCAC 


2160 




CTGAAAAGGT 


TATAAATGAA 


GAAAAACGTA 


AAAAACAAGA 


TTATCAAGAA AAATATGATG 


2220 


5 


GTGTGAAGGC 


AAGAATTGAA 


CAATTAAAAG 


CATAGGAGTT 


AGTAACAATG AATTACCTAG 


2280 




AGAGCTTGTA 


TTGGATACAC 


GAAAGAACTA 


AATTTGGCAT 


CAAACCAGGT GTTAAACGTA 


2340 


10 


TGGAATGGAT 


GCTAGCACAA 


TTTAATAATC 


CTCAAAATAA 


CATTAAGGGT ATTCATGTAG 


2400 


GTGGCACAAA 


TGGTAAAGGC 


TCTACAGTTG 


CTTACCTTAG 


AACAGCTTTA GTTGAAAATG 


2460 




GTTATGAAGT 


AGGTACATTT 


ACGTCGCCGT 


TTATTGAAAC 


ATTTAATGAA CGAATTAGTC 


2520 


IS 


TAAATGGTGT 


GCCAATATCA 


AATGACGCTA 


TTGTAGAATT 


AGTATCACGT ATTAAACCAG 


2580 




TAAGTGAAAT 


QATGQAAOGT 


GAAACAGATT 


TAGGTGTTGC 


AACTGAATTC GAAATAATCA 


2640 




CAGCGATGAT 


GTTTTTATAT 


TTTGGTGAAA 


TACATCCTGT 


TGATTTTGTC ATTGTTGAGG 


2700 


20 


CTGGATTGGG 


TATAAAGAAC 


GATTCGACAA 


ATGTCTTTAC 


ACCGGTTTTA TCAATCTTAA 


2760 




CTAGTATCGG 


TCTAGACCAT 


ACAGATATTT 


TAGGTGGTAC 


TTATCTAGAT ATTGCTAGGG 


2820 




ATAAAGGCGC 


GATTATAAAG 


CCTAACGTTC 


CAGTGATATA 


TGCTGTTAAA AATGAAGATG 


2880 


25 


CATTAAAATA 


TGTTCGTGAA 


CGCGCAATTG 


AACAACAT6C 


AAAGCGAATT GAATTAGATA 


2940 




GAGAAATTGT 


TGTTGTATCG 


CAAAATGATG 


AATTTACTTA 


CCGTTATAAA GATTATGAAT 


3000 


30 


TAGAAACAAT 


CATTTTAAGC 


ATGTTAGGTG 


AACATCAGAA 


ACAAAATGCT GCATTAGCCA 


3060 


TAACAGCTCT 


TATTGAATTA 


AATGAACAAG 


GATTAATTGA 


ATTAGATTTC AATAAGATGA 


3120 




TAGACGGTAT 


TGAATCAGTT 


CGTTGGACTG 


GACGTATTGA 


GCAGGTGCAT GACAAACCTT 


3180 




TAATCATTTT 


GGATGGCGCA 


CATAATTCAG 


AGAGTATAGA 


TGCTCTAATT GATACAATTA 


3240 


AACAGTACCA 


TGATAAAGAA 


AAAGTAGATA 


TTTTGTTCTC 


AGCAATAAAC GGAAAACCGA 


3300 




TTAA&AGAT 


GGTCAAACAT 


TTAAGTTTAA 


TTGCGCATAC 


GTTTTATGCA ACTGAATTTG 


3360 


40 


ATTTTCCGAA 


AGCGTTACGC 


AAAGAAGAAA 


TTGTAGGTAG 


TATTGAAAAT GATGAAATAC 


3420 




AATTAGTAGA 


TGACTACGTT 


GAATTTATAA 


AAAATTATCA 


AGGTGATACA TTAGTAATTA 


3480 




COGGTAGTCT 


GTATTTCATA 


AGTGAAGTTA 


AATCAA 




3516 



(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 
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TGAGTGATAG AATCAAAAAA GCCATCTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 60 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACOTA ACTATGATTT AATTTAAGCA 120 

TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAATTAATG AGAATTAAAT TTAATAATGT 180 

AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 240 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC AGTATAGTCT 300 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT ACGAGATGAA 360 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACXSTCTT ATCACAATAA GTATGTTGAG 420 

CX5CGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 480 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 540 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 

20 AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 660 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCX5T 720 

TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT AOGGTATQAT 780 

ATTTAACTTA GCTGATATCG CAAATAATCT TAAAGTAATC ATTGTTTCAG GAATTATACC 840 

ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 

TGCGAATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 960 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTOCTT TTTCTATAGA ACATTTGTCT 1020 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 1140 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CATOTATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 1260 

40 CGATTGTATA TTTTGCCAAT GTCTTCGCCT CCTTTGATAA AAATGCTAAA ATAGTAATGT 1320 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCCGTTTTAC AGAGTGAGGG 1380 

TTAAATTTGC AACACATTAT AAAAACAGCA TTACAACAAA CATTTAACTA TAAAACAAAT 1440 

^ AAAAGTA1TT ACAATATCTT AGTTGGTAAQ AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

AGTCAACAAC AGTTGTCATT ATATCAGAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

CTATTTCTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

50 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

TCTAATACCA AACAACATGT TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 1740 

SS 



25 



30 



35 
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AATGAACTAC ATAACTPGTT TAAAGCAATA 
TATTTGCAAG GCTATGATGA ACCAATGTAT 
^ CTATCTCAAC AGCAATTGTT TGAATACGAA 
TTAGAAAGTG GAGAATATAC TATTTTATCA 
CAAACTTATA TTACTTATAC AAAATTGCTT 

10 

CAACAACAAG TTAAAATCAA TACTATCGAA 
TACATGTCTA ATTACGATGA TTATGTTGAA 
TATCAACAGC ATCGTGGCGA ACX3ATTAAAA 
TATTTTCAAT TAAAAGTATT AATCGTTGGA 
TATTTTACGA AACAAATTTG GATTCGAGAG 
20 AAGTATAATG TCTCAACAAC ACACTCTAGG 
GTGTTATCAA ATACCTACGT ATTTATCAGG 
ATCTTTAATG GATGACCAAG TTATGCAGTT 
TATTCACTCT GGTATGGATG AAATTGAGAA 
CCGCTTCATC TTTCTAAGTC CAGAATTTCT 
TATGATAGAC TTTGGCATGA TTGTTCTAGA 

30 

TGATTTCAGA CCACATTATG CTCTAATAGG 
TGTCTTAGCA TTGACAGCAA CTGCACCACC 
AGCGATTCAA TTCAATGTTA TTAAAACTAC 

35 

TCTTAATTTT CATGATGATG AAGATAAAAT 
GGGAGCAACG ATTATTTATG TCTCATCGAA 

^ TTATGATTCA GGTTTTCTTA CAGGTATTTA 
CACAGTTCAA CAACAATTTT TAAATAATGA 
TGGTATGGGA ATTAATAAAA AAGATATTCG 

45 TCCTTCTAAC TACATTCAAG AAATTGGCCG 
AATTAGTTTA TTCCAACXX3G ACGATAAATA 

GATAACAGAA GAAGATGTAC AAAATTTCGA 

50 

AGCCGTTTTG ACAACGTTGC AATCATTCTA 

GCAATCATTT AAACGAAAGC AATTAGGATT 

55 



ACTTTAAAAG GGCCATGTTA CTTACATTAT I860 

ACGAGACAGC AAGTTAGTTT AATAGAAAAG 1920 

ATGAATAATT TAGTGACAAT GATGTTTGAA 1980 

AAAATAATAA TGAAACCTAC ATTATTAAAT 2040 

GAACAATTCA CGATGGAAGA TATAGCGGCT 2100 

GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TTTTACAAAG AACAATTTGA CACGTTATCA 2280 

TTTGAAAGAG QTGATCTGAA TGTTGCATGA 2340 

TTTTAAACCG GGACAACAGG AAATTATAGA 2400 

TATACTTCCA ACTGGAAGTG GAAAGAGTTT 2460 

TAAGCCGAC7V TTAATTATCT CACCGTTAAT 2520 

GAAAATAAAT GGAGAAAAAC GTGTAACATG 2580 

AAAGCATAAT ATTAAATGTT TACGACATAG 2640 

CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TGAAGCACAT TGCCTATCTG AATGGGGATA 2760 

AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2820 

GCATTTACAA QATGATTT6A CXXyUU^TGTT 2880 

AATGAATCGC CCAAATATAA GCTTTAAGCA 2940 

TGAATGGTTG CTGCCGTTTC TACAACAGTC 3000 

AAAGATGTGT CTGAATTTAG CGCAACTTAT 3060 

TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

TATTCCGATT ATAGTCGCAA CGAGTGCTTT 3180 

CACAATCATT CACTTTCATC TTTCAACAAG 3240 

TGCGGGTCGC 6ATGGTGAAC TAAGTCAGGC 3300 

TATTTTAGAA ACX3TTATTAT TTGCAGATAT 3360 

AATAGGAGAA TTTTTAGCTC CCGATAAACA 3420 

TAGTATCGGC GCCTTGaAAC AGATATTTAA 3480 

CTTTCGCATG ATTGGCTATT GCAAATTGGA 3540 
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ATGTTGTGAC AATGATTCTA ATATAACTOA TATCGCAATT TTAAATAAOA AGAAGGTAAT 3660 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC 3720 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3 84 0 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3960 

GAATTAGAAC ATCAGGATAC AATAGAGAAT AGGGAGCAAC AGTTTCCGCC AAGAAATGCC 4020 

75 CAAAGAAGAA AAAGAOGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 4140 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4200 

GAAGAGGAAT ATTATAATAA GAATGCTTTT GCAATQGATA AATCACATCC AGAACCAATC 4260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 4320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4380 

GCTAACCAAT CAGAAACATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 4440 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGT6C TGGAACAGCA 4500 

GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4560 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4620 

QATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4680 

GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4740 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4800 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4 860 

40 TTAGCGATAT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 4980 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 5040 

GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACX3CTA ATCAAGCGAA CAATCAAGCA 5100 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 5280 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 5340 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TrtCKTVVT^ CMAGTTTGT 


TATTTAACAA 


5460 




GGTCTTGTCT CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC 


AAAAAGATAA 


5520 


5 


AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA 


TATTAAATTQ 


5580 




TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA 


TTGTCAAAAA 


5640 


10 


TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT 


TATAATAGTT 


5700 


TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT 


GGTGGAGGGC 


5760 




CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC 


TTAATTATTG 


5820 


IS 


AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA 


TTTTTCTCAT 


5880 


CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 


5940 




GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA 


AAAGTAAATG 


6000 


20 


CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT 


ACTACGACGA 


6060 




AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT 


CAGCATAATA 


6120 




CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 


6130 


2S 


CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC 


GATGCTGCTT 


6240 




TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA 


GATTATTCGC 


6300 




CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT 


GAAAAAATTG 


6360 


30 


ACATGGAATT TAATGCT7\AT GTTACCCAAA TAACTGAAGA TACTGTGACT 


TATGAAGTAA 


6420 




ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT 


TATCATCCCG 


6480 


35 


ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT 


GGAACAGCXSC 


6540 


CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 


6600 




TTGCTGCAGG GAACQATGCG AATACCATTT TTATTGAAAA TGGTAAATTC 


CACGGGGGCA 


6660 


40 


TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA 


AAATAAAGGT 


6720 




CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC 


AAATGAGGTT 


6780 




GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA 


TTCAAAATAT 


6840 


4S 


GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT 


TAATCTTGCT 


6900 




TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG 


GTAACCACCA 


6960 




CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC 


TAGCACAATA 


7020 


SO 


GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG 


TATGTTGCCT 


7080 




TGTCCTAACG CTTCAATAAC CATACCATOG ATACCTTCTC GACTATAAAA ACTAAAAATG 


7140 



ss 
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TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 7260 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 7320 

* ACATTACX3CG CCGTATGAAT TTCATCATTA AaTACAACCA TCACGCCTTT ATGACGGGCC 7380 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 7440 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 7481 

10 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 





ATGAATTATA 


CCTTCCACTA 


GAAGTGTCGG 


TATTTATGTG 


CCTGGTGGTA 


AAGCAAGTTA 


60 




TCCTTCAACG 


GTATTAATGA 


CAGCGACGTT 


AGCACAAGTA 


GCAGGTGTGG 


AAAATATTGT 


120 


25 


CGTTGTGACA 


CCACCTCAAC 


CTAATGGTGT 


GTCTCAAGAA 


GTATTAGCAG 


CATGCTATAT 


180 




TACGCAAGTT 


GATCAAGTAT 


TTCAAGTTGG 


TGGTGCTCAA 


AGTATTGCTG 


CGTTAACTTA 


240 




TGGCACAGAA 


ACGATACCTA 


AAGTTGATAA 


GATTGTAGGT 


CCAGGCAATC 


AATTTGTTGC 


300 


30 


ATATGCCAAG 


AAATATTTAT 


TTGGACAAGT 


AGGTATTGAC 


CAAATAGCTG 


GACCAACAGA 


360 




AATAGCACTT 


ATTATTGATG 


AAACAGCAGA 


TTTAGATGCC 


ATCGTATATG 


ATGTTTTTGC 


420 




ACAAGCAGAA 


CATGATGAAT 


TAGCACGTAC 


ATATGCCATT 


AGTGAAGATG 


CGCAAGTCCT 


480 


35 


TAAAGATTTA 


GAATCACGTA 


TTGCTAAAGC 


ATTGCCTAAT 


GTGGACAGAT 


ACGACATTGT 


540 




TTCTAAAAGT 


ATCGCTAATC 


AACACTACCT 


TATCCATGCT 


AGTAATTTTG 


ATGAAGCATG 


600 


40 


CCATGTCATG 


AATACAATCG 


CGCCTGAACA 


TGCGTCGATT 


CAAACAGTAA 


ATCCTCAACC 


660 


ATATATTGAA 


AAAGTGAAAT 


ATGTGGGTGC 


ATTGTTTATT 


GGACATTATT 


CGCCAGAGGT 


720 




CATAGGAGAT 


TACGTTGCAG 


GTCCAAGTCA 


TGTATTACCT 


ACAAATAGAA 


CAGCTAGATT 


780 


45 


TACCAATGGG 


TTATCGGTCA 


ATGATTTCTT 


AACACGGAAC 


ACGGTCATCC 


ATTTATCAAA 


840 




AGATACGTTT 


GAACAAATTG 


CTGATTCAGC 


ACAACATATT 


GCTCATGTTG 


AAGCATTATA 


900 




CAATCACCAG 


CAGTCTATTT 


TAATACGTCA 


GTCTTAGGGG 


AGTGTAATTG 


AAATGATTTA 


960 


50 


TATTGATAAA 


AATGAAAGTC 


CAGTTACGCC 


GTTGGATGAA 


AAAACAATGA 


CGTCTATTAT 


1020 




TAGTGCAACg 


CnATATAATT 


TATATCCTGA 


TGCAGCATAT 


GAACAATTCA AGGAAGCTTA 


1080 
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GATTCAAAAQ TTAATGCTQA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTGATTT 1200 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCXSTGAA ATTGCATTTG TAGATGCAGG 1260 

ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACC3AAA ATCGATGAAG TACAACCATC 1320 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGGCATTTTT 13 80 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 1440 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 1500 

ATTATCAAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 1560 

AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 1620 

TATTGCGACr TATATTTTTA GACATAGAGA AGAGACAAGA CAATTTTTAA CGAT6CAACX3 1680 

ACAfiTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 1740 

20 GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 1800 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCG6TGA TGAAGGGCTA 1860 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA TTAGAAGAAA TTGTTAAAGA 1920 

25 ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 1980 

TTTCAATATC TGATGATCAG TCACCATCGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 2040 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACATTGAG GCACAAGGTG 2100 

ATATTGACGT AGATGATCAC CACGTAACTG AAGATATCX3G CATTGTCATT GGCCAATTGT 2160 

TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCX5TTA TGGAACGATG TACATTCCAA 2220 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 2280 

ATGCATCATT AAGTAAAGAA AAAGTTG6TA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 2340 

TTAGAGCGGT CGTAATCAAT 6CAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 2400 

ATACACACCA TGAAATTGAA GCTATATTCA AAGCGTTTTC CCGTGCATTA GGCATAGCGC 246 0 

TAACTGCAAC TGATGATCAG CGTGTGCCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 252 0 

CGTTGATTAT GGATTAGGQA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 2580 

45 TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 2640 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 2700 

ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 2760 

SO TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2820 

TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 2880 
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AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3000 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTCAGGC 3060 

AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGQGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CX3CTCGGCTG 31 BO 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 324 0 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 33 00 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3360 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 34 80 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 3540 

20 TGAGACX3GTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGAOGATTC 3660 

CTOTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

25 ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3780 

AATGATTAAA AAACGTATCA TTCCAT6TTT AGATGTCTU^ GATGGTCGTG TCGTTAAAGG 3840 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3900 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3960 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4020 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4080 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 4140 

CGGTJ$GACAA TGCATCTGCA TAGGAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4260 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 4320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4380 

4S CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 4440 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 

TAAAGAAGTG ATACX^GCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4560 

50 AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGA7AGAAG ATGGTGTGGT ATGTTTCTAT 4680 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 4800 

CCAACATGTC ATACAGGCAG TCAAA6TTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 4860 

ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 4920 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 4980 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 5040 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCX3TCG ATTTTTCAGA AATTGAGGCG 5100 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAAOGACA AAATATCGAA 5160 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 5220 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTTAACT 5280 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 5340 

AATAGGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAQGAAT AGATGTCACT 5400 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 5460 

GAAAAAGCTA AAGAG1TATG TTTGTCGTTA CTACCTATTA ATATCATAGT TCTCTTTACA 5520 

25 TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 5580 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GATATGCATC COGAAAGTTA GACTAAAAAA 5640 

CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 5700 

30 AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 5760 

AATTAACTCT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 5880 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 5940 

CCAAGCATGT AAATATTAAA AATCAAAAmA GATATRlTGTA AAAaAGTTAC AATTtGCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 6060 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 6120 

AATGTAAACG CTTTCTCX3TC AAATTAAACA ATAGAAAGQA TGGTCATTAT GAGTGCTTGG 6180 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 6240 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 6300 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 6346 

SO (2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 
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(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTA6TCAAT GAATAAAGTA 60 

ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 120 

TCAAATAAAC AATCATCTGA TAACAAAGAT AAGGAAACAA CTTCAATTAA ACAT6CAATG IBO 

GGtACAACTG AAATTAAAGG GAAACCAAAG C6T6TTGTTA CGCTATATCA AGGT6CCACT 240 

IS GACGTCGCTG TATCTTTAGG TGTTAAACCT GTAGGTGCTG TAGAATCATS GACACAAAAA 300 

CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA AGATTGTAGG TCAAGAACCT 360 

GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATT6TCGC GTCAAAA6TT 420 

AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATCGCAC CAACAGTTTC TACTGATACA 480 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 540 

GCTGAAGATT TACTTAAAAA 6TACGATGAT AAAGTAGCTG CATTCCAAAA AGATGCAAAA 600 

GCAAAGTATA AAGATGCATG GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 660 

ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA AT6ATTTAGG ATTCAAACGT 720 

AATAAAGACr TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT TACATCTAAA 780 

GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTGTAG TAAAATCAGA TCCAAATGCG 840 

AAAGATGCTG CATTAGTTAA AAAGACTGAA AGCGAATGOA CTTCAAQTAA AQAQTGGAAA 900 

AATTTAGACG CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 960 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATT^AAAAC AATCAAAATA ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 1080 

40 ATCGTTATTG CTGGTTTATG TCTTGCAATA GTAGCTATCT TAAGTTTAAT GATTGGAAAT 1140 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGTTATTCA ACTTTGATAG TGAAAACGAT 1200 

TTACATGATG TTGTCACTGG TGCACGGGCG TOGAGAACAA TCATTGCGTT ATTGACTGGT 1260 

^ GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 1320 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 1380 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 1440 

50 

ACTGTATTAG TTGTTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 1500 

ATTTTGGCAG GTGCTGCGAT TGCGATGCTA TTTACAGCCT TTACTCAAGG CATACTTATT 1560 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTCA TACTTATTTT AATTGCATTT 1680 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 1740 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT 1800 

TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACOGAA TATTAGCAAA 1860 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTA6C TGGAGCAATC 1920 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 1980 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 2040 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATACCATTAC 2160 

AACGCGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 2220 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 2280 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 2340 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 2400 

2S ATTTACTTCC ACTATTTGCA GTATrAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 2460 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCX3ATGCAAA 2520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2580 

30 AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 2640 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2700 

TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 2760 

TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 2820 

TTGQGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2880 

CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 2S40 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3060 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 3120 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCO ACTTTCTTAA TTACATATTA 3180 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 3240 

TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3300 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3360 
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ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 3480 

TAAGAAAAAA GTCAAGCCT6 AAOTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 3540 

5 TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 3600 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 3720 
TO 

nAAAATTGCA GAACtiTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3775 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 





AGAAAAATTA 


GCCTACCTAT 


GCAAGTTGTT 


ATTGCATTAG 


TGTTAGGTGT 


TGTCGTAGGA 


60 


25 


CTTTXATTAT 


ATGGACAAGA 


AAACGTAGCA 


AATTACATTA 


AACCATTTGG 


TGACGTATTT 


120 




TTAAATTTAA 


TTAAAATGAT 


CGTTATACCA 


GTTGTATTTT 


GCTCACTAGC 


GCTTTCTATT 


180 




TCGAACGTTG 


GGGAATCGAA 


AACTGTAGGG 


CGTTATGGCT 


GGaAAACAAT 


TTTATACTTT 


240 


30 


GAAATTATTA 


CAACAATCGC 


AATAGGTTTA 


GGGATTATCT 


TCGGTAACCT 


ATTTAAACCA 


300 




GGTGCTGGAT 


TAGACCCAAC 


AAAATTACCT 


AAAGGTGATA 


TTTCTAAATA 


TCAATCAACT 


360 




GCACATGCAG 


CAGAACAATC 


TACATATGGa 


AATCATTTTA 


TTGATACCAT 


TGTACATATT 


420 


35 


ATTCCGACAA 


ACrrTTTTGA 


AGCTTTAAAT 


AAGGGTGAAT 


TATTACCTAT 


TATCTTCTTC 


480 




GCAGTATTCT 


TTGGATTAGG 


ATTAGCTGCr 


GTAGGTAAAA 


AAGCAGAACC 


AGTTAAAGAA 


540 


40 


TTTTTAAGCG 


GATCGCTTGA 


AGCTGTGTTC 


TGGATGATTA 


ATAAAATTTT 


AAAATTAGCA 


600 


CCACTTGGAG 


TGTTTGCATT 


CATTTGTACT 


ACAATTATTA 


CATTTGGTGC 


ATCCGCATTA 


660 




TTACCACTAT 


TAAAATTAGT 


ATTAGTTGTT 


GTCTTTGCAA 


TGGTGTTCTT 


TGTATTCGCT 


720 


45 


ATACTAGGAC 


TAGTTGCATG 


GATGTGTGGT 


ATTAATATCA 


TGAATATTAT 


TAGAATCTTG 


780 


AAAAGTGAAT 


TGCTTTTAGC 


ATTTTCTACA 


TCAAGTTCGG 


AAGCTGTACT 


TCCTGTAATO 


840 




ATGAAGAAAA 


TGGAAAACTT 


CGGTTCTCCA 


AAAGAAATTA 


CTTCTTTTGT 


TATACCAATT 


900 


SO 


GGTTATACGT 


TTAACTTAGA 


TGGATCAGCA 


CTTTATCAAT 


CTATTGCAGC 


ATTATTCGTT 


960 




GCACAGATGT 


ATGGAATGCA 


CTTAACATTA 


TCAGAGCAAA 


TTGTGTTGAT 


GTTAACATTA 


1020 
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ACATTAGGTG CCATGGGCTT ACCGOCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 1200 

ATAGCTAAAT GGGAAAACGT ATATCACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 1260 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

20 





TCAATATGTA 


AATCGGCGAT 


TGTCCCTATC 


tTCATACCCT 


TACCTTCTTT 


ACTCAAATTA 


60 




AAATGTTATT 


TATTATGCCA 


TaGCTTATCT 


AATATATATA 


GTTAACTGCT 


TCATTTAGAT 


120 




GATTATTTTA 


TATTTTTGCA 


TAAAAACTTA 


TATCTTTTCA 


AAAAAATCGA 


TAAGTTTTAG 


180 




TTATCATACC 


CTTACCTATC 


AAATGTTTTT 


TCTTATATTT 


AAAAAAATAA TTGCTTTATT 


240 




AAATGGATTT 


CTTTAGTATT 


TATAATTAAG 


AAAACGCTTA 


CACACAACTT 


TTTTATTTGC 


300 


30 


TTTATCCTGA 


GGAGGAAAAT 


TATGGCAAGA AAATTGCATA 


GAGAGTTGAA 


TAACAGACAC 


360 




ATCCAATTAA 


TAGCAATTGG 


GGGCGCAATT 


GGAACTGGGT 


TATTCCTAGG 


ATCAGGTCAA 


420 




ACAATATCTT 


TAACTGGTCC 


ATCACTGTTA 


TTCACATACA 


TGATTATTGG 


GGTTGTACTA 


480 


35 


TTCGCTTTTA 


TGCGCGCATT AGGCGAATTG 


TTGTTGAGCA 


ATACAAGATT 


TAATTCATTT 


540 




GTTGATATTG 


CAAATGAATA 


TTTAGGCCCT 


TTTGGTGGCT 


TTGTCATTGG 


CTGGACTTAC 


600 




TGGTTATGTT 


GGATTGTATC 


AAGTATGTCA GACCTAACTG 


CGATGGGACA ATACTTTGCA 


660 


40 


TTTTGGTATC 


CACAAGTCCC 


AAATTGGATT 


ACCGTGCTAT 


TTATTGTTTT 


AATCTTGATT 


720 




AGCTTCAACT 


TATTAGGTGC 


CAGATTATTT 


GGTGAACTGG 


AGTTTTGGTT 


CTCGATTATT 


780 


45 


AAAGTTGTCA 


CAATTATTGC 


GATGGTTATC 


GTTGGTCTTG 


TATTAATCTT 


TTTCTCATTT 


840 


AAAACACATT 


ATGGACATGC 


ATCATTCACA AACTTAATCA 


GTCACGGTGG 


CATGTTCCCT 


900 




GGTGGAaCAT 


TTGGTTTCTT 


AATGTCATTC 


CAAATTGCTG 


TATATTCATT 


CATTGGTATT 


960 


50 


GAACTTATAG 


GTGTAaCTGC 


TGGTGAAaCG 


AAAGATCCTG 


AAAAAACCTT 


ACCGAAAGCA 


1020 




ATTAATAATG 


TACCTATCCG 


TATTTTATTA 


TTCTATATCG 


OTGGTCTATT AGTAATTATG 


1080 
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TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 1200 

TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 1260 

GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 1320 

TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 1380 

ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGGGCA 1440 

ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 1489 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 





TTTTCCATCA 


TrATcACCAT 


TTAAGACAAT 


AGCTGTATGA 


CCATATCCTC 


CACCAAATCT 


60 


25 


TCCACTAAAA 


ACCACTAAGT 


CCCCAGGTTC 


AGCTTTGAAA 


GTTGGTGTGT 


TGTGATAAAT 


120 




TTTAGCTTCA 


CTATTAAAAT 


TATTTGCATA 


TGGTATATCT 


TTAGCTCCAT 


ATCCTTTtAA 


180 




TCCAkOACCA 


TAAAGAkGAT 


TCCAATATAC 


ATTAACTAAA 


TCGAAACATT 


GCCATCCATA 


240 


30 


ACTACCGTCG 


AAATCCCATC 


CTCTATTCTC 


TAATGTTTTC 


AAATAATTTA ATGTACTGCT 


300 




ATTACTATTA 


CTTTTATTAT 


TTGAAGACAC 


TGTTTTTGGT 


TTTGGTTCTA 


CTAATGGTGT 


360 




CATTGGCACT 


TTTAATTTTT 


GACCAATAAA 


TATTAAATTA 


GGATTTGCTA 


TATTATTTGT 


420 


35 


ATTTTGAATA 


TTTGAAACTG 


TAGTTTTGTA 


TTTTAATGCT 


ATAGCACTAA 


GTGTGTCTCC 


480 




TTTTHTACA 


GTATAGATTT 


GTGTTTTTGG 


AGCTTCTCTA 


AAACTGTAGT 


AACCAAAGTA 


540 


40 


ATTATTAGTA 


ACTTTATTCG 


TTTTCTTCTG 


ATTAGAATTT 


TGAGCTTCCA 


AGTTTGCAAT 


600 


TTTAATTTCT 


TTAGTAAGTT 


CATTGTTATT 


AATAACTAGA 


TTGTTACCTT 


GGCTTGAGTT 


660 




TTTCGGA6TA 


TTTGAAATCT 


TTATATCTTG 


ATTAATTTCA 


TTTCCGTTT6 AAATTGCTGA 


720 


45 


TTTGTTGTCT 


AACTTTAAAC 


TTGTGTCCGA 


TGTTTTAACA 


GCACCTTCAT 


TTTTTATTTT 


780 


GTCTTTTGTC 


GTATTTTTAT 


TAGCATTTAA 


CTCTGATTTC 


GCGAATACAT 


TTTGCTCATA 


840 




CCCTCTTGTA 


AAATCTTTAG 


ATTTATCAAT 


TTCATCTGCA 


TATGCTTTGT 


TCGACATACC 


900 


50 


CAATGCCAAA 


AACATACCTA 


TTGAAATTGA 


CAAAATTCCA 


ATACTAACTT 


TTCTAATTGA 


960 




ATAGCGTACT 


TTACTTTGTT 


GTTTATTCAT 


GATGAAACAC 


TCCTTACAAT AATATACAAA 


1020 
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GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 1140 

GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 1200 

^ ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 1260 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 1320 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 1380 

10 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC ACCGATAAAG CCTACAACTG 1440 

GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 1500 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 1560 

IS 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 1620 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 1680 

CAACATGACC TTTTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 1740 

20 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 1800 

TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT 1860 

2s CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 1920 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTGAAAACA GGAACGCCTT 1980 

CAACAACTTG ACCACCTTTA CCAGOCOTCA CACCTGCTAC TATTTTCGTA CCATAATCAA 2040 

30 GCATTTGTTT TGTATGGAAA AGGGCAGTA6 ACCCTC3TAAT ACCTTGTACX: ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTA6 TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTCCTAATC 2220 

^ CTGAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 2280 

GTGGiTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 2340 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 2400 

40 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCOGCCTGCA TCTAAGAAAT 2460 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 

CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TOATAAATCA TGTTTAGACG 2580 

45 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCAOGTAA TTCTACAACA TCTTTATGTC 2640 

TGAATAATGC ATTATCATCA AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 2700 

50 CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AATACATTAT 2760 

AAAGTGCTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 2820 
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AGATCTTTTC AGGAGTCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 2940 

CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATTCTTTTT 3000 

GAATAGCACA ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 3060 

GATGTGTCAC CAAAGTTTTC CCTAATAATT CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 3180 

CAACATAAAC ATCAGAATTT AATTCTTTTG CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 3240 

CTACTCGTCC TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 3300 

CGTGGATATT CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 3360 

AATGTAAGCG CTATTGTAAA CTTAAATGCT ACTTTTTTAT CATTTAATTG AATTTTACGA 3420 

TTTACAGTAA CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT 34 80 

CATCGATTGC TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAACCATATT 3 540 

CAGGATAATC TTTAGATAAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 3600 

TTGCAGCTGC AATGGACAcfA CTTCTTGCAT CACCCTTGAT TAAAGATACT TGAGGCAGTG 3660 

CATTATCAAG CGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 3720 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT TATAAATATT AAATTCATCT ATTTCTTnCA 3780 

GtGTCGOGAT CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 3840 

30 GACGTTTCGT AACAGGTACT TTTTTCGAGT CATCAAGGCC CAAATAATTG TGATTTGAAT 3900 

TTAAAATTGT TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 3960 

TCCCACAAAT AATAGCATTA GGATGCTCTT TTAATATTTC ATTTTCAAAG TAAGTCATTT 4020 

^ CAACATACTT TTCTTTTAAA GCTTGTTCTT TTTCTAACGC TTTTCTGCGC CTAGCTATGG 4080 

CATETTGAAC ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 4140 

TATTAACCGC ATTAATCAAC TGCGTAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 4200 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA TAATTTCCTA TTTTAGCATT TCGAATATCA 4260 

TAAATAATCA GTTCAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCc 4320 

ACGTTTTTTC CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 4380 

ATTATAATGT GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGTTTA ATCCATAGAT 4440 

GGCAACTTCA TCTAAGTGCA CAATACTATC TTTTATOGCA CCAGTTAAAC TCAACTTCTT 4500 

ACCGACTTCT TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGT6TCTA ATAGTTGTAA 4560 

TCCATTACCA ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTAT TACCAGTCTG 4620 
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AACGATCATT GCTCTTATCG CTCTAGGTTT AAGTCCTTTC GCTTTTTCGC GTTCAAATTT 4740 

TTCAGCAGTC GCCTTAATTG CTGCAGCTTC CACTTTCTTT AAATTTTTAC CGTGCTTAGC 4800 

ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT tGTTCCCATT TTGACATCTC 4860 

ATTTAAATTA GACATATCTT TTTTATTTAA TATAACAACA C6TGGTTTTT GGTTAATAAC 4920 

TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 4980 

TACATCTACT TTTTTTAATT 5000 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

AACCCGnAGA CGAAACTnCA TAGTTGCCTG TTATGTTGAT TAGTGCGTTT GTAGCTATTT 60 

2S nAAATCAAAC ATTATTAAAT ACAGCGTraC CTAGTATAAT GAGAGAATTA AATATCAATG 120 

AAAGTACATC GCAATGGCTA GTTACTGGGT TTATGCTTGT TAATGGCGTC ATGATACCTC 180 

TGACGGCATA TCTAATGGAT AGAATTAAAA CTAGACCTTT ATACTTAGCG GCGATGGGGA 240 

CATTTTTATT AGGTTCTATT GTTGCAGCCT TAGCTCCGAA TTTTGGA6TT TTAATGTTAG 300 

CTCGTGTAAT TCAAGCGATG GGTGCAGGCG TACTTATGCC CTTAATGCAA TTTACGTTAT 360 

TTACATTCTT CAGTAAAGAA CATCGAGGTy TtGCAATGGG ACTAGCAGGT TTAGTAATTC 420 

AATTTGCACC AGCAATAGGA CCTACAGTTA CAGGATTAAT TATTGATCAA GCGAGTTGGC 480 

GAGTfCCATT TATTATAATT GTAGGAATTG CTATACTTGC CTTIGTTTTC GGTTTGGTTT 540 

CAATCTCGA6 TTACAATGAA GTGAAATATA CGAAATTAGA TAAGCGTTCA GTAATGTATT 600 

CAACTATTGG GTTCGGGTTA ATGCTATACO CATTTAGTAG TGCAGGAGAT TTAGGATTTA 660 

CAAGTCCAAT AGTAATAGGT GCGTTGATAT TAAGTATGGT TATTATCTAT TTATTTATAC 720 

GTAGACAATT TAATATTACT AATGCACTTT TAAATTTAAG GGTTTTTAAA AATAGAACAT 780 

TTGCATTATG TACGATTAGT TCAATGATTA TAATGATGTC AATGGTTGGA CCTGCGCTGC 840 

TTATACCGCT ATATGTTCAA AACAGTTTAT CTTTATCTGC CTTGTTATCA GGACTTGTTA 900 

SO TCATGCCTGG TGCAATAATA AATGGTATTA TGTCAGTTTT TACAOGTAAA TTTTATGATA 960 

AGTATGGTCC TAGACCATTG ATTTATACTG GTTTTACAAT TTTAACAATT ACTACAATTA 1020 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACA6GAATT AATTCTTTGA 1140 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 1200 

5 

CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 1260 

CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA A7CCATTGCT ATAGGGGTGG 1320 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 1380 

10 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 144 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT 1500 

AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA OATQTCTTAA AATGATTAAG 1560 

IS 

GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 1620 

CTATTATAGA T6TGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 1680 

20 AAnAAGAACA TGTGAGTGTT 1700 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 
25 (B) TYPE: nucleic acid 

(C) STHANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA CACTTTTCAA 60 

ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 120 

3S 

ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCC6 TGCATGTATT AGCTTTTTTA 180 

ACTAAGCATC aTTCAGAAAA ATTCAATAGT A6TTCATTAG CAGAaTTAaC TTGTTTAAAT 240 

CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT TGACACAATA 300 

40 

CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 360 

TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGGCGACGAA 420 

^ GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TCTCACATTA TCAGCAAGAC 480 

GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 540 

ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 600 

SO AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG CAGTTATCGA 660 

ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 720 

55 



1029 



EP 0 786 519 A2 



TGTTGTCAAT GCGCTAAACA ATAAAAATTA 
TGTACTGGAT TTTAAAGCGC AGTTTAAATC 
* TGACGATATC GTCGATAGTA TTACTGCACC 

TGTCATTCCT AACATTAAAG GCCTTGATCA 
ATTAAACATT AGCTATCAAC CTAAGCACCT 

10 

AGAATTTGCT TCAATGTTTG CGAATTTAGG 
ATCATTTATG CCACGCGAAG ATCAAGATGT 
TAAAGGCATT 6CATTGCATA CAAATGTTGA 

IS 

TACAACAGTC CATACCAACG TTGGTAACTT 
GCGCAAACCG AATAGGGATT TAGCTTTAGA 
CGAAATTAAA GTCAATGCTC ATCTTCAAAC 
TGTTAAAGGC GGACTTCAAT TTACGTATAT 
AGCGTTATAT GGTAATCAGT CACGTACGAC 
25 ATTTATAGAT CCACCATTAT CACGTGTTGG 
TTATGATTAC ACTGAACATC AACTTTTAGT 
TGATCCAAGA GGTTTATTTA AAGTAOTCAT 
TACATTATAT GGTAAGCAAT CTGAAGAATT 
AAACATTCCA TATACCGTAT TACGAGATAA 

ATTTAATGAT TTATTTAATT TCTAGACAAA 

35 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT 

TATCTTGTTC TTCTTTACGA CGTTTACGTT 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT 

40 

TAAGCGCagc CAGTGGGCGG TTATTAACAA 
AATAAGATTG ACAACCTATC ACGATTTCAG 
GTAACGTTCT TATATTGACT GCCTGACATT 

45 

TGCTACCCAC CTTTCTTTAT CATATCTATA 
CTATCTATTT TACTCTTTTT AATATTTTTT 
SO AATGGAGCAT TTATCTTCTA ATTAAATTAG 
CCCATGATAA ATATTAAATT TTAATGGTTT 



CCACTTATTA GCAGACGATA ACAACATTGA 840 

TAATACTGAA GTGAATTTAT TAGATCAACA 900 

TCATATCATT ATTAATACCG GTGCTACCTC 960 

AGCTAAACAC GTCTTCX5ATT CGACAGGTTT 1020 

CGTTATTGTA GGTGGCGGTT ATATCGCCTT 1080 

TAGTAAGGTA ACAGTATTAG AACGTGGCGA 1140 

CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

AACGACTGAA TTGTCATCTG ACAATCATCA 1260 

TGAGGCTGAT GCAGTACTTT TGGCTATCGG 1320 

AAATACTGAT ATCGAATTAG GCGACAGAGG 1360 

AACTGTGCCX5 CATATTTATG CTGCAGGTGA 1440 

ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

TGACAATAGA GGCAGCGTGC CTTATACAGT 1560 

ATTAACTAGT AAAGAAGCTG CCGCTCAACA 1620 

AAGTGCTATA CCTCGTCATA AAATTAACAA 16 BO 

TAATAATGAA AATAATATGA TTTTAGGTGC 1740 

AATTAATATA ATTAAACTTG CGATTGATCA 1800 

TATTTATACG CATCCTACX3A TGGCCGAATC 1860 

ACATAAAAAC CTGGTGGCAC GCATTGAATG 1920 

TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 2040 

GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 

AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 2160 

CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

CATCACAAAT AAGGAATGTA TTTTTCATAT 2280 

TCGTCGATTT CATTAATTTT TTCGTTAACT 2340 

TCAAGATACG TAACACGGCT GaCAATAAAA 2400 

ATGaTTGCTC CCCTATCAAA TCATTTATTG 2460 

AATACCATGT TTTGTCCATT CATCATAAAT 2520 
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TGATGTTTTA GATGCGCOGT GATACTTTTC 
AGTTTCTATA TCAACTGTAG CTTCTTTATC 
^ CACCATCTTT TGCACACCTT TAATGTTATT 

ACAACGATGT GAATCTTCTA AGAAGTCACC 
CACTTCGCTA ACAAAGTGTQ GTGATGACGC 

10 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT 
TTCTTCAACC GTAGTATCTT CAATTTGATG 
CGCTAGCCAT CCACTATATA CACTCACAGC 

IS 

CTTCATATTT GCAATCACTG CTAGTTTATA 
AAATTCATTT AATACTTTTA TAACTGACAC 

^ TTTATGACCA TTTGAATCAT CTAAATTACT 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA 
TTGTTTTGCT GcATGTGGAT CTGAAATGAC 

2S TGG6TTATGA TGTAATGCTT TTGAATGAAT 
AGTAGCAGTT ACAAAACGAT CTAACGCAAT 
AGCATATTCT CCAGCAATAT AAAGTTTTCC 

30 TCCTTACTCA ATTATTTCAA TTCCTGTGGC 
ATCAAACTGT GTTAATAATT TATCTATAAT 

TTTCACATTA GGTCCCGCAT CCATTGTAAA 

35 

GTGAACAAGC GCCATGACAT CATAACTTTC 

TCCTOGATTC GTGGCATGCA TACGCAAACC 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC 

40 

CCAATATTGA TAAAACCTTG ATGTGTTTCG 
CTTTTTAGAA TGTTGATTAA TCACAACAAA 
CGATTCAAGT GGAACGGCAT ATGACGTCTC 

45 

TCCACCATAA ATACTACGCG ACGCAGAACC 
CTTATCTGAC AGCTGCATGT CTAGCGCTTG 
SO GCTTGCCGAT GAAGCCAACC CTGCTGCTGT 
ATACCAATCG ATGCCAGCTC TATTTCTGAC 



AGCAATATCA CACAAATATT TTAGCTTTTC 2640 

CATACGTTGA ATAATTGTAC GATTCTGACG 2700 

TGTTTTAAAA GCATGAATAA G TTTT T CAAC 2760 

GTAAAATGAA GGATCTGATT TCAAACGTTT 2820 

CGGTGAGCCA GTCCAACCGA TAAGTACTTC 2880 

GTGCAATCCA GGCCAGTTTT TGATTAAAAC 2940 

CTTAACCCAT TCATGATCAA AAGTACTATA 3000 

AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

AATGTATAAA TTAGATAACT TCATATCATA 3120 

AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

ATCAATAGTC AGATGAAAAT GCTTCATCGC 3240 

TATTTCAATA GCTGTGaCCA CATAATTTAA 3300 

AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

GGTACCTTTA TATTGGTCT6 CTTCTTCAAT 3420 

AAGTACAGAT TTATATCCTG OTTCTGTTAC 3480 

GGGTGCTTTG ACCTGAATCA TTTTATCTCT 3540 

AATAATGTCA CTATCAATAA TTTGGTTATT 3600 

CTGTTGCTTG rrrrivrm ' ctacaagtat 3660 

ATAACACGGA TACCCCGCTT CTCGGCATTC 3720 

TTGCACAAGA TATGTGAACG GCGGTGTT6A 3780 

ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3840 

TTCAGCTAAA TCTTCATCAA TATGATCTAA 3900 

TGTCAATGAC ATACCATATC GACTAGGTAC 3960 

TATCATGGCA AGGTCATCTT CAAAATGATT 4020 

ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4080 

CGAACCAATT CGCGCCAATC TCGATAAATC 4140 

ATTACAAGCT GCTGCTAAAG CTGCATATGC 4200 

TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4260 

AATATCCATA TATTTTGAAA TTTTCTCTAA 4320 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 4440 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA TATTCGTATG 4500 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4 560 

GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCnTGG 4620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 6 80 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 4740 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 4800 

CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 4 860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4920 

CTATAACAAC CATATAGCC6 TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 4980 

20 GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 5040 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 5100 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 5160 

25 CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 5220 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACX^IT 5280 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 5340 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAOGCTC TCCAAATGTA ACAGCATGTT 5400 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 5460 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 5520 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 5580 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 5640 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGA6TGAT ATTTCAGATT CTTTATCTTT 5700 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 5760 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 5880 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 5940 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 

SO CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 



55 



30 



35 



40 



45 



1032 



EP 0 786 519 A2 

TTACATCTTG AACAGTAATC GTTTTGTTAA 6CAATGTCTC TAATGAjGGCC ATACAAGATG 6240 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 6300 

5 TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG 6360 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCX5ATACTTA 6420 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 6480 

CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 6540 

TTTCTGTTTG TCCTTTGAAC ATCAGCX3ATA TATTTAATAC ACCTTGATCT AGAACGACAC 6600 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 6660 

75 

TGCCA7CTTT TAAAAACGGC AATCTTGAAT CATGAATACC AAGAATAACA GTATGTTGAT 6720 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 6780 

CATCGAATGC GAAAGATTGC ATAGGTTCTA ATCCAGAAGA ATGATCGATA TATCGCCAGT 684 0 

20 

TGACGCCATT AAAATATTTA CTCGCTAAAT CCATCGTTAT TGTAAGGCTT GCGCTGCTGT 6900 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6960 

2^ AGAATTTAAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 7020 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 7080 

TAATTTAGCA CCTGGCGCTT TTTTCTCAGC AACACCTGGT ACAATCGCAG CATCAAATTG 7140 

30 GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 7200 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA QACCCTTTTG TTGAAAAGCT 7260 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7320 

^ AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGAITGATT GCACAATCAC CAAAGATGTA 7380 

TTGrrCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT CTTGATACAC CTGGTTTCGT 7440 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7500 

40 

TAAACCATCT GCTTTACCAG CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7560 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7620 

TAATTCAGCT TTCAATTCAC TTGTCGCAGG ATTAATTAAT TCAATATTAG AAATATCAAG 7680 

45 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTAACACGA TTGGTGTAAC 7740 

ATAATCTGTT GCTTGTAATT OTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7800 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7860 

50 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACGCCATT 7920 

55 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 8040 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT BlOO 

^ CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTGA TCAAGCTATT TATAATATAA 8220 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC B280 

10 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT ACATATTCAT 8340 

ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTGATGAA GATCCTTATG 8400 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 8460 

15 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8640 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8700 

CA0GATACX3G TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 8760 

25 ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 8880 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT OTGCTAATGT 8940 

3^ ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 9000 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 9060 

ACrCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 9120 

35 

AACATATTTT T6TCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 9240 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 9300 

40 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 9360 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 9420 

AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 9480 

45 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 9540 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 9600 

so TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA 9660 

AAATATATGT TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AG G TT T TGCT GCAGCAATGA 9720 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 9 840 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

S TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 9960 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAQCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

10 

AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 10140 

AGCTTC 10146 

(2) INFORMATION FOR SEQ ID NO: 244: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 



2S 


ATTTAATTGG 


TTGGTGGCGT 


ATTCTCnTTT 


CCAAGATTTG 


CCACGATTGG 


ATGTTCAGTC 


60 




GACAAATATG 


TAACCAGTCC 


CGATAAATTA 


CGTTCCTCAA 


TCATTGATAO 


TATGTCTGGT 


120 




GTTTGTAACG 


ATTTATCGAG 


TAAGGCATCA 


AGCAATTGAT 


AATGTCCCAA 


TACAACAAAA 


180 


30 


TGCACGTTGT 


CTCTCAACTG 


CTGTTGAATA 


AACTGAATAA 


AGAGCTTTAA GCTCTGTTGC 


240 




ACATTCGCTA 


ATGATGGTGC 


ATAGTTTTCC 


AAACCAACTT 


GTACAGCCGC 


TTCATTATTC 


300 




CGAATGATTA 


AACCTGTGTA 


TGCCACTTTT 


GTTGCTGCAG 


TTGGATACAT 


TGAGTAATAA 


360 


35 


CGCAATAATT 


GATCTGTAAA 


ATCATTTCGA 


AGTGCATAAA 


TTTGATGCTC 


ATGTTGCCAA 


420 




AAATftCGCT 


CACCCATCTG 


CTgCAAATCC 


TCATGGTTCA 


ATTGTTTCCA 


GTCCAACTTT 


480 


40 


TCAACCACAC 


TAAAATCAAC 


TAACTCATAA 


TCCGCTTTAT 


TAAAATATTT 


TAAAAATGCT 


540 


GTTTCCGATT 


CTTTTAACGC 


AATTAATTGT 


TCTGnATTAT 


TCACTCGACC ACCCTTTACT 


600 




TTCAATACTG 


TATTTAAAAT 


CACTTGGTAT 


TTTCGTTGTT 


TGCTTTACTT 


CTCTACCACG 


660 


45 


CTAAAGTGTA 


ATATGATTAA 


TAACTTATCA 


TTTTTAGCAA 


TACATTACAA 


CCTTTTTCAG 


720 


AAAATTCGGT 


GTATTGATTT 


TAAAATTTTT 


TAAAATAnAA 


AAGGCAAGAC 


ATTTGTGCCT 


780 




ATAAAAATGC 


TTAACCAAGA 


TTTTTATATT 


GaAGTTGTAC 


TTCTTGCACA 


TATTGTCCTT 


840 


SO 


GCCTTATTAT 


GTAAAGTTAT 


TTTCTTTCTA 


TCTTTTTATT 


AAATTTAACT ATTCTTCATA 


900 




ATCCCGATTC 


CCTTTAAAGT 


AACGTCTATC 


TT6TTTACTA 


TATACATTTT 


CAGGATTAAA 


960 



ss 
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TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 1080 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 1140 

5 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 1200 

TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTTTTCA AAATTAATGT GACCATATGA 1260 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 1320 

10 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 1380 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTT 1440 

TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 1500 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT A TTTTTATCG AGTACTAAAC GGTCAGCATA 1560 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATT6AGATGA TTAGTGGCTT 1620 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 1680 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 1740 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 1800 

25 CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 1860 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 1920 

TTTATTlcTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 1980 

'Hi 

CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC 2022 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
3S (A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 60 

^ TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG 120 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 180 

TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 240 

SO TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATGCA ATTCACGCTG 300 

AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 360 
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5 



10 



1$ 



20 



25 



CACCATTGTT 


AAGTCTGTTT 


AATGACTTAA 


AAATAGATAG 


ACGTTTAATC 


GGTTTGATTA 


480 


TCGGTTTTGG 


TTTATGTTTC 


CCGTATGTGT 


TATTACCATA 


TGQATTCGGT 


CAAATTTTCC 


540 


AGCAAATTAT 


TCAAAGTGGC 


TTTGCAAAGG 


CAAATCACCC 


AATTGAGTTT 


AATATGATTT 


600 


GGAAAGCAAT 


GCTTATTCCT 


TCAATGGGGT 


ATATTGTTGG 


CTTACTTATC 


GGTTTATATG 


€60 


TATATCGTAA 


ACCACGTGAA 


TATGAAACAC 


GTAAAATTTC 


AGATAGTGAC 


AATGTTACAG 


720 


AGTTAAAACC 


ATATATCTTA 


ATAGTAACAA 


TTGTAGCAAT 


ACTAGCTACA 


TTTTTAGTAC 


780 


AAACATTTAC 


AGATTCAATG 


ATTTTTGGTG 


CACTGGCAGG 


GGTACTCGTA TTCTTTATTT 


840 


CACGTGCATA 


TAATTGGTAT 


GAATTAGATG 


CTAAGTTTGT 


TGAAGGTATT 


AAAATTATGG 


900 


CTTATATTGG 


TGTAGTTATT 


TTAACAGCAA 


ATGGATTTGC 


TGGTGTAATG 


AATGCTACTG 


960 


GTGATATAGA 


TGAATTAGTT 


AAAACTTTAA 


CAAGTATTAC 


TGGTGATAAT 


AAATTATTTA 


1020 


GCATTATCAT 


GATGTATGTG 


ATAGGTTTAA 


TTGTCACTTT 


AGGTATTGGA 


TCATCATTTG 


1080 


CAACAATTCC 


TATTATCGCA 


TCATTATTCA 


TTCCTTTTGG 


AGCGTCAATT 


GGACTAGATA 


1140 


CAATGGCATT 


AATCGCATTG 


ATTGGAACAG 


CGAGTGCATT 


AGGTGACTCA 


GGTTCXjCCTG 


1200 


CAAGTGATTC 


AACATTAGGA 


CCAACTGCGG 


GATTAAATGT 


TGATGGCXrAm 


CATGATCATA 


1260 


TACGTGATAC 


ATGTGTACCA 


AACTTCTTGT 


TTTATAATAT 


TCCTTTAAAT 


GATTTTCGGT 


1320 


ACTATTGCTG 


CTATGGTACT 










1340 



30 (2) INFORMATION FOR SEQ ID NO: 246: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

40 



45 



SO 



CAAAATCTAA GAGAATAAnA 


TTTGTTAATT 


TnAAATAGCA 


AGCAATTCAA 


AGTTATATGT 


60 


GTAATAGATA AAATAGATAT 


CCCTATAOTQ 


ATGCGTTACT 


AGCTAAACAT AATAACACAT 


120 


TAGAAGATAA TGAAGTTAAG 


GAGTTACTGG 


ATTGTTTOGA 


CTATGTAATT 


AAGTATAAAA 


ISO 


ATATCCAACG ACAAAACGTA 


ATTATAAAAT 


GGTAAAAGCT 


ATGGTACAGT 


TTCAAATTGC 


240 


TAATGACATG CGTATCGGTG 


AGCTACTTGC 


AATAAAGAGA 


GTAAATATAA 


ACTATGAAGA 


300 


TAAAACGCTA GATATCGACG 


GTAAAGTTAA 


TTGGATAACT 


GAAAAAAGAC 


GGGAGCATTC 


360 


GGAGTAAAGG AGACAACTGA AAGAAGTAAT 


AGCTATAAGG 


CCACAGGGCT 


CACTACCCAA 


420 
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TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 540 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 600 

^ CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAOAGATA TACACACATG TTACTAATCA GATGGCGAAA 720 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 780 

10 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 840 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT vGcTTATTTC CGsJAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 

IS 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 1020 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 1080 

20 ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 1140 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 1200 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 1260 

25 ATAAATACTA TCmAAATCCA CCCXSCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AGAGTTGCTA 1380 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 1440 

TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 1560 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

^ AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 1680 

GAnTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 1740 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCX3TATGCA AGAAAAAGAA 1800 

40 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT I860 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCTT 1920 

TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTAT6 CcTACGGGgT 1980 

45 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGAOG 2040 

AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACAOG 2100 

SO ACXSTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 

55 
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CGTTAATTTT TATTAGAATG GTAAGTCATC ATCACTTATA TCAATCGGTC CGTTTGCATT 2340 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 2400 

TGATTGTTGA CCACOGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 2460 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAAOG CTATCACACA CAACTTCAGT 2520 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 2580 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 2640 

AACACAGTTA ATAAAATctG ctTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 2700 

TGCAAGAGTG AATGtCGCTA CACTCACACC T6AGGGAGTG GTTCTGTATT CCGGATCTTT 2760 

CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 2820 

TTGTCTTCGT CTTCACGAAT AACCATGTAA CX3AATGATAT CGTCACTGAT TTTAGCTAGA 2880 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 2940 

CCATCTTTGA AATCATTGAT TTCATAAGCT AGGCGACGTT TACCCCAGTC TTTTGCTTCT 3 000 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 3060 

25 GCATCXTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCT CATTTTATAT 3120 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGaATAAT 3180 

TTTCATTACT CACAATAAA6 AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 3240 

^ AACTACTCTT CATATCATTT TTGATATtAA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3300 

AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3360 

ATTTA 3365 
(2) INFORMATION FOR SEQ ID NO: 247: 



10 



IS 



20 



35 



^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

45 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 60 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 120 

so TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 240 



55 
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ACTAG6TATA CATGATACTA CTAAATACAT TAAAGAATTA TCXIGGCGGAC AACAAAAACG 
TGTTGTACTT GCTAAAACAT TAATAQAACA ACCAGATTTA TTGTTATTAG ATGAACCTAC 
GAACCATTTA GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 
TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 
TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT 
GCGCGCTGAA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CITTATATAA 
6GAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG CAACAAGCTA 
GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGATAAAG 
GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTC6AATTA GAAGACTTAT 
CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 
AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 
GT6GAGAAGA CCAACAATTC GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 
ATTTTAAGCA AA 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
T6TGATTAAC GAAGCTTATT TTCGTACACC TTCAACAACT GATTACAACG GCGTTTATCA 
AGGTPATTAT ATTGATTTTG AAGCAAAGGA AACTAAAAAC AAGACGTCCT TTCCTTTAAA 
TAATATTCAT GACCATCAAG TCGAACATAT GAAAAATGCA TATCAACAAA AAGGTATTGT 
6TTTTTAATG ATTCGTTTTA AAAOGCTAGA TGAAGTTTAT CTTTTACCCT ATTCAAAATT 
CGAAGTATTT TGGAAGAGAT ATAAAGATAA TATTAAAAAG TCTATAACAG TTGATGAAAT 
ACGAAAAAAT GGTTACCATA TTCCTTATCA GTATCAACCA AGATTAGACT ATCTAAAAGC 
AGTTGATAAG TTGATATTAG ATGAAAGTGA GGACCGCGTA TGACGGAAAA CAAAGGATCT 
TCTCAGCCTA AGAAAAACGG TAATAATGGT GGGAAATCCA ACTCAAAAAA GAATAGAAAT 
GTGAAGAGAA CGATTATTAA GATTATTGGC TTCATGATTA TTGCATTTTT CGTTGTTCTT 
TTACTAGGTA TCTTATTGTT TGCTTATTAT GCTTGGAAAG CACCTGCTTT TACCGAAGCT 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 



720 



GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 



780 



CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGlcTCtGA AGGtGCCTCA 



840 



ACATTAACAC AA 



852 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 
(O) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CA C TTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 60 

CCCAAGTTTT AACTITAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 120 

GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TITCCATATC AGCATAACTA AAACCTAACT 180 

TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 240 

CACCCATCTT ATCAGTTTCA GCATAAOGCG CTTCTATTTC TGTAATACTA TATATCATTT 300 

TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 360 

GCCAAGAACA CAAAGGIcTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 420 

TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 480 

TTOCGCTAAT TTATTTCTTA AAACCATTT6 TAAAATTCCA CCGTGACGAT AGTAATCCAT 540 

ITCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 600 

CTTAJSCAGTA ACTTTGAC6T AGtCATGTGG TTGAACATTT TCATCAATAT TAACAGrAAT 660 

TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 720 

TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 780 

TGTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 840 

ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 900 

ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 960 

TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 1020 

ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 1080 

ATATTTACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 1140 
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TTCTTTAGy^T AATCCTTGGA AGAATGATGG ATTTTGAATG TATGITGAAT TAGGATCAAA 1260 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 1320 

^ GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 1380 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 1440 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

10 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 1560 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 1620 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1680 

IS 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 1740 

TGATCCTGGT GCTAGAGAAG TTTTAACX3TA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

20 TTTTTTAGCA ACTTJACCTG CACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 1860 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

25 TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 2040 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 2160 

30 TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 2220 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT QGTGCCATGT TTGCAATTGT 2280 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 2340 

35 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 2400 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 2460 

AGGTTGTCCA AGCATTCCAG CTTCAOCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2520 

40 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2580 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 2640 

GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 2700 

45 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 2760 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

SO TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 2880 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 2940 

55 
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GTCTGTAATT ACAAAATCAT CTTCTTGACG AAC3TAAAGAT TCTAACAAAA CACGAATTGA 3060 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3240 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3300 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3360 

TAATTGAGAA TAATTATCAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 3420 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 3480 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3540 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAQAATTGAA 3720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCX3TAGAG 3780 

25 CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3840 

OCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCG TAGITTAATA ACGAAATTCT 3960 

30 CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CtAAACAAGTA GTATATCTGT 4020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCG6AGTCA ATGTTAAACC 4200 

TAG«ATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACCATCATGA GTATGACGAC 4260 

AAAGiSAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4320 

GCCGTTACCX; CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 4380 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 4440 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4500 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCT6TG TTTCAGGAGT 4560 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4620 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 46 80 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4740 



SS 



40 



4S 



SO 



SS 
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TGCTGTATCA AAACTATTAA ACAA6AATGT GTTTA6TAAA 
CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA 
^ TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT 
AACATTGAAC AAGTAATOTA AACCACCATT AATTTGTAGA 

10 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA 
TTCTACITTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT 
ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT 

IS 

CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG 
GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG 
ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA 

20 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA 
AACACGATAA TCGTAGTAA6 AATAAGATAA TACCAACCTA 

2s TTAAXATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA 

ATAGCAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT 
TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT 

30 GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
^ <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
r (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 



45 



SO 



ACCCGCGAAT 


ATGGTCCATC 


CTATCGATTT 


ATTTTTAACT 


GGTTTGACAA 


TATTTAATTT 


60 


TTCATAATCA 


TTCTTAGTGA 


TTTTGACATA 


TGTTTTCGGT 


ATGAGCCAGT 


TAATAAATGG 


120 


AAAGAAGAAG 


ACAATCCAAT 


TACTTGCCAA 


ATCaATCATT 


AAATATTCAC 


TATCGTATTT 


180 


GATTATTCGA 


TATTTAGGGT 


TTTTATTAAT 


AACTTTAGAT 


TCGCAAAGCA 


ATGTCTCCAC 


240 


ATCCCTTTAA 


TTTTATGTGT 


AATACATTTT 


TCGATACTTC 


AAAAGACATT 


CAAATACTAT 


300 


CAAGTTACTG 


TCATCAAAGG 


TTTTATTAAC 


TGATATTtTC 


ATATTTTTAa 


TCTGAATTTA 


360 



1044 



CTACCCGTAG A6CTAGTTAA 4860 

GCTACCATTA AAATAGTACC 4920 

CTTAATCCAG ACCATGCACT 4980 

TGTACAAACG TATTGTTTGG 5040 

GCACCCATAC CTAACGAAAC 5100 

ATAAAAATCC CAATAGGACC 5160 

AATAAACCTG GTTCACCTTT 5220 

AAACAGCCCA AGCATGGAAT 5280 

CTTCAGTAGT TTTGGGATCT 5340 

CACCATAAAA CACCAAACCT 5400 

TTGTATTAAA CTCAGGTTTG 5460 

AAATAAGGAA TATACAGAAG 5520 

ACTTTTCTGT AATCCACATT 5580 

ATGCACCAAG TAATACGACT 5640 

ACTTCTTTCC ATTTGGATTC 5700 

TCTATATTTA ATTTTATGTA 5760 

CACG 5804 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 251: 

(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAGAT GCTGATATTA ATTCTATGAA 60 

AACAGCAGTT TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 120 

IS 

ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 180 

CAACGAAAAT ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 240 

ACATGTACTA CAAAATAACT TTGATGATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 300 

TACTCATAGT GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 360 

AGATAATAAA GAAATTGAAT T6CCATAATA ATGAAAAAGA TAryACTTTA CGGCGGTCAG 420 

25 TTTAACCCTA TCCATACTGC ACATATGATA GTAGCTAGC6 AAGTATTTCA TGAATTACAG 480 

CCAGATGAAT TTTATTTTTT ACCTAGTTTT ATGTCTCCAT TGAAAAAGCA CCATGATTTT 540 

ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 600 

GATATTTGTG ACGATGAAAT TAAACGTGGT GGTCAAAGTT ATACCTATGA CACGATCAAG 660 

GCATTCAAGG AGCAACACAA AGACAGTGAG TTGTACTTTG TTATTGGGAC GGATCAGTAT 720 

AACCAACTAG AGAAATGGTA TCAAATTGAA TACTTAAAAG AAATGGTTAC TTTTGTAGTT 780 

GTAAATCGAG ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTGCAAT TCAGATACCT 840 

AGGOTAGATA TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 900 

GTTCTTGTTC CTAAATCCGT TGAAAACTAT ATTAAGGGGG AAGGATTATA TGAACATTGA 960 

AAAA 964 
(2) INFORMATION FOR SEQ ID NO: 252: 



35 



40 



45 



SO 



SS 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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TTGTCATGAA 


ATAAATGGGA 


TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT 


120 




ATAAACTAGA 


GACGCTAAAT 


TCGACATAGT ATGTnGCTCA ATTATGGCTG ATGATGAATT 


180 


5 


TAAAGTATGT 


GCGTTGGAAC 


TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 


240 




GATGAATAGT 


GTAAAAATAA 


ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 


300 


10 


TTGTTTAGAG 


CCTTTCATTA 


TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 


360 


AATAGTTATT 


TTTGTAATTC 


TAAACCTTTT TCATCGCGAA AACAATTAAA TAGGTCGCGG 


420 




TATTAATTAT 


TATATTATTA 


CCGCTTAATA TGAAAAATAC ATGAaAATTA ATTTTCTAAT 


480 


19 


ATACTTTTGA 


AAAATTATTA 


CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 


540 


GTTTCAAAAG 


GGCTGATAAG 


CATAAAATGA AATGTAATAT TTCGATGTAT AAAATTITTA 


600 




ATTAGCTAAA 


AATCATCGCA 


TTAATTTrXT GAGCTACATC ATCAAAATTC GGACATTTTA 


660 


SO 


AOGACACATA 


TAATTTAATT 


TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 


720 




06TCAAATAA 


GACACX3AATA 


ACATTTGATT TAGGAGAATT AATCTGC6AC GTTGTATCTT 


780 




TATCCAAATG 


ATAAACCTCG 


CTAGTTAAAT AATCTTCAAT TGCTTTCACT TTGAGTCCTT 


840 


2S 


GAATCTCTTG 


CGGIGGATTT 


GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 


900 




CTTCAAATCC 


TTCTAGC6TA 


TGCGgAATAA TGTATCCTCA TGTCTACCAA CAGTTTGATA 


960 




AATCTGTTCT 


AATTCATCTT 


TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATTT 


1020 


30 


TATAATGAGT 


GGCACAATTT 


GTACGGCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 


1080 




ACCGTAACTC 


TCTTCAAATG 


CGAAAATCAT ATTTGATCAT CATCCAGTTG TCTTATTTCC 


1140 




TGAGCAATAA 


ATTTAAGCCC 


GTCAGCACCr CTTTGGTATT CAACATTATT ATA 


1193 



(2) INFORMATION FOR SEQ ID NO: 253: 

CL) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG 


TATATnCCTT 


TTATTCTCCT 


60 


AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT 


CCAAAAACCA 


AnGAATATAT 


120 


nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT 


CATTAATTTT 


GCATAAAGAT 


180 


GAACCTATGT ATTTAAAAAA ACGTACATQT GTACCTACTT 


TGTTAATTAA TGGGGAACAT 


240 
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5 



TO 



15 



20 



AAAAAAATCT 


TCGAACATTC 


AGGACATGCA 


CCGCATATTG AAGAACCAGA AGCATTTATG 


360 




TAAAATTTTT 


AAAAAGCGTA 


TCATAATATG 


TGATATATAA 


ACCTAGGGCA 


420 


TAAAGTCCTT 


AGGCAATGTG 


AAAAAGCTGA 


TTACTATTCA 


TTATTTGATA 


GAAATCAGCT 


480 


TTTTTTGAAA 


TGTATTTGAT 


ATATACTGCT 


CGTTATGCGG 


CTATCTTCCT 


TATATTAAGT 


54 0 


GCCATTAGTG 


CAAAACCTCT 


TAACAATTAG 


GTAAAAAGAG 


CATAAAAAAA 


GGAAGTTTAA 


600 


TAGAATGTAT 


CATCTATCAA 


ACTTCACCAA 


ATTGCGCTAA ACAAAATTAT 


AGTTCAATTT 


660 


CGTTGTTTGC 


TTCAGTGATT 


CGTTTATTTA 


CTCGACTCAA 


TAATGATTCG 


Arm-rrrAC 


720 


GTTGTTGTGC 


ATTAACAAGA 


ATTAATACAG 


TTCTTTCATC 


ATGCTCATTA 


CGTTTTTTAT 


780 


CGAAGTAATC 


TTCTTGAGAT 


AAAATTTTAA 


CTGCTTTAAC 


AACTTGTGGT 


TGTTTGTAGT 


840 


TTAAATGATT 


AATAATATCt 


TTAAGATAGT 


ATTCTTTCTC 


TTTGTTTTCG 


CTQATGTATG 


900 


TCAATACAGC 


GAATTCTTCA 


AAGCTAATTG 


AnAATTCCTT 


TTTAATTAAA 


CTTTTTAATT 


960 


TGTCAGCATA 


AGTGACCATT 


GATAACAACT 


CAAAGCAATC 


ATTGATTTTT 


GTAATTGCCA 


1020 


TGTTTAAAAC 


CTCCCTATTT 


GATGCATCTT 


GCTCGATACA 


TTTGCCCCGA 


TAATATAnTG 


1080 


TATCTAATCT 


TTATGnAT 
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(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



40 



45 



SO 



CCAGGTAAAA TTGTGCAATC ATTTGACGCA 


TTAATGGACG 


CCTTGGACAA 


TGAAGATTAT 


60 


6AAGGAGAAA AAGTCATTCC 


ATTCTTAGAT 


AAACATTTTA 


AATATCAAGA 


TGGCCGATCA 


120 


AGTGAGCGTT TAGTCAGAAA 


TTTATTTGGT 


AGCTAAGTTT 


ATATAGTAGT 


CAAAGTGGGA 


180 


GAGGTA7AAT GATGAAATTT 


TCAGTAATAG 


TTCCAACATr 


CAATTCAGAA 


AAGTATATAA 


240 


CAGAATTACT TAATAGCCTT 


GCGAAACAAG 


ATTTTCOGAA 


AACTGAATTT 


QAAGTGGTTG 


300 


TAGTTGATGA CTGTTCAACA GATCAAACGT 


TACAAATAGT 


TGAAAAGTAT 


CGCAATAAAT 


360 


TGAACTTGAA AGTAAGTCAA 


CTCGAAACAA 


ATTCTGGTGG 


TCCAGGTAAA 


CCTAGAAATG 


420 


kGGCGTTAAA ACAAGCAGAA 


GGTGAATTTG 


TATTATTTGT 


GGACTCCGAT 


GACTATATAA 


480 


ACAAAGAGAC TTTAAAGGAT 


GCAGCAGCAT 


TTATTGATGA 


ACATCACTCA 


GATGTCTTAT 


540 



55 
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10 



15 



20 



CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 660 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 720 

AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 780 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 840 

CACCTGAAGA CTTTTATGAA GTCATOAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 900 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 1020 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 1140 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATAITGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 1260 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACX3AAAC AAGAAGCACA 1380 

2s CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 1440 

AAGATATTAA AGCGTATTTA CCTAAAQAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 1500 

TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 1560 

30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAQATTAACA CCGTATTTCA 1620 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 1680 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 1800 

AAACSrCAGA AAACGAA6TG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 1860 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 1920 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 1980 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 2100 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 2220 

CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 2280 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 2340 



35 



40 
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SO 
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TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 960 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 1020 

^ tCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 1056 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 



20 



25 



40 



45 



SO 



ATGCCaCACT 


TATTGGTGGC 


AGGATCGACG 


GGTAGTGGTA 


AATCTGTTTG 


TATAAATGGT 


60 


ATTATTACAA 


GTATTTTATT 


AAATGCTAAG 


CCGCATGAAG 


TTAAACTTAT 


GTTAATCGAT 


120 


CCGAAAATGG 


TTGAACTAAA 


TGTTTATAAC 


GGAATTCCAC 


ACTTATTAAT 


TCCGGTTGTT 


180 


ACAAATCCTC 


ATAAAGCTGC 


TCAAGCTTTA 


GAAAAAATTG 


TAGCTGAGAT 


GGAAAGACGT 


240 


TATGATTTAT 


TCCAACATTC 


ATCAACTAGA 


AACATTAAAG 


GTTATAACGA ATTAATCCGT 


300 


AAGCAAAATC 


AAGAATTAGA 


TGAGAAGCAA 


CCAGAATTAC 


CTTATATCGT 


TGTTATTGTA 


360 


GATGAGCTTG 


CAGATTTAAT 


GATGGTAGCT 


G6TAAAGAAG 


TTGAAAATGC 


GATTCAACGT 


420 


ATTACACAAA 


TGGCACGTGC 


AGCAGGTATA 


CATTTAATTG 


TAGCGACACA AAGACCTTCT 


480 


GTGGATGTAA 


TTACAGGTAT 


CATTAAAAAT 


AATATTCCAT 


CTAGAATAGC 


TTTTGCTGTG 


540 


AGTTCTCAAA 


CAGATTCAAG 


AACTATTATT 


G6TACTGG0G 


GCGCAGAAAA GTkACTTGGT 


600 


AAAGGTGACA 


TGTTATACGT 


TG6AAATGGT 


GACTCATCAC 


AAACACGTAT 


TCAAGGG6CG 


660 


TTTTTAAGTG 


ACCAAGAGGT 


GCAAGATGTT 


GTAAATTATG 


TAGTAGAACA ACAACAGGCA 


720 


AATTATGTAA 


AAGAAATGGA 


ACCAGATGCA 


CCAGTGGATA 


AATCGGAAAT 


GAAAAGTGAA 


780 


GATGCTTTAT 


ATGaTGAAGC 


GTATTTGTTT 


GTTGTTGaAC 


AACAAAAGGC 


aAGTACATCA 


640 


TTGTTACAAC 


GCCAATTTaG 


AATTGGtTAT 


AATAGAGCAT 


CTAGGTTGAT 


GGATGATTTA 


900 


GAACGCAATC 


AGGTAATCGG 


TCCACAAAAA 


GGAAGCAAGC 


CTAGACAAGT 


TTTAATAGAT 


960 


CTTAATAATG 


ACGAGGTGTA 


AAAAAATGTC 


AGAAATGAAT 


GCGGTATATA ACGTTAAACA 


1020 


ATaCATTTtA 


AATTTgATTA 


AGCAAAATAA 


ATTGGAATAT 


GGTGACCAAC 


TTCCAAGTAA 


1080 


TTTATCAATT 


GCCAGAGAAT 


TAAATGTAAA 


AACCGACGAT 


GTTTATGAAG 


CAATTCAGCA 


1140 


TTGATTACTG 


AACAAGTCAT 


TAAAGATAtT 


TTGAAGAGGG 


CACAAGTGTT 


AAGTCACTGC 


1200 
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GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SBQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 60 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 240 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 360 

TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 420 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 480 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 540 

30 CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 600 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

^ CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACACSBTAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 840 

TGTACATGGT ACGTATTTGA CCGTOGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

TCAQACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 960 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 1140 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 1260 

ACATATTTAT ATTTTOTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 1320 
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ATTTAATTTG TTATACCAGT ATTTTAOGCT TTTTCGTCTA CATATACAAA TTTATATTAA 1440 

ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 1500 

* ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 1560 

CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC TAATTAArAT GCyTTGTCTT 1620 

GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 1680 

10 

ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 1740 

AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 1800 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAGCA 1860 

IS 

CCAACACCTA AGATTGATGA TGACACAACG TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 1980 

_ ATTTTCATAA TATTACCACC TACAGTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 204 0 
20 

AGCCCCATTG cTQTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATCA 2100 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGOGCATCA 2160 

25 TTCGTACCGT GAGAGAATGA TTGTAACGCT GCTOTGAAAA TTTGGAAAAA TCTAAAGTTA 2220 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTOA ATACATCAAG 2280 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACXSATTAATA CAATAATAAT TTTTGTGAAA 2340 

30 CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 2400 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 2460 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

^ TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 2580 

TTCMCACTG CTGCCATTAA AATT6CCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 2640 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 2700 

40 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 2760 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCXaATTTCT 2820 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTOGATCTGT TTCGCTATTG 2880 

45 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2940 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3000 

so TTCATTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 3060 

TATTCGATTG AATACATTTC AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 3120 
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TTTAAATCAG TAATTACTTG ATGTACTAAt tCOcACCATG TGACTCATAA GTTTTAATGT 3240 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3300 

5 CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 3360 

TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 3420 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 3480 

GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 3540 

CTGTATTTGC CCTTATA 3557 



(2) INFORMATION FOR SEQ ID NO: 258: 

IS 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 



2S 


AACTATACAT 


TTCGAAAAAT 


TCTTCTAGTG 


AACCTGCGCC 


ACCAGGAGCC ATGACAAATG 


60 




CATCTGCAAG 


TTCTGCCATT 


TTATTTTnAC 


GTTCATGCAT 


AGAATCAACT AAAATTAATT 


120 




CAGTTAAACG 


TTGGCTTGTG 


ATTTCATGTT 


CATCTAACAT 


TTTAGGCATG ACGCCAATAG 


180 


30 


CTTTGCCGCC 


ATGATCTAAT 


ACACCATCTT 


GaATGGCACC 


CATAATGCCA ATTGACCCTG 


240 




CACCAAATAC 


TAATTCATAA 


CCTTGTTCAG 


CAAAATATTT 


ACCTAAATCG TATGCTTTTT 


300 




GTACATATGA 


AGGGTCATGA 


CCTTTGCTTG 


CACCACAATA 


AACTGCGATT CGTTTCATGT 


360 


3S 


TAATCCAGCT 


CCTTAATTCG 


ATGAATGACT 


TTTAATAGTG 


ATTGTTCAAA CACTTTTTGA 


420 




TCTTQCTTTG 


TAAAAGGTGG 


GGGACCTTTG 


TGGCGACCAC 


CTTGTTTTCT AATTTGTGCA 


480 




TTCATATATC 


GTTTATCTAA 


TAGTTGTTGA 


ATATTTTrGG 


AATTGTATAT CTTCCCATTA 


540 


40 


TGATGCATGA 


CAATTAAGAC 


TTTGT06ACT 


AATAAACTTG 


CGAGTCCATA ATCTTGAGTG 


600 




ACTACGATAT 


CATCCTTCGT 


TGATAATTGA 


ACAATTTIGT 


AATCAACTGC ATCTGGTCCA 


660 


45 


TCATCAACAT 


ATAATGTTGA 


TACATGTGGA 


GGATATAATT 


GGTTCGAAAA ATGGCTGAAG 


720 


CTCCGAATAA 


TTGTCACAAA 


AATGCCTGTC 


TCAGTTGTTA 


AATCTATAAT AGAATCAACA 


780 




ACAGGACAAG 


CATCTCCATC 


AATAATAATA 


TGTGTCACAA 


TTATGCCTCT GTATTGTTTT 


840 


SO 


CTTTATTTTG 


TTGAGAGGCG 


CTTTTGGCAA 


CATAATCTTT 


ATATTTTTTA AATGACTTGA 


900 




TGCGTGCTTT 


ATCAGCTTCT 


TGTTGGCGTT 


TTTGTTCTTC 


TTTGTGTCGT TTTTCAATAT 


960 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 1140 

GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 1380 

TTGTAGCATC TCTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 1440 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 1560 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 1620 

AATTAnAAAA A ^^^^ 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

CGAAATCATG ATTTAATGCT TTTTCATATA AGCTTTTCCA ATTAATCTTT CGTCCATGAT 60 

ATTCTTCAAC TGTTGCTAGA TATTGTGCAA TTTTAGTTAC TTTAAAGGAG TGTGCTGCAA 120 

CaTTGTGkTC tnAAATATTTA AATTTTCCaG GtAATCTTAT AAGTCTTTCC aTATCTGATA 180 

ATCTffTTAAA ATATTGATGT ACACCCATTT CAATTACCTC CTCCATTAAT TAATCATAAA 240 

TTAIACTTTC TTTTTACATA TCAATCAATT AAATATCATT TaAATATCTT CTTTaTATAA 300 

cTCTGATTAA ATGATACCAA AAAATCCtCT CAACCTGTTA CTTAAACAGG CTAAGAGGAT 360 

AGTCTTGTCT TGATATATTA CTTAGTGGAT GTAATTATAT TTTCCTGGAT TTAAAATTGT 420 

TCTTGAAGAT TTAACATTAA ATCCAGCATA GTTCATTTCA GAAACAGTAA TTGTTCCATT 480 

AGGGTTTACA GATTCAACAA CACCAACATG TCCATATGGA CCAGCAGCTG TTTGGAAAAT 540 

AGCGCCAACT TCTGGTGTTT TATCTACTTT AAATCCTGCA ACTTTTGCTG CGTAATTCCA 600 

GTTATTTGCA TTGCCCCATA AACTTCCTAT ACTTCTACCT AATTGTGCAC GACGATCGAA 660 

AGCATAATAT GTGCAGTTTC CATAAGCATA TAAGTTTCCT CTGTTAGCAA CTGATTTATT 720 
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TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA 840 

TGCTTGTTGT ACAGGTTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 960 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TGAATATAAT TGATTCAACG TTAAaTGATG X020 

CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTGATTG 1080 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 1140 

ATGTTGTTGT GCATGTGCTG CATTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 1260 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AATCAAATGG 1320 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTATTC ATACCATGTA 1380 

ATTGAGAACG TTTTCAATAA TTAATTCAAT ACCTTGAAAA TCGCCATAGG TAATATTACT 1440 

AAATGCACAC TGCATATGTT GTTTTAACAA ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

TATCTACCX5A ATTGTACTTA AATATTCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 1560 

2S ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 1680 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 1740 

30 CATTGAGTAT GGATGGAAAA CATATGATCT AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 1800 

TTOGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT 1860 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCXrTATT GCCGATTCAG TTTTGTTAAT 1980 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 2040 

TTCAATCTCT ATTTCATCAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 2100 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCX3TACT AGTCGGTGTA CACCTTTTTC 2160 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

CGCTTCATCC CCAGGTAGAT AATCAACAGT TTCAACTTTA AAGCCTTTCT TCTCACAATA 2280 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 2340 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGTGAGGC CCATCTAATA ATAATTGCAA 2400 

TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCTCTAAGT CTTCTTTCAT 2460 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTOC 2520 
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TTGCGCTTTC GTTTGGTTAT CCXy^AAAATT 
AATATTAGTT TCTTTGTTCT CTAAGTCAAA 
^ TATACTTATC TATATTTCGT TTGATTTCTG 
TTTCAATTCA AGTCATTGAT TTGCATCTTT 
AAATAACAAT GTTAAGAAAT ATAAGCACAC 

10 

AATCAGTACA TTTATCATCG ACATATGCCT 
AACGTTTAAC AACTTCTTCA CATTTTTTAA 
CCTCCCATAA CACAAACTAT AGGTGTTTAA 

75 

TCCATGGCAA TTTTTGAATT TTTTACCACT 
TTGATCGCCT TTAACGATTG GTTTCGGTTT 

20 GTGCTTCXSCT TCACCAAACT CTGTTGTTTT 
TACAGATTTT AAAATGAATT TACAAGTATC 
TAATTCATGA CCTTCATTTT GATAGTCACG 

25 GTGAATACCT TGACGTAATT GATCCATTGT 
AGAACGAaGT AAAATCATAC GCTCAAACTC 
ACTTTQATAT GCTGCTTCAA TCTTAGCCCA 
ACCTTTGATA TCATCCTCTG TAATGTCACC 
GATGAATGGT TGATATTCAG GCTCGTCATC 
TTGTAACGTT GAACGTAGCA TTGCATCTAC 
ACTATTTCTT TCGTTATAGA TAATTTCACG 
GATAEGTTTA CGCGCGTCGA AGTTATTACC 
TCTTQATACC ATTTTTGATT CAATTGGTGT 

40 

TTTCTGTAAA CGTTCAGAAC CAAAACGAAT 
GCGACTATCC CCTTTATCAC CTTGACGTCC 
ACGAGATTCA TGTCGCTCTG TACCTATTAC 

45 

ACCTAATTTG ATATCTGTAC CACGACCAGC 
TTGTCCAGCG CCTGCAACAA TTTCAGCTTC 
^ ATGACGGATA CCACGTTTTT TAAGTAAATT 

AGTACCTAAT AGCACTGGTT GCCCTGCCTT 



AGGTTCTGCC ATCATTTCTT CATATTCTTG 2640 

GAGACCCCCT AATTTGTGTT AAATCTTGAT 2700 

ATAATTCCAT AGCATTCGCT CCTATTTATA 2760 

TATAATGCTA AATTTTAACA TAATTTTGTT 2820 

TGACAATTAG TTTATGCATT TATTGTTAAA 2880 

AAACCGATTT TTTAAAACTA AGTACATAAC 2940 

AGTATTTAAC GCTTGTAAAA TAAAAAGACT 3000 

TTGGAAGGAG TTATTTTATA TCATTTATTT 3060 

ACCACATGGA CAATCATCGT TACGACCAAC 3120 

CACTTTTTCT TTACCATCTT CAGCTGAAAC 3180 

TTCACGTTCA ATATTATCTT CAACTTGTAC 3240 

TTCTTCAATA TTTTGCATCA TGATATCAAA 3300 

TAATGGATTT TGTTGTGCAT AAGAACGTAA 3360 

GTCGATATGA TCAGTCCAAT GGCTATCAAT 3420 

ATTCATTTGT TCTTCTAAGA TATCTTTTTG 3480 
AACGACTTCG AAAATATCTT CAGCATCTTT 3540 
TTCTTGTAAG AAGATGTCAT TAATGTAGTC 3600 
TGCTGTATTA ATATAGTAAT TGATACTACG 3660 
AACTTGAGAG CTGTCTTCTT CATCAATAAT 3720 
TTGTTTAC6T AATACTTCAT CGTATTCTAA 3780 
TTCTACAOGT TTTTGTGCTG ATTCTACAGC 3840 
AGAGTCATCT AAACCTAGTC GGCTCATCAT 3900 
CATTAATTCA TCTTGTAATG ATAAATAGAA 3960 
AGAACGACCA CGTAACTGGT CATCAATACG 4020 
TGCTAAACCG CCTAATTCCT CTACGCCTTC 4080 
CATGTTAGTG GCAATAGTAA CGGCACCTTT 4140 
ACGTTCATGA TTTTTCGCAT TTAACACATC 4200 
TGAAATATAT TCAGAA6TCT CAACTGCAAC 4260 
GTGTTTTTCA ACAACATCTT CTACTACTGC 4320 
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TTTATTTGTC GGAATTTGAG TTACTGTCAT GTTATAAATA TTTCTAAATT CTTCTTCTTC 4440 

AGTTrrAGCT GTACCTGTCA TACCCGCAAO TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4500 

TGTAATAGAC GCCATAGTTT TAGATTCATT TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4560 

AGCTTGGTGT AAACCTTCCG AGAAACGACG GCCTGGCATT GTAC6TCCTG TAAATTGATC 4620 

GACAATTAAT ACTTCGCCAT CAACAACCAT ATAGTCTACG TCACGTTGTA AT6TAACGTG 4680 

COCACGTAAA GCTGTGTTGA TATGACTAAT AACATCAACA TTTTGTACAT CATATAAGTT 4740 

TTCAACTTTG AACATACGTT CAGCTTTATC CGCACCTTGT TCTGITAAAT GTACAGCTTT 4800 

CGTTTTTTCA TCGTATTTAT AATCTTCGTC CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4860 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC TTCACCAGAA ATAATTAATG GCGTACGTGC 4920 

CTCGTCGATT AAAATTGAGT CAACCTCATC AATGATTGCA AAATGTAATG GACGCATTAC 4980 

TCTATCTTCA GAATAATTCA CCATGTTATC TCGTAAGTAA TCAAAACCTA GCTCATTATT 5040 

AGTACTGTAA GTAATGTCTT GTGCGTATGC TTCACGTTTT TCTTCTGTCX5 TCTTACTGTT 5100 

TAAGTTTAAT CCGACAGTCA AACCTAAGAA GTTATATAAC TCAGCCATTT CTTCACTTTG 5160 

AACACTTGAT AAGTATTCAT TGACTGTAAT AAOGTGAACA CCTCTACCAG CTAATGCATT 5220 

TAAGTATGTT GGCATTGTCG CTGTTAATGT TTTACCTTCA CCTGTTCTCA TCTCAGCX3AT 5280 

ATCACCTTTA TGAATTGCAA TACCACCCAT AATTTGAACT TTATATGGTG TCATATTGAA 5340 

30 TACACGTTTA GAGCCTTCTC TAACAAGTGC ATATGCTTCT G6TAAAATTT TATCTAAATA 5400 

ATCATTTTGC TTTTTGACAT TATCAATGTC AGCTAATTCT GTTTGGAATT GTTTC6TTTT 5460 

ATTACGAATT TCTTCATCAG TTAAAATTGC CGTTTTTTCT TCTAAAGGGA TTACTTTATC 5520 

^ AGCAAGTTTA CCTAACTGTT TAATTTCTTT ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

TCCCATTTCXS TTCGCTCCTT TAGCTAAAAA ACTGTTTGGC CTACAACAAT ATATCTTATC 5640 

ATTTATAGTT AGAAAATTAT ACTTATTTAC TCATTTGTAG AATCAATATA AATATATTTA 5700 

TGACATACTT CATTCACATT CTGTTGTCAA CAAGTTTATC ACTAATAAAT ATATTCTCAA 5760 

TACGCAATTA TACTTCCTAA TAAATTATAT TATAAATATT TTACGATTTT CGACTCX^C 5820 

TATACAATAG ACTGACATAC TATTATTAAC TTAACATTCA AATATATACA TCCATTAACA 5880 

TTAGCATAGT CACTATGTTT CATTCAACAA ATTACATTAT CGAACTATGA AATAGTCATA 5940 

ATTTGCTTTT GGAGTATAAA AAAGCACTT6 TGCAAAAACA CAAGTGCTTT AAACTTAATT 6000 

TATTGTTCAC TAGTTTGAAT CAAGCCATAT TTACCGTCTT TACGGCGGTA AACXaVTACTT 6060 

GTTCCATCAG TTTCTCTGTC TGTGAATACA AAGAAGTCAT GACCTAATAG ATTCATTTGT 6120 
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ATCTCX3TTAT CATCGTAAGC GTCATTATCA ACTTGT6TTT CTTGCATTTC TTGTAATTCG 6240 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACXJ TGTTTTATAT 6300 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 6360 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 6420 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTQGTAC GTCATTAAAA 6480 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 6540 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCrrAAA CCTCTTTATxi 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 6645 
(2) INFORMATION FOR SEQ ID NO: 260: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 60 

AAACGGT6TT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 120 

AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 240 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 300 

TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACaATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 420 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 480 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 540 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAAT6 ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 660 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCQAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 780 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 840 

TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 9Q0 
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45 
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TCGT6TAGAT 


TATGGTGCAG 


CACATGCAGC 


AAAATAT6GA 


CATGAACGTT 


ATGGAAAAAC 


1020 


ATACGAaGGA 


ATTTACAAAG 


ACTGGAAACC 


AGGACAGAAG 


GTACACCTAG 


TTGGACATAG 


1080 


TATGGGCGGT 


CAAACGATAC 


GTCAACTAGA 


AGAATTACTG 


CGTAATGGTA 


ATCGTGAAGA 


1140 


AATAGAGTAT 


CAAAAGAAAC 


ATGGTGGCGA 


AATTTCTCCA 


CTATTCAAAG 


GTAATCATGA 


1200 


CAATATGATT 


TCATCAATTA 


CTACTTTAGG 


AACACCACAT 


AATGGTACAC 


ACGCATCAGA 


1260 


TTTAGCTGGT 


AAT6AAGCTT 


TAGTGAGACA 


AATCGTATTT 


GATATCGGTA 


AAATGTTTGG 


1320 


TAATAAAAAT 


TCAAGAGTAG 


ACTTCGGGTT 


GGCTCAATGG 


GGTCTAAAAC 


AGAAGCCAAA 


1380 


TGAATCATAT 


ATTGATTATG 


TCAAACGCGT 


TAAACAATCT 


AATTTATGGA 


AATCAAAAGA 


1440 


TAATGGATTT 


TACGATCTGA 


CGCGTGAGGG 


TGCAnCAGAT 


TTAAATCGTA 


AAACGTCGTT 


1500 


GAACCCTAAC 


ATTGTGTATA 


AAACATACAC 


TGGTGAAGCA 


ACGCACAAAG 


CATTAAATAG 


1560 


CGATAGACAA 


AAAGCAGACT 


TAAATATGTT 


TTTCCCATTT 


GTGATTACTG 


GTAACTTAAT 


1620 


CGGTAAAGCT 


ACTGAAAAAG 


AATGGCGAGA 


AAACGATGGT 


TTAGTATCCG 


TTATTTCTTC 


1680 


TCAACATCCA 


TTTAATCAAG 


CTTATACAAA 


AGCGACAGAT 


AAAATTCAAA 


AAGGCATTTG 


1740 


GCAAGTGACG 


CCTACAAAAC 


ATGATTGGGA 


TCATGTTGAC 


TTTGTAGGAC 


AAGACAGTTC 


1800 


TGATACAGTG 


CGCACAAGAG 


AAGAATTACA 


AGATTTTTGG 


CATCATTTAG 


CAGACGATTT 


I860 


AGTGAAAACT 


GAAAAGCTGA 


CTGATACTAA 


GCAAGCATAA 


TTTATAAAGT 


AAAGGGAGGA 


1920 


ATTAATAATG 


ACTGCAGACT 


TCTTTCAATT 


AATCGGATCA 


TTATTTAGAA 


TTCTAAAAGA 


1980 


ATTATTCAAG 


TAAAACATTG 


GCGAGGCCCC 


AACATAAAGA 


ATTTCGAAAA 


GAAATTCTAC 


2040 


AAACAATGCA 


AGTTGGCGGG 


GCCCCAACAA 


AGAAGCTGGC 


GGAAAGTCAG 


CTTACAATAA 


2100 


TGTGCAAGTT 


GGCX3GGGCCC 


CAACATAGAA 


GCTGGOGGAA 


AGTCAGGTTA 


CAATAATGTG 


2160 


CAACTTGGGG 


TGGGACGACG 


AAATAAATTT 


TGCGAAAATA 


TCATTTCT6T 


CCCACTCCCA 


2220 


TTGGCATTTA 


CGAAGTTTAA 


ATGTGCAATT 


AGAATATATG 


TATAACAATA 


TTAAACACGC 


2280 


GGTAAAACGA 


AGTCAGTCAA 


TTCAAACTGA 


TTTCGCCCAC 


CGCGTGTTTT 


TAACATAGCT 


2340 


TAATAATTAA 


TAAGCATTAA 


TGTTCAATTT 


ATATGGTTGT 


TTTCCAATAA 


TAAACCTAAA 


2400 


GATATAGAAT 


TCACGCAATA 


TCATGCCGAC 


ACCTATACAT 


AATCCTAAAA 


TGAATAGTAG 


2460 


TGATATCGCT 


AGAAAGACCA 


TTGTATTATC 


CTCAAATATA 


TTTGTATATG 


CAAACAATGA 


2520 


GTCTAGAATG 


ATTGGATGTA 


ATAAATAAAT 


AAAGAATGAG 


AAAGCACTAA 


TCATTTGAAT 


2580 


CGTATTAAAT 


AACATTGTTT 


TAAAATGCGT 


GCAAATACCC 


AAGATAACAA 


TAAACATAAT 


2640 


ACTATTATAT 


GGTGTTAATG 


AATATGAAAA 


GCTGGTAACG 


TTCCAATAGT 


CTCCaTTTGT 


2700 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA 
AAAATCCATC CGAATATTAT AGTATTTTCA 
^ TGAAACGCTG TGTTGTTCGT AAAGTAATAT 

AATAATAATA TTTTACT6TT GAATAGGTTA 
ATAAAGAATT GCATGATAAC AACGATAAAA 

10 

TCAATGAATT GTTTATTGAA ACTTGAATCT 
AATCCCATTA ATATGTAAGG AATAAGTATA 
GTGACTTTTT GGTAATTCAA GGTTGTCAGT 

IS 

CCAAAAATCA CAATATTACG AATGTAAAAT 
TCATGTTTTA AAGTAATTTG TGTAAGTAAA 
GCACGTAAAT ATACGAGTTC AAGTCTAATC 

20 

TAAATGCTCA AAAGCATCAT CACTAATTAA 
CGGAGTGACT GCTTTTTCCT CTAATGAAAA 

25 CGGTAATTTA TCGTCATTCA TCAAGCCATA 

AAAGTTTTTA GTTAGATATT TTTCACTTTT 
AGCTTTCATT AATTTTGACT TATTATTTTT 

30 TTCAAATTCC CATAACCCAG TTITATACAT 

GAGGTT6TGA AAGTTTTCTT CCCCAACATG 
TTTATATTTT TTTAAGATTG GATAAGCATT 

35 GTTAATCCAT ACACTTCGTT TTGOaAACTT 
CAAQprrAAA AATTTAGCAT CATGTGATTT 
TTGRCTAACA CTATAATTTT TAATTTCTTT 
CAGAAAATTC GCTTTTCTTA CACGGTGATA 
TrrCAGTTTT TTAGGTGAAT CGTCATCTGC 
TACAGGCAAT ATGATCAAGA TACTCAACAC 

45 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT 
ATCGCGAAAA TGCCCATAGT TTCAAATATA 
GCAACACGTA TTGTATTGAT ACTTTCGTCA 

50 

AGTAGAACAA CTAAACAATA TATCCAAAAG 



TAACCCATAT ATGCACCTAA GAAGAAATAA 2820 

CTTAATGGAT AATAGTGTAG CACGGTATCG 2880 

AAAAATGATT GCTGTAAAAT AAAAGATAAC 2940 

TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

TAGCCATACC ATTGACCTAA TAGGACATTT 3060 

GTTAATAAT6 ATTCACTATA ACTGTAAAAC 3120 

TATTTTACX3C GTGTAGTTAA GTATCTATAG 3180 

AACTGTGACA AGATAATAAA GCAAGGTGTA 3240 

TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TGTGTGATAA TAATAATTGC ACATATAATA 3360 

TTTTTCATGG AATCCGTCCC ATCTCTTAAT 3420 

TATTCTAGGG ATGTAATAAT CATTGGAGTT 3480 

ACCGTATTTT AACCCAGCTT TTTTGATTAC 3540 

AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3600 

GTTTAAATCT TTTATGATTG TAGCTTCAGA 3 660 

AGATAAGTTA TGCAAATCGT GGGTATGTGT 3720 

TTCTTTTAGT TCrTTTTTAC TAATCATATC 3780 

ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3840 

TTCATAAATA GTTTCATCCA TATCATCAAA 3900 

ACCTTTTTTC TTGtAATATA AAAATTCTTT 3960 

TAGCCATTTT ATTTGAGATT CAAATTGTGA 4020 

ACTACTAGAA AAGAAGTAAA TAAAATTATT 4080 

ATTTAATGCC AGAGCACTAT TTTCTTTATA 4140 

ATTTGCAATA TGATGACCAT CCAGTGTGCT 4200 

TAAAATTATA AATTTTCTAT ACTTCACGAT 4260 

ATGCTAATTG TAAAAAATAC AAAAATGATA 4320 

TCTAAAATTT CAGTATTTTC AATGTTTAAA 43 BO 

TGAATTTCAA ATATAGTACC AATATAAACG 4440 

ACACAAGATA TAGCGATAAS TGCTGTTTCT 4500 



55 



1060 



EP 0 786 519 A2 



10 
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TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 4620 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 4680 

TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4740 

TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4800 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4860 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 920 

TAAAAAAGTC TC6TAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4980 

AAAGACCTCC CAATGTTTCT GCAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 5040 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 5160 

TAGTATTGAC TGCGCCAGCA AGTGTCTGAC TtCXSCTTAAT ACAGCCAATT AAACTTGCAT 5220 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 5280 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 5340 

25 GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 5400 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 5460 

TATTTTCTTT GATTTTATAG ATGA6TTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5520 

30 TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 5580 

TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 5640 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTOGT AAAATAGAAA TAAATTGAAC 5700 

^ CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 5760 

ATrtCTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 5820 

AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATTTT 5880 

CCTGTAACAT AGTGTGATTA ATTTTCA6TA GGGGGTTATA AAAATTGAAG GAXAAGATTA 5940 

TTGATAACGC AATAACCTTA TTTTCAGACA AGGGGTATGA CGGTACAACA CTTGATGATA 6000 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 6060 

GTATTTACGA ACAAAGT6TT AAATOTTGTT TTGATTACCT TAATAATATT ATTATGATGA 6120 

ATCAAAATAA ATCX5AACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

TCGACATCGA AGAAAGGTAT ATTA6AATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 6240 

TTTCTGGAAA TATTTACG6A CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 6300 
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70 



75 



30 



TGCTGTTTCT 


TGAAAGTTGG 


TATTTGAAAG 


CATCCTTTTC 


GCAAAAATTT 


GGAGCAGTGG 


6420 


AAGAAAGTAA 


AAGTCAATTC 


AAAGATGAAG 


TGTATTCGCT 


ACTAAATATA 


TTTTTGAAGA 


6480 


AATAATTTTT 


GTTACTAGTT 


TGTAATAATT 


AACTTACTTT 


TGTAAAAAAA 


GACATGAGAT 


6540 


TATTTTTTTA 


AATCTATATA 


AAGTTGACAA 


TACAAATCGA 


TATTGAGAAT 


ATTAAGATGT 


6600 


ATATGAATTT 


TATAAATTAA 


ATGCAATACA 


TTAATATAAA 


TATCAATTGT 


TGCAAAATAC 


6660 


GATTTGTTCA 


ATGATTTGAT 


AATATTATTC 


TTTATATTTG 


TGAATGGTTA 


AGTTTGTCTT 


6720 


TGAACATATT 


ATAAAAGTGT 


AATGTTCCCC 


TGAAAAGAAT 


AAGTTGTCAT 


CTAATTACAG 


67B0 


GAAATCCGOV 


TAAATTAGAT 


GAAATGGAAA 


GTAATAAGTA 


ATAATTTATT 


GATAAGCX3GC 


6840 


TATGTGATGG 


TAAATCATGA 


CATAGGCGCT 


TTTTTTTATA 


AGTTAAAAAT 


GTAAATAAAA 


6900 


ATTATATAAA 


TTACCCACAT 


CTTTTTAAAA 


GGTGTGGGCT 


TTATTATCAT 


TAACCCAACT 


6960 


CACAGTGACG 


GGTTACGCAA 


GGTATTGAAT 


TACCGAGTAC 


GGGCAOGCTC 


GGTGTTGTAA 


7020 


AGAGCAAATA 


ATCAAGTAAT 


GATGATGCTT 


CTACTOGATT 


ATAAGAAAGC 


CATGATAGAG 


7080 


TACGATGGTA 


TCTAGTTTTA 


TTATTAATAG 


GTTTGGATAT 


TTAAAGTTGG 


ACAATATTAT 


7140 


ATCTTGTGCA 


AAAATATAAA 


TAAGTTATAC 


ATAATGGTAG 


AGAATCATGA 


TATAATTTTA 


7200 


AACGATAAAA 


TATTTATATA 


AATAATTAGA 


GaAAATGTAG 


TTGTGTATGT 


yTTGTGGtCG 


7260 


TTAAACTAGA 


TATAATTGTC 


CGATTTATAA 


AACATACATA 


ATGAATACaA 


TGATTGATTA 


7320 


TGTGGAGGAA 


ACCATGAmAG 


AmAAGTTTgA 


TTTAGTAAAA 


CTATTAAATA 


TTCTAAAGAA 


7380 


GAATATTAAA 


TTATTGCTTA 


TTTTACCGGC 


AATATGTCTT 


GTAGTAAGTG 




7430 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4082 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

45 



SO 



ATTGTTACTC 


ATTATITITT 


CATATTCACA CAAATGATCT TGTTTATATT 


TAGCTAATTG 


60 


ATTTTTATCT 


AGCATTTTAT 


CCTCCTGCTG AGTTTOTAAC CTTTAATAAT 


TTATTTTCTA 


120 


TAAAAACTTA 


GTATTCCAGT 


TGCTTATTAT ATCATTGATG AAAGGCTGAA 


ATAAAACATA 


180 


AACTGTTCCG 


ACCATTAACG 


CTGTAGCTAA AGATAAGTCT ACAAGTCCAC 


CTGTTTTAAA 


240 


TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC 


CTTTTGGTGT 


300 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC 
AAAGGTTCGA TGTCCAAAGA TCAATCTCAC 
^ AATCTTACTT TGAGTGTGAC ATATATCGGG 

AACCGTAACC GACGAAAAAA TATCCGTTTG 
CATGCCGCAT GAAGCATGTG TTTTACCTGT 

10 

TACCACATCC CTAACAAAAA CACGAACATA 
AATTTAAAAA AGTATCACAT AACTCTTGAA 
ACAAAATATT TAAAGGATGT GTTTGAATAA 
AAGTGCGTAT TCAATGGAGA GTTGCTGATA 
TTACACAAGA CCAAGATATT CATGCAGTTC 
20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCX3 

TTATTTATAC TCAAAATGAT CAAAAGGTTT 
AaAATCATTC ATGGTGAGGG CTTCATGAAT 
AAACAAATAC AAAAGGACAA CTGTTCCCAT 
TTTATAACAA AAGATGTGCC ATCAAAGAAA 
AGAAAATCAT AAACAATTTG CCGATGCTTA 

30 

CTACTTCAGA CAAGTATATT AACTGCGATA 
TATCATTTTG TACACCTTCA ATTAATATAG 

35 TAATCGTTTG AGAAGCCTGT GCCGCTTCAG 

AAGGTTTACC CAATATGACA AAAAAGGGCG 
CAATACTCAT TACTACAGGC AAAATAACAA 

40 ACTTAAAAAA GTCAATAACG CCCGGTGCTT 
CATGTCTCAA TGCCGTCTTT CCTTCTGGCA 
CCTTAGCATA CTCATCAGGA ATTTTATTTA 

4$ 

CAACGAGGCA GGATACTATC ACTGATAAAT 
CTGTTTCAGC AACTACAATT GCAAAGGTGA 
CTGTTGCCTC ACGACGAGAA TAATATCCTT 

50 

CAACAGTTCC GTCTCCAATA AATGATGCTA 
TAAATAAAGG TCTCATAACC GGTCTAAAAA 

55 



TATGATTGCG ATAAATAATA TTGAATGCGT 420 

AAAAAAACTA ATTACCTTAA ACCTTCTGCC 480 

TAATAAGCTA GCTAGAGTTG CTAGAATGAT 540 

AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATATTTGTT CTCCTTTAAT ACTCACATTT €60 

TTTTCGGGTT AAAATTCATT AGTATGACAC 720 
AACGATTACA AAATCGTTTA TGATGTATTT ' 780 

TGGCAATGAC AGTAAAAAAG GATAATAATG 840 

TCAAAATTCC TACAAGTGAA ATTAAAAATA 900 

CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

TTATTATCGA TACTGAAGAC CACGAATACA 1020 

ACAATGAATT AACTAAATAA ATTGTATAAA 1080 

GATTTTTTTA ATTGATTCAA CACCCAGCAT 1140 

AATTTTAACA GTTGTCCTTT TTCACATATA 1200 

TAATTGGTAG TGTAATGATT GTTCTAATCA 1260 

CAGGAATCTT CGAACCAAGT ATGACGCCAC 1320 

TACTAAGTGC CCCAATAACA AAACGAGTTA 1380 

AAGGTAAAAA CATATCGGCA AAACCGATAA 1440 

GTATTTGCAT TAACTCTAAA AATGGAACAA 1500 

TGTAGTTCGC AATAATGGTA GCAATAGTAC 1560 

ACCACATATC AATGACTGTT TTTAATCCTG 1620 

TAATACCTAC TTCTGTTGCA GTATCAAAGC 1680 

ATGCCTCAGT ACGCGCACTT TCAGGTACCT 1740 

AAGGCCAAAT TCTTGGCATA ATGACTGCTC 1800 

AGAAAGCAAA AAATTGATTT TGCATGTGCA 1860 

TAGAAACTAC ACTAAATGTC GTTGAAATAA 1920 

CACCATATTG TCTACTTGTA ATTAAGACAC 1980 

AATTATCTAC CGTCGAACGT CCTGGCAATG 2040 

TAGGACCCAA CATCTCTAAC AAACCX3TATT 2100 
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AACTTGAGAA. CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA 2220 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 2340 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 2400 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 2460 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCO 2520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2580 

TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 2640 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2700 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2880 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2940 

ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3060 

AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 3240 

AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3300 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

C Crig 'AACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 3420 

40 AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 3480 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGT6AGGC TAACGGGTTT 3540 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3600 

AGTAATGTTC TAOGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT OAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CXyiCGTATAA TTTTCGTTGT CATCATATTC 3780 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT GATGTAACAA 3 840 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3900 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4020 

CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 40 BO 

^ TT 4082 

<2) INFORMATION FOR SEQ ID NO: 262: 
(i) SEQUENCE CHARACTERISTICS: 

10 (A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 60 

^0 TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA GAAATGAAAC 120 

CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC ATAAGATGCA 180 

TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTTCTAT CTGaGTATGA TAGAAATCGC 240 

CTTGTTTATT TTTAATCTTC ATCTAAAAAO TCTTTAATAG CTTGTTTATT TGTGTTTTTA 300 

TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 360 

GGCaCAGTCA ATGACTTAAC ATCTTTTTCA CCTCGGATAC CaAAACTCAA ACCTGTTTGG 420 

AAAATCCCTG AATCAGGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 480 

GGTAATTTAA CAACTGTTCT AAAATTAACC ATTTCTTTTT TCAATGTTTG CATCACTTGT 540 

55 TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT TGCATAACCA 600 

AGTAATTCTT TACCATTCAA CCTATGGTTA CCCTTTTTCA AAGATACACC AATATTTTTC 660 

GACATATCTT TTTCGACATT AATTGGTACA CCTTCAGGCA TTAATTCATC AATCATTTTC 720 

TCAAATCCAG TAAAATCAAC TACTGCATAA TATTCAGGAT TAATTCCTAA ATTTTTATCA 780 

AOTGTTTTTC TAAGTAGCTC TGGACCACCT AAAGCGTATG CTGAATTAAT TTTGTGTTTT 840 

CCATATCCTG GAATATCTGC ATAAATATCA CGCATGACAG ACATCATTTT CATCTTTTTA 900 

TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC ACCTTGTGCT 960 

TTATCTGCAC CGAGTACAAG AATAGAAATT nTACCATCAT TTTTTACTGG TCCATTAAAT 1020 

TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACOGCT ATTGTAACTA 1080 

TGTACAATAT ATACAACTAA TGCCGATAAG TAAAATTACA ACAATCAGAA GAATGATAGG 1140 

TAATT 1145 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

10 



20 



2S 



40 



45 



TATGGCTCAT 


CATTAATGCA 


CGTATCGGGT 


AGCGTTTACC 


ATTGATAAGT GCTTCATGTT 


60 


TAGCACGAGT 


TCTTAAAATT 


CCATCGCCAT 


AACCGATATC 


AACTACAGCT AATTTTGTAT 


120 


TGTrTTTAGT 


CACTTCAAAG 


GCAAAGCTAT 


AACCGCAATA 


ATCACCAGCT TGTACTTCGC 


180 


GCACTTGAAT 


AACATGTGCT 


TTTAAAGTTA ATGACTGAAC 


TATATCATGT TGATTCAGTG 


240 


AACTATATGG 


TCTTGAACCG TATAAOGCAA 


TACCTACACG 


CGCATGTGTA TGGTGGGGTA 


300 


GTAATAyyyg 


TCCTTCCCGA 


TAAAAACTCG 


CACTATTTTG 


AGCATGGATT AGGTCGAACT 


360 


GATAACCTTC 


AGATAAAAGT GCTTCAACAA 


TTTCCATCCA 


TTGTGAACGT TCAACATTAT 


420 


AAtCTGACAC 


ATCGAATTCA TCAGCATATC 


CAAAATGGGk 


CCATAAACCA CTAATAATCA 


480 


TTTTTGCATT 


TTGATTATGA 


TGGTGATCTT 


TCAATACTTC 


TTTAATTTCG TTTAAATCTT 


540 


TAAATCCAGA 


CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA CCAGCTAAAT 


600 


CATTTTTATG 


GTTATAGTAA 


TATGTCAACG 


ACGGCAAAGT 


CATGTGTATT TGATGTTCAC 


660 


GGACTAAATC 


AAACTCGTAA ACTGCATTCA 


TTAAAAAGAT 


TGTTGCATCT GGAGCAAGTT 


720 


GTCTAATTTG 


AATTGCTTCT 


CGTAGTGATG 


TTGTGCTAAA 


TGTATCTATA CCTGCATGGA 


780 


TAAACTGAGT 


TACAGCAAAT 


TCTAGGTCAT 


AGTGATATGC 


ATTaTTTTTA ACAACTGCCA 


840 


TTAATGGCTG 


ATTGTTTTTG 


ACTGTGATTG 


CATTTTGTAA 


AAATATTTTC TTATTTACAG 


900 


ACCfl3<5TTGC 


TGTCAATGTA 


TTACACCTCT 


TTGTAATTAT 


TTAATAAATT TTCGTAAAAA 


960 


TTAACCACGT 


TTATTAACAC 


TTTTTCATCA 


AAATTTAAAT 


GTGATGTGTG CAAACCAGTT 


1020 


ACAAAACCTT 


TATCTTCATT 


TCGTGTTCCT 


ATAAAAACAA AGTAA6CTGG AGCTAGTTGT 


1080 


TGACCATAAA 


AACTAAAATC 


TTCCCCAAAT 


AAGAATGGCG 


TTGGTTTGTC ATAGACATTT 


1140 


AAATCAGCTT 


TTATTAAGGC 


GTCCTCTATT 


TGAGTACGTA 


ATTTCGGACT ATTGATTGTA 


1200 


GGGGGATAAC 


CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG CTTGACACTT 


1260 


TCTGCTATCT 


TGTGCATTTG 


ATTTTTAACG 


ATTGTTAAAT 


CATCAATATC ATATGTACGA 


1320 


ATAGTACCTT 


CTAAATAGCC ATTACTTGGT ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 


1380 


TGACCAATAT 


GAACAATATT 


TCGTTTCAAA 


CCGTTAAGGT 


GAAATTGTTG AATTTOTGAT 


1440 



55 



1066 



EP 0 786 519 A2 

ACATGACTTG ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 1560 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 1620 

5 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 1680 

GTTTCTTCTG CAGGTTGGAA AATGAAAACX5 ACATTTTGCG GTAATTGACC TGCATCTTGC 1740 

ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 1800 

TO 

GCATGCATCA CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 1860 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 1920 

IS ATGACGCCAG TTTCCAATGG GCAATC6TAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 2040 

CGATGTTTCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 2100 

^ ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 2220 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 2280 

25 

CTTAATCACT TTTGCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 2340 

AATCGCGCCA GCTGcAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 2400 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 2460 

30 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2520 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2580 

35 TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2640 

ACCTGGTTCA ATTCX3TGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 2760 

^ CCAATCATCC GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 2820 

TTCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACTTTTATTG GTGTAGACTT 2880 

TTTAGCATCA CTTATATATT 6AATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 2940 

4S 

ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 3000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 3060 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 3120 

SO 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TTCAAAGATA CGATCACATC ATACAATTTT 3240 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 3360 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

5 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 BO 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTAACA 3540 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3600 

10 

CCAAOGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3720 

IS GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3840 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3900 

20 CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3960 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4080 

25 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 4140 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4260 

30 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4320 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 43 80 

35 TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 4440 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATKACTTT GTTArTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4560 

^0 GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 4620 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 4740 

45 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4860 

TGACTATCTT GAACAGGCAC ACGT6CGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4920 

50 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4980 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 5040 
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TGATATGTTG TATATGCCAC TCGTTTTAAA CCATAAGCAT CTTGCAATAC TTTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 5220 

5 

TCATTGACTT CCGGAACGAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 5280 

TCTATAACGA TTGCACCAGC TTTTTCAAAA AGTGGGGCAA AGTGTTCGCT TGTACCGCCA 5340 

CCAGCACTCA TTAATACATA ATCGAAATGT TCACTTGCAC GAGCATCAGT TAATTCTTGA 5400 

10 

ACTCTATATQ TTTTTCCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 5460 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 5520 

IS ACTAATCCTQ TTGCACCCAC AACTGCTAAC TTTGTCATAA CTTGTCACTC CATTTTATAA S580 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCX»TCA 5640 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAATCATCAA TGACGTATGA AATACTTATT 5700 

TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTCAA TTAATGTCAA AAATGCTTTT 5760 

GATGCCACAC CTGACATATC ACGCATGCCT GAGCCAATTA ATGAAATTTT GACATAATGC 5820 

TCATTGATTT TATAAGCTAA TGCTTCATAT TGATTCTTTA ATGTTTCAAG AATCATAGAA 5880 

2S 

ATTTGATGAA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 5940 

ATTTGTGAAA TCATATCAAC ATTTACAGCA CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 6000 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 6060 

30 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 6120 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TATAGTTTTT AGCTAATTCA 6180 

35 ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTTCATAT 6240 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 6300 

TCA;K:GTCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 6360 

GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 6420 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 6480 

TGAGCAATTT TACTTTTTAA ATGATGGCCA ATGGTTTTAA TACCCGCTTG ATAGCCAGTC 6540 

45 

ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 6600 

TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCTTGTTGTT TTGQTOCTTT AGTCAAGGTT 6660 

GATACATTCG TCATTAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 6720 

so 

TTGTTCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTA TTTTTGTAAA 6780 

ATCACTGACG GATGATCCGC CAAATTTCAA CACACTTCTT GTTACCATAT AATCCTCCTA 6840 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6960 
AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7020 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDBDNES5: double 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 60 
TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA TCAGAATCAG ATGTACTTTG 120 
TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 180 

GGACATTGAT TTACTGTCT6 ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 240 

25 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 300 

GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 360 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 420 

30 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 480 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 540 

3S TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 600 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 660 

AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTO AAATAQAATC 720 

40 GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 780 

CACTAAACTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 840 

AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 960 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 1020 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT CTGTAGAAAC 1080 

SO 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 1140 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCAC06CAT TTCCACTGTT 1200 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTOGAT 1320 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 1380 

5 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 1440 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 1500 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC ATCAGTAAAA 1560 

10 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1620 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 1680 

IS TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 1740 

ATATCTTTAC CTGTTGTTAC ATCAACXITAT CTCACTT6TG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

^ GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTOACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 1980 

TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AGCTGTTGAA 2040 

25 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

30 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 2280 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 2340 

OS AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 2400 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 2460 

ATA'AtTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 2520 

40 GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2580 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 2640 

TGTCAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 2700 

45 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 2760 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTG TACTATTAGT TGTCGAAGTG 2820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2880 

50 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2940 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC ATCAGAAGCC 3000 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

5 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 3240 

TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 33 00 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 3360 

10 

TaVTTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CX3TTTTCATA 3420 

TTTTTAAAAT TACTTTTTCT CGTTTATGCQ TATAATCTTT TTTTATATAA ATTTGGCTAA 3480 

IS TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 3540 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3660 

^ AATATATGTG TAATTCTAAA AGATTCTTCT TTAAAAATAT AAATACCACQ ACATATTGCT 3720 

TTAACATTTT CATTTATAAA GCGAAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAOTAAG 3780 

GATGCATTGA ATTCACTAAA ATGATTAAAT TACTTATATC TTTTCATCTG ATTGATTATC 3840 

2S 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3900 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4020 

30 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 080 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 4140 

3$ ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4200 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 4260 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4320 

TGCT6CGAAC ATAACCATAO GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 4380 

TAAATTATCT TTCCCATCTT TCX5ATACAGT AGAGATCCAC CCTATTGGAC GTGGTACTGT 4440 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4500 

45 

ATTCATACTA TCTTCACCCC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4560 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTOCT 4620 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 4680 

50 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 4740 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4800 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4920 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4980 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 5040 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 5160 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

f5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

CTACACACTA AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC ATTTTTAATA 60 

GGTATTTGGA TTGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 120 

ATTATGTACG TACTTGGGCT TGTAGATGAT ATCTACGATT TGAAACCGTA TATAAAATTG 180 

OCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 240 

TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 300 

TGGATTGTTG CTATTACAAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 360 

GGTGTTTCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 420 

TTCATAACGA TGATTTGTTG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTTACAAT 480 

TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATGATTGG ATTTATCATC 540 

40 GGATTCCTTT CTTTACTCGG ATTCAAAAAT ATTACAATTA TTGCATTGTT CTTCCCAATT 600 

GTTATCTTAG CAGTTCCATT CATTGATACT TTGTTCGCAA TGATTCGACG TGTGAAAAAA 660 

GGGCAGCATA TAATGCAAGC TGATAAATCG CATTTGCATC ATAAACTATT AGCTTTAGGC 720 

TACACACATA GACAAACAGT ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 780 

AGCATTATTT TGTATGTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 840 

TTTAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 900 

TTAAATTTAA TTAGTCGTAA GTCATCTCAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 960 

TATGGGATAA TTTGTATTAT ATGGCTTTAC TCTTTACAAT TTTTTTGTAT TAAATTTCAA 1020 
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ATTTACCGTC TTATGATAGT GCTTTTTATT 
GCTTTGGAGT TTCTTCAGTC AAATCGAAAT 
^ TAAACGCTTC ATAGTCACTT TTAACGACAT 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG 
CATAATTCAG TGTTACTATA CATGGTACTG 

10 

AAATGACGTC TCTAACAGCA CCGCTATAGG 
TACCACCAAA ATATCTTGTT ACTACGACAC 
TGTCTAACAT TGGGaCACCG GcAGTTCCtG 
TATTCATTtC AGGTCCAATA GTATATQCAG 
TTTTTATTGC AGCAATAAAT GCTTTaGCTT 
20 CAATGAATCT TGATTTACTA ATCACATTTT 

TATTTTGTGT CATAATAACT CCrTAATTCA 
CACTGAAAAT GACATGACTA TAAATCGTTT 
GTATCATTGC TATGAGTATA TTTTAGGAGG 
TCTACAAGTT ATCTGTCGCA GGACTTAATC 
AGTGTGACTT TTGAAGATGG CAAGATTATA 

30 

GCCATTCAAA CATTAGAAAA GAAAGTAT7A 
TTATTTGTCA TAAATGGAGA TCATTTCGAA 

GATGATTGTC CTTCAGCTTA TCAAGTAGCA 

3$ 

CATTTAGGTT CTGGTGGATT AGGTTTAGGC 
TTATAAAATT TTAATAAAAG AGTCTATATT 

4Q CTTTAAATGT TCATCATTTG AAAGCCAAAA 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT 
CATCCTCATA CTTCACTTAC TAATGAAAAA 

^ CAAAATTCAA ACTATTATTG TGTTCAATGT 

TATGATTCCT CAGTACATTC CAAGAAAATT 
CGAATGGATA ATGTAACAAG ATATAAAATA 

SO 

TATTATCATC TCTCATTTGA ATTGTCX3GAA 
CGAGCCATTA GAAAGAGACA AACGATTTTG 

55 



TTTATTCAGT TGGTATATCG AAAGGTAACT 1140 

TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

CGATATAGTA GCTTACCTTA TCAGTGTAAG 1260 

CTAATTCATA TTCAAATTTA CCAGTTTGAT 1320 

CTTCTCGTAG TTCGACACGC CCGATATCAT 1380 

CGCGAATTAA ACCGCCACCA CCTAATTTAA 1440 

ACGCATTATG aACATCXSrGC TTTTTTaATA 1500 

TCGGTTCACC ATCATCATTC GCnTTTTGAA 1560 

AACAATTATG AGTGGCATCT TTATGTTCTT 1620 

CATCTTCATT TTGAACAGGT TTGATATGAG 1680 

CAATAATGTG TTCTTTTTTA ACAGTAATGA 1740 

TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

GATTGCCATT TTCTTTTTAA CTGAAATATT 1860 

ACGACTATGA AAATTGCTGT GATGACCGAT 1920 

GATAAATATA ATATTcAAAT AGCGCCATTA 1980 

CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2040 

GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

GATGGTCAAG CGTTATACAA AAAGTTACAA 2160 

TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

TATGTTGGCA GAAAAATAAG ATTAACATAA 22 80 

GTAATTGGAA ATTATCTCTC GTATACATGG 234 0 

TGCTAAAGAT ATAAGAAAAT CATTATAATA 2400 

TTAAGCATTA AACATTACGG GAAATTAATT 2460 

ATTAAAAAA6 AAGTAACAGG TGTCATCAAA 2520 

GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

GTATATTGCA GAAATTOTAT ATCACTGGGT 2640 

ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 2700 

CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 

TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 
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TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA 
GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT 
^ TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG 
GTCGATGCCT TTCCTTTATC AATGGATAAA AATTTACAAC 
AAAGTTGAAC ATGCAACAAT TTATATGACA GCAACACCAC 

10 

ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC 
CCTAAATATC GTTATTTCAA ACTTAATAAT AAGAAGATTC 
TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT 

IS 

ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA 

GAGGATGTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA 

20 TTTACTACGA CAATATTAGA ACGTCGATTT ACAATGGCAA 

GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG 

(2) INFORMATION FOR SEQ ID NO: 266: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1017 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



35 


TTTCCAAGAA 


GGcTTgAAAA 


AtGTTwCaAG 


TGGTGCGAmC 


CCAGTTGGTT 


TACGACAAGG 


60 


TATCGACAAA 


GCAGTTAAAG 


TTGCTGTTGA AGCGTTACAT 


GAAAATTCTC 


AAAAAGTTGA 


120 




AAATAAAAAT 


GAAATTGCGC 


AAGTAGGTGC 


OATTTCAGCA 


GCAGATGAAG AAATTGGACG 


180 


40 


TTATATTTCT 


GAAGCTATGG 


AAAAAGTAGG 


TAACGATGGT 


GTCATTACAA 


TTGAAGAATC 


240 




AAATGGACTA 


AACACTGAAC 


TAGAAGTGGT 


TGAAGGTATG 


CAATTTGATC 


GTGGTTATCA 


300 




ATCACCGTAT 


ATGGTTACTG 


ATTCAGATAA 


AATGGTTGCT 


GAATTAGAAC 


GCCCATACAT 


360 


45 


TTTAGTAACA 


GATAAGAAAA 


TCTCGTCTTT 


CCAAGATATC 


TTACCTTTAT 


TAGAACAAGT 


420 




GGTTCAATCT 


AATCGTCCAA 


TCTTAATTGT 


AGCTGATGAA 


GTTGAAGGC6 


ATGCATTAAC 


480 




AAATATCGTG 


CTAAACCGTA 


TGCGTGGCAC 


ATTTACAGCT 


GTTGCAGTAA AAGCACCTGG 


540 


50 


TTTTGGTGAT 


CGTAGAAAAG 


CGATGCTTGA AGATTTAGCT 


ATTTTAACTG 


GTGCGCAAGT 


600 




GATTACTGAT 


GATTTAGGCT 


TAGATTTAAA A6ATGCATCA 


ATTGATATGT 


TAGGTACTGC 


660 
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TTAAAGACGC ATTTCTTAAT 2940 

TTGAAGGGCA TTTTGTTGTA 3000 

ATACTATTTT TATTGATGAA 3060 

AAGCATTGAA GTCATCTTCT 3120 

CGAAACAACT TCTGTCAGAG 3180 

ATAAAAAATC ACTTCCAGTT 3240 

AGAAAATGTT ATACCGAATT 3300 

TTTTTAACAA TATAGAAACA 3360 

AATTAACATA CGTCCATAGC 3420 

ATGGACATTT CGATGTCATT 3480 

ATTTGGATGT TGTTGTTATC 3540 

CTGGACGTG 3589 
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CAGCATTGAT GCAOGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 780 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 840 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTACGTATTG AAGATGCATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 960 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT 1017 
(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TTAATCCAGC GTTAACTGTA TTTGCATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG 60 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 120 

TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 180 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGO 240 

TCTATGGCGC AATCAATGGT CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 300 

TTTTCACAGC AATTTCATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 360 

AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 420 

TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTGCATTC TTAGCAGGTG 480 

TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 540 

ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 600 

TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 660 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTGCAT 72 0 

CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 780 

^ GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 840 

GTGTATTCGT TCCGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACC6TA 900 

AAATTGAAGT TAGTTTAATT GGTAAAAATC AATTAAOGAT TCCTATAGCG CAAAATTTAA 960 

CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 1020 

TGTCAGATGA TATCACGATO ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 1080 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT G1TGAATTTA TTAAGTAATG 1320 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 1380 

GTAATTTCCC ATTCGGAGGA GACATCATC 1409 



10 

(2) INFORMATION FOR SEQ ID NO: 268: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 





AAAGAGGGTT 


TTTTACTACT 


ATAATCCATC 


CTTTAATGGA 


ATTCCATTTG 


TCCCTTTG6C 


60 




CCGTTCATAA 


CCATAGGAAA 


ATATATGCnG 


nAATCATTGA 


TAGTAAATAC 


ATCAATAATA 


120 


25 


CGAATAATGT 


CGGTGAAATC 


AOGTCCGTAT 


TACCATTATT 


AGCTAAAACA 


TTTTCCAAGT 


180 




TTTCTTTTGA 


ACOGGATACT 


CATAAATGCT 


TTTAATGCnT 


GGTTTTGTCT 


GTCGCCATCT 


240 




TTAGAATTTG 


TAATACTTTT 


TTAAATTCTT 


TAGCGAACAA 


CTCATTATCT 


TTATOGTTTT 


300 


30 


TAGCCATTTG 


ACGATTCAAA 


TCGTTAGCTC 


TTACGCCTAA 


TGCTCGACCT 


GTCGCATGTA 


360 




GCTTATTCAC 


ATTATTGTCG 


ACTTGATTTA 


ATTGGCCACT 


AACAGAATCT 


GCAATTGATT 


420 


35 


TTGATTCTTG 


TGTATCTGAT 


AGCAATTGCG 


TACTCTTTTC 


AGAAATTCTA 


CTAATTTCTT 


480 


TATCTAAATT 


TGAAGACATC 


GTATTAAATT 


CATCATTTTT 


GCCTTTATCA 


ATTTTTGGTT 


540 




CTTGTGGCTC 


TTCAGCAAAA 


GTCTTTTTAA 


CGTTTTCTAA 


CTGATCAATC AGCTTGGAAA 


600 


40 


TATCTTCTTk 


ATTTTTTGTT 


gTATTCTTTT 


TGTTATTTAA 


AATGTCATCA ATCAGTTTGT 


660 




CTGAGTTTTC 


TTCCATTGAA 


TCAATTTGaT 


GTAACACAGC 


TACTTTATCG 


TCTTTGAAAC 


720 




TTTCCATGTC 


ATTGATAACT 


TGGTCAACCA 


TCATATCAAT 


TAAACGTTTG 


TTGTCGAATG 


780 


4S 


GTTTATCTTC 


tCTGCCTTTT 


GTATCTGTGT 


ACATTTTATA 


ATGCGCATCA 


AACCTAGATA 


840 




ATGCACTCAA 


TTGCTGGCTT 


AATGCATCTT 


TCGATAAGCG 


ACCATCaAGG 


TTATGATTCA 


900 




ATGTTACATC 


CACAACACTC 


GTTGCTTTCT 


TATCATTTGG 


TTCATCTTGA 


CGATTTGCTT 


960 


50 


GTCCAAATAA 


CAATTGTAAA 


TGCATTGTTT 


TATCTTTTAA 


GAAATCTTTC 


TCAGCATCCT 


1020 




TTTTCAATTT 


AGCAACGCCA 


TTGACTTCAA 


CTTTATATTC 


CTTGTTA6AT GTATCGAGTT 


1080 
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TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 1200 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTT G T GTCTTGCGCA ACTTGGTTAT 1260 

CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTTTTT CAAATTATAT TCATTTTTAT 1320 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 1380 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 1440 

TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 1500 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGAIT 1560 

CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 1620 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCTTTAC 1680 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 1740 

TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 1800 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 1860 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAATGATTTA TTGTATGTTT 1920 

25 GGAACCATTT TCTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 1980 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 2040 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 2220 

TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 2280 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTGTTACTG 2340 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 2400 

CTTTnTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 2460 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2520 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA ACTGACAATT AATGCATAAA 2580 

45 TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2640 

AAAGCCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 2700 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA ATTGTTGTTT 2760 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2820 

TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2880 
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TCTGATTTCC 


TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 


3000 




AAGTTTATCa 


AAACTTTTTA GGGACACTAT 


TTTTTGAAAA AGTGCTCCTT 


ACTCAAATAA 


3060 


5 


TATATAAATT 


ATTAGTATAT GTATATMTt 


TTTTAAGTAT 


TTTTAGCTTT 


TTTAAAATAA 


3120 




ATATATTGAA 


TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 


3180 


10 


ATACTAAAAT 


TAAATTAAAT ACATAAAGGA 


TTAAATGGTT 


ATTATGAAGA 


AAACAATTTT 


3240 


ACTGACGATG 


ACAACTCTTA CTTTATTTAG 


TATGTCGCCT 


AACTCGGCTC 


AAGCATATAC 


3300 




GAATGATAGC 


AAAACATTAG AAGAAGCAAA 


GAAAGCACAC 


CCAAACGCAC 


AGTTCAAAGT 


3360 


IS 


GAATAAAGAC 


ACCGGCGCGT ATACTTATAC 


ATATGACAAA AACAACACGC 


CAAACAACAA 


3420 




TCATCAAAAC 


CAGTCACGTA CAAACGACAA 


TCATCAACAC 


GCAAATCAAC 


GTGATCTTAA 


3480 




CAACAATCAG 


TACCATTCTT CATTAAGTGG 


TCAGTATACG 


CACATTAATG ACGCAATTGA 


3540 


20 


TTCACACACA 


CCGCCTCAAA CGTCACCAAG 


CAATCCTTTG 


ACACCAGCAA 


TACCX3AATGT 


3600 




CGAAGACAAT 


GACGATGAAT TAAATAACGC 


TTTTTCAAAA 


GATAACAAAG 


GGCTTATTAC 


3660 




AGGCATCC5AT 


TTAGACGAAT TGTATGACGA ATTACAAATC 


GCCGAATTTA ATGACAAAGC 


3720 


25 


AAAGACCGCT 


GACGGTAAAC CTTTAGCATT 


AGGTAACGGT 


AAAATCATTG 


ATCAGCCTCT 


3780 




TATCACAAGT 


AAGAACAACT TATATACTGC 


TGGACAATGT 


ACATGGTATG 


TCTTTGATAA 


3840 




ACX5TGCCAAA 


GATGGACACA CGATTAGTAC 


ATTTTGGGGA 


GATGCTAAAA ACTGGGCAGG 


3900 


30 


CCAAGCTTCA 


AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT 


CAATTTTACA 


3960 




AACAGTAAAT 


GGTCCATTTG GTCATGTAGC 


CTAOGTTGrA 


AAAGTTAATA 


TTGATGGAAG 


4020 


35 


TATTCTAATT 


TCAGAAATGA ACTGGATTGG 


TGAATATATC 


GTTTCATCAA 


GAACCATCTC 


4080 


TGCTTCAGAA 


GTTTCATCAT ATAATTACAT 


CCATTAAATT 


AATCATGACA 


TCAATAAAAA 


4140 




GCGAeCAGTT 


CGCAGTTTAC AATTCGTAAC 


ACTGCAAAAT 


TGGTCGCTTT 


ATTTTGTATG 


4200 


40 


TTATTCGATT 


ATAAAATTAC AAAGAAATGT 


TCTCTACATT 


CCCCATTAAT 


CAAAATCGTT 


4260 




TACXIAAAGTA 


TAATTGTAGC TATAATAATC 


CAAGTCGTAA 


CAACTAGTGG 


CACTATCGTC 


4320 




TTGAATAAGA 


ATATACCGTA TTTTTTCTTG 


CGATATATAT 


CCAGTACTAG 


CCAAATTAAA 


4380 


45 


ATGATTATAA 


CACCAACAAA AATAAATACA 


GGATTCATCG 


ATATAGCATC 


TGCCTGTAAC 


4440 




TCAGGTTGCA 


TTCTTAATTT AGTGATAATT AACATCACTA 


CTGAAATAAT 


GAAAAAGTAG 


4500 




ATACCTCTTA 


TCTTTGATGT CTGTAAATCT 


AATTCCTGCT 


CTTCAATGAC 


CTCTTTAGAT 


4560 


SO 


TCACCCAATT 


CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 


4620 




ATTGCAAAGT 


ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 


4680 



55 



1079 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 60 

^5 TAACGCTTAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 120 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 180 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 240 

20 AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACCC AGTAAAACAA GGAATGCTTA 300 

CTAGGTGTCT TCACTATACT TTGGCTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 420 

CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT ACAGGTGCTA 480 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 540 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 660 

CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 720 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 780 

TATAATCTGJc TGACTCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTTGTTGTT 840 

TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 900 

40 CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 960 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 1020 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 

^ CGTGTAAATT GATAtCGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCOGTCAC 1140 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA TTATAATGAA 1200 

TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA GCAACGACTA 1260 

$0 

CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 1320 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTTCAA 1380 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 1560 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 1620 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 1680 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACXS CTGAAGCATA TTTAAAAAAT 1740 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACX5TTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 1860 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC ACAT 2004 
20 (2) INFORMATION FOR SEQ ID NO: 270: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT TTATCAAGAG 60 

TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 120 

GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 180 

C6TATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 240 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 300 

TGTATTArTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 360 

GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 420 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 480 

45 GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 540 

GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAG GTTATACAAA 660 

AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 720 

CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATGCATAGAT AAAATTGTTA 780 
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ACCTAAGAGG TGTGGATATG AATAAACACA AGAAAGGTTC TATTTTTGGA ATAATAGGAC 900 

T T G TTGTCAT ATTTGCTGTT GTCyCaTTTT TATTTTTCTC AATGATATCC GATCAGATAT 960 

^ TTTTCAAACA TGTTAAATCC GACATTAAGA TTGAAAAGTT AAATGTTACA TTAAACGATG 1020 

CAGCAAAGAA ACAAATAAAT AATTATACGA GTCAACAGGT ATCAAATAAA AAGAATGATG 1080 

CATGGAGAGA TGCATCTGCA ACTGAAATTA AAAGTGCAAT GGATAGCGGT ACTTTTATCG 1140 

10 

ATAATGAAAA GCAAAAATAT CAATTTTTAG ATTTATCAAA GTATCAAGGG ATTGATAAAA 1200 

ATAGAATTAA ACGTATGTTA GTAGATAGAC CAACGTTATT GAAACATACG GATGATTTCT 1260 

TAAAAGCTGC TAAAGATAAG CACGTTAACQ AAGTTTATTT AATTTCACAT GCATTATTAG 1320 

IS 

AAACTGGCGC AGTTAAAAGT GAATTAGCTA ATGGAGTCGA AATTGATGGC AAAAAGTACT 1380 

ACAATTTCTA TGGAGTAGGA GCCCTTGATA AAGACCCAAT TAAAACAGGT GCAGAATATG 1440 

20 CTAAAAAGCA TGGTTGGGAT ACACCTGAAA AAGCTATTTC AGGCGGTGCT GATTTCATTC 1500 

ATAAGCACTT CTTATCAAGC ACAGATCAAA ATACATTGTA TAGTATGAGA TGGAATCCAA 1560 

AAAATCCAGG AGAACATCAA TATGCTACAG ATATTAAGTG GGCAGAAAGT AATGCAACAA 1620 

25 TTATPGCTGA CTTTTATAAG AACATGAAGA CTGAAGGAAA ATACTTCAAA TACTTTGTGT 1680 

ATAAAGATGA CAGTAAACAT TTGAATAAGT AATTTGATAA GCTACGAGTT GTTTTTATGA 1740 

CTCX3GACATA CTAAAAAGAC GCTTTCTATC TTGTTTTGAT AGAAAGCX^TC TTTTTGCATT 1800 

AGAGAAAACA CATTGATkGA TAAtCCCaCC aATGCAAgTG GGGcAGGACa TCGATAAAGA 1860 

ATTACTTTTT CTTTAGAAAT TAGTATTTCT TATGCATGAG TTTTACTCAT GTATTCCTAT 1920 

TTTTAAGTAC ACATTAGTTA TAGCTAATGA TAAAGAACCA CTACATAATA AATCATTAGT 1980 

35 

GTTTTTTTAT CATTTCTGTC CCaCTCTCAT CGTGATTTGA AATTTTCAAT TGCGATTTTA 2040 

ATTTCATCTC TTACACGTTG GAACTCTGAC CAAGGCTTGC CTGCAGGATC ATCAAATCCC 2100 

CAATGTTCTT TCTTAACATT TGTTGGTAAA GAAGGGCAAT TTACGTCTGC ATCACTACAT 2160 

40 

AATGTAACAA CTAAATTTGA ATTTnTAATA ATATTATTAT CGGATTAAAA TCTGATGGAT 2220 

GATTTGATAT ATCAATGCCT ACTT 2244 
45 (2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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5 



10 



IS 



20 



25 



30 



35 



40 



ATAAGCAATT 


TAATTTTGAG 


TCTACAATGG 


AGGAATTATC ATCTTTATCA GAGACTTGCC 


60 


AACTTGAAGT 


GTTGGGTCAA 


ATTACTCAAA 


ACAGAGATCG TGTAGATCGC AAATATTATG 


120 


TTGGTAAAGG 


TAAAATTGAA 


GAAATTCAAG 


CATTTATTGA GTTCAAAGAT ATTGATGTAG 


180 


TCATCACAAA 


TGATGAATTA 


ACGACTGCAC 


AATCCAAATC ACTAAATGAA GCTTTAGGTG 


240 


TAAAAATTAT 


TGATAGAACT 


CAGTTGATTC 


TTGAAATATT TGCATTAAGA GCAAGAAGTA 


300 


AAGAAGGTAA 


ATTGCAAGTA 


GAGCTAGCAC 


AACTTGATTA TTTATTACCT AGATTGCAAG 


360 


GCCATGGTAA 


AAGCCTTTCT 


CGTTTAGGTG 


GCGGTATTGG AACTAGAGGC CCTGGTGAAA 


420 


CGAAGTTAGA 


GATGGATCGC 


AGACATATTC 


GAACTCGTAT GAATGAAATT AAAGATCAAT 


480 


TGCGGACGGT 


AGAAGAACAT 


CGCGAAAGAT 


ATCX3AAATAA AAGAAATCAA AATCAGGTGT 


540 


TTCAAGTAGC 


TTTAGTTGGT 


TATACAAATG 


CTGGTAAATC ATCATGGTTT AATGTTTTAG 


600 


CAAATGAAGA 


GACGTATGAA 


AAAGATCAAT 


TATTTGCAAC GTTAGATCCT AAAACACGAC 


660 


AAAITCAAAT 


AAATGATGGA 


TTTAATTTAA 


TTATTTCAGA TACTGTTGGT TTTATACAGA 


720 


AACTACCTAC 


GACGTTAATT 


GCAGCTTTTA 


AATCAACTTT AGAAGAGGCT AAAGGTGCAG 


780 


ATTTATTAGT 


ACATGTCGTA 


GATAGTAGCC 


ATCCTGAATA CC6TACGCAG TATGACACAG 


840 


TTAATGATTT 


AATCAAACAA 


TTAGATATGA 


GTCATATTTC TCAAATAGTT ATTTTTAATA 


900 


AAAAGGACTT 


ATGTGATCAT 


GCATCAAATC 


GTCCAGCAAG TGATTTGCCT AATGTTTTTG 


960 


TTTCTTCTAA 


AAATGATGGT 


GATAAATTAC 


TTCTTTAAGAC GTTATTTATT GATGAAATCA 


1020 


AAAGGCAATT 


AACTTATTAT 


GATGAGACAA 


TTGCGAOGAA TAATGCAGAT CGATTATATT 


1080 


TTCTAAAACA 


ACATACATTA 


GTGACTGAAC 


TTAAATATGA TGAAATTGAA AATGTTTATC 


1140 


GTATAAAAGG 


ATTTAAAAAA 


TAATAAAAGG 


ACGAAATTCA AATGAAAGAT ATAAGTAAGA 


1200 


TAGTAGCTGA 


CGTCGAATCA 


ACGTTAGCAC 


CATATTTTAA AGAAATTGAA GAAACAGCAT 


1260 


ATATTAATCA 


AGAAAAAGTA 


TTAAATGCAT 


TTCATCATGT CAAAGCAACC GAAAGTGATC 


1320 


TACAAGGATC 


AACAGGATAC 


GGGTATGATG 


ACTTTGGACG TGATCATTTA G 


1371 


(2) INFORMATION FOR SEQ ID NO: 272: 





^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

<D) TOPOLOGY: linear 

so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAACCTA TGCGTTTTAA TATTCTGAAG TTACTTAOTT CATCCTCAGT TTCATCCATT 120 

TGTTTAATAT AAATAATACA TCCAGCTGCT ACTAAAAATG CTAATCCTAA AAATGATGTA 180 

5 ACAAATATTA GAATACCGTT AGTAGCATcG ACCTCTTTTT TCATTTCATC ATACGTGATG 240 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT TGAGCTTTTT CCTTTTGTGA TGTTTGTTTA 300 

ATATCATATC CATAAAAAGT ATGAACGTTA TTTTGTGnT TCAACTGCTG ATACTTTTCA 360 

GGACTTACTT CGATGACAGG TGAGTTGAAG CTTAGATTTA AAGGATAAAC CTTACCTTTG 420 

TCTTCTTGTG TAcACGGAAA GTTTCATTCT TAGTACCTTT TACTACTAAA TCrTTGTTTA 480 

AATGGATATT AATAATGTTA GGCAGCX5ATT TTGTATTTGT AATGATGGCA TTGTTGCCTG 540 

IS 

TtAACTTGTT ATTTGCACTT AAAATAGAAT TCGTGCGACC TGAATCACTA CCATTTTCCA 600 

AAGTAATAAC CTGATCATTA ACATTATCTA CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 660 

^ ATGTAATTTG TTGTTGGCTT AGTTTAGTTT CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

CX3TTAAATTC ATTTGGTGCC ATAGATGTAA GGGTTTGATC TGTATTTGAT TTAGATAATG 7 80 

CCGCAAAACA CAATACAGTT ACTGTAACTG CAGAAATAAT TGCAATGATA GTTAAAGACA 840 

2S TGQCATTTTT CTTCATTCTG TACATAATAG ACGATGTGAA TACAACATCX3 GTAATAGATA 900 

CGCGTCCATT TTTTGATTTT TTCAATGTTT TAAAAATAAG TGACACGGAA CTTCTGAAGA 960 

ATAAATAGGC GCCTACAACC GTTAAAAATA AAATGATAAA CGGTGATGTC ATAGCCATAG 1020 

TTAGTCCTTT GAACGTACCA AACATTTCTG TCGCCATATA ATAGCCTAGT GCAATCATAG 1080 

CAATACCTAA TACGCCTGAA ATAACCTCTG CAGTOGTTAC TTTAGCAGTT GTGGCATCAG 1140 

TTTTAATTGA ATCTTTCATC ATTGATAAGA TACTACGTCT TTTTAGAAAT AAAGCACTTT 1200 

3S 

GAAATAAAAT CAGTACATAC GCAATAATTA GCATGAAAAT AGTTAAAACA AGGGCCATAG 1260 

GTTCGAAATG TATCGATAAG TTAATCGATA ACGACATCAA TTTAGATACT ATGGAAAGCA 1320 

ATAaTTGTGC ACCTGCAATG CCACATAATA CACGGACAAC ACCTGTGATT AAAAATACGA 1380 

40 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA TGTTTTGTCG TGTTAAACCA ATCaACTGAA 1440 

ATAGCGCAAA TTCACGTGTA CGGCGTTTTA CGrmTAAATG ATTGGCATAC ATTAAAAAGA 1500 

45 TCACAATAAT GATAAATAAA AATATTGATC CGACTAAAGC ACCTTTCTTA ATGATGGCCA 1560 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA ACTGTAAGGT TGTAAAACTG AAATATAAGA 162 0 

CGATGCTAAA AAATAATGAA AATAAATACA TTGCATAATG TTTTAAGTTT TGTCGTAAGT 1680 

^ TTTTCAAAAC GATATGGTTA AATGTCATTT GAGACACCAC CTAATACTGA TTGAAGATGT 1740 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA CGTCCTTCCT GATAAAGTTG TGTATGAATT 1800 
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ACCATGACAA TAGTTGTATC AAACGATTTA TTCATTTCTT 


CCAAACGTTG 


TAATAGGTCA 


1920 




TTTGCACTTT TCOAGTCGAG TGCGCCTGTT 


GGCTCATCTG 


CAAATATGAT 


TTGTGGTTTG 


1980 


5 


TGAACAAATG CTCTCGCTGC TGCAGTTCTT 


TGTTGTTGAC 


CACCAGATAA 


TTCGCTAGGG 


2040 




TATTTATTTC CTAGGTCATA AATACCTAAT 


GCTGTCGTGA TOGCTTTATA ATTTTCTTCC 


2100 


10 


ATTGTTGCCT TCGACATTTT TTGAACAGAT 


AAAGGTAACA TAATGTTTTC 


TTTAACGGTT 


2160 


AATGTCGGCA GAATACTGTA ATCTTGGAAG 


ATGAAACCTA ATGATTCnT 


GCGGAATTTG 


2220 




GCAAGTGCTT TTTGATTAAG TTTATTAAGC TCTTGTCCGT TAGCAATCcg 


cTACCGCTAG 


2280 


75 


AAATTTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT 


CTTACCTGAT 


CCAGAAGGCC 


2340 




CCATAATCGC APJCGMTTCG CCTTTTTGTA 


TGTCAAAGTT AATATCTTTA 


AGTGCTTGAA 


2400 




ATGTGTGCTT TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 


2460 


20 


ACTCCTTtTG TATTTAATTT CATTTTAAAT AATGTTTGGA GTAGTAGCCT 


TTATCTAAAC 


2520 




TTACAATTCA ATGAATGAAC CTTACAGAGT 


TGAAArcTAT 


CGCTACTTAG 


TAGATTTTTG 


2580 




AGTGAGGATA CAGATTCATC GTACATATTA 


GACAAAAGCA 


ATGGTGCTTT 


CTAAGTGATG 


2640 


25 


ATGTTTGTGT AAATTGAGAA AAGGGAATTT 


AATTATTGTA 


TAATAAATTT 


TTTGTAAAAA 


2700 




TTAAAAGAGG GTTTTATTTG AAAGGAATTG 


ATTGTTATGG 


AAAAAGGAAA 


TCAAGGTATT 


2760 




AAATGGTCTA GTTTAATAAT GGGTGTATTA 


TTATTAATGT 


TGGCAGTCGT 


TATTTTTACA 


2820 


30 


TTTCCAATTG AAAATTTTTA TGCTATTACC 


TGGTTGATTG 


GACTGTTTGT 


ATTAATTAAC 


2880 




GGTGTGATTC AAATCGTTTA CCGTAGAAAA 


GCAAAAGCTT 


TAGTAGGTGG 


TAACCAAAAT 


2940 


35 


TGGATTCTGT TTATGGGGAT TGTAGATATT 


CTATTTGGTC 


TATTAGTTAT 


TTTTAATGTT 


3000 


GGCGCAAGTT CAGCATTCTT TATTTATATG 


TTTGCTTTTT 


GGTTTATTTT 


TAGTTCTATC 


3060 




TCTGGATTAT TTACGTTTTC GGGTAGTGGT AGCTTAAAAC 


TAATTTCAGT 


GATTTTTAAT 


3120 


40 


TTATTAGGTA TTGTTTTCGG TGTCATTTTA 


TTATTTAATC 


CATTAATGGG 


TATCGTCTTT 


3180 




ATTTCGACGA TGATTGCTAT TGCATTTGTA 


TTCGTAGGTG 


TCATTTATGT 


TGTAGATGCA 


3240 




CTTGCTTAAG TAAAATGAAG CGGTTCAAAA 


GAAGGGTGTG 


ACATGAAGTT 


TGTGTCATAT 


3300 


45 


CCTTTTTGTT GTGTTTATGA AGCATAAAAA AGGGGCGCTA 


CCTACAATAA 


GTAAGATACG 


3360 




CCCATATTTT TATATTTTAC TATTATTGTT 


TTTCAATACX5 


ATTAATAGTT 


ACATTTAGTC 


3420 




CAAAATATTT TTCTAAAAAA TGTTTATAGT 


TATCTTTAGT 


GACATCAAAT 


TTTTCTGAGC 


3480 


50 


TACCATTCCT TGTTAAAGTT AAATGATTTT 


CAGACATTGT AGCACGGCCA AATGATTGTG 


3540 




GCATTGTAAT TAATAAATGC TGTACAAATA 


TTGAATCTGG 


ATGOGTTTGA 


TTATATTCGA 


3600 
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AATGATCATT 


TTCGAATTTT 


TGAACATAGA AAATATCCTT GTCTTCGTTQ 


TTAAAAATAG 


3720 




CGCGGAATGT 


ACCACTGATA 


TCAGTAATTG 


GTTGTGTATG CTCAGATGAA GTAATAGGAA 


3780 


5 


TGGCATGTAG 


AGGTAAGTCT 


CCAAAGCCAA 


CATCAGTTAC ATAGAATACA TCATTTATAG 


3840 




AAACAACAAG 


TGAAGCATGT 


GAACCGTTCA 


GACTACGACC GCCACCX5GGW 


GTOTGAATAG 


3900 




TAGCTGACAT 


TAATTCAGGA 


TTAAATCCTT 


TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 


3960 


10 


TTAATTCATA 


ACAAAAACCA 


CCACGTTTAT 


CATGAACAAT TTTATTAAAA 


AGTGCATCGA 


4020 




TATTTATAGA 


TATCGGCTTA 


CTATTTTGAA 


CATCAATATT TTCAAAAGGT 


ACAGTTAACA 


4080 


IS 


TAAAACGTGT 


TGCATAATAA 


TTTAATGCTT 


CAATACTCGG TCGATTATAA 


CGAGATGAAT 


4140 


CAATTTGTAA 


ATAATTCTCT 


AACTTCGCAA 


TATTCATAAG CATAGCGCCT 


CCTGTATTAA 


4200 




AGATTATAAT 


TAAATTTTAA 


ACAGAAATAC 


TGAAATTTTA AATTCGAAAG 


CATTGAATTT 


4260 


20 


TGGATAAATA 


CATTTTAAAT 


AGAAAAATAC 


GCTCTCAAAA TGAAGTCATC 


TCTAAAAGAA 


4320 




ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 


4380 




AAATGCAACA 


ATACAAAGGT 


AGTTGCAAGA 


CAAGTTGCCT TATCTAGACC ATTTGTGTTC 


4440 


2S 


TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCCAA GTAATTAAAA ATAATGCAAC 


4500 




TAAAATATAG 


CCTAAATAAT 


CAAATTCGAT 


CGAACCAATG AATGCCCAAA ACGCACCATG 


4560 




TAAATCTAAC 


TTATCAGCAA 


GAATTTGTAG 


CAATTCAATC ATCCCAATCA 


CTAATGCTGC 


4620 


30 


CATGACTGAT ATCGCAGTAA TCGTTATATT 


GTAATAGATT TTGCGAATAG 


GATTGAAGAA 


4680 




TGCCCAATTA 


TAGGCATACT 


TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 


4740 




TGATGCGAAT 


AAAATTGGTA 


AAGATAAGAT 


TCCGATAAAT GAAATGGCTT 


GTTGTGATGC 


4800 


3S 


GCCTGAAGAA 


AGAGCGAGTA ACGCAATTTC 


ACTAGCTGTA TCAAAACCAA 


GTCCAAATAA 


4860 




AAAGCCAAGT 


GGCAATACGT 


GCCAACTACG 


CGTGATTAAT TTGAAATAAG 


GTCCTACAAA 


4920 


40 


TCGAGAAACC 


AATCCTCTAG 


ATTCAAGTAA 


TGCATCGACT TCAGCTTCTT 


CAATGTGTTC 


4980 


ACGACGTAAT 


TTAGCGAACA 


AGTTAATTAA 


AGAGATTAAA ATAATTAGAT 


TCAACACACC 


5040 




QATAAGCACT 


AAAAAGAAAC 


CTGAAACTAG 


TGTACCAATC GTTCCACCAA 


TATCTTGGAA 


5100 


45 


ATGCGGTAAT 


TCATCTTTAG 


CCCATTTTAC AGATACCCCT AAAAAAACAG 


CCATTAAAAA 


5160 




TACGACAGAT 


GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATGGAT 


CTTTGCGTTG 


5220 




CTGTAATAAT 


TTGCGaACCG 


TATTATCTAT 


TGCAGCAATG TGATCTGCAT 


CAAATGCATG 


5280 


SO 


ACGCAAACCT 


AATGTATATG 


CAAGAATCCC 


CaTACCAAAT AAGATATGAT 


GGTCTTTTCC 


5340 




AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATGTATGG 


5400 
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ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 5520 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 5580 

^ GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 5640 

GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAcCAGTT 5700 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 5760 

10 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 5820 

TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACCAG TGAACGTAAT 5880 

AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 5940 

IS 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 6000 

TTAAAATGTA 6TTGGTGCTG AAATAAAGCT TGTTC 6035 
20 (2) INFORMATION FOR S£Q ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 



35 



40 



45 



TTTTGAACAG 


CCATATTTAT 


TCACCCTCAA 


CATCATTATA 


ATGGTATTAG 


TCGCATTACC 


60 


TTCACTTGTT 


TTAGCTATAT 


ATGATTATAT 


GAGTTTTAGA 


ATTTCTTCTG 


CTATTTTACA 


120 


ATTTCTAGGG 


GCTATCTCTT 


GGTTCTTTTT 


ATCATTGATA 


TTATCGCTCA 


CACAATTTAC 


180 


ACCTTTTACA 


TTAGCGTCAT 


TTATAACTTC 


AATTATTTTG 


TTCACAAGCA 


CAATTATCAC 


240 


ATTMCCATT 


GGTGGTAAGT 


CTGTFGAAAA 


GAATGATTCC 


CCTTAAATTC 


CAAATGAAAA 


300 


AAAGGTTCTG 


AAGGCCGCTA 


TAAAACACAG 


TTTTTCAGAA 


CCTCTATACT 


TCTATTCAAT 


360 


GATATATGGT 


TTGCAATTTT 


CTACCTTTAA 


ATCCACAGCT 


TCTGCCCTTG AAACTTTGTT 


420 


AAAATAAACC 


ATCAAACAAC 


GAATGACAAC 


TTGATGTGCA 


ACAATGACAA 


TATCATCTTT 


480 


TTGTGTATCT 


TCATTGACAA 


CATGATTCAT 


AAAATGTTCT 


ACGCGTTGAT 


ATACATCTTC 


540 


ATAACTTTCT 


CCTTCAGGCG 


CTTTTTGTGA 


AAAACTATGA 


CGAAAGTCTT 


TAAAGTTTGG 


600 


ATCATTGAAA 


TATTTTTCAT 


ATTTCGGATT 


CGCACTGATT 


TCATCTTTAT 


ATTCACCCTC 


660 


AAATACGCCA 


AGTGAACGTT 


CTCTTAATAG 


AGGGGTAGTC 


GTTGATGCAA 


TGTCATATGG 


720 


AAAAATATGT 


TCAAACGTTT 


GCTGTGTTCT 


TAATAAGTCT 


GAAACATATA 


CATGTTTAAT 


780 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCXJATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCX3T CATAATATTA CTCCTTACCT TATGTATTTC 960 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 1020 

AAAGAATTAA TTCATATnT 1039 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 



20 


GAGAGAATnT 


GCAATTAGTT 


ATTCAATTAG 


TTGATTTAAG 


ACATGATCCA 


ACACAAGATG 


60 




ATATCTTAAT 


GTACAATTAT 


TTGAAACATT 


TTGATATTCC 


TACTTTAGTT 


ATATGCACTA 


120 




ArGAaGACAA 


AATTCCaAAA 


GGTAAGGTyC 


AAAAGCATAT 


TAAAAATATT 


AAGACACAAT 


IBO 


2S 


TAGATATGGA 


CCCAGACGAT 


ACAATTGTAA 


GTTATTCATC 


AATTCAAAAT 


AATAmaCAAC 


240 




AACAAATATG 


GAATTTAATT 


GAACCGTATA 


TTTCATAGTT 


TTTGTACGTC 


AAAACTTATA 


300 




CAAAAATTTT 


AAAAATAATG 


TAAGCACGAA 


ACTTTTAATT 


AGTACACAAT 


TGATAACATT 


360 


30 


TTTCAACGTT 


CATCATTTTG 


TCAAAAACTC 


AAAAGTAAAT 


TAGAAAGATT 


ATAATTTATT 


420 




TAAGCATCGT 


ACTTAATTGO 


ATTTTAAATT 


ATGTTATAAT 


ATTTGTATTG 


TTAGTATATA 


480 




TGGGGGCTTT 


TCAAATGCAT 


TTTATTGCAA 


TTAGTATAAA 


TCATCGCACA GCTGATGTgC 


540 


3S 


ACTAAGAGAG 


CAAGTTACTT 


TTAGAGATGA 


TGCCTTACGA 


ATTGCCCATG 


AAGATTTATA 


600 




TGAAACTAAA 


TCTAyTTTAG 


AAAATGgTCA 


TATTaTCAAC 


ATGTAATCGA 


ACTGAAGTAT 


660 


40 


ATGCTGTTGT 


TGATCAAATT 


CACACAGGTC 


GTTACTATAT 


TCAACGATTT 


CTAGCTCGTG 


720 


CATTTGGATT 


TGAAGTAGAT 


GATATTAAAG 


CAATGTCAGA 


AGTAAAAGTG 


GGGGACGAAG 


780 




CaGTAGAACA 


TTTATTGCGT 


GTCACTTCTG 


GTTTAGATTC 


AATCGTACTT 


GGAGAAACTC 


840 


4S 


AAATTTTAGG 


TCAAATAAGA 


GATGCATTTT 


TCTTAGCGCA 


AAGCACAGGT 


ACGACAGGrA 


900 




CAATTTTTAA 


TCATCTATTT 


AAACAGGCAA 


TTACTTTTGC 


AAAAAGAGCA 


CATAATGAAA 


960 




CAGATATAGC 


TGATAATGCT 


GTAAGTGTGT 


CTTATGCTGC 


GGTCGAGTTG 


GCGAAAAAAG 


1020 


SO 


TATTTGGCAA 


ATTGAAAAGT 


AAGCAAGCTA 


TCATTATTGG 


TGCAGGGGAA 


ATGAGTGAAT 


1060 




TATCACTATT 


AAATCTTCTT 


GGTTCTGGAA 


TTACTGaTAT 


TACAGTAGTA 


AATAGAACAA 


1140 
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TACCAAATTT ACTTGAAAGT GCAGATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 1260 

TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 1320 

* TGATTGATAT TGCAGTTCCT CGAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 1380 

TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 1440 

TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 1496 

10 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 





CTTGATTTTT 


TCCCtTTAGT 


ATTTTCCaTt 


TGeOlTGTCGC AGCTTCTAAA 


TCCTGCTTTG 


60 




GTTCTCTAGT 


GAACTTCATA 


ATTAAAGCAG 


CTACAACGAA* TGATACAAGT GCAGCAAGGA 


120 


25 


AGACACCGAG 


TAACATGTGC 


AAGAATTCAC 


CTCTAG6TGC ATTTAAACAG 


TAAACTATAA 


ISO 




ATGAACCTGG 


TGACGCGGGA 


CTTTTAAATC 


CAAATCCTGT TGCTTGATAA 


GTTGCAACAC 


240 




CAGTCATTCC 


ACCTAAAATA 


ACAGCGATAA 


ATAATAAAGG ACGCATTAAT 


ACATATGGGA 


300 


30 


AATAAATTTC 


ATGAATACCA 


CCTAAGAAGT 


GGATAATTCC AGCACCATAT GACGTTGCTT 


360 




TTGCAGTGCC 


TTTTCCAAAA 


ATCATATAAG 


CAAGTAAGAT ACCTAAACCT 


GGTCCAGGGT 


420 




TAGATTCAAT 


TGTGTATAAA 


ATTGATTGAC 


CAGCTTTTGC AGCTTGATCT GCACCAAGCG 


480 


35 


GTGTGAATAC 


ACCATGGTTA 


ATCGCATTGT 


TTAAAAATAC AATTTTTGCA GGCTCTACTA 


540 




AAATACTTAC 


AAGTGGAAGT 


AGGTGTGCAT 


GTACTAATGC TTCAACTGCC 


ACTGATAAAA 


600 


40 


TATGCATAAT 


AAATTTCATA 


AGTGGTGCTA 


AAATTTTAAA TCCTGCAATC 


GTCATGATAA 


660 


ATCCTAAAAT 


ACCAGCAGAA 


AAGTTATTAA 


ATAACATTTC AAAACCTTGC 


GGCGTTCTAG 


720 




GTTGAATCAA 


TTGGTCGGTC 


TTCTTCATTA 


ACCAACCAAC AAGTGGACCC 


ATAATCATTG 


780 


45 


CACCAAGTAA 


CATTGGTGTA 


TCAGGTAATG 


CAACGATGAC CCCCATAGTT 


GCTGTTGCTG 


840 




CGATGATACC 


ACCACGTAAA 


TCATAAATTA 


AACGACCACC ACTAAATGCG 


ATCAATAATG 


900 




GGATTAAATA 


AGTAATCATT 


GGTCCTGCTA 


AAGTAGCTAA ATCTTTGTTA 


GGTAACCATC 


960 


SO 


CATTATCTAT 


AAAAATGGCC 


GCGATAAAAC 


CCCAAGCGAT GAAAGCGCCA 


ATGTTTGGCA 


1020 




TGATCATACT 


ACTTAAGAAT 


GATCCAAATG 


CTTGAACACG ACGACCAATT 


CCTTTTTTCT 


1080 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 1200 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 1260 

5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 1320 

9TACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 1380 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTGCAA 1440 

70 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 1500 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 1560 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

IS 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGCGAT GATAGCATGT 1680 

TCCTAnTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 1740 

20 ACGCX5TACTA TTCCAATATG GAGGAAAATG TC6TTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGIT 1860 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 1920 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 1980 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 2040 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACX3GTGT TATTGAAAAC TATGAAGAGT 2100 

TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCX5 2160 

TTCAATTAGT TGAATACTIT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 2220 

TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 2280 

35 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 2340 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 2400 

ACCATGAAAT CX^TTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 2460 

40 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2520 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 2580 

^ AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC AATGATGTTA 264 0 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 2760 

50 TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2820 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 2880 
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TACAC6CGG6 TCCTGAAATC GCAGTT6CAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATArTGATT 3060 

TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 3120 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTACA 3240 

TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATT6CC TTAATCGAAG 3300 

AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3360 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 3420 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG CCATTAGTAT 3480 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 3540 

ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 3600 

TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATT AATCGACTAG AAGGGTGCTT 3660 

TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 3720 

25 GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 3780 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 3840 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGAT TT CTTG GCAAATTGCT AAAGGTGATA 3900 

AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3960 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCQQ TAAAATGCCA GGCAAGGTAG 4020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 4080 

AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 4140 

GTGTCTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 4200 

TTGGAATGTC TGCTAAAGCX5 CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 4260 

GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4320 

CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 4380 

ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 4440 

ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4500 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4560 

50 GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 4620 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 4680 
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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4800 
TCGCATTCTG AAAATACATC AACTTG 4826 
^ (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 846 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

75 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

2Q TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 240 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

25 ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 480 

^ GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 540 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

35 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 

40 

TTATTATTAA ACTTAATATC TT T TT T A TTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

45 CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

SO GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCQTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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10 



15 



20 



TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 1380 

TTCATAAAAT GAAACCCCCA ATTTATCC5TT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 1440 

TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 1500 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCO GCACCAATTA 1560 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 1620 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC T6TCCAAGTA TTAGCGCCCA 1680 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 1740 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CXK:ACCTGTC GGACCACTTA 1800 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 1860 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 1920 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 1980 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 2040 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 2100 

25 CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 2160 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 2220 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 2280 

30 ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA CGATATATGC ATCTTTTTCT 2340 

AATGOCTCTA ATTCTGTTAA AGATGTTTCQ AAATCATCAT TTAAATCGTC TTTCACAAAT 2400 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 2460 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 2520 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 2580 

ATTT^TCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 2640 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 2700 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGAOGAGTTC GACAAGTTCT AATGCTTTTT 2760 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 2820 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 2880 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 2940 

50 ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3000 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTG 3180 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTTTA TGTQAATTGG 3240 

* CACCATTTCT ATATAAGACG GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3300 

ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA 3360 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 3420 

10 

CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 3480 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 3540 

CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 3600 

IS 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

20 GTAAAGCCAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGQCAATT COCAACTTGA 3780 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 3840 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 3900 

25 GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 3960 

ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 4020 

ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 4080 

TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 4140 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAOGC 4200 

TAAGTCAAAA ATCAACATCC TAAAATCTAC AATGTTATAT TAACAATAGT TAACCAAAAG 4260 

35 

AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGQT 4320 

AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4380 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTAGTCATGT ATTCCTATTT TTAAATACAC 4440 

40 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4500 

TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 4560 

GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4620 

45 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 4680 

GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 474 0 

50 AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATOAA 4 800 

TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 4846 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 120 

IS AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAA6AACAT 240 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 300 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 360 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 420 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 480 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 540 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 600 

GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 660 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 720 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 840 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC '900 

CTTJ^TCAAG GGATAAATGC AAATGGCTGG ATGA6AACTG ACTTGAAAGG TTCAGAGTTT 960 

ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 1020 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 

AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 1140 

GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 1200 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 1260 

50 TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 1320 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 1380 

TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 1440 
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AAGTGCTACG ATAACTAACC CAATCAATCT 
CACTATATAT CATTAATGTC TACGTATGAA 
5 CACTTGGACA ATAATGACTA ATACAATAAC 

TTGATAACCA TACACTAAAG CTAAGTCTCC 
CGCCGTACTT CCAATAAGTC CAATAATCGC 
AGCTTCAGGA ATTAAAAAAT ATCTAATGAT 
CGCTTCAATA ATCCCCTCGT CTACTTCCAA 
AGGTGCCACA TATACTGTTA AAGGCACGAT 

IS 

TACTAATTTT GTOAATGGCA CAATCGCAAT 
AATATTGATT AAAGGATTTA AAACTTGATG 

2Q TCGAGTAATT ACCAATAAGA CACCTAATGG 
AATAGATACC ATATATAGCG TTTCGTACAA 
AGAACCAAAC ATATGTTAAT GcACCTCCTC 

2S ATTTATTGCC GTGTCTTCAA ATTGTTGATC 
GGTGTTACCT TGTATTTCTG ACATAGATGA 

TTGAATCAAG TCATTTATAA TCGGTTGTGT 

30 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT 

AGTCTGTATA ACTGTAGACA CAAAATTTTG 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT 

35 

AATGTCTTTA ATAACGCGCA TTTCATGTGT 
ATTG&CATTC TTTAATAACG TCAATATCGA 
TGCTTCATCG CATAGGAGTA TTTTCGGATT 
TTGCTTCTGC CCACCAGATA ATTCATCAGG 
AAATTCAAGC ATTTCCGTTA CTCGTTGCTT 
^ GAGTGGCATT GCTACATTTT TAAATACGGT 
TATCATACCG ATATCTTTCT TAATATCCCT 
ATGTCCATCT ACAATCACTT GTCCATTCGA 

so 

CAACGTACTT TTTCCTGCAC CACTATATCC 
CGTAAATGAT ACGTCCTTCA AAGCATCTAT 

55 
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TTTCATTCTA TCAATTCCTT TCAAAATCTT 2940 

TCTAGCTAGA ACATTCCCTA GCX3TTTGAAT 3000 

GGTAATAATA ATGACCGTCG TATCAAATCT 3060 

TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

AGTGGTAATT GCTAATACTA ACGAACCTAA 3180 

TTGTAGTGGT GAAGCGCCCA TCXSnTTTCGC 3240 

TAATGAGTTT TCAACAAGTC TTGCAATGTA 3300 

GGCAGCAGTC GTACCAATTG AAGTACCTAC 3360 

TAACAAAATA ATAAATGGTA GTGACCTTAA 3420 

TATCACTATA TTGGGCCATA TGCCTTGTTT 3480 

AATACCAATC ACTGCTCCTA AAAATAAAGC 3540 

TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3600 

AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

CATATTAAAT CGAAGCCACA TATAACATAC 3720 

AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 

CACCTGAGTT TCCTCGACGA AGATTTTATA 3840 

CAATCGACGA ATCAATGATG TACTTGGCTC 3900 

AGCAATCGTC GTTTTAGGAT GACTAAACAC 3960 

CCCCTTTTCC ATTACAGCAA CACGATTACA 4020 

AATCATCATA ATTGTAATGC CAAAGGTTTG 4080 

AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 4140 

AGTAACAAGC GCTCTTGCAA TAGCCACCCT 4200 

AAATTGGTCT TTTTTATCAC TCAATCCTAC 4260 

AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4320 

AGCTGAATTT AATAAATTGA AATGCTGAAA 4380 

CATCATTTTA TCX5CTATAAT TCGTAATATC 4440 

GGCAGCTTCA AGATGATTCA CGAGTCTTAC 4500 

AATCACACCA AAAATATCAT TGCGATTGAC 4560 

TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4620 
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TAAGAGCATT ATATGTAAAA TTGCATATAT CGTcAATACA ATTTGCCGAA TTTTCTAAAA 4740 

AATTAAAAAA TAAGTAATTC ATGTGACAAT GACGAATTGT GAGACTACTA TGACATTTAT 4800 

CAAATTAAAT CCATAAAAAT GTCCACCAAT CCTCCACAAC 6CAATTACTA AATATTAACA 4860 

TCGCACAAAA AAGCACTAGC ATATTCAAGA ACAACAAACG TTGAACTCAA AATATATGCC 4920 

AGTGCTGCTA TTATTTATAA AGTATCTAGT GCTTGTTTTA AATCATCGAC TAAATCTTCA 4980 

GTATCTTCAA TACCTACAGA AATTCTTACA AGTCCGTCTG TAATACCTTC TTTAGCTCGA 5040 

ATATCTGCTG GAATGGATGC ATGTGTCATC AATGCAGGTA CTGAAATTAA ACTTTCCACT 5100 

GCACCTAAAC TTTCAGCTAA TGTGTAATAC GATGTTGCTT TAATCAATTG TTTGGCACTT 5160 

TCTGTATTTT TCACTTCAAA TGCAATCACA CCTGTATGGC CATCCGCTTG AGCCATATGG 5220 

ACATCATGAT TTAAATGACT TTCAATACTT GGATGGAACA CTTGTTGCAC AGCTGGATGT 5280 

20 GCTTGTAACA TTTTAATAAT TTCAATAACG CTGCGATTAA TTTGTTCCAT ACGTAAACCT 5340 

AATGTTTTAA TACCCCTCAC AAGTAAATAG CTATCTTGAG GTCCTAAAAT GCCACCTGTT 5400 

GAATTTGAAA TAAATGCTAA ACGTTCTGCA AGCTTGTCAT CCGATGTTGC AACTAAACCA 5460 

GCAACGACAT CACTATGTCC ACCTAAATAT TTCGTTGCAG AATGTAAGAC AATATCGATA 5520 

CCTAAATCTA ATGGATTCTG ATAATAAGGT GTCATAAATG TGTTATCAAC AACTGAAATC 5580 

AAACCGTGTT CTTTCGCAAT TTCAGCAGAC TTTTTAATGT CAGTAACACG TAATAATGGA 5640 

TTAGAAGGTG TTTCAATAAA CAACATCTTT GTTGTTGGGC GTATCGCTTG TACAATTGAA 5700 

TCTGTATGCG TTGTATCTAC AAAATCCACT TCAATGCCAA ATCGTGTAAA TACTTTTGTC 5760 

AATGCGCGAT AAGTACCGCC GTATACATCT GAATTTAAAA TAATATGATC TCCTTTGTCC 5820 

AACAGCATAA CAACTGCACT GATTGCTGCA ACACCTGAAC TAAATGCAAA GCCATGTTTG 5880 

CCATTTTCTA ATGTCGCAAT AACGCTTTCT ACAGAACTTC TTGTTGGATT CGCAGTACGA 5940 

40 GAATATTCAT ATCCTTGACG TAAATCACCA ATATCATCTT GTAAATATGT ACTTGTTTGA 6000 

TAAATTGGTG TTGTAACGGC ACCTGTATAA TCGTCTGTTG TGTGCCCACC ATGAATTAAT 6 060 

TTAGTTTTCT TGTTCATTAT TATTCTCCTC ATAATTAAAT ATTTGCTTAG ACATATATCG 6120 

^ ATCACTACCA TCTGGAAATA CGACAACAAT CGTACCTTCA GATAATTGCG CTTTTAAATT 6180 

CAATGCACCT TGTAATGCTG CACCTGaAGA ACTGCCTACT AACAACCCTT CATTTATAGC 6240 

CAAACTTTTG ACATTTCGAA AGGCATCTTG ATCTTTAATC GTAAATATCC CATCTACAAG 6300 

SO 

ACGTCTCTCT AAAAATATCG GCCATTTCTC AGAACCX5ATA CCTTCAGTGT CATGTGCATG 6360 

AGCTGGCCCT CCATTTAACA CGGACCCTTC TGGCTCAACG GCATAACATT GCACGTGATG 6420 

55 
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3S 



1099 



ATAATCAATT TGCTGTAATG CTGAAGTCAA 
CX3GATTATGT TCGGATTCAA ATTGATTCAT 
S ACX5TGCAGCT AATTGTGCCC CATGCATACC 

ACCAAGCGCT ATCATAATAT TAATCTTTTC 
ACATTTCAAA TGATGTCTAT TCGCTGCAAT 
CGCTTCAACA ATAGTTTGAC CTGCACGCAC 

ATArrrcccG agtctgtctt taacactgcc 

AATTTTAACT TTATCATCAC TATAATGTTC 

1$ 

atcataagta atcatagatg caccctcatc 

CTTCATGTCT CAATTTTAAT TCTTACTTTT 
2Q TTTTGCAATT ATCTAAGTTT CGATTAATTC 
AAAAAAACAC CTGAGCAACA CAAATACTTG 
TAATTGCACX3 ATAAAGACTA AAATAATAAC 
2S CCAACCACTG AATAATCTAA ATCGATCTTT 
TAATAATTGT CCTACGACAA GCGTAGTACC 
AACGATGAAA TCCATATTAT CAAAAATCGT 

30 

ACCAAAAGAT AAGGTTGCTG GAATACTAAT 
GACTTTTTTA CGTTTTGTAT TGTCATTCTT 
AGAAATAGAT GACGTTAAAG CCGCAAATAA 

3S 

TGTGCCTAGA TGCATTTGAC TAAAGACCAT 
TTCT^GTGGT TCATAGCCAA AACTATGTAA 

^ AGATACAAAG ATATTCATAA CAACGATAGA 
AGAGGCATAA CTCGCATAAG TAATCATACC 
TTGACCTAGC GCAAACAAGA TGCCATCAGC 

^ AAATTTCACA CCTTCTAAGA CGCCATCTAA 
AAAGACAAAC AGCAATGGCA TCATAACTTT 
TAACATGACA ATAATCATCG TAGCGAATAT 

SO 

AGGATTTGAT ATTACCGCTT CAAAATTCAT 
AACTAATTGC CATAATACTT GTCCGATGTA 
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TTCGGGTCCC AATGTATGAA AATATGTATC 6540 

ATAAACGGCA CCATATTTTT CAGCATAGGA 6600 

TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6660 

TTCTGAAAAA CCATACGGCG CAAAGATCTT 6720 

AGCTAACCCT ATGCCTGTAT TACCAGCAGT 6780 

ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6840 

TCCAGGATTC CATTGTTCAA GCTTGGCATA 6900 

TAACAGTACT AATGGTGTAT TGCCaATTAA 6960 

TGACATGCCG ATCAAATGAA TGAAACCTTT 7020 

CAGATAAGAA TTATAAACGA CATTTTGTTA 7080 

AGAACCAGTA CTAAATTTTC AATTCCAAAC 7140 

TGTGTCAGAT GCTTCTATAT ATTAACTAAA 7200 

GACAGGCATC GCATACTTAA TTAAGTAATA 7260 

ACCAAAATAT TGTTGTAATA ATTTTTTATC 7320 

TAATGCGCCT AATGGCATCA ATACATTCGA 7380 

TCCCGCACCG AATCTTACAT CTTTTAAGAT 7440 

GATAAATACT AAAATACTAC CGATCACTGC 7500 

CX5TGAAGTTA GAAACATTTA ATTCTAATAA 7560 

GAACAGCACT AAGAATCCCA AATAGAATAA 7620 

TGGCAGTACT TTAAATAATA ATCCAGGCCC 7680 

AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 774 0 

AATAGCTGAT GACTTAATCG TCATGTCTTT 7800 

TGTAGTTCCT AATGATAACG TAAAGAATGA 7860 

AGTAATCTCT GATACTCTTG GTTGTAAAAT 7920 

TGTTAAAGAC TTAATCACAA TGACGATTAA 7980 

CGATGCCTTT TCTAATCCTT TTTCAACACC 8040 

GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

TTCTTGGAGA TGATTGATGC GTTGAAATAT 8160 

AATGACAATC CAACCACCXSA TAACACTATA 8220 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CX5TCCCATTT TCCCAACAGT 



8340 



GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 



8400 



AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 



8460 



GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 



8520 



TCTTTTCATC ATTCAC 



8536 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dOXlble 
(0) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 360 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 480 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTQATTGGTC TCAACGTTTG CTTCATAAAA 540 

TTTJETAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

AGTtATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGT6C CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 840 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 1080 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 1200 

TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATC TCATATGAAA 1260 

^ TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 1320 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 1380 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 1440 

10 

CATTCX3GATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 1500 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 1560 

,5 TTAAACCTTT TTTCTTATTT AAAATQTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 1620 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TGCATTCAAA TCAGGTGTTA 1680 

ACXK3ATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 1740 

20 GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 1800 

TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 1860 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 1920 

2S 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT OTACCTGACG 1980 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 2040 

CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

30 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 2160 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

35 TCGACTCCTA ATGCATTCAT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 2280 

TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 2340 

ATAAGCATC6 AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 2400 

ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 2460 

AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 2520 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 2580 

45 

TTTAGAATTT TAGGCGTCTT ATCATAATGT AATTCAGTGA TATAAGGTTT AATATCATGT 2640 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 2700 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 2760 

SO 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2820 

ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTTAGA ATATGCTTGC 2880 
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ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3000 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3060 

^ GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 3120 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 3180 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 3240 

TO 

AACCCTCCAA CAACTCGAAG ATGATGTTTC TCAGATTGCX3 GATGTATAAA TGTGATGTTA 3300 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 3360 

AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG 3420 

75 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 3480 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 3540 

20 TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3660 

TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3720 

TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAGTACC CGTTTTAGTT 3840 

GGTACAGCAG CATCTAAAAT ATATTTTCTT GGTAGCGAAG ATCATATTAA AATCAGCCTA 3900 

30 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3960 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4080 

35 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 4140 

TTAGfATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4200 

40 CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 4260 

TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4320 

ACTATCGG 4328 

^ (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 
so iC) STRANDEDNESS : dovible 

(D) TOPOIiOGY: linear 
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GTTCAATACA GAAAAAATAA 


ATTTAGATGT 


TGAAGCATCC 


TACAATTAAT 


ACAGATCCAT 


60 




TTCAATATAT TTAAACTAAA 


ATCTCGGGAT 


TTCTAAATTT 


TGAAATTTCG 


AGGTTTTnAT 


120 


5 


ATTTTTATTT AAAATAGCAC 


ATTTATACTT 


TATAATAGTA AAGATGAACA TATAAGGAGG 


180 




CCAAATCATG GCAAAACATC 


CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 


240 


10 


TGTGAAAGAC CTTAATTTTG 


AAAAACCAAC 


TGAAATTCAG 


AATCGAATTA 


TTCCAAGAAT 


300 


ACTAAAGAGA ACAAATTTAA 


TTGGTCAATC 


TCAAACGGGT 


ACAGGGAAAT 


CTCATGCATT 


360 




TTTATTACCA TTAATGCAGT 


TAATTGATAG 


TGAAATAAAA 


GAACCACAAG 


CAATCGTAGT 


420 


IS 


TGCACCAACA AGA6AACTTG 


CACAACAACT 


ATACGATGCA 


GCGAACCATT 


TAAGCCAATT 


480 




TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT 


TGGTGGTACA 


GATATAGAGA 


AAGATAGACA 


540 




ACXSTTGTAAT GCACAACCAC 


AATTGATTAT 


AGGCACCCCT 


ACTAGAATTA ATGACTTAGC 


600 


20 


TAAAACGGGA CATTTACATG 


TGCACTTAGC 


ATCATATTTA 


GTTATTGATG 


AAGCGGATCT 


660 




TATGATTGAC TTAGGATTAA 


TTGAAGATGT 


AGATTACATT 


GCTGCAAGAT 


TGGAAGATAA 


720 




TGCAAATATT GCGGTGTTTA 


GTGCTACAAT 


CCCACAACAG 


TTACAACCAT 


TTTTAAATAA 


780 


25 


ATATTTAAGT CATCCAGAAT 


ATGTAGCTGT 


C6ACAGTAAA 


AAACAAAATA 


AAAAGAACAT 


840 




CGAATTCTAT TTAATACCTA 


CTAAAGGTGC 


AGCTAAAGTT 


GAAAAGACTT 


TAAATTTAAT 


900 


30 


TGATATACTA AATCCATACT 


TATGTATTAT 


TTTCTGTAAT 


AGTAGAGATA ATGCAAATGA 


960 


TTTAGCACGT TCACTAAATG 


AAGCTGGTAT 


TAAAGTTGGT 


ATCATTCATG 


GTGGCTTAAC 


1020 




GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 


1080 


35 


TGCCaGCGAT TTAGCATCTC 


GTGGTATTGA 


TATTGAAGGT 


GTTAGTCrTG 


TCATCaATTT 


1140 




TGATGTGCCA AATGATATTG 


ACTTCTTTAC 


GCATAGAGTC 


GGACGAACTG 


GTCGTGGGAA 


1200 




TTATrrAGGT GTAGCAATTA 


CGCTTTATAG 


TCCTGATGAA 


GAACACAATA 


TTTCATTAAT 


1260 


40 


AGAASATCGC GGTTTTGTAT 


TCAATACTGT 


TGATATTAAA 


GATGGTGAGT 


TAAAAGAAGT 


1320 




TAAAGCGCAC AATCAGCGTC AAGCAAGAAT 


GCGCAAAGAT 


GACCATTTAA 


CTAATCAAGT 


1380 




GAAGAACAAA GTTCX3AAGTA 


AAATTAAAAA 


CAAAGTTAAA 


CCAGGTTATA 


AGAAGAAATT 


1440 


4$ 


TAAACAAGAA 










1450 




(2) INFORMATION FOR SEQ ID NO: 281: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



5 



TO 



IS 



30 



3S 



AGTCAGGTAT 


ATCATGCCaT 


yCTGAATTGG 


TCGATATTAA 


TATCAGTGGT 


GTTAAAGAAC 


60 


GAATTGTATA 


CCAATAGACG 


CTTTATATTG 


TAAAATAGTA 


TTAAATGCaG 


AATAGAGAGG 


120 


AGATTTAATG 


CGATATGACA 


AATTATAAAG 


TTGTCGTTTT 


AGATATGGAT 


GACACATTGC 


180 


TAAATTCAGA 


TAATGTGATA 


TCAGAAGAAA 


CTGCAAATTA 


TTTAACAGCA ATTCAAGATG 


240 


AAGGTTATTA 


TGTTGTTCTA 


GCATCTGGTA 


GACCTACTGA 


AGGTATGATT 


CCAACTGCTA 


300 


GAGATTTAAA 


ATTACCTGAA 


CATCATAGCT 


ATATTATTAG 


TTATAACGGT 


AGTAAAACGA 


360 


TTAACATGAC 


TAATGAAGAA 


GTAGAAGTAA 


GTAAATCGAT 


TGGTAAGCAA 


GATTTCGATG 


420 


AAATTGTAGA 


TTATTGTCGA 


GATAGAGGCT 


TTTTCGTTCT 


TACATATCAT 


GATGGTCAAA 


480 


TTATTTaCGA 


CAGOGAACAT 


GAGTATATGA 


ATATTGAAGC 


AGAATTAACA GGTTTACCGA 


540 


TGAAACGTGT 


TGATGATATC 


AAAGCGTATA 


TTCAAGGCGA 


TGTACCCAAG 


GTCATGGGTG 


600 


TAGATTATGT 


AGCGAATATT 


ACAGAAGCTA 


GAATTGATTT 


GAATGGT6TG 


TTCAATGATA 


660 


ATGTAGATGC 


TACGACAAGT 


AAGCCATTCT 


TCTTAGAATT 


TATGGCCAAA 


GACGTTTCAA 


720 


AAGGTAATGC 


AATTAAAGCG 


TTATGTCACA 


AATTGGGATA 


TTCGGTGGAT 


CAAGTCATTG 


780 


CTTTTGGTGA 


TAGTATGAAT 


GATAAATCAA 


TGTTTGAAGT 


CGCAGGTCTA 


GCTATTGCTA 


840 


TGGGGaATGC 


ATCAGATGAA 


CTTAAGCAAT 


ATGCAAATGA 


AGTTACGTT6 


GATCATAATG 


900 


AAAATGGTAT 


TCCACATGCG 


CTCAAAAAAT 


TGTTATAAAT 


TTTAAAATAA 


GCCTTAACAC 


960 


ATGATATTTG 


AATAAGATAT 


CTTGTGGTTA 


AGGCTTTTTA 


TTTTTGTGAA 


AATGACTTCA 


1020 


GTTATACTAT 


GGAGGATTTG 


AAATACATAT 


TTTAGATTAG 


TAATGATATC 


AAACGAATAG 


1080 


AGTAAATGTA 


TATTTttTGA 


ATAAATCAAG 


TATTAACTAG 


TCACGGAAGG 


nAGATAAAT 


1139 



(2) INFORMATION FOR SEQ ID NO: 282: 

40 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

SO 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACnTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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AATGAAGTTG TTAAAATAGA TTTATATACT 
GTTAAAGAAA TGATAGAACA TCAACAAATC 
^ GTACGTTATT ATTTTAATGA AGGATTTGTA 

GAACAAACAG CACGGACCAT TAAATCATAT 
ACACTCGAAT CACTAATTGA AAAGATTTTA 
GACATAGAAG ATTGAGATCA TCAGCGACAA 
GAAAAGAAGA TTTAATATAT CCAATTTTTG 
TTAAGTCATT GCCAGGTGTA TACCAAATCA 

IS 

AAGCTTATGA CTTAGGCATA CGTGCCATTA 
ATATAGGTAC TGGTGCATAC ATTCACGATG 

20 AAAAAATGTA TGATGACTTA TTAATTGTTG 

ATGGTCATTG TX;GCGTGATT GATGACCATA 
CACTACTTGT TAAAACAGCA ATTTCTCAAG 

2S 6TAATATGAT GGATGGTTTT GTTGCTQAAA 

ACAATATTCC TATAATGAGT TATGGTGTCA 
GAGAT6CAGC AGATTCAGCG CCATCATTTG 

30 

CTAACCGTTT GGAAGCACTT CXJTGAATTAG 
TGATTGTTAA ACCTGCTCTA AGTTATTTAG 
ATGTTCCAGT TGTTGCATAT AATGTGAGTG 

3S 

AAAATGGTTG GATAQATGAA GAACGTGTCG 
CAGQT6CTGA TATGATTATT ACGTATTTTG 
AAGGTTTTAT ATTTATGATT TTCCATAAAC 

40 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA 
AGTACGCGCA TTTAAATCAG TAGATACACC 
45 AAAAATTTAT GATATCX3ATG GTAACGAGTA 

TATTTTAGGA CATAGAGACC CTCAAGTTAT 
TACAAGTTTT GGTGCATCAA CATTACTTGA 
AGTACCTTCA ATAGAAAAAG TGCGTATGGT 
TTTAAGATTA GCACGTGGTT ATACTGGCAG 

55 



TCAGTGCCTA ACAAACAAAA TATACAAGAT 300 

GATGCATTAA CATTTTCAAG TTCGTCGGCA 360 

CCAAAATTCA AGTCGTATTT TGCTATTGGA 420 

CAACAACCAG TAACAATTGC AGAAATTCAA 480 

GAAAGTAGGG GCTAAAAATG AAATTTGATA 540 

TGAGAGATAT GGTTAGAGAG AATCATGTAA 600 

TAGTTGAAAA AGACGATGTG AAAAAAGAAA 660 

GTTTGAATTr ACTTGAAAGT GAATTAAAAG 720 

TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 780 

GTGTTATTCA ACAGGCAACA CGTATTGCTA 840 

CAGACACTTG TTTATGTGAA TATACTGATC 900 

CACATGACGT TGACAATGAT AAATCATTGC 960 

TGGAAGCTGG TGCTGATATT ATTGCGCCAA 1020 

TTCGTCGTGG ATTAGATGAA GCCGGCTATT 1080 

AGTAT6CATC AAGTTTCTTT GGACCTTTTA 1140 

GGGATAGAAA AACGTATCAG ATGGACCCTG 1200 

AAAGTGATCT TAAAGAAGGG TGCGACATCA 1260 

ATATAGTTCG AGATGTTAAA AATCATAOGA 1320 

GAGAATATAG TATGACTAAA GCAGCGGCAC 1380 

TTATGGAACA AATGGTTTCA ATGAAACGTG 1440 

CAAAGGACAT TTGTCGCTAT TTAGATAAAT 1500 

TGTAGGAGGA ATTTACTTTA TGAGATATAC 1560 

AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGCAATTTTT ATGGATCACG GTAAAGGTTC 1680 

TATCGACTAT GTACTAAGTT GGGGACCACT 1740 

TAGTCATTTA CATGAAGCAA TTGATAAAGG 1800 

AAATAAATTG GCGCAgcTCG TTATTGACCG 1860 

GTCATCTGGT ACAGAAGCTA CATTGGATAC 1920 

AAATAAAATT GTGAAATTTG AAGGTTGCTA 1980 
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5 



10 



IS 



20 



2S 



30 



40 



4S 



50 



GCCGGATTCT 


CCTGGTGTGC 


CTGAAGGTAT 


TGCTAAAAAT 


ACAATTACAG 


TTCCATACAA 


2100 


TGATTTAGAT 


GCACTTAAAA 


TCGCTTTCGA 


AAAATTTGGa 


AACGATATT6 


CTGGTGTAAT 


2160 


CGTAGAACCT 


GTTGCTGGTA 


ATATGGGTGT 


CGTACCGCCG 


ATTGAAGGTT 


TTTTACAGGG 


2220 


ATTAAGAGAT 


ATTACGACTG 


AATACGGCGC 


ATTGCTAATT 


TTCGATGAAG 


TAATGACTGG 


2280 


TTTCAGAGTC 


GGTTATCATT 


GTGCACAAGG 


TTACTTTGGT 


GTGACACCAG 


ATTTAACTTG 


2340 


CTTAGGAAAA 


GTTATCGGTG 


GAGGACTACC 


TGTAGGTGCA 


TTTGGTGGTA 


AAAAAGAAAT 


2400 


CATGGATCAT 


ATAGCACCAT 


TAGGAAATAT 


TTATCAA6CG 


GGTACGTTAT 


CAOGAAATCC 


2460 


TCTTGCAATG 


ACAAGTGGTT 


ATGAAACGTT 


AAGCCAATTA 


ACGCCAQAGA 


CATATGAGTA 


2520 


TTTTAATATG 


TTAGGCGATA 


TACTTGAAGA 


CGGTTTAAAA 


CGTGTATTTG 


CTAAACACAA 


2580 


TGTACCAATA 


ACTGTAAATA 


GAGCAGGTTC 


AATGATTGGT 


TATTTCTTAA 


ATGAAGGACC 


2640 


TGTAACTAAT 


TTTGAACAAG 


CGAATAAAAG 


TGATTTGAAA 


TTATTTGCAG 


AAATGTATCG 


2700 


AGAAATGGCA 


AAAGAAGGTG 


TGTTTTTACC 


ACCATCTCAA 


TTTGAAGGTA 


CATTCTTATC 


2760 


TACGGCACAC 


ACGAAAGAAG 


ATATTGAAAA 


AACGATTCAA 


GCATTTGATA 


CGGCTTTAAG 


2820 


TCGTATTGTA 


AAATAAATAT 


ACGGACAAAT 


TGAGAGCCTG 


AACTTTGTTC 


AGGCTCnTTT 


2880 


TAAATGTATA 


TAAGGCATGG 


GCGGCGACTT 


GATAGTGAAA 


GTCCACTACT 


A 


2931 


(2) INFORMATION FOR SEQ ID NO: 283: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOtOGY: linear 








r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 






AATTATGAAT GCATTACCAG TATTATTACA 


AAAGAACAAT 


TAAAAATGTT 


TGTTTATGAT 


60 


TATGATACGC ATCTCATTAA AAATGTAaTG 


GTTGCAGCAG ACGTGTTAAA 


GGCAAATGAT 


120 


ATTCAAGGAC ATGAACCATT AATCGTTAAC 


CTTCAAACGA 


TTGATGAAAC 


ATTACATCGT 


180 


TTACCTATGC 


ATAATAGAAA AGACATGATG 


GTTAATGGC6 


GTGTACTTAT 


GGCACATTTA 


240 


AATGCCAAAA GTGGTCCGTG 


GTTAAAAGAT 


GTGCTAAGAC AAATTGAGAT 


AGCGATTGTA 


300 


ACAGGTAAAG 


TAAGCAACGA 


AGAAACTGAA 


ATTTTGAAAT 


GGGTGGATAA 


TCATGTCAAA 


360 


ATATAGTCAA 


GATGTACTTC 


AATTACTCTA 


TAAAAATAAA 


CCGAATTATA 


TATCTGGACA 


420 


AAGCATTGCG 


GAATCACTTA 


ATATTTCACG 


CACTGCAGTA AAAAAAGTGA 


TTGACCAATT 


480 
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CCCAGATATT 


TGGTATCAAG 


GTATAATAGA 


CCAATATACA 


AAAAGTTCTG 


CTTTGTTTGA 


600 




TTTTAGTGAA 


GTATACGATT 


CAATAGATTC 


TACACAACTT 


GCTGCGAAAA 


AGTCACTTGT 


660 


5 


TGGAAATCAA 


TCTTCATTTT 


TTATCTTGAG 


TGATGAACAA 


ACGAAAGGTC 


GTGGGCGATT 


720 




TAATAGACAT 


TGGAGTTCTT 


CAAAAGGGCA 


AGGACTTTGG 


ATGTCTGTCG 


TGTTAAGACC 


780 


10 


TAACGTTGCA 


TTCTCAATGA 


TATCTAAATT 


TAATTTATTT 


ATTGCATTAG 


GGATAAGAGA 


840 




TGC6ATTCAA 


CATTTTAGTC 


AAGATGAA6T 


CAAAGTGAAA 


TGGCCGAATG 


ATATATTTAT 


900 




TGATAATGGT 


AAAGTGTGTG 


GTTTCTTAAC 


TGAAATGGTT 


GCTAATAATG ATGGTATAGA 


960 


IS 


AGCAATAATA 


TGTQGTATAG 


GTATTAATTT 


GACGCAACAA 


CTAGAAAACT TTGATGAAAG 


1020 




TATTAGACAT 


AGAGCAACAA 


GTATACAATT 


ACATGATAAA 


AATAAATTA6 ATAGATATCA 


1080 




ATTTTTAGAG 


ATATTACTTC 


AAGAAATTGA 


AAAAAGATAT 


AATCAATTTT 


TAACGTTACC 


1140 


20 


TTTTTCTGAA 


ATTCGTGAAG 


AATATACTGC 


AGCTTCTAAT 


ATTTGGAATA 


GAACGTTGCt 


1200 




ATTTACAGAA 


AATGATAAAC 


AGTTTAAAgG 


ACAAGCAATT 


GATTTAGATT 


ACGATGGCTA 


1260 


25 


TCTAATTGTT 


AGAGATGAAG 


CGGGTGAATC 


ACACCGTTTA 


ATTAGTGCAG 


ATATAGAITT 


1320 


TTAACACTAA 


AGCAAGGAGA 


GATAGCTATG 


GGTATGGCAA 


CCTATGCCGT 


TGTGGATTTG 


1380 




GnAACAACAG 


GCAACCAATT 


AGATTTTGAC 


GATATCATTC 


A 




1421 


30 


(2) INFORMATION FOR SEQ ID NO: 284: 








35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








40 


-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 






CCAAGTTGCC TAAAATGATT AAGCAAGGTT 


TATACCCTAT 


GCnAACGATT 


GAACAAGAAT 


60 




CTGGAGCCAT 


CCGACTGCCA ACGATTTCTA 


GAGTGAnGCG 


TTCATTACAA 


TGGGGTAATG 


120 


45 


ATGCTTATAC AATGATTTTA GATCGTATGA 


ATATTGAAAC 


AAATGAATAA 


TAAATGAACG 


180 




ATAAACAATG 


GTTATCTATC 


TGCACTAATA 


AGGTAGATAA 


TCATTGTTTT 


TTCACGAAAA 


240 




AATTTACAGA 


GTAAAAGAAC 


TTAAATTTCA 


TATTAAGTCT 


TTAGAACTCG 


ACACTTAAAA 


300 


SO 


ATGCTATAAT 


CATATGTATG 


TTAAAAAAGG 


AGTTTCGGAA 


AATGTATGAC 


ATTAAAAAAT 


360 




GGCGCCATAT TTTTAAATTA GACCCAGCTA 


AACATATTTC 


AGATGATGAT 


TTAGATGCGA 


420 


55 


TTTGTATGTC 


TCAAACAQAT GCAATTATGA 


TTGGTGGAaC 


TGATGACGTT ACTGAAGATA 


480 










1108 









EP 0 786 519 A2 



CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

5 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 840 

10 

GCXSACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 960 

IS TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 1140 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGT6TTC GTATTTTACG 1320 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 1380 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 1440 

TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 1560 

ACGCCAATTG TCACX3TAATG AAGC6TTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 1620 

35 CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 1680 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 1740 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

40 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACXSAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

45 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

50 CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 

55 
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10 



(A) LENGTH; 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG GCACGATAAA 60 

ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 120 

AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 180 

ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 240 

GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 300 

CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 360 

CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 420 

CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 480 

AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 540 

CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 600 

CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 660 

30 AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT AT6GCACCAA CTGGAAACGA 720 

TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 780 

TCTTC 785 
(2) INFORMATION FOR SEQ ID NO: 286: 



20 



25 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 
- (B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 120 

50 TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 180 

ATAATTGTTG TTAACGGT6T ACGTATGTCA TGCGAAATTQ ATCTTAAAAA ATTTGAATGT 240 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 300 

55 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 480 

5 TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 540 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

10 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 

75 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 



30 



35 



SO 



ATnnATTATT 


ATTACTGCTA 


TTTTAAATTT 


TAAAAAATGC 


TTTTGATTAT 


ATTCAACAnT 


60 


TTGTATAAAA 


TTAAATTTGC 


TTTTGATTAA 


AGCATGAAAA 


TTGTAATCAA 


ACCATAAATT 


120 


GTCGTATGAT 


GTAGTTAGAA 


TTTTAAAATG 


CAGGAGGTCa 


AGTATATGAC 


TGAAATaACA 


180 


TTCAAAGGTG 


GACCAATCCA 


CTTAAAAGGT 


CAACAAATTA 


ATGAAGGTGA 


TTTTGCACCT 


240 


GATTTTACAG 


TGTTAGATAA 


TGACTTAAAT 


CAAGTAACAT 


TAGCAGATTA 


TGCTGGTAAA 


300 


AAGAAATTAA 


TTAgTGTGGT 


ACCATCAATT 


GATACAGGTG 


TTTGTGATCA 


GCAGACTCGC 


360 


AAAT7CAACT 


CTGATGCTTC 


TAAAGAAGAG 


GGGATTGTGC 


TTACAATTTC 


AGCAGACTTA 


420 


CCATTCGCAC 


AAAAAAGATG 


GTGCGCTTCA 


GCAGGTTTAG 


ACAATGTCAT 


TACATTAAGT 


480 


GACCACCGTG 


ACTTATCATT 


TGGTGAAAAC 


TATGGCGTTG 


TTATGGAAGA ACTTCGCTTA 


540 


TTAGCTCGTG 


CAGTATTTGT 


ATTAGATGCA 


GATAATAAAG 


TTGTTTATAA AGAAATCGTT 


600 


AGTGAAGGTA 


CTGATTTCCC 


AGATTTTGAT 


GCTGCTTTAG 


CTGCATACAA AAATATTTAA 


660 


TCATTAAAGA 


GATAAATCTT 


AAAATGTATA 


CATCGTGTCC 


ATCGTTGTCA ACAGCATTAA 


720 


AATAGAATTG 


TTTTCTATGA 


TTGCTAAGAC 


CTATGGGCAC 


TTTTTATTGG 


AGAGGGACGA 


780 


ATATGGCAGA 


ACAACAAACA 


ATTATGGAAC 


GCTTGTTTCA 


TACATTAGAT 


GAAAAAGCTA 


840 


AAACATTAAA 


TAATGAAAAT 


GGCCaAAGTT 


TTATTGAAAA 


TCTTGGGCTA GCAATGGAAC 


900 



55 
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CATTCCAATT TGCATATTTA AGTTTAATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 
TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 
AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 
AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 
CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 
ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 
GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 
GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 
CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 
ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCT^CT TTATTTAAAA 
ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 
CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 
GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 
(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 
AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 
GAATCAAATT AACGAAATTG TOCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 
ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 
AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 
AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 
ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 
TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAO CTAGTTTCTG 
GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAGAGA 
TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 



1112 



AAATAATGGC GAGTCACTCG ATAAAGTTAA 
TCGTAAAGAA GATGAAGAAG ATGTGGAA6T 
S GAAAGATAAC GAACATTCAC CACTCTTATT 

TGTTGAAACA ACAACTAGTA TTGAATCTAA 
AAAGAAAAAT CAAAAAGATA ACCAATCCAA 
TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA 
TTCTTTGATA AATAGaGGAG CACCGATTGA 
TTCTTTTTAA TTAATTTATA CAATGCCTGT 

IS 

TCTCTCGGTA TCCATTTAAC AAATAATAAA 
TCAAAATAAG GTTTGAAATT TGCGTTTTTC 

20 AGCTTTGAGT CTGTATATAA TAGTGCGTTT 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG 
TATGTATAAT GCTGCTCATC TTCTTTGATT 

25 CCTTTCGTCG CAGCATCAAA ATTTATTTTC 

TTAGTTAAAG TTACTATTAC TGTAATACAA 
CATCACTTAA ATAAGTAACA CGTGTTTAAA 

30 

TACGTAtACT GTaATATACT TCCTAAAAAA 
TTCTATGCTC TACGATGTTA TATTGGCAGT 
AGCTTTTTTC AATTCTAGTC AGGGGCCCCA 

3S 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC 
AATCTQCaAG TTTGGGATGG GCCCCAACAA 

40 AATGCAAGTT GGGGTGGGAC GACGAAATAA 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG 
TCTTACCTTT ACGATAAAAT GATTTAGCAA 

^ AAGCTAATTT TAATACGTAC GTTGTAATAA 
TTCCX3ATAAA TGCGATAGCT ACAAATAAAG 
TACCATATGg CACGGATGAA AAACGTTTTa 

SO 

AAGATAAATA CATCAATATG TTGACCAATA 
CGTGGCACAA CATCAAAGAT TGCGTGTAAT 

55 
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ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TACAAATGAA AACACAGATG AAAAAGTGTT 780 

CGCAAAACGA CGCAAAGATA AAGAGGAAGA 840 

AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

TGGTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

ACAATGGCAC ATGTACTTAT GCCTGGATTT 1440 

GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

AATATGTTGG GTAATCCATT AAAAAACACG 1560 

ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

ACACAGAGAA TTTCGAAAAG AAATTCTACA 1800 

ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

TGCGITAATA GCCTTGATCT AACTTATCAA 2040 

TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TCtnGaACTAA ATACTTTTTT AATTAGTGAA 2280 

ATATATGCGA CGATTGAGCC TAAAGCAATG 2340 

GCTTTTIGTG CCATATCTTC TGGTGCAGGA 2400 
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10 



IS 



20 



30 



35 



45 



SO 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2S20 

TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 2580 

TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 2640 

TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 2700 

CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 2760 

GAAACCGCTC AATAAATAA 2779 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTGCTTAAAA 60 

TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATTTGGCATC GACATCTCTA 120 

TGTTTTCCTG TCAACAATCC AAAATCACTA TCATTTTTTA TAGGTAATAC TTTTTTAAAT 180 

TCTTCCTTAG CTCTATTTAA AATCCCCTCA TTATGAACAA TAAATAAAAA TTTATTAGGG 240 

TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 300 

ATTAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAA TGCTTCTGCT 360 

TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAGCATTTGA 420 

GTTTGTTCAA CCTCOGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 480 

TTAMCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA ACAAGTCAAA TTCACTTTTT 540 

40 ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 600 

ACCTTTAACG CATTAGATGT TAAATTAGAG CTTCCTATTA CCATAGAACT ATAATCCTTA 660 

TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 720 

TTTTTTAATT TAAGTAATTC TCCATACATT TTAGGACTAT TAAACCCTAA GTAATTAGAT 780 

GTTAATATTT TCCCTTTAAC ACCCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 840 

GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 900 

TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTGTA 960 

TAATTCCCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTGATTGAAA 1020 

SS 
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CGCTGGGGCC CAATTTAATT 
TTCAGTTAGA GTTGGTAACT 
^ TCCAAAATCA TATTCATGTT 
CATTTCTTCT CTAATTTCTC 
ACnGCCAGGA AATTCCCACA 

TO 

AATTTTGTTA TCAGAAAAAA 
CACCCTTCAA TTTAAAATCA 
AAATCAATAT TAAAAATTAA 

IS 

ATACACATCT ATTATATAAT 
TTTGGGGCGT ATTTCTTTAT 
20 TATTAAGTCT AACTGAGATT 
TAGAACGAGC TAATGATAGT 
AGCGAGAAGG TTTTGAAAGT 
GTCTTGGTGA AGGTCGACTT 
GTTATTTAAA TGCCCAAACA 
TTTCcAACTG AATTCGCCC 

30 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



45 



so 



GATGACTTTT 


CCCCCTCATA ATCTTCATGG 


TCCAGGCGTC 


CATTAATGCG 


TCAAAGGATG 


60 


GCACATTTTA 


CCIGGAACAA ATGATTCATA 


TGGTTCATAA 


AAATCACGCG 


TCGTAATATA 


120 


ATCTTCTAAA 


TCAAATGCAT AGAAAATCAT 


TGGCTTTTTA 


AATACTGCAT 


ATTCATATAT 


180 


TAAAGATGAA 


TAGTCACTAA TTAATAAATC 


TGTTATGAAC 


AGTATATCAT 


TAACTTCTCT 


240 


AAAGTCAGAA ACGTCAACAA AATATTGTTT 


ATGTTTGTCT 


GCAATATTAA GTCTATTTTT 


300 


CACAAATGGA 


TGCATTTTAA ATAATACAAC 


OGOGTTATTT 


TTTTCGCAAT 


ATCTTGCTAA 


360 


ACGTTCAAAA 


TCAATTTTGA AAAATGGGTA 


ATQTGCTGTA 


CCATGACCAC 


TACCTCTAAA 


420 



55 



TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 1140 

CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 1200 

CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 

TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 

TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

AATTCATTAC ATAAATGTAA GQGCTTTAGT TTTCATGTTT 1680 

TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 1740 

TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 1860 

AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 

1999 
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TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACX5TG GAACACCTGT 540 

AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 600 

AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 660 

CTTTTTACCA GTACGACTAA AGCCAACTGT TTTAAAGGCA CCAACGGCAT GCCATACTTG 720 

AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 780 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 840 

TCTATCTGTA ATATTCGCTT TAAAAACAGT GTGAATATCA TACTTTTTAT CTAAATTTTG 900 

ACX5TAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 960 

TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTOTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 1080 

20 ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CATCAAGGAC 1140 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCX5AAAT 1200 

GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 1260 

2S TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 1320 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 1380 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 1440 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA CGAATAACTG 1560 

AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT TTAACCAACG TTAATTCATC 1620 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 1680 

TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 1740 

CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT CTTGGAAATC 1300 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 1360 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 1920 

^ GTAATCGATC ATT 1533 

(2) INFORMATION FOR SEQ ID NO: 291: 

Ci) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 204 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 



30 



35 



40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 





nGTnCGGnCA 


GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA 


GATGTCAAGG 


60 


5 


GCATGTTATC 


ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC 


ATAAaCAAaT 


120 




GAACACAaGA 


TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA 


AAGCTGATGC 


180 


TO 


TTCTTCATCC 


TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT 


TAGAACTTAT 


240 


TAAAAAAAGT 


GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT 


TAGATAGAGA 


300 




ACTTCAATAC 


AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG 


TGATAGAGAT 


360 


IS 


GCCAAAAATC 


ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG 


AAAAATTTGT 


420 




GAAAAAGTTA 


TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT 


GTAGAAGTAC 


480 




AGTATACAAC 


TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA 


ATTTATACAT 


540 


20 


TGATTATTCA 


GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT 


ATTTGATCAG 


600 




AAAGCATAAA 


AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA 


AAAGCTACCT 


660 




ATTAGCAGTG 


TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC 


TGTACTTCAC 


720 


25 


TACAGCATGG 


TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT 


ATAAGGAATA 


780 




CTTTTATGAA 


GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC 


AAGATGAGCA 


840 


30 


ATTGTCTTAA 


ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA 


AAATACTATC 


900 


ACATTTTGCA 


ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT 


TTGGGATATC 


960 




TAAAACACAT 


GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA 


TTGCAATGGA 


1020 


35 


AAGTTGGGGT 


GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG 


TCGAACGTAA 


1080 




AAAAGAAATT 


TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC 


AAAGTGATGA 


1140 




CCCT^GAGAA 


ATAQAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC 


AATTITAAGG 


1200 


40 


AGGAGTTAAT 


CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT 


ATTAAATAAG 


1260 




AACTTACGTQ 


TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT 


ACAAGAGAAA 


1320 




GTTAAAGAAG 


CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT 


TGAAGAAGTT 


1380 


45 


AAGGAAACTT 


CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA 


AGAGTTCCTT 


1440 




AAAAAAGCAG 


ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA 


ACTTGGTGCG 


1500 


50 


GATAAGCTAT 


CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT 


TATGAATGCG 


1560 


AGAATAAATG 


CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT 


AGTGCAAGTG 


1620 




GAGCAAAACA 


ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT 


ATTGCAGATA 


1680 
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10 



IS 



20 



25 



30 



35 



45 



SO 



GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 1800 

GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 1860 

AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 1920 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 1980 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2040 

TAGATAATC 2049 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBK^n^: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 60 

TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 120 

TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATQGTGCT TTAGATAAAT 180 

GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG TCTTGATTGA 240 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 300 

CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA 360 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGQAA 420 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 480 

CCTCATTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 540 

40 GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 600 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 660 

CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 720 

CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 780 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAOTATT TTACAGTTTO 840 

ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 900 

AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 942 
(2) INFORMATION FOR SEQ ID NO: 293: 

55 
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20 



25 



(A) LENGTH: X268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 293: 

TO TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 60 

GTGAGATTGT GaAATTAATC AAATAATAAT AOGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 240 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 360 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTOGAATTCA GTCAATAACT 420 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 480 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTAGT TGTCGAATGT 540 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 660 

3Q GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 720 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 780 

ACATTGACCG ATTACCTTCA CTCAATTCTT CAOGCATGTC TACTATATCA ATTTCAGGTA 840 

3S AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAC'H'CGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGO ATAATCTTCT TGTTTATATG 1020 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 1080 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 1140 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 1260 

TTCCTTGO 1268 
(2) INFORMATION FOR SEQ ID NO: 294: 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 



IS 



20 



TACCACCAAA 


TAATATATTA 


GCTGGCATTT 


TAATAACATT 


TAziATTTGTC 


ATGATATCAT 


60 


CAATAAAATG 


TTGAAACTTC 


GTAATTTCAC 


CTTCATAATC 


ATCAATTGCT 


GCTAATTGCG 


120 


CACTCGATGC 


TTGCTGATCT 


AAATTTAAAA TATTCGACAT 


GCGTTGACTA 


TAATAAACTA 


180 


AATGTTCTAT 


TAAGCCATCG 


TCACTCTTTT 


CCTTTGGTGC 


TGACATGACA 


GCGATACGTT 


240 


TCAAAGGATA 


GTGTTGCGCC 


AATTTTAATG 


TCATTAATCC 


ACCTAAAGAC 


ACACCCGTTG 


300 


CACTGATAGA 


TTCATAACCT 


TCATTGACTA 


AAAATTGGTA 


AGCTTTCTCA 


ACTTCTTCCC 


360 


ACCAATCATC 


TACATTATAT 


GTCATGAAAT 


CTTTCAACAA 


TAAACCATGA 


CCTGGATAAT 


420 


TCGGTGCATA 


ACAACTAAAT 


CCTTGGTCAT 


TTAACTCAGC 


TGCAAGATGC 


TTCACATCCC 


480 


GATTTGTACC 


TGTAAATGAA 


TGTAATAATA ATATCGCATG 


TCCATTTGTG 


CCTTTTAAAT 


540 


ACGtGGACTC 


GGTGTTTTAA 


TTCTCATTTT 


TCtaTATAcC 


TCCACTAtGT 


CTAAAGakGT 


600 


TkGCTAAACG 


CGTTGtCGTC 


GATGATTAA 








629 



25 (2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



35 





TATGAAAGTA ATGAATGGTA ATATTATTAA 


ATTTGATGGA 


AAAGTAGATA 


TTGATAATGC 


60 




AGA'^ATATC 


GGTTTTTTAA 


TTGAGCATCC 


TAAATTATAT 


GATAATAAAT 


CAGGATTGTA 


120 


40 


TAACTTGAAA 


TTATTTGCAC 


AAGTATTAGG 


TAAGGGTTTT 


GATAAAGCAT 


ACACAGACAA 


180 




AATTATAGAT 


GCATTTGGTA 


TGAGACCTTA 


TATTAAAAAG 


AAAGTTAAGA 


AATATTCAAT 


240 




GGGGATGAAG 


CAAAAGTTAG 


CAATTGCAGT 


ATCTTTArTG 


AATAAACCTA 


AATTTTTAAT 


300 


45 


CTTGGATGAG 


CCTACAAATG 


GkATGGATCC 


AGATGGCTCa 


ATTGATGTGC 


TGACTACAAT 


360 




TAAGTCTTTA GkAAATGaAC 


TTGATATGAG 


AATTCtAATA 


TCAAGTCATA AGTTAGAAGA 


420 




TATTGAATTA 


ATTTGTGATA 


GAGCTGTATT 


TTTAAGAGAC 


GGnCATTTTG 


TTCAAGATGT 


480 


50 


AAACATGGAG 


GAAGGTGTTG 


CATCTGACAC 


AACGATAGTT 


ACTGTTGATC ATAAAGACTT 


540 




TGATAGAACT 


GAAAAATATC 


TTGCAGAGCA 


TTTCCAATTA 


CAAAATGTCG 


ACAAAGCA6A 


600 



55 
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ATTAGATATT TATCCGAAAT ATATTGAAAC 
CAATATAAAT CAAAGAGGT6 ATAAATAATG 
^ TATAGTATAT TTAAaArTCC TTTAACATAT 
GCAACTCAAA GTATACTTAT GGCAAATTCG 
GGATCTGTAT TTGCTGCAGC AAAATGGTTA 

10 

AAGACAATTA CGCGTGATTT TTCACAAGGT 
ACACGCXSTTG GAtACATTAT TTCGAAAACA 
GCATTAATTC ATTATGTGAT TTTGATTGTT 
QCGTTTTCTA AATATGTAGA TAATTTATGG 
TTGTTTTTAT TCTTAATCAC ACTT6CATCA 
20 GTATTTTTAG TACTCATTGT ACC6TTTATT 
GGTGAAAAAG TTTTAGATGC TTTTGATTAT 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG 

25 

ATATTCTTCA TTTTGAATAT CTTATATGTC 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC 

TTTTCTAATT TATTTAGGAT GGAATTATAA 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT 

TTGCTCGGTA ATTTATCAGC ACGAACATCA 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA 

CGTGATTTAC CATTTGTTAA ATATTGATCG 

ACGCGTACTT CTAAATTTAA TGCTTCACTG 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT 
TTTTTGATGC GTGAATCATC TTCTGGTAAC 
TCTTGATTTG TCTTCATTCC ATCAATGTAT 

45 GTTTCTAAAT TTGTCATTGT AAAAACCTCC 
AGTAATAAAA GGTAATGAAA TAAATTAATT 
GAAACGGGAA GCACCTTATG CTATATTTAA 

50 

GTATTCAAAA TCX5CTGGGTT GTQTTTATAT 
TTGGTTTATA TCAATATCGT CATACGAAAA 

55 



ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

AGAATTTTAA ATTTAGTTAA GTATGATTTT 780 

TTAGCGaTAC TAGTCGTATC TAGTTTGATT 840 

ATGGATAACC CGAAACATAT TATTGTCTAT 900 

TTGTTAATAA TTGGATTAAT GTTTGTTGTT 960 

ACAaTTCAAC TATATATGAG TAAAGTTAAA 1020 

ATTTCAATTa TTTTAATTTC AATATTATTT 1080 

GTGCAGGCAT CTAGTAATGG AAAAAATTTG 1140 

TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

CAAAAAACAG CAATGATATT TTCATTAGGT 1260 

AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

CAATGGGTAA TTTCATTAGG TTCTATAGTG 1440 

GCTAAAAAAG ACATyTAATA AAAATAATTT 1500 

TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

GTTACAAATT TTTGGACTTC ATTTGCTCTT 1740 

TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

ATCAGTTTTG TATCATCATC TCTATGGAAC 1860 

ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TCAAATGATT GATAGATACT GAGAACGGTA 2040 

TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

GGAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2340 

CTGTQGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCITACCGTT AAAGTGAGAG 2520 

AAAAGGATAA CATCXSTTAAG GGGATAATAT TAGTAAGAGA TGAAAAGATA CATACTAATT 2580 

^ TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 2640 

AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 2700 

TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 2760 

10 

AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 2817 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 

(Xi) SEQUENCB DESCRIPTION: SEQ ID NO: 296: 





TCTGTTAAAA 


TGATTTTTCT 


TTTAnAAAGG 


CCGnAAATCA 


ATGTTOGATT nTTATTTGCA 


60 


25 


TTATGGTCTC 


GATATTGGTA 


GAATATCAAA 


T6GTTAAATG 


AGAAAAACTT GGAGGTOCTC 


120 




ACATGTCATC 


AATCGTAGTA 


GTTGGGACAC 


AATGGGGAGA 


CGAAGGAAAA GGAAAAATAA 


180 




CGGATTTCTT 


GGCAGAACAG 


TCAGATGTTA 


TCGCGCGTTT 


TTCAGGTGGT AATAAT6CAG 


240 


30 


GCCATACCAT 


TCAATTTGGC 


GGAGAAACAT 


ATAAAtTACA 


TTTAGTACCA TCTGGTATCT 


300 




TTTACAAAGA 


CAAATTAGCG 


GTAATCGGTA 


ACGGAGTC6T 


TGTTGATCCA GTTGCACTAT 


360 


35 


TGAAAGAATT 


AGACGGATTA 


AATGAACGTG 


GCATTCCTAC 


AAGTAATTTA CGTATATCTA 


420 


ATCGTGCGCA 


AGTGATTTTA 


CCATATCACT 


TAGCACAAGA 


TGAATATGAA GAACGTTTAC 


480 




gTGGtGACAA 


TAAGATTGGT 


ACAACTAAAA 


AAGGTATCGG 


TCCAGCATAT GTAGACAAAG 


540 


40 


TTCAACGTAT 


CGGTATTCGT 


ATGGCAGATT 


TACTTGAAAA 


AGAAACATTC GAAAGATTAT 


600 




TAAAATCAAA 


CATTGAATAT 


AAACAAGCAT 


ATTTCAAAOO 


TATGTTTAAC GAAACATGTC 


660 




CATCATTTGA 


TGATATCTTT 


GAAGAATATT 


ATGCAGCAGG 


TCAACGTCTA AAAGAATTTG 


720 


45 


TAACAGACAC 


ATCAAAAATC 


TTAGACGATG 


CATTTGTAGC 


AGATGAAAAG GTACTTTTCG 


780 




AAGGTGCGCA 


AGGTGTAATG 


TTAGATATCG 


ACCATGGTAC 


ATATCCATTC GTTACATCAA 


840 




GTAATCCAAT 


TGCAGGTAAC 


GTTACTGTTG 


GTACAGGTGT 


AGGTCCTACA TTCGTTTCAA 


900 


SO 


AGGTAATTGG 


TGTATGTAAA 


GCTTATACAT 


CACGTGTTGG 


TGATGGTCCA TTCCCTACTG 


960 




AATTATTCGA 


TGAAGATGGA 


CATCATATTA 


GAGAG6TTGG 


TCGTGAATAC GGTACAACAA 


1020 
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TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCT6 TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

^ TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

TO 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 1440 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 1560 

IS 

rTTAgATGAt AtTtAACXSAA AAtTAAgATG anATACtTGA ATGGTAA • 1607 

(2) INFORMATION FOR SSQ ID NO: 297: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



30 



3S 



40 



TTAGAAGTAA 


GCACTTTAAT 


ATCTTTACCT 


ACCCATGTAC 


CAACACCTTC 


TTTAGGCTTT 


60 


GGATTTTCAG 


CATGGTTATT 


TGATTTATTC 


ACCTGTTCAG 


ATCTATTTAC 


TTTATTACCT 


120 


TTATTCGGGT 


TCTCTAATAC 


ATCAAATTTT 


AATCTCGGTG 


AATAAAAAAG 


ATATATTAAA 


180 


AATCCAACTA 


AAAATAATAC 


ACCGACGACT 


CTTATAATTA 


ACTTTTTCAT 


CAATCAACCA 


240 


CCTAAAAAGT 


ATTAATACTA 


TTGTAAAAAA 


CAACACATTA 


ATTAGCAAAT 


TTTCAACACT 


300 


GACA^AACTG 


TGTCGTTTCG 


ATAAAACAAA 


ATCATCTTCA 


GGCATAATTT 


TAACGTCATA 


360 


AAGATTTTCC 


CTACACTCTA 


TATCATAACC 


TATCTTTATG 


TTTTCAGGTT 


GAATTTCATT 


420 


ATTAAGATTA 


AAATATGTAT 


AAAAAAATGG 


ACAGTTAAGG 


TATCAAATGA ATACCATCAA 


480 


CTGTCCGACT 


ATTCTTCATC 


AAAAAACCTG 


ATAAAACAAA 


TTGCCTTATC 


AGATTAGTAT 


540 


CATTTGTATA 


AGCATATTAA 


TGACCAAATG 


TTGCTTTAAT 


CAGTGATGTC 


GTTTCTCCAC 


600 


CTGGATATAA 


TACATATAAT 


AATAAGTAAA 


CTGCTACACC 


TGTAATTGCA 


GTACAGAACC 


660 


ATATAACTGA 


AGCGAATGGC 


CCGAATTTGC 


GGTGTACATT 


ATATTTATCT 


TTAAATGCAG 


720 


TAATAATTTG 


AACTAGGCCT 


AGAATACCAC 


CAATTGTTGC 


TAAATTAATG 


TGGAAAAATA 


780 


AGAAAATCGT 


ATAATATTTC 


TTAATTGATG 


CTGGTCCGCC 


AAATGCTGTA 


TTACCGATAA 


840 
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TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 960 

AAATTGCACT AATGACAATA CATGTCGTAC TAATOGTTGG TAAAATTGGA ACGCCCATAT 1020 

^ TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 1080 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT ATATAAACAT TTGTGTTGCC CATTTTGTTT 1140 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 1200 

TO 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 1260 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 1380 

TTGGtTQCCA AQnAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGT6T 1500 

20 TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 1560 

TAAGACCAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 1620 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 1680 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAACGCAC 1740 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 1800 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 1860 

30 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 1920 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 1980 

TAAGATTATG ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2040 

35 

AAATTAACXSA CAAGAGAAAA TGTGAATTrX GTGACACAAA TAACATTTAA TTTATTGCTA 2100 

TAATCTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA GATTAATATT 2160 

TAAGCrrCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2220 

40 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 2280 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2340 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 2400 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCX3 TAGCAACGTT AATGATGACA TTTGTACAAC 2460 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

TATGTCATGG TGCX3TTGATT CCAGAATTCT TTCCTATTGA TACGATTATT GAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2640 

SS 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2760 

CATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

5 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 2880 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2940 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3000 

10 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 305S 
(2) INFORMATION FOR SEQ ID NO: 298: 

c (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDBDNESS: double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 





TTCTTATTTA 


AAGAAGTCAT 


TTTTAGAAAT 


TGTTGAGACT 


TTAAAAAATG ATCCGTATAA 


60 


25 


AATAACACAA 


TCTTTTGAAA 


AATTAGAGCC 


TAAATATTTA 


QAGCGATATT CAAGAAGAAT 


120 




TAACCATCAG 


CACAGGGTCG 


TCTATACCGT 


AGATGATCGA AATAAAGAAG TATTAATACT 


180 


30 


ATCGGCATGG 


TCACATTATG 


ATTAAT6AAT 


ATTCAATATC 


TGAATAACTT TAATGATAAG 


240 


TTAATTAAAG 


AAACTAGTAT 


TTAAGTGTAG 


G6AAAATAGC 


GACGTTAATG CGTTGTTATC 


300 




TCTACACTTT 


TTAATTTTAT 


AATAGCGCAA 


GACTAAACAG 


ATTGAAATTA GTAACAATAA 


360 


35 


AAGAATAACG 


TATTATAATA 


AGGAATTTTA 


AATTGTGACT 


TTTTCGGAAT ATTAAATTTT 


420 


AGAAATATGA 


GGTTTTTAAG 


CGGATPCCTC 


ACAAAATTTT AAAAATATTT AAGCCTGAAA 


480 




ATGATAAAGC 


GGTAGGGAAC 


GTTTTTCTGA 


AaGTTAGTGA 


TACAATAGTT TTAAGTTGAA 


540 


40 


ATACAGGAGG 


ATGAATAACA 


TGAATCAGTC 


AGTCAAATTA 


CTTAAACATT TAACAGATGT 


600 




AAACGGCATT 


GCTGGTTATG 


AAATGCAAGT 


TAAAGAAGCA 


ATGCGTaACT ATATAGAGCC 


660 




TGTCAGTGAT 


CaAATTATTG 


AAGATAACTT 


GGGTGGCATT 


TTTGGAAAGA AAAATGCTGA 


720 


45 


GAATGGTCAA 


TACTCAATTA 


TGGAnTTC 






748 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG GCGAAGTCTT 60 

5 

CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 120 

TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 180 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 24 0 

10 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 300 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATTGCGAC 360 

AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AATTTACCAG TAGTAGTGAT 420 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGTTTCA ATTAATTACG TTTCAG6ATC 480 

TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 540 

20 TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 600 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 660 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 720 

TCaAGAGTTT AA06TAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA AGGAGTTTTA 780 

GTATTATGAA AATCATTAAC ACAACAAGAT TACCGGAAGC ACTTGGACCA TATTCGCATG 840 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 900 

30 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 960 

TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCOAA AGCGACCATT TTCATTAAAG 1020 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT GAACACAAGC 108 0 

35 

CAGCGCGTAG TTGTGTAGAG OTTGCGCOTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 1140 

TAGWAGTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 1200 

AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 1260 

40 

TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT TAGATGAAGC 1320 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG TTGCAATGCC 1380 

45 AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 1440 

GAGACAAGAA ATTCAAGATG CAGTGATGAA AGTATATGAT GAAACAGATG AAGTAGTACC 1500 

AGATAAAAAC GCTACATCAG AAGATTCAGA AGAAGCTTAA TCAATTTTAT ATTTAGCGAT 1560 

^ GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 1620 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 1680 

55 
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10 



IS 



20 



TGaTGCTCGT ATTTTTGAAG TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

CAATGCCTTT TATATGTTGA AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA 1860 

AATTTAATGG GGGTTAATGT TCATGCGAAG ACACGCGATA ATTTTGGCAG CAGGTAAAGG 1920 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 1980 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 2040 

ACATGGTGCT GAAAGTGTAA AAGGACATTT AGGCGAGCGT TCTTTATACA GTTTTCAAGA 2100 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

TTGATTGCGC ATCACGAGGA TGCTAATGCT CAAGCAACTG TATTATCTGC ATCGATTCAA 2280 

CAACCATATG GATACGGAAG AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 2340 

GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GATATTAATG AAATTAGTrC AGGTATTTTT 2400 

GCX3TTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 2460 

GGTGAATATT ACCTCCCTGa TGTATTGTOG TTAATTTTAA ATGATGGCGG CATCGTAGAA 2520 

25 GTCTATCGTA CCAATGATGT TGAAGAAATC ATX;GGTGTAA ATGATCGTGT AATGCTTAGT 2580 

CAGGCTGAGA AGGCGATGCA ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 2640 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 2700 

ATTGAACCAG GCGTACGAAT TAATGGTCGT ACAGAAATTG GCGAAGATGT TGTTATTGGT 2760 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

GTTAATGATG CTAGCXJTAGG AGCX5AATACT AAGGTCGQAC CGTTTGCGCA ATTGAGACCA 2880 

GGCGCGCAAT TAGGTGCAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAQAT 2940 

CTTAAAGATG GTGCCAAGGT TTCACATTTA AGTTATATTG GCGATGCTGT AATTGGCGAA 3000 

CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3180 

4S AGTTTAGCTG TGGCAAGAGC AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 3240 

ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3300 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

TTTCATTGAA AGGAAACGAA GCATTAGGGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 3480 
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GTATTCGTGG 


TTGTGACGTA 


TTThTTATVC 


AACXJ^ACATC 


ATATCCTGTG AATCTACATT 


3600 




TAATOGAATT 


ATTAATTATG 


ATTGATGCTT 


GTAAACGTGC 


TTCTGCAGCA ACAATCAATA 


3660 


5 


TTGTAGTGCC 


ATATTATGGA 


TATGCAAGAC 


AAGATAGAAA 


AGCCCGTAGC CGTGAGCCAA 


3720 




TCACTGCTAA 


ATTAGTTGCA 


AACTTAATCG 


AAAcAGCtGG 


CGCAACTCGT ATGATTGCGT 


3780 


10 


TAGACTTACA 


TGCACCACAA 


ATTCAAGGAT 


TCTTTGATAT 


TCCAATTGAC CACTTAATGG 


3840 


GTGTGCCAAT 


TCTTGCTAAA 


CATTTCAAAG 


ATGATCCGAA 


TATTAACCCA GAAGAATGTG 


3900 




TCGTTGTTcA 


CCAGACCATG 


GCGGsnTTAC 


ACGTGCACGT 


AAATTAGCTG ACATTTTAAA 


3960 


IS 


AACTCCAATT 


GCAATTATAG 


ATAAACX3TCG 


TCCTAGACCA 


AATGTTGCTG AAGTGATGAA 


4020 




CATTGTTGGT 


GAGATTGAAG 


GACGTACGGC 


AATTATTATT 


GACGATATTA TTGATACAGC 


4080 




AGGTACAATC 


ACTTTAGCTG 


CACAAGCATT 


AAAAGATAAA 


GGTGCTAAAG AAGTATATGC 


4140 


20 


TTGTTGTACA 


CACCCTGTTT 


TATCAGGACC 


GGCTAAAGAA 


CXSTATCGAAA ATTCTGCTAT 


4200 




AAAAGAATTA 


ATCGTAACAA 


ACTCAATTCA 


TTTAGATGAA 


GATCGCAAAC CATCTAACAC 


4260 




TAAAGAATTA 


TCTGTTGCT6 


GTTTAATOGC 


ACAAGCTATC 


ATTCGTGTAT ACGAAAGAGA 


4320 


2S 


ATCAGTTAGC 


GTATTATTTG 


ACTAATATTT 


AAAAGGCGTT 


TGACGAACAT ATTCCAAACG 


4380 




TGTATAATAG 


TTTCGTTCGT 


GATTATACGA 


ATAAATAAAC 


ACTTGCAAGC AACGATGATG 


4440 








X\vV7X X X X.\3tJnaji 




Afimry^AAAT nAnAATYTfif^r 




30 


TCATTAAAGT 


CAATCATCCG 


TCAAGGTAAA 


CAAACACGTT 


CAGATCTTAA ACAATTAAGA 


4560 




AAATCTGGTA 


AAGTACCAGC 


AGTAGTATAC 


GGTTACX3GTA 


CTAAAAACGT GTCAGTTAAA 


4620 


3S 


GTTGATGAAG 


TAGAATTCAT 


CAAAGTTATC 


CGTGAAGTAG 


GTCGTAACX3G TGTTATCGAA 


4630 


TTAGGCGTTG 


GTTCTAAAAC 


TATCAAAGTT 


ATGGTTGC 




4718 



(2) ;1NF0RMATI0N FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 60 
50 TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 
ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 

55 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

CAATGCTCGG ATATTCTTAG TTCTTGGATT AGCACTCACT GCTATCGTCA ATTTGTTAAT 360 

^ GGGATTTGTA CCGTTCTTTA CATCAGGTAT CGGTATTATG TTTGTCCTAT TATTCTTAAA 420 

TGGATGGTTT CAAGGTATGG GCTGGCCACC TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 480 

TGTAAGTGAA CGCGGAAGTA AGACTGCCCT TTGGAACGTT GCGCATAATG TTGGTGGAGG 540 

10 

TATTATGGCA CCTATTGCTQ CTTGGGGTAT TACAACAACT GCATTTATCA ACTTTGGTTA 600 

TTTAAAAGGT TTCGAAGGTG TATTCATTTA CCCTGCACTC TTAGCACTTA TCATTGCCGC 660 

AATTTCATAC GTATTGATTA GAGACACACC TCAATCTCAA GGTTTACCTC CAATCGAAAT 720 

15 

TTATAAAAAT GACTTTGCTA CAAGCGATAA GAAAACATTA GAAACAGAAT TAACTACAAA 780 

AGAAATTTTA TTTAAATATG TACTGAACAA TAAATGGGTA TGGGCAATTG CCTTTGCAAA 840 

TATATTTGTT TATTTCGTGC GTTATGGTGT ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 900 

AGAAAAACAT TTCGACTTAA AAGCATCAGG TTGGGCATAC TTCTTATACG AATGGGCTGG 960 

AATTCCTGGT ACATTATTAT GTGGTTACAT TTCTGATAAA TTATTCAAAG GTCGTCGTGG X020 

25 ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT CACAGTATTT GTATTAATTT ATTGGTTAAA 1080 

TCCTCCAGGC AATGCTTGGT TAGACAATGT CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

TQGACCAGTT ATGTTAATTG GTTTACAAGC ATTAGATTAT GTACCTAAAA AAGCAGCTGG 1200 

CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGCCAACAT 1260 

CGTCTTAGGT GCTGTAGTTG ATAAATTCX3G ATGGGATGTC GGTTTTATTT TATTAACAGC 1320 

AATTA6TGTG TTTGCAATGT TGAGCTTTAT CCTCACTTGG AATAAAGTAG GACAAGAAAC 1380 

35 

CGTTCATCAT TAAATGATAA AAAATAAAGT CATATGGTTA TCTTATCGAA AGATQATATA 1440 

TTOrrCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT 1500 

TGCCACTTTT GTGTTTGCCC TGCTGTTTTA TTCAATTGAT TACACCACTT AGGATAAACT 1560 

40 

CTAAAAGCCA TTTTCCCTTG ATACTTAGAT GAAGCTAAGA TACCTTTTTC TACCAATAAT 1620 

TCCCTAGGAA ATAGAAAGTA GCCATTTAAT TCATCATCGA TAACAGCAAC AATCAGGTAA 1680 

TCAGCAAACT CTTCAATTTT GTATGGCCGA TTATAATTAT CtTCGTCTTT TGTCCAACAT 1740 

45 

GTCACGAAAT ATCCCGATTT AGTCGGTGTT TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1800 

TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 1860 

SO TTAACTTCTG ATTCTTCACT AAAAATATTT TTCAACAGTA TTTTAGATTT ACACATATGC 1920 

CAATCTCACT TTATTTTTTC ATAATCGTAT CATATATTTA TTTTTTTCGA AAAATACACT 1980 
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10 



IS 



20 



TGTATCAACT GTCAAACGAT CACOTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 2160 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCXIATAATTG TAGCACCACG 2220 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2280 

TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 2340 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 2400 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 2460 

AAAGITATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2640 

AATACCGATT TTAAAC3ATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 2700 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 2760 

ATTACTGATT GCAGTACCCA AAGCTGCACC AACAACGTTT aAATCAAAAC CAAAAATTAA 2820 

2S AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 2880 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2940 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACX5TTTCAC CTCTCGCCCC 3000 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCX3C 3120 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

T 3181 

(2) ::iNFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

SO TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 



55 



30 



35 
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TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATGATTG TATTATAGAA TTAAAATTGA 300 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 360 

^ TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 420 

CTATTACCAC ACGTTTGATT AATCACAGAC AATATTTCAT CTACACGGTC ATCATTGACA 480 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTGTTGC CAATTTTGTT 540 

10 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 600 

ATCX3CTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 660 

AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTGATCTGCA 720 

IS 

TTAACGCTTT TGAACCGTTG TGATTCATTA TGAATGATTT CTTGGTAACC TTCAATTACT 780 

TTTTCGTGAA CTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 840 

20 TACGTTCGCG ACCTACTTCA GCACTAACAT TTAAATAAAT CGTCAAGTCT GGaTATAATC 900 

CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 960 

CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 1020 

25 AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 1080 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 1140 

GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA TCTATGGTAA 1200 

^ ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 1260 

GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 1380 

35 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTCGC GCCAGAACTT TACCTTTgCC 1440 

TTrtcAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTT6 TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 1560 

40 

TTTTACX5TAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 1620 

CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAQATATG TGCATTCATA AACCAATTTT 1680 

^ GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 1740 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

ACACGGTACT ATCATATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA GCCATAATCA 1860 

50 AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 1920 

GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

^ GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 2220 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 2280 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 2340 

10 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 2400 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 2460 

AATCCAGCTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2520 

IS 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2580 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 2640 

20 GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 2700 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2820 

ZS GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2880 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2940 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3000 

30 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3060 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 3120 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 3180 

35 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 3240 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATOGCTTGC TTCTTTCCTC 3300 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 3360 

40 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 3480 

45 TTTTTGCCTG GCAACGTTCT ACTCTAGCXK5 AACGTAAGTT GGCTACCATC GwCGCTAAgA 3540 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 3600 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTTTTCA 3660 

SO CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACXSTTCTA 3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4020 

GAACCTATT 4029 



(2) INFORMATION FOR SEQ ID NO: 302: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRRNDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



20 


GATGCAATAG 


TTGAGTAGTT 


ATAAGTAGCT 


ACATACAACC 


GCTCAAATAT AGGTTCAAGA 


60 




ACATTTTTAA 


TGAAGAAAGC 


QATTTTCCCm 


CAGA6TGCAA 


ACGCTTGACT 


CAATAACGAA 


120 




ATGTCACCTA 


TGTATGGCGT 


GACTTATTAT 


ACAGGaGGTG 


CAAAGTATGT 


TTGCTATTAT 


180 


2S 


TGAAACAGGT 


GGAAAACAAA 


TCAAAGTAGA 


AGAAGGTCAA 


GAAATCTTCG 


TTGAAAAATT 


240 




AGACGTAAAC 


GAAGGAGATA 


^ A A ^ n>^m^% A A 


TGATAAAGTA 


TTATTTGTAG 


GTGGAGATTC 


300 




AGTTAAAGTT 


GGAGCGCCAA 


CAGTTGAAGG 


TGCAACAGTT 


ACTGCTACTG 


TTAATAAACA 


360 


30 


AGGTCGCGGT 


AAAAAAATCA 


CTGTATTCAC 


ATACAAACGT 


CGTAAAAATT 


CAAAACGTAA 


420 




AAAAGGCCAT 


CGTCAACCAT 


ACACTAAATT 


AACAATCGAT 


AAAATCAACG 


CGTAATTATT 


480 


35 


ATGATTACTG 


TTGATATTAC 


AGTTAATGAT 


GAAGGCAAAG 


TAACAGACGT 


TATTATGGAT 


540 


GGCCATGCTG 


aCCATGGTGa 


ATATGGTCAT 


GATATCGTTT 


GTGCTGGAGC TTCAgCTGTA 


600 




TTG'&TGGTA 


GTGTTAATGC 


GATTATAGGA 


TTGACATCTG 


AGAGACCAGA TATCAATTAT 


660 


40 


GACGACAATG 


GTGGTCATTT 


TCATATAAGA 


AGCGTTGATA 


CAAACAACGA 


TGAAGCGCAA 


720 




CTAATTCTTC 


AAACAATGCT 


TGTGTCTTTA 


CAAACTATTG 


AAGAAGAATA 


TAATGAGAAT 


780 




ATTAGATTAA 


ATTATAAGTG 


AGGTGCATTC 


CGATGTTAAA 


ATTAAACTTA 


CAATTCTTCG 


840 


45 


CATCTAAAAA 


AGGGGTAAGT 


TCTACAAAAA 


ACGGACGTGA 


CTCTGAATCA AAACGCTTAG 


900 




GTGCTAAACG 


TGCTGACGGT 


CAATTCGTAA 


CAGGTC3GTTC 


AATTTTATAT 


CGCCAACGTG 


960 




GTACTAAAAT 


TTACCCTGGT 


GAAAATGTAG 


GTCGTGGTGG 


CGATGATACA 


TTATTCGCTA 


1020 


SO 


AAATCGACGG 


CGTTGTTAAA 


TTCGAACGTA 


AAGGTCGCGA 


CAAAAAACAA GTTTCTGTAT 


1080 




ATGCAGTAGC 


TGAATAATTT 


TGTCTAGTTA 


ACACCAGAAG 


TGAATCTTCT GGTGTTtTTT 


1140 
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AGACGTTATA CTAAATGTGC AcTGTATAAG AGCCCCTAAT CACTAAACTA TAAGGGGGAC 1260 

AAAGGAATAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 1320 

5 

TAAGTATTTT AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 1380 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAAA TGAAAATTAT ATGTAAAAGA 1440 

GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGGTG ATGGTGGTAA 1500 

10 

TGGTATTACC GCATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCAG CTGGCGGTGA 1560 

CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 1620 

,5 TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 1680 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 1740 

TGTTGAAACA GACGAAgTQT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 1800 

20 GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTGCACC I860 

TGACTTCAGT GAAAAAGGTG AACCAGGTGA GGAATTAGAT GTATCTTTAG AATTOAAATT 1920 

ATTAGCTGAT GTAGGATTAG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 1980 

CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAACGATTA AACCAAATCT 2040 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG GCAGATTTAC CAGGTTTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 2160 

30 

AAAAGTTATT GTTCACATGA TTGATATQAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 2220 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG ATAGACCTCA 2280 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2340 

3S 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 2400 

TGATCAATTA TTATATGCAA TAGCAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 2460 

AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 2520 

40 

AGATAAATTT ACAATTTCAA GAGATGATGA TGGTGCTTAT GTGGTAAGTG GTAATGCTAT 2580 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC GATTTGCTCG 2640 

4S TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATQGTGA 2700 

TATOGTTAGA ATTCTTGGCQ GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 2760 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2820 

TAAAACATTQ AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 2880 

CGTTAAACAG TTTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CGATATTTCC 2940 

SS 
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10 



IS 



TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3060 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 33 00 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

AAGAGAAAGA TATGTTCTTG AGTTTAATrA AAGTTACTGG TATTGGTCCG AAATCAGCTT 3480 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

20 ATTTAAAAGG TAAAGTGAAA ATTACTQAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3840 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3900 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 4260 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

TTGGTAGGTG CAACAACGCXj AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 43 90 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 4440 

4S GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

^ GATCAACACG GACTAGATTA CATTOATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4680 

AATGGCGGAC CTGTTGGTrr AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4740 
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GGCAGAAAAG CAACACCATT 


AGCTTATGAA 


CATTTTGCAA AGTCGAATGA GGAGAGAGAA 


48€0 




TAACGTGAAT ATTGAAGAAT 


TTGACTATGA 


CTTACCAGAA 


TCATTAATTG 


CTCAAACGCC 


4920 


5 


TTTAAAAGAT CGTGATCATA 


GTCGTTTATT 


AGTCATGGAT 


AGAGAAACTG 


GTGAAATGAA 


4960 




ACATTTACAT TTCAAAGATA 


TCATTGAGTA 


TTTTAGACCT 


GGTGATACAT 


TAGTGCTTAA 


5040 


10 


CGATACGCGA GTAATGCCAG 


CTAGACTTTT 


TGGTTTAAAA 


GAAGAAACTG 


GTGCAAAAGT 


5100 


TGAAATGTTA ATGTtAACTC AAATTGAAGO TAATGATTGG 


GAAGTCTTAC TGAAACCAGC 


5160 




TAAGCGTATT AAAGTTGGTA ATAAATTGAA 


TTTTGGTAAT 


GGCAAAATTA 


TAGCTGAATG 


5220 


IS 


CATAAAAGAA ATGGATCAAG GTGGACGCAT 


CATGCGTTTA CATTATGAAG GTATTTTACA 


5280 




AGAAAGATTA GA7GAATTAG 


GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 


5340 




TGATCCAGAT CGTTATCAAA 


CAGTTTACGC 


TAAAGAAAGT 


GGTTCAGCGG 


CAGCACCAAC 


5400 


20 


AGCAGGATTA CATTTTACTG 


ATGAGTTATT 


AATTGAAATT 


AAAAATAAAG 


GTGTTAATAT 


5460 




CX5CATTTGTT ACATTACATG 


TTGGGTTAGG 


TACGTTTAGA 


CCGGTGAGCG 


TAGACGATGT 


5520 




GAATGACCAC GAAAIGCATA 


GTGAATATTA 


TCAAATGACm 


CAAGAAACAG 


CTGATTTATT 


5580 


25 


AAATGATACT AAGcCAAAGG 


ACATCGCATT 


ATATCAGTTG 


GTACAACTTC AACACGTACA 


5640 




CTTGAAACAA TTCX3ACGCGA 


TCATGATAAA 


TTTGTTGAAA 


GGAGTGGCTG 


GACTAATATA 


5700 


30 


TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AGATTACTAA TTTTCATTTA 


5760 


CCAAAATCAA CATTAGTTAT 


GCTAGTATCA GCGTTTAGTA GTCGTGAAAA TGTTCTGAAT 


5820 




GCTTATAAAA CGGCAGTAAA 


TTTAGAATAT 


AGATTCTTTA 


GTTTTGGCGA 


TGCAAT6TTA 


5880 


35 


ATTATATAAA AAGAATGTGA 


GGATTTTGAA 


TATGCCTGCA 


GTAACATACG 


AACACATTAA 


5940 


AACTTGTAAA CAATCAGGTG 


CGCGTTTAGG 


TATCGTGCAC 


ACACCACACG 


GTTCATTTGA 


6000 




AACXCCTATG TTTATGCCAG 


TTGGTACTAA 


AGCAACCGTT 


AAAACAATGA 


GTCCAGAAGA 


€060 


40 


GTTAAGACAA ATTGAAGCAA 


AAATCATTTT 


GGGCAACACA 


TATCATTTGT 


GGTTACAACC 


6120 




CGGAAATGAT ATTATCAAAC 


ACGCTGGGGG 


ATTACATAAA 


TTCATGAATT 


GGGATGGTCC 


6180 




GATTCTTACA GATTCAGGCG 


GTTTCCAAGT 


GTTTAGTTTA 


AGTAATTTAC 


GTAAAATTAC 


6240 


45 


AGAAGAAGGC GTGGAATTTA GACATCATAC 


TAATGGGTCT 


AAATTATTTT 


TGAGTCCTGA 


6300 




GAAATCAATG CAAATTCAAA 


ATGATTTAGG 


ATCTGATATT 


ATGATGGCAT 


TTGATGAATG 


6360 




TCCACCGATG CCTGCTGAAT 


ATGATTATGT 


AAAAAAATCT ATTGAACGTA CAACACGTTG 


6420 


SO 


GGCGAAAAGA TGTCTAGATG 


CACACCAAAG 


ACCTGAAGAT 


CAAGCATTGT TCGGCATTAT 


6480 




ACAAGGTGGC GAATATGAAG 


ATTTAAGAGA 


ACAAAGTGCA 


AAGGATTTAG 


TAGAATTAGA 


6540 
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AATGGTTGAA 


CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 


6660 




AGGATCTCCa 


GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 


6720 


5 


CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 


6780 




TAAAAATGCA 


AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 


6840 


10 


ATGTCAAAAC 


TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACrTTTGG 


5900 


TATTCGTCTT 


ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 


6960 




ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 


7020 


15 


ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 


7080 




ATTACTAATA 


TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 


7140 




ACAAAAACTG TGCGAAACA 


7159 


20 


(2) INFORMATION FOR SEQ ID NO: 303: 




25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 






TCCATTTATA 


GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 


60 




AATCATCAAA 


AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 


120 


35 


ATTTGATTTT 


GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 


180 




CTTCATTAGC 


TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 


240 




GATATGGCCA 


TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 


300 


40 


TAATTATCTT 


CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 


360 




TATCGGOGGA 


AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 


420 


45 


TAAAACAATT 


AGTTATAATC AAAAAAACTA T6TTACTGAA GTAGAAAGTC AGAACTCTAA 


480 


AGGTQTTAAA 


TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 


540 




ATATGATCAA 


TACTTATTTG CACAAGACCC AACTGQTCCA GCAGCACGAG ACTATTTCGT 


600 


50 


CCCAGATAAT 


CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 


660 




ATTGTCACAC 


GAAAGAGGTA AAGGTGATAA AAGCGAGTTT GAAATcACTT ACGGCAGAAA 


720 




CATGGATGCT 


ACATATGCTT ACGTGACAAG ACATCGTTTA GCCGTTGATA GAAAACATGA 


780 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 900 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 960 

^ TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 1020 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 1080 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 1140 

10 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 1200 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 1260 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 1320 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 1380 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 1440 

20 TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 1500 

GTTATTAGAA AATGctAAAG CT6CTAACGA TACTGAAGAC ATCGGTAAAG GAAOCGATAT 1560 

AGAAATTATC AAAAGGACAG AA6ATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 1620 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 1680 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 1740 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT IS 00 

30 

AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 1860 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 1920 

35 TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 1980 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 2040 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

40 TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 2280 

45 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 2340 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 2400 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 2460 

SO 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCXSA TGACAAAGTT ACTTTATACA 2520 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 

55 
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10 



ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2700 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2760 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2880 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2940 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3000 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3060 

t5 GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT GATATGATGA 60 

30 

ATCAAATTAA AGTTGATACG ATGTTGCATO GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

35 ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 240 

ATTGACITAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 300 

AGGAC3ATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 360 

CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 420 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 480 

CTGGTTAOGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 540 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGOOCGT TTTGATGTAA 600 

TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 660 

ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT AAAGATTTTG 720 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT AAATTAGCTA 780 

AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG GATTTATTAT 840 

55 



40 



4S 



SO 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA 
TTGCATTTTC TAAAAAAGCA GATGATGAAA 
^ AAATCGAGGA AAACGGTGAA TTAGCTAAAA 

CTAAATCTAA ATAGCGAACA ACAACATGCG 
ATGCTAGAAG GTTTGGTCAA ATATTCAATT 

10 

TTGATTATTG CATTATTTAC AGCATTAATG 
ATTTCACGTG TCTATGTATC TATTATTCGA 
ATATTTTATG GTATTCCAGA ATTAGGTAGA 
ACATTGGCAC CTGTAGTGGC TGCTATTATT 
TCGGAAATTA TTCGTGGCXSG TATTATTTCT 
20 TcCaTCGGTA TGACGTATGG TCAAACGATA 
GTGTCGATTC CTGCACTAGG TAATACATTT 
GGATTTATTT TAGTGGCTOA AATGTTTAGA 
GAATATTTAA CAATTTATGT GTTAGTTGCG 
TCAATTATCC AAGGTATCTA TGAATCTTAT 
TGAACAATAT CCATAAATCA TTTAATGATG 

30 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG 
TACGTATGAT TAATGCATTA GAAATTCCAA 
3S CATATACATC TAAAGATAAA AAATCACAAA 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA 
TCACAGTTAA AAAGTTGAAA AAGQATGAGG 
AAGTTGGTTT AACACATGTC AAAGATCAAC 
AACGTGTTGC TATTGTCAAG AGCACTAGCA 
CCAACATCTG CACTTGATCC TGAACTTGTG 

45 

GCTAATGAAG GCATGACAAT GGTCATTGTG 
TCTAATAACA TTGTATTTAT TCmTGAAGGC 
ATGTTCAATC GTCCGAAAAC AGAAGAATTA 
TAATCAAATA GAACCACGTA TCATGTTTTA 
TGGTTCTTTG TTATGTTGTC ATAATCTTGG 

55 
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AAGGTAATGC TGAACAAAGT AGATCTGCAT 960 

CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 1020 

TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 1080 

CTAGATGCTG CAAAACAAGC TTTCGGACCT 1140 

CCTATTACAT TAGTTACATT TGTTTTAGGA 1200 

CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

GGAACACCAA TGATAGTACA ACTATTTATC 1320 

TTATTAACAA ATGACX3CTGA CAACCAATGG 1380 

GGTTTATCAT TAAATGTAGG T6CGTATGCT 1440 

ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

AAAGCTCAAG AAGTTGCGTC TACAACATAT 1680 

CTAATGTACT GGGTGGTATG CTTTATTATT 1740 

ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1980 

TAGAAGTTCG TAAACAGTCT GGTATGGTAT 2040 

CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

CACGTGGGAA ATCACTTGAG TTACTTGAGA 2160 

GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

ATGAACCCTA AAGT6ATGTT GTTTGATGAA 2280 

AATGATGTTT TAAAGGTTAT TAAAGATTTG 2340 

ACACATGAAA TGCGTTTTGC TAAAGAAGTA 2400 

ATGATCGGAG AACAAGGGGC TCCAGAAGAG 2460 

AGACGTTTCT TAAATGTTAT AAATGAAGAA 2520 

GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

AGCGATATTT TAACGACGTT TATGATTTAA 2640 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGATGACTC 2760 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2820 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 2880 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2940 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3060 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

CGTTTTTAAA TACTCXnTTA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACX3GAAT TTGAAACCGT GCTTTTCCTC 3240 

AGCCATATGG AATAAGTCX3A AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

^ GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 3420 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 3480 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3540 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3660 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

35 ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3821 

(2) INFORMATION FOR SEQ ID NO: 305: 



25 



30 



40 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 

CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 240 
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IS 



20 



25 



TATTGTAATT 


GTGTAGCTGC 


ATAGTCAGCA TAGTTATTCA TTGAACGTGC 


AAATGAAGCA 


360 


TAGTCGTGTG 


TGTGTACGAT 


TTCAACATTG 


ATGAATCTAG 


GGTTACCGAC 


TGCACCX3ACA 


420 


CCCCAAGATA 


AGTAATCCGT 


TGGTGCTGTT 


TCGATTATAC 


GATCCCCATC 


AACAAATGCA 


480 


TGTACXSAATG 


CGTTTTGATA 


GTTATTTTTC 


ATATAACTAA 


TTTCACCATT 


TATCGTCGAA 


540 


CGATCATTAG 


CTGTATCATG 


AACTACGATA 


CCTTCAGGAC 


GACCTACGCC 


QTTACGGTAT 


600 


GCGTATTTAG 


GGAAGTAAGA 


TGTATAATCT 


TCTTCAATTT 


TAGGTGCTTT 


TAAGTTATTT 


660 


TTACAAATGT 


AATCGTTAAT 


TGAAGAGTTT 


ACTTGTGGTT 


TATATTTTGG 


CAAACTCGTT 


720 


TTTGGTGTTG 


CAGCAACTGA 


TCTTGGTTGT 


GCTGAAGCGC 


TAAAAGTAGT 


TACTTTAGGT 


780 


GTCGCTTCAQ 


TTTTAGCTTT 


AGGTGCTGAT 


GTAGTTGCAG 


CTTTAGGTGC 


TGCGGTTTTA 


840 


TATTGCGTyT 


CAAGAGCTGC 


AGGTTTAOCA 


GCTGATTTAA 


TTAATTCTGG 


ATTAATTTGA 


900 


TTTTCTGAAT 


TATCATCTTC 


ATCATCAACT 


AAACTATAAC 


CAGCATTTGT 


AACATTAGTG 


960 


TTAGTTTTAO 


GTGCTGTAGT 


GCTTGTTGAC 


TTTGCAACAG 


GCTGCGTATT 


ATTTGTAGTC 


1020 


GCTGATTGAT 


TAGCACGAGT 


GTCACCATTT 


ACTTGTGCAG 


TATCAACTTT 


TTGACTTACT 


1080 


TGAGCATTGC 


CTGTTTTGTT 


ATTTGCTGTT 


TTTGGTTGGA 


CAATAGCAGG 


GTCTTGATAT 


1140 


ACTTGAGTGC 


CAGAAATGTT 


TTGCGTTGGA 


TTTTTTACCT 


CAGCTTTTGC 


TTGTTCAGTA 


1200 


GTTGCTTTAA 


CTTTATTACT 


ATCTAAAACG 


TTTTTATTAG 


TAGTTTGATC 


TTGTGTCGTC 


1260 


TCAGCTGCTT 


GAACTTGATG 


TGCAGTGACT 


GCTGAACCTA 


CAAGCGTTAA 


TGCAACCATT 


1320 


GAgGTAGTTT 


GTAATTGAAT 


TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 


1380 


TATtACTAAC 


ATTATAGTAC 


CTGTnTTATA 


TACCTGTGCG 


TA 




1422 



$5 

(2) INFORMATION FOR SEQ ID NO: 306: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

5 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 4 80 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 540 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

10 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACAOGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCOGTCCTT ATTGTAATTT TCCAACTACT 720 

IS CAAATTAAAA ACGACCGGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTrGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 840 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGOGTGCG AAGTCACTTG 900 

^ TCATACCGTC GATAGATGTT ACTGCACGAA TACCrACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1080 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

GTTOTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

ATACGTATAC AAATTCATTA CCAATGATTT TACX3TTTTTG TTCAGGATCT GAAACACCTT 1440 

do 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACC?5AATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACX3CCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

^ GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CX5ATTGCTGC ATAGTCTGTO CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

SO 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACXSTTCATTG GCAOGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 2040 

55 
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TAAATGAACC TTCTTCATAA ACTGAATTTG 
TCATTTTCTT AATTTCTTCA ATTGAAATTT 
^ TTTCACGAAT TCGGCGTGTA ATTAATTGGT 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT 
AGTAGTTCGG TGATTCTTTC GTAATTTGAA 

10 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT 
TATAACCCAT ACCAGCACGC ACACCGCCCA 
CTTTATAAGC CGTACGTCCT TCGATACCTT 

To 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT 
ATACTTTATA TTGTCTACCT TGGAAAATTT 
20 CTAATAAGCT ACCTAACATA ACCGCATGTC 

CTGAGAATTT AATACCACCA TCAGCAATGA 
CACAATCATA AATTGCTGTA ATTTGTGGTA 
AAATTGAACC TGGGCCAATA CCAACTTTAA 
TTGTTGCTTC TGCACTTTGCT ACGTTACCTG 
TAATATGTTT CACTTGATCG ATAACACCTT 

30 

AGACATCCAC ACCTGCTTCG ACTAATTTTT 
CAATTGCTGC GGCTACAAGT AGACGACCAT 
TAACTTTTTC AATATCTTTA ATAGTAATAA 

35 

GTAACTTTTC AATCTTATGT TTTTGGAGAA 
CTGGAGCTGT AATTAAATTt TCTTGCGTCA 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA 

CAACAATTGO TACACCTGAA ATACGGTATT 
TTTCTTCTGG CGTTAAGAAA AATGGGTTTG 

^ TTTGAACTTC GTCCGCTTGT TCTTCAACGC 

CTTGACX5AGC CATAGCAATC GCCATTTTAG 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG 

SO 

GTAAAATATC AGATTGTGCT GGAATTAATA 
CAAATTTACT TTCCCACATT AAAAACAGCC 

55 



GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

TGTATTGGCT ACCAAAGTCT AAGACAAGGA 2280 

TTGTCGTTCT CCTTTATCTT AATTAGAATG 2340 

TATTATGTGG ATGGCTTTCT GCTAAACCAG 2400 

CGCGTAATTC TCTTAAATCG TGTGAACCAG 2460 

TTAATTGGTA AATTGTATCT TGTAACGCAC 2520 

CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2580 

CCATCGCACC TAAAGAGCCX: ATACCACX3GT 2640 

CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

CACCAGCAGC TAATGCTTTA ATGATATCTC 2760 

TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2820 

CACCAACACC TGCTACAACA CGCGTCGTAC 2880 

CAATATCTGC ACCCX3CTTCA AATAAATCTT 2940 

CTACTAATGT GATTTCTGGG TAAGTCTTCT 3000 

TAGAGTGACC ATGTGCTGTA TCX3ATAACTA 3060 

GAGCACGAAT ATCAGTATCT TTTGAAATAC 3120 

GTTCATCTTT TGCTGCATTA GGGAATTCGA 3180 

GACCTTCTAG ACGTCCGTCT TTAACTAATG 3240 

TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 3300 

TTACATCTAC AATTTTAATC GAGAAGTCTT 3360 

TACCTACTAA GTTGCGATCT TCTTTATTAT 3420 

TACCCATTAA TOCTTCTGCT TCATAAACGC 3480 

AAATGACACC ATTTTCTGAG CGTTTTACTT 3540 

CCATATTTTT ATGAATAACA CCTAAACCAC 3600 

ATTCAGTTAC AGTATCCATA CCAGCAGAAA 3660 

ATAATTGTAC GCTTAAATCA ACGTCTTTCG 3720 

ACACATCATC AAACGTTAAT GATTCTTTTG 3780 

TCCATTTTTC AAATTAATTA GTTATATTAT 3840 
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10 



GCAGAGATTG CGCCTAAAAC AATTCOSTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4020 

ATTAATAAAT TGnTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 40 80 

ATTACCATGC CAAACATAGC TATCATCX3CA CCGCCTAACA CAGGTAGCGG TATGATATTT 4140 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4200 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCXS CTAGACCTTC CGCAGTATAA 4320 

,5 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4380 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTAGAATAA AGAACACTAA CGTCGATGTC 4440 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4500 

^ TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4560 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 4740 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4860 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4980 

ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5040 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

40 GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGO AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCX5ATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCnCTTCTG 5340 

ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 5400 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 5460 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 

55 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 5760 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CX3AATTTGTC ATGCAATCAA 5820 

5 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5940 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 

10 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6060 

ATCTTTTAAT AGTTTT 6076 
(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: 



30 



35 



SO 



ACCGCCGATT 


GATAGCTTTA 


CTGCTGCGAT 


TTTTAACACA 


CGATGATTAC 


CATCATACAT 


AGATCCAATA 


TCGTAAAAAC 


ACAATGCATC 


GATATCAAAT 


GAAGTTCCAA 


TTGAACCATC 


GTCACCTGCC 


ATTTGCTTTA 


ACAAAGATTT 


AACAAGTATA 


ATTTTAGGCA 


TTATCTTCAT 


TACTTTGAGC 


ACCTGGATCA 


ATATAACCTT 


CTTTCGTTGC 


TACCATATCT 


TTTGTATTAT 


AAGTTTCACC 


ATTTTTAAGC 


TTCTCTGCTG 


CATTTAAAGT 


AACTTTACCA 


OGTGATTCAA 


TGAAATCTTG 


ATTATCCATA 


TCATCTTTGG 


CATACAGTCG 


TCCTGATGCA 


CCCCCAACAT 


ACAATGATTO 


CATTGAGCTA 


TCATCAAGTT 


TGTTAACCCC 


GTGGTCACCA 


TCACCAATTG 


GTTTTTTAAA 


CGTTTCTTCT 


AAATTTAATA 


GTGCCACCTC 


ATAAATGTAT 


ATTTATTCAT 



55 



;EQ ID NO: 307: 

AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

ATCCTCTAAT TTAGTCAAAA CTTCTTGGAT 180 

TGGTAATCCC CCGATTGGTA TAATATCAAC 240 

TGTACCACTT GcAATTTCTT TACTGTGGCT 300 

CTCCAATCAA AGCGTTTAAA ATATAAACCA 360 

TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTGCTAATTG CTGTAAATCA TTGAATGTTA 480 

CTCGCGCTAC TACATCATAC ATTGTCTTTT 540 

CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 600 

TGACTGCAGA CATTTTAACA AAGCTAAAGC 660 

TTGACATCAA TGCCATACCA GTTGATTTGA 720 

TGTCTTTAAa ACTACTAAAC CCACGAACCA 780 

CTCGATCTAA TTCAGTTAAT TCAGATTCAT 840 

AACGTGCTTT CATATCATTC ACTTTCATTT 900 

ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 1140 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTtAA TTTCAGGTAA 1380 

CAACGCTTCT ACAOGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAO CAAGATAACC 1440 

GGCAAGCTTA TGAACAAACA CT G TTCCTGC AACACCACGA CGTTGTACTT CXSTTTGTCAC 1500 

AGCAATGTCG TCACX5AACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

^ ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 1680 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGQGTGC GCAATTAACA ATCCTTCAAG 1860 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 2040 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCAQCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACX3 TTTGGGAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 2400 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 2640 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 2700 

QCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 

55 
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30 



35 



45 



50 



1147 



EP 0 786 519 A2 



CCAATTGCAA TAAATAATGC GATTGCAAAT 
GTTCAGCTAT TGGATTTTTT GTGACACTTT 

5 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT 
ATGGCATCAC CTTGTCCAAT TAAAGTTTTG 
CCTGAGGTAA TACTCCAAGT ACAACCTAGA 

10 

ATCAATTGTC TGCGTTTATT ATTTGAACTA 
AATATAAAAA 6TAAGGAACA CCGATAAATG 

^5 AATATCGTGC TATCACATCG GCAACAAGCA 

GTAGAATTTT AGCATAATCA GTTCCAATTA 
CTACAAATGC AACTTGTCCA GCGATAGCAA 

20 TTAAACATAT 6CCTCTGATC ATTGTTACAT 

CAAGATTTAA AATGGTAAGT TGTTTACTAA 
TTAATGGAAT TGCCCACTTA AGGTX?rGACC 
AAAATGTTAC TGTTTGATTT AGTCTAAAAG 
ACATAGCACT TACFGCTGCA CCCGCTAAAA 
TAGATCGGCC TATCATTAAT ACAATAGCAC 

30 

ACATCAATAT TAAAAATGAA GTGTTTGGTA 
AAGCACCTGA ATTTAAACCT ATGAGCGCCG 
GTATAATCGC ACCAGAAACT GCAAGCGCCA 

3S 

GAATCCTAAT CTCATTGATG ATGTTTTGCT 
CCTCTATAAT TGTAGAGGCT TGAATTTTGG 

40 T6ATTAGTAG TATCATACTT AAACCTATAA 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT 
AGCTCTGCGA CATAAGTCAT AAGTTACCAG 

^ TAAAACAACA TCAATATTAA AAACTTTTTC 

TGAACCTGTA GCGATGATAT CCCCTTCTTT 
CGCTTGGTTG ATATCATGAA GAACCATGAC 

SO 

CTGAACTAAT TCTAGTATTT CTAATTGATG 
TAAAAAGATA ATATCAGTTC TTTGTGCTAA 

55 



CCGCCACtGC GTTAATACCT AATATACCTG 2890 

GCACTATTGC ACCACTAATA CTAAgCtGCG 2940 

AACCTGAAAT CTAATAAGAT TAACTCATCA 3000 

AAAAATCTTT CAACAGGTAT GTTGTATTCA 3060 

AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCATATTGA GCGTCCTCCT TTTTTAACTA 3180 

AAATGATTGC ACCAACAGGC GCTTCTCCTA 3240 

CGAGTATCCC ACCTAACAAG GCTGTTAATG 3300 

AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTGCAATACC TGCTAGAATC ATAGCAATAA 3420 

TTTGACCTAA ACCTTTAGCT AATGATTCAC 3480 

TTGTTAATAT AATGAATAAC GCAATACCAA 3540 

ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3600 

CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3660 

TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3780 

AAACTGCATA TGTTAATGCT AAAGCAAATG 3 840 

GATCAGCAAG ACCATTACGA GTAACACCTT 3900 

TACCTACAAT TACTGCTGCT ATATTTCTGG 3 960 

GTTGATTGCT AGGATTATAA TTAAAAATAG 4020 

CGTCACCTAT TAATGTAGAA ATAAATAGTG 4080 

TATAGGATAA AAACTTCAAT GGCGTTGGGT 4140 

TTATCATATT AACTTACTTA ATTAAGAATA 4200 

TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4260 

TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

CATCGCAATA AGATGATCTG AGAAACGAAT 4380 

AATTGTACAA CCTTGTTCCT GATTTAGCTT 4440 

ACAGATATCT AAATATGTTG TTGGTTCGTC 4500 

TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4560 
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TGCCCAATCA ATtTCTTTCT TAtCCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 4680 

AAAACGACCA TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4740 

aGGTAAAaTG GC tATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAGCAAAGTA 4 860 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4920 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 980 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 5040 

TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATTATCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 5160 

TGTCTTGTAT ATTA CTTTfA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 5220 

20 AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTQTGCTCAT ACCATTTATT TATTCATCTG 5280 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 5340 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 5400 

AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 5460 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 5640 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

TCGCCAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 5760 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 5820 

ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 5880 

40 TACTTCTCTT ACTCAAAAAt TTATAAcCTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 5940 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 6000 

CGCCTTCACC TAAGTTACCA GAAAT^UITAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

AACCTAAGAC AACTAGTATT AAATCTTTAA CCX5TTTTATT CACGAAACCA TCCCCTTTAT 6120 

ATATTTATTA GACTAT 6136 
(2) INFORMATION FOR SEQ ID NO: 308: 



25 



30 



35 



45 



SO 



ss 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30B : 

GATATCGTAs CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 120 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 240 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 300 

15 ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 360 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 420 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 480 

AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 

ACAATGACAT TCCAATTCTT AATCGTTCCA GCAGTAATCX3 CTGTGATTAT GTGGTTCTAT 600 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 660 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 720 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 

ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 840 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 96 0 

GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 1080 

TCAAtCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

40 CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CXSTCGCX3ATC TAAGCTGCGA ACTGTAATAA 1260 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTOGTAC GCTAACAGCC AAAACGCCAC 1320 

TC6TTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 1440 

TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 1620 

55 



25 



30 



35 



45 



SO 



1150 



EP 0 786 519 A2 

TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TTATTATTAC 1740 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 1800 

CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 1860 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 1980 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 2040 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTGCACAT ACAATATTAC 2160 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

20 TTTTAATOGG CATTGCAOCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 2340 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATTAGCATCA CAGAATTTAA 2400 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 2460 

25 TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAA6AAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 



70 



T5 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 

45 TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 240 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

50 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATQAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 480 

55 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 
CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

5 

ACCAGATA 668 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



20 



2S 



30 



3S 



SO 



CATCAGTTTA 


TTTTGAAAGG 


CAATGCOATC ATTTCATGTA TTTATGTTGT TTGAAACATC 


60 


GTTAGATAAC 


AATAGTGATA 


TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 


120 


CAAATAAACA 


ATGAATTTTA 


GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 


180 


GTGTTGGTGA 


GTGATGTTCA 


CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 


240 


TTTTTATTAA 


GCGTTAGGTT 


GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 


300 


OGAAAATTTT 


TCTTAAAAGC 


AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 


360 






TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 


420 


TGAGAAAATC 


GAAATGTCAG 


TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 


480 


ATTTAGGGTT 


GAACTATACA 


TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 


540 


GCTTTAAAGT 


GGAATATACT 


TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 


600 


TAGAATCATA 


CAGAAGTGAG 


ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 


660 


AACQTGGTGA 


ACTAGCACAA 


AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 


720 


ATGATCCACA 


ACGTGAAAAA 


GAAAT6CTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 


780 


TCAACGATAA 


TACTATTAAO 


CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 


840 


AAAAATCTGA 


AAATGAAAAA 


CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 


900 


TTGTAACATT 


TGATAATGGG 


GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 


960 


CATGTTCAGT 


TGAATCATTT 


GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 


1020 


GTGAAAAATT 


TATTCGTGGC 


GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAO 


1080 


GCCTAGGT6T 


TGAAGGACTT 


AAAATACTTA AACAGATTAA A6ATAAATAT GATTTAAATG 


1140 


TTGTCAGCGA 


AATCGTAAAT 


CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAQACGTAT 


1200 
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AAAAGCCTAT TCTATTAAAA CGTGGTTTAT 
CTGAATACAT TGCTTCACAA GGTAATCAAA 
^ CTTATGAAAA GGCGACACGT AACACTTTAG 

GTACACACTT ACCAGTCATG GTAGATGTTA 
TACCAACTGC GAArsAgCAT TAGCAGTTGG 

10 

AGATCCATCT GTTGCACTTA GTGATGCGGG 
ATTTTATGAT GAATTAAAGC CTTTAGCTGA 
TTCCAAGGAA ACTATAGACT ACTTAACTAA 
GAATCGACAA TTTTAAACTT ACAGCCATTC 
ATTTTTTGTA TAGAAATAAA QGTATACTGC 
20 TGTTCAATTT CACAATTATT AAAGATTATG 
GAATTATCTG TAAAATATAA TAGATTCACA 
ACAAATGCAA TT6ATATTTG AAAAGGCTTT 
GTTTATATAA AATTAAAATA TATCX5TTCGT 
TTTTCGCTAA ATTTTTCCAA ACAAAAATAT 
GAAAATGAAA TGAAAACATG TTATTATAAT 

30 

ACAGTTACTA TATATGATGT AGCAAGAGAA 
GTTGTTAATG GGAACCAAAA TGTTAAAGCA 
AAGCGTTTGA ATTATCGTCC AAATGCTGTT 

35 

ACAGTAGGTG TGATCATTCC AGATATATCT 
CTTC^GATA TTGCAACAAT GTATAAATAT 

40 CCTGAAAAGG AAAAAGAAAT TTTTAATAAC 

TTCCTTGGTG GTACAATTAC TQAAGAAATG 
GTAGTAGTAT CAGGAACAAA TGGTAAGGAT 

^ ACTGAAGCTG CGAAAGAAAT TACGGGAGAA 
TTAGTAGGTG GAGAACATTC TAAAAAAGCT 
GTGTTAAATA AAAATGGCCT TCAATTAGGT 

50 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA 
TGTATCAGCG ACGAAGAAGC AATTGGTATT 

55 



CTGCTACAAT CGAAGAGTTT GTTTATGCAG 1320 

ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

ATATTTCAGC AGTACCAATT TTAAAACAAG 1440 

CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TGCTGATGGA GTTATGGCTG AGGTGCATCC 1560 

TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

TTTATATAAC GCTAAAAAGT TAAAATAATA 1680 

TATGTCATGT TGAAGTAGAA TATTATCTTT 1740 

TAAGAGTATA TTACTTTTAG AGTGGCTATT 1800 

ACTTAACGAT TGTTATAATA CTTCGACACT 1860 

ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

CTCAAAACAT TACAATTAAA AATGAAAAAA 2040 

TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

GCGGGTGTCT CTATGGCCAC AGTGTCGCGT 2280 

GAAACTAAAA ATAAAGTTAA CGAAGTCATT 2340 

GCTAGAGGTT TAGCTAGTAA AAAGACAACA 2400 

AATATCTATT ATTCACAACT TGCTCGTGGA 2460 

CACTCAATTA TTTCAAATTC AGATAACGAT 2520 

TTATTAAGTA AACAGGTTGA TGGTATTATT 2580 

AAAGAATTGA TAAATCAATC ATCTGTACCT 2640 

GCACATATAG CATCAGTTAA TATTGATTTT 2700 

TTAATTGAAA AAGGCGCTAA ATCATTTGCT 2760 

CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2820 

GATACATTGA ATTGTTCTGG TGCTGAAAGT 2880 

ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 2940 

ATGGATAGTG CAATGGATGC TGGTATTAAA 3000 
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CCACAACTTT CTAGTGTTAT TCAACCATTA 
TTAACAAAAT ATATGAACX3A TGAAAAGATA 
^ ATTGAATACC GAGGAACTAC AAAATAAATT 

AATTATATGG GTGTTGGAAG AATGCCTTTT 
TTACTTATTG ACGAGTAGAT TCGTACCAAC 

10 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT 
TTTTAGGTAA TAATTCAGAT GAATAATGTT 

^5 AACCACTTTG AATTGGTTGA TCAATTAAAC 
CGACTCTCCA AATATTGTAG CCTCCGCCAC 
AATCAGCTAA ATATTTTACA AAATATGGAA 

20 GATGAGTTAG TGGATCACGA TAATGTATAT 

GTTTAAAACT CTTTACGACA GGCTCAACTG 
CTTCTGTATA CGGTTCAAGT GGGACATTTA 
GCTCAGTATA GTGACCAGAG CCTGGGAAAA 
TAGTAACATG GTTATCGGCA TAGAAACTCC 
TATCTATGAT TAAAACGCGT TGATTGTATT 

30 

TATCATTGTA TATACAAAAA CCACTTGCTC 
CTAAGTGACA ACCATTTAAT ACTTTGCCTG 
CTCCAACAAT TGTGGCACTA TGGCGGTGCA 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC 
CTTOGACGTA ATCATATTTA TGAATTAACA 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG 
AACGCATTTG ATTGAAGGGA TGTTGGTCAT 
AAACATATGC AGTTTTTGAT GAATGTTGTT 

^ AAAAGAAAAA CCGATTCATA TAACGAATAT 
TGTTTTTGCC AATTCTTGCC ATTAAACAAT 
CTGTAGCXUU^ TATTTCAAGT CCACCAGTTG 

SO 

CAAATACATC TAACTTTGAA TTTTTTAAAT 
CGATATAATT CTCGAATTCT GGTGTAGAAA 

SS 



TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

CACAAAATTA GGCATTCATC TAACGACCCA 3240 

ATTTATCTTT TAAAATCGTT GCAGATTAGG 3 300 

TCGCTATATG TAAAGCTAAT TTTTTATTTT 3360 

AGGTATAATC ATTTAAACGA TCTTCCCATC 342 0 

TCCATTTATT AATCCATTCT AACGGTAAAT 3480 

TTAAGAATAC ATGACTCCAT GCACX3TGGTA 3540 

CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

CGACACCATT TACGCTTAGA ATAATATCAG 3720 

TTAATTTAAA ACACTCCAAA AATGATGCAT 3780 

CAGTGTGTCC ATAGCCXSATA TCTTCACCGC 3840 

GAAATTTTCC GGTTTCATGG ATAGAATAAG 3900 

ATTGTGTACC ATCTCCATGA TGTGCATCGG 3960 

CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4020 

GACCAGGTTG AGCGTGATGC AAACCACCAC 4080 

ACATAATAAG ATCTGCTAAA GTTAAAGCGC 4140 

TATGCTTAAA TTGACCATTC TCTTCATCAT 4200 

TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4320 

CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

CATCAAACGC TTGTTGCTGT TCTAATGTAA 4560 

TAGCTGGATG ACTTGTTATT TCTCGATCAT 4620 

CCATTAACCG CTGCATTAAT TTTTTATAGT 4680 

CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4800 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG 

GTTGATCTTG GCGAGCAACA TAAATTCTAC 

5 

CTTTATACTG ATCTTCAGCA TCTCTAAATG 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA 

CTTCGGATTG ATACGTCTTT AAATGATTCA 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC 
AAAAATTTTG AATAATTAGA AATGTTATGT 
AACAAAGGGG 6AGTTTCAAA TGAAAGTCGA 
CCTTAAAGAT TATGAAGAAA CATATAATAC 
TTCTTGGAGT GAAACTGGAA AAATGAACAT 
20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA 
TACTTATAAA GATATGCAAC GGTTATCTAA 

AGAAGTTGAC AAAGGTGACA GA6TATTTAT 

25 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA 

30 

AGATGAGGAT GTAGAAGACA ATTACATAGA 
TGAATTTGAC ATTGAATGGT TAAAGTCGGA 

^ TTCTACTGGG CAACCTAAAG GtGTATTGCA 
TTCTGGAAAA TATGTATTAG ATTTACAAGA 
AGGrrGGGTT aCAGGAACAT CTTATGGTAT 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA 
AGTGACOATT TGGTATACGG CACCAACAGC 
TATTGTTGA6 AAATATGACT TGTCATCGTT 
AAATCCTGAA GTTATAAAAT GGGCGAAAAA 
GTGGATGACA GAAACAGGTG GACATATGAT 
TGGCTCAATG GGCAAACCAT TACCTGGTAT 

SO 

TGAATTACCA CCAAATCGAA TGGGCAACCT 
GTATCGTATC TG6AAGAATC CAGAAAAATA 

55 



GATAGTGGAA AGTGACATAA CCCACAATGA 4920 

CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4980 

CX3GTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

TTITGATTGG AAAGACGTAO AACAAGCATT 5400 

GGCATATGAA TGCATAGATC GCCATGTAGA 54S0 

TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TAAAGCAGCG AATGTTTTGT CTGAACATGC 5580 

ATTTATGTCG CGTACACCTG AACTATATTT 5640 

AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

CTTCATTAGT TTGATGGAAA CTGCTAGCX3A 5880 

TGATGGTTTG ATTTTACATT ATACATCAGG 5940 

TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

TTTAAGAATG TTAATGAGTG CTGGTGACX^A 6240 

ACGTTCGATT CTATCAGTAG GTGAGCCTTT 6300 

AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

TGTTAACTAT CCAACQATGG ACGTcAAGCT 6420 

TCAAGCTGCA ATTATCGATG ATGCAGGGAA 6480 

TGCTATAAAA AAAGGCTGGC CATCAATGAT 6540 

TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6780 

5 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGACG AATTAAAAGA 634 0 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6960 

10 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7020 

TTGAAGATTT TTTTCGAAGA ATAAAGQGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

,5 TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 7140 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

^ TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 7320 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 7380 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 7440 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7560 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

30 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 7680 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 7740 

^ GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7300 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 7360 

CATt&TCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

40 GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATC6 TOCACTTAGA CATGTAAACO 7980 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8040 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

^ TTAGTCOTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 8280 

SO 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 8340 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAO 8400 
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10 



IS 



20 



2S 



30 



35 



40 



45 



50 



GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8580 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 864 0 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGO TGGTAAAGGT GGCGTTGACT 8700 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 3760 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 8830 

ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTT6T 8940 

TAGGTGCACC ATCAOGATTT GAAATTACAA TTCXSTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGGA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 9240 

ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTQGGCT TGAAGAATAG 9300 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 9360 

AATTGGGATG CGCTACAATT AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 9420 

ATAATCTTAT AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 9480 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 9540 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 9840 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATG6AGAA GAAACAAATT X0080 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 10140 

CAGATACAAA TGATGCAGTA GTAACX3AATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT CAACCTCCAC 
CGTCAAGCCA ACCAGCTCAT GAAACAAATT 

^ AGTCAAGTAA TCAGTCGGAT GTTAATCAAC 

ATGCAATTAA AAACCCGGCT ATCATCGATA 
TTGATTTTCA AATGAAAAAT GATAAAGGTG 

10 

TTGAACCAGC AACTGTCATT TTTACAAAAA 
CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT 
TAGTATCATA TGATTCTGAT AAAGATTATG 
CGAGAGAAGT TAAAATTGTG TCATCTATTG 
ATTATACGCT AATGGTCTTT GCACAGCCTA 
20 AAGAAACATA CAATTTACAA AAATTATTAG 
GACAAGTTTA TGAATTAGAA AAATTACAAG 
ATAAAAAGAA ATTAGATCAA ACTAGAGTAG 
CGGAATTTGA AAATGTtACA CCTACAAATG 
TTGTTGTTTT TGAAAGTGAA GAAAATAGTG 
CATTCTATAC AGCAACTTTA AATGGTCAAA 

30 

GTTACTGGAA AGATTTAATT GTAGAAGGTA 
AAAATAATTC TAGAACGCTG ATTTTCCCAT 
TTGTTAAAGT CGTTGTGGCA AACATTGGTT 

35 

ATCAGGATAT CAATACAAAA GATGATGATA 
ATGTACAAAC AGGACAAGAA GGTAAGGTTG 

40 CTGCAACAAA TCCTAAAGAT GCGTCTGATA 
TGGTTAAAGA TGCTGATAAT AATATTGATA 
CCGATATGTC GGATAATAAT CACTTCGATA 

^ TTGCCAAAGA TACTGATAGA AATGTGGATA 
CTAATGTCGA TACTGATAAA GACTCTAATA 
ATATTGCCGA TAAAAATAAT CATACTGGAA 

50 

ATTATAATAA TACAGACAAA GTTACTGACA 
TTCATAAAAC TGTAGATAAA ACAGTGAAAA 

55 



AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CTAATGGTAA TACTAACGAT AAAACGAATG 10380 

AGTATCCACC AGCAGATGAA TCACTACAAG 10440 

AaGAACATAC AGCTGATAAT TGGCGACCAA 10500 

AAAGACAGTT CTATCATTAT GCTAGTACTG 10560 

CAGGACCAAT AATTGAATTA GGTTTAAAGA 10620 

ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

CCTATATTCG TTTCCCAGTA TCTAATGGTA X0740 

AATATGGTGA GAACATCCAT GAAGACTATG 10800 

TTACTAATAA CCCAGACGAC TATGTGGATG 10860 

CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 10920 

AGAAATTGCC AGAAAAATAT AAGGCGGAAT 10980 

AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 11040 

ATCAATTAAC AGATTTACAA GAAGCGCATT 11100 

AGTCAGTTAT GGACGGCTTT GTTGAACATC 11160 

AATATGTAGT GATGAAAACA AAGGATGACA 11220 

AACGTGTCAC TACTGTTTCT AAAGATCCTA 11280 

ATATACCTGA CAAAGCAGTT TACAATGCGA 1134 0 

ATGAAGGTCA ATATCATGTC AGAATTATAA 11400 

CATCACAAAA TAACACGAGT GAACCGCTAA 11460 

CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

AAGCAGATGT GATAGAACCA GAGTCTGACG 11580 

AAGATGTGCA ACATGATGTT GATCATTTAT 11640 

AATATGATTT AAAAGAAATG GATACTCAAA 11700 

AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

AAAATAAAGA CAAAGTGATA CAGCTGAATC 11820 

AAGCAGCAAA GCTTGACGTA GTGAAACAAA 11880 

AAAAAACAAC TGAACATCTG CCGAGTGATA 11940 

CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 

5 

(2) INFORMATION FOR SEQ ID NO; 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

'0 (C) STRANDEDNESS: do\ible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



30 



35 



50 



CAACATTAAT 


ATTGATATTA 


AATCTTCCTG 


GATAACGTGC 


TTGTTOGAGT 


GATAAGTATG 


60 


CACGCACTTG 


ACTTAACTCT 


TTATCTAAAG 


TAATCGTATG 


TTGCTTAGAG 


CCTTGTAAAT 


120 


TCGCTCTGAA 


AAAATAACTC 


AATTCTAATA 


GTAACTCTCG 


TGCCTTTTCG 


CTATTTATTC 


180 


TAACTAAAGC 


TGAGATCGTG 


TTAATTGAAT 


TGAAGAAAAA 


ATGTGGACTC 


ACTTGTGCCT 


240 


GTAATGACTT 


AATCTCAGCA 


TCTTTCAATA 


ACTTACTTTG 


CGTTTCGGCT 


TCACCAAGTT 


300 


CAATTTGGCT 


ACTAAAAATA 


TTTGCCAATC 


CTTCTGCAAG 


TTGACGTTCC 


ACAAAAGTTA 


360 


AATCATTAGG 


GTTTGTAAAA 


TACATCTTCA 


ATGTACCGAC 


GATAGAACCA TGCATCTCAA 


420 


GTGGTATCAC 


GATAGCTGCT 


CTAAGCGGGC 


AATTCGGATG 


ACTACAACCA 


ATCTCTTCTT 


480 


TAGTATGAAC 


TTCTTTCAAC 


TTTCCTGATT 


TCAATACATC 


TTTAGACAGA 


CTTGTTAATA 


540 


TTTCATTTGT 


TGGTATGTGA 


TGATCACTAC 


CTGCACCTAC 


ATGCGATAAG 


ATTTCATTTT 


600 


TGCTTGTAAT 


TGCTACGGCA 


GATACTTTCA 


TTAAATTTTT 


AATAATCATC 


GCAATTTGCT 


660 


GTGCCGATTC 


TCTATTCAAT 


CCTTCTTTAA 


AATACGGCAA 


TGTCTGGTTC 


ATCAATTGCA 


720 


GTACATCATG 


TGTTTGAACA 


GCCTTCATTT 


GCTCCTCTTG 


CTTTAATGTT 


GAAATGATAA 


780 


TAGACATAAA 


AATCGCCGTA 


CCAACGCTAT 


TAACAATAAT 


CATTGGTAGT 


GCAATTAATG 


840 


ATATGAGGTC 


AACC6CATAT 


GCTTTGTCGT 


GGGAAAATGT 


TAAAATGCTC 


AACATTTGAA 


900 


TCATTTCCAT 


AACAATTCCA 


ATCATGGCAC 


TTTTCGCAAT 


ACTCGGGTAA CGCTTGCGTC 


960 


TTTGAGCTTG 


TAAGCCAAAA 


TAACCAGCAA 


TTATACCAAT 


AAATATAGAT 


GAGATAAGAT 


1020 


AAACTTGTGC 


ATCCQCCCCA 


CCCATATACA 


CTCTGAAAAT 


ACCTGAAATA ACGCCAACAA 


1080 


ATAGACCTAC 


AAAAGGGCCA 


CCAACTAATC 


CTGCGACACC 


TATOGTTAAT 


ACACGTGTGT 


1140 


TAGCTAAAGA 


TACATCATCA 


TCTAAACGGA 


AGTACACACT 


TCCTGACAAA CTATGTTGAT 


1200 


GATCGATGAC 


GATACCAGTT 


AAATTAGACA 


TTAAGGCAAA 


CAAACTGAAA ATAATACATA 


1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 60 

IS TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 240 

^ CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCQ TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 480 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 540 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 600 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

35 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 840 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

40 CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATQCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 1140 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 

55 



25 



30 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

5 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 1680 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 1740 

10 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

f5 aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CCGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 2040 

CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2280 

TCGTCGAGTT ATTGAACAQA CAATAAACX3A AAAATTGCCA GATGATTTCC AAACTGCAGA 2340 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 2400 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2460 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2520 

35 GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCXTTcAT TGGAACGAGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2640 

ACGACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TOGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

45 

AATCGTCCTA TCTTTACATT TATAOATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 

50 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 

55 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
^ AGAAATATCG GTTCTTATAT AGAATAATCT^ 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG TTAAAGATAA 
GAAAATTGCA 6TTTTAACTA GTGGTGGAGA 
AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 
20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG 
TAAAGTTGCA ATCGAAAACT TACTGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACX3CAT 
CGGTATTCCT GGTACXSATTG ACAATGATAT 
CACAGCATTA AATAGGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 
TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA 
AGCAGAAGGT TGTATGACTQ CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGAGTTTTA GCATCACGTT TAGGTGGATA 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA 
TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

^ AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 
ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 

55 



TTAGCTATTA AATCAGCGTT TGTTGCACAG 3300 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

ACTTGAGCAT TTTTATGTTA AATCGATACT 3420 

ACAATAATTT AATATTTAGA TACTGAATTT 3480 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 3540 

CAATACATTA GTTTTGATGG AATGACATAT 3600 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 660 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3720 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 3780 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3840 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3900 

AGGGATTGAG GGCCTTGTAO TTATTGGTGG 3960 

CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4020 

CAATGGTACT GATTTTACAA TTGGATTTGA 4080 

CGACAAAATT AGAGATACTG CGTCAAGTCA 4140 

CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4320 

TCAAAAAGAA TTATCACAAT ACATCAATGT 4380 

TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4560 

TCTATATGAA CTTGCTAACA AGTTATCTAT 4620 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 4740 

GCATAAAGGT AGAATTGATA CAATTCGTAA 4800 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4920 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4980 



1162 



EP 0 786 519 A2 



TAAAGATATT GACCATGCTA AAAAAGAAGT 
TAAAAACAAA AAAGGTGTTA ACTTACCTGG 

5 

AAAAGATGCT GAAGATATCC GTTTCGGTAT 
TTTCGTACGT CGTCCTAGTG ATGTTTTAGA 
TAACATTTCA GTATTCCCTA AAATTGAAAA 

TO 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC 
TGAAAAAGTA CCAATGGTTC AAAAAGATTT 

,5 AGTTATTACA GCTACACAAA TGTTAGATTC 
AGAAGCTA6T GACGTTGCCA ACX3CAATCTA 
TGAAACTGCT GCTGGTTTAT ATCCTGAAGA 

20 ATCAGCTGAA GCAGCCCAA6 ATTACAAAAA 
AACTTCATTA GTGAATGCTA TCGGTATTTC 
TAAAGCAATT GTAGCTGCTA CTGAAAGTGG 
TCCACATTCA GACATTATTG CGGTGACTCC 
TGTTTGGGGA GTTCAACCTG TAGTTAAAAA 
CAATGCAGTT GCAACAGCTG TTGAAACTGG 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG 
TGGTGACGAA ATTGCTAATG GTCAAGGTAT 

^ AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG 
TAACTCAATC GATGAAACGT TTGTACCTTA 
AGAAAATGGT ATTACATCAC CAAGTGCAAT 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT 
TGCTCAAGGT AAAATCTTTG AAGGATATGC 
TTCCATTTAT CAACAATGGA AAGGC6TTTT 
AATAGGTTAT TIGATAATGA TAGTGTATGA 
TATACATGTA CATTATCACC TTCAAACTTT 
AATAGAAGTA TGGATGTATT TCTGAAATGA 

50 

TTTTGAGCCA AGTACGCAAT TTAATATTAT 
ATCCCTAAAA AAACAGCAGT AAGATAATTT 

ss 



TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

CGTAAGAGTA AGTTTACCTG GTATTAGAGA 5160 

TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

AATCAGACAA TGTAACAAAT TAGGTAAACC 5460 

TATGCAACGT AACCCACX3TQ CTACACGTGC 5520 

TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

TAGAGTATCT AATGGTGATT TAATCATTAT 6000 

AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

TGirGGnTA GAAAAAGGTA TTCCAACAGT 6300 

AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTGGTTcATC TGGTATTTTA TGACGTAATT 6480 

ATGGCAATCT ATATAAATGT TTATATCTTT 6540 

CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 

TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 
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tCTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA TACTACGTAT TAACATGTAA 6900 

TTTTACTTTT ACATACTTTA AAAAAATAAA ACACTTTGCC CAACTTGCAC ATAAATGTAA 6960 

5 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA TAATTTAATA AATACCACTA 7020 

AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC ATGACCCAAC TTACACTACC 7080 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TGATATTTCG CGATATGTTA ATGAAATTGT 7140 

10 

TGAAACGATA CCTGATAQCG AATTCGATGA ATTCAGACAT CATCGTGGCG CAACATCCTA 7200 

TCATCCAAAA ATGATGTTAA AAATCATCTr ATATGCATAT ACTCAATCTG TATTTTCTGG 7260 

75 TCGTAGAATA GArAAATTAC TTCATGACAO TATTCGAATG ATGTGGTTAG CTCAAGATCA 7320 

AACACCTTCT TATAAAACTA TTAATCGTTT TAGAGTGAAT CCTAATACTG ATGCGTTAAT 7380 

TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG CAAAATCTTA TTGATAATAA 7440 

TTCAATTT TT ATTGATGGTA CAAAAGTAGA AGCTAATGCC AATAGATATA CATTTGTGTG 7500 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA AATTCAAAAA CATTATATCG 7560 

TGACTTAGTT GAAGAAAAAA TAATACCAGA GATAAAAGAA GATGGAGATA GCGATTTAAC 7620 

25 

AATAGAAGAA ATAGATTTAA TTGGTAGTCA TTTAGATAAA GAAATCGAAG ATTTAAATCA 7680 

TTCTATTGAG AACGAAGATT GTGCTCAAAT TAGAAAACAG ACCCGTAAAA AAATAACTGA 7740 

GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA AGAAAAAATA AATATGAAGA 7800 

30 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA ACTGATCTGA TCATGATGCA 7860 

ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC AACTTAAGCC AGGATACAAT 7920 

^ TTACAAATAG CGACAAATTC TCAAAAATGT TTTATCCTAT GACCTATTTC AA 7972 

(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 
-^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 
ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

SO 

CTGAATGAAC ATATGCTTAT AATATTTTTT TOCAATGCTA GTCAAGTTGA TTTATGCTCA 180 
CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 240 

55 
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AATTAAATTA TTGTTGTTTT AACCATTGTA 
CTTTTGGACC ATTATATGAA GCTTTATTAG 
^ TGTGAGGTGT TTGACCACTA AAATGTGTTA 

GTCCATTCGT ATGGTTAACT AAATCAGCGA 
TTGCAGTAAT CGGTAATTCA AATACCATTT 

70 

CGTTTAATAT CACTTTACTA TTCGATTCAA 
CAATTGCTGC ACCCGTTAAT GTAGCAAAGT 
CATAAAACAC AGCATCTGCA AGGACTAATC 
CAGTTTCACC ACTTAATGCT GTAAATACAT 
TATTTTCAGC ACACGCAAGC ACTCCGACAA 
20 CTTCAATGAT ACCAACGACA TTCGCAGCGC 

CATTCTTCGT TTTAATACTA TAACCACCA6 
CAATTGGTGC TTCATCTTTG TCTTTGCCAT 
GCTTACTACC TTTACCTACT GCTTGTAAAA 
AATCTTTAAC ATCTACTTTG ACCTTTGTAT 
ATOTTTGTGG TGtTAATACA TTCGGTGGcA 

30 

aTTGGcCAAT ACTGATACCC TCATQAATAA 
GGkTTAAATT CGTCTTAAAC GGtGCCTTCT 
kTGcACGCTC ACTTTGGAAT ACCGCATGCC 

35 

TTTGgAnATA AATGaATCCa TAAGTAAATA 
TTGGAAAAGA TGTCCCCaTA TTTTCAACAT 
AAACCAACTG TAATTAAACG ATATGTTTGA 
TTCCCAACTT TACTACCAAT AATATGTTGA 
TCGATATGAT TAAAACTAAT GCGCTCTAAC 
^ AATGTATTTA TTTCGTTGCT TAGTGTGTTA 

AGATTTGATA TGTTACTCAC TTCAAATTGT 
GACTTTATAA ATGAATAATT TAATTTTATG 

SO 

AATTTACCTT TTTTGAATGC TAAACCGATA 
ATCACTTTTT TCATGAATGC TGCTTTTTTA 

55 



CTATCXjTCGG AATCATAAAT CCTGTTGGCC 360 

TCGTTGCTGG ACCTGCAATA TCGAAATGAA 420 

CAAAACTTGC CGCAAATAGC GCTTTACCTT 480 

TATCACTGTG TTTAATACTT GCACGTTCGG 540 

CATCGACTTC AGAACTTATT TGTAATATAT 600 

ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

CACACATGTC AAACTTCATT GTAGCCATGC 960 

AATCATACGT TATACCTTTA CCAACTAAGG 1020 

TATATGTGAT GGTTACTAAt CTCGGTTTAT 1080 

GTCCGAATCC TTCAGAAACT AAAGTGTCAT 1140 

TTTTAAAATG ATTAACAATA TCTTCTGcAA 1200 

TATTACTAAA GTCTCyTGcC AAATTAATGG 1250 

AATCTAATTC AATTAATGAT TCACTAATAA 1320 

kACTTgrTTT ATAATGATCG aATyCATATG 1380 

ATTAATACAT CTGgATAACT GgATCcATAT 1440 

CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

ATCTTGATAA tACGTGTCTT TAAGTTTCCT 1560 

TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TGATTTAAAT GTTCTGGTAT ACCAATAATC 1740 

TTTAATTTAA AATTCATAAT GTACCTCCTT 1800 

ACAACAATAA AGCCCTcAGT GACACTGAAG i860 

TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 

CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

CCAGCGATAG GTTTACCAAA TACCATACCT 2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 2160 
GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 2220 
^ TCACCAATAA CAAAAATGTT GTCX3TAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 2280 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 2340 
GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 2400 

10 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 2460 
AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2520 
GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2580 

IS 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 2640 
AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 2700 

20 TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AGCATGATCT 2760 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 2820 
AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 2880 

^5 TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2940 

TCtTCATAGT TTAGTGTACC TGCTQATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 3000 
TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3060 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACXaATCTTGA 3120 
GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 3175 
(2) INFORMATION FOR SEQ ID NO: 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3X4: 



45 



50 



CATCATTATT AAAGATTTTC AATCAATACA 


gAATCACAAT ACGTACGCAT TGTGCACQAT 


60 


AAAAATACAG ATGTGTATAT TAACTATGAA 


CTACAAGAGC AACTAACGAA 


CAAAGCTTAC 


120 


ATTGGTGATC ATATTTATGT TGAAGGGATA 


TGGCTCGAAG TACAAGCTGA 


TGGTTTAAAT 


180 


GTATTGAGTC AGAATACAGT GGCATCGTCA 


TTAATTCGCT TAACACAAGA 


GATGCCACAT 


240 


GCACAGGCAG ATGATTACAA TACGTACCAT 


CGTTCGCCAA GGATTATTCA 


CCGTGAACCG 


300 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 540 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60 

AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 360 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 480 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

40 AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 790 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 

TTGTATCATG ACfCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

50 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACA6TTTCG 1020 

55 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 1140 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

5 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 1320 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 1380 

TO 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 1440 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

,5 TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 1560 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

20 TTAGGGATAA ATGCTAATGA GAATAGCATT GGTGACTCAA ATGTAATACG TCCTTTATAT 1740 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 1860 

ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 1920 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 1980 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGGCATA 2040 

30 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

35 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 2400 

AATAGCCAQA AACTTACX;TT GTTTAATACT GGGAATGCAA CATCACX3TGC TCCAATTTGT 2460 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2520 

^ ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTG6TTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2640 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2700 

SO 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTO CGATTAAACC AATAACTAAG 2760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 2820 

55 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 2940 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 3000 

TAATTTGAGC TTTACXSAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3060 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACXJACCTCTG AACGTACCAG 3360 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 3480 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3540 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA GCAACGATTA 3660 

^5 TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 3780 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3840 

30 CTGGCCCTTT TGCX5TTGAAA ATTTCTATAT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3900 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACA C TTTTTT GACCTCCTAA TACTACAAAT 3960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4020 

AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTQACX3TGGG TTCACACATT 4080 

TGTTAAAATT ATGTQTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 



10 



IS 



20 



3S 



40 



4S 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
5^ CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 



55 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 240 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

S CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 360 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 420 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA GAAATCATAT 480 

TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 540 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 600 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 660 

75 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 720 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 780 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG B40 

20 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

2s AATTGAACTT GCCTGACTTT TTAGAAGGAG AACGTGAACA AATTGAAAGT AAGTTACAGG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 1080 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 1140 

30 TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 1260 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTrC TTCAACGATT 1320 

^ ATTAGTGCAC AGATCCTGTA AGAGTATTTG AAGACTTCX5C GACATTAGAT TTGATTTCTC 1380 

ATGGTAGAOC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 1440 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

40 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1680 

4S 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

50 AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GTCAGATTAT CGAGAAGCGC 1920 
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GTCATCAACG 


TTTTATGGCA 


CAGCTTGATT TTGGCXX5TGT GCCATTTGAA AATGTTATGA 


2040 




AGAATATTGA GTTAATTGGC 


AACGACATTA 


TACCGGCGAT 


TAAAAAGCAT 


TTATCAAAAT 


2100 


5 


AGGAGGGGCG 


TCATCATGAA 


TATTGTATTA 


TTGTCAGGTT 


CCACAGTAGG 


TTCTAAAACG 


2160 




AGAATTGCTA 


TGGATGATTT 


AAAAAATGAA 


CTAGAAGTCA 


TCAATGAGGG 


ACATCAAATA 


2220 




GAGTTGATGG 


ATTTACGAGA 


ACTTGAATTA GAATTTAGCG 


TTGGAAAGAA TTATCTAGAT 


2280 


10 


ACTACAGGAG 


ATGTATATAA 


ATTAACGACG 


TCGTTAATGC 


AGGCTGATGT 


GATTTTTATT 


2340 




GGTTTTCCAA 


l-mTCAAGC 


TTCCATCCCT 


GGTGCTTTGA AAAATGTGTT 


TGATCTACTT 


2400 


IS 


CCAGTCAATG 


CGTTTCGTGA 


CAAGGTAATA 


GGACTTGTAG 


CGACAGCAGG 


TTCTAGTAAA 


2460 


CATTATTTAA 


TTCCTGAAAT 


GCATTTAAAA 


CCAATATTGA 


GTTACATGAA 


AGCACATACG 


2520 




ATGCAAACGT 


ATGTATTTAT 


TGAAGAGAAA 


GATTTTTCAA 


ATCAACAAAT 


TGTCAATGAT 


2580 


20 


GATGTTGTAT 


TTCGGTTAAA 


AGCGTTGGCA 


CAATCCACAA 


TGCGAACTGC 


CAAAGTACAA 


2640 




CAACAAGTGT 


TTGAAGAAGA 


AAACAACCAA 


TACGACTTTT 


AAAGTATAAA 


AATAAGACGC 


2700 




TCGGCACACT 


AAATTTGTAA 


GTGTTTGAGC 


GTCTTTTCAT 


ATTAACTATA 


TAGCCAATGA 


2760 


2S 


acx;acgataa aggcaagtga 


TGACAAGCAT 


ATTGAGGTAA 


TAATGATTGT 


CATAAGCGGT 


2820 




TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 


2680 




ATTAATAAGC 


AAATTGTTGA 


TACAGTATTT 


AAAATATTTA 


GCAATGCTGA 


CGGAATAGTT 


2940 


30 


ACATATGTAT 


TCACTAAGGC 


CATAATGACA AATCCAATTA 


AAAAGTATGG 


AATGCTTATT 


3000 




CGACCCTTGC TAGATGATTC TGATGAACGG AAACGCATAA TTAAAATAAG 


TACGATGGTT 


3060 




AATGGAATCA 


GTAAGAATAC 


TCTACCAAGT 


TTACCAAGAA 


GTGCAATTTT 


AAGTGCATCA 


3120 


3S 


CTACCACCAA AGCCACCAGC 


TAAGACAACG TGTGCAATTT 


CATOAAGACT AACACCAGAC 


3180 




CAAGCGCCAT 


AAACATTTGT 


CGTCATTGAA AAGATAGCGT 


AGATAGCTGT 


ATATATAAGT 


3240 


40 


GAAAATATGG 


TACCAATCAA TGCGATGATA 


CCGATACTAA 


TAGCTGTATC 


CTTTTCACGT 


3300 


GATTTGAATA 


TTGGAGCGAC 


TGCGGCAATA 


GCAGCAGCAC 


CACAAACGCC 


TGTGCCXSACA 


3360 




CCTAGTAATA ATGCGATGTT 


TTTGTCACCA 


TGCAACAGTT 


TGTTGACAAA GAGCATCATT 


3420 


4S 


ACAATACTGA 


AAATAACGAC 


ACCTACATCG ATGGCTAATA 


GTTTACTACC 


TTGACCGATA 


3480 




ATATCGAATA 


TATTGAGTTT 


AAGTCCATAT 


AGGATGATTG 


CAAATCTTAA 


TAAATATTTA 


3540 




GATGAAAACG 


TAATACCTGA 


GCTATATTGT 


TCAGGATATC 


CTCTAAAGTG 


ACGATATAGA 


3600 


SO 


ATAGCGATTA 


ATATCGCGAT 


AGTTAATGCG 


CCAACCTTAT 


CTAGGATTGG 


CAATTTAGCT 


3660 




GCTAAAAAGC 


TAAATAATGC 


GACTATAAAT 


GTTAGTGATA 


GCCCAATCAT 


AAAATGCTTA 


3720 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT 
GGAAACTTGG ACATCAATCA AAGTAATAGG 
^ AAAGAAGCTG ACGAAAAGTC agCTTGcAAT 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC 
AATTTCX3AAA AGAAATTCTA CAAGCAATGC 

10 

CAATGCGTAA CATTAATATG TTATTATAAT 
TGAGGATGTG ATGGTATGTT TGGAATGAAA 
GAAGCTCATG ACACAGAAGC GCTTTTCAAT 

75 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG 
AGAGGACTTT TGCAATTTGC TGATGGTAAT 
ACGCTAGTTG GTGTCATCGG TTTACATGAA 

20 

GGGTACTATT TAGATAAAGA ATTTGAGGGT 
TTGATAAAGT ATTGTTTCGA AGAGCTTGAC 

25 AATAATGAAA AAAGCCXK3GC TATTCCTGAA 
CGTGACAATG AATTACTAAA TGGTATTTAT 
TCAGAATACG ACCAAAAATG ACAAATTAGA 
CAGCGCTCTT TTATTTAATT TTTGAAAATA 
ATATAATGAT TTTGATAATT ATTATCAACT 
CAGCCGTTTT AAGTGTAATG TATTATTTTA 

^ AAGAACACAT CTATGGGGAT AATAGAATTT 
ACAACGCTAT TATTAGCATC AACGTTATTA 
AAGGATGATT CAAAGACATC GAAAAAAGAT 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA 
AAATTTGTTA AAGCGATTGA AAATAATGAT 
GTTCGTATGT ATTATGAACG CTCTGAACCA 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA 
TATCATAAGA TTGAAAAAGC ATTATACGAA 
SO GATGCACAAC AATTATTGAA AGATGCAAAA 
ATCACACCAA AATTAATGTT ACAAGGTTCT 



AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3840 

CACTACAACXJ CTTATTGGCG GGGCCCCAAC 3900 

AATGTGCAAG TTGGGGATGG GCCCCAACAT 3960 

AATGCAAGTT GGCGGGGCCC CAACATAGAG 4020 

AAGTTGGGGA AGGACAACAA ATTTAAGATA 408 0 

GATAATTTAC AGAATTATAT GAAAAATGAA 4140 

GTGAATGAAC AAATAACATT AAAAATTTTA 4200 

TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4260 

CAACCATCAG ATACGCGTGC ATTTATTAAA 4320 

GGATTTCAGT GTGGCATTTG GTATGAAGGA 4380 

ATTAATCACA TGCACAGAAA AACTTCATTA 4440 

CATGGGATTA TGACACAAGC AGTTGAGGCA 4500 

TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4560 

AGGCTGGGAT TTACTAGAGA AGGTATGTTA 4620 

TCATCGAGTT ACATCTATAG TTTATTAAAA 4680 

CTTACAAAAG AGTGATGACA TTTAAAATGG 4740 

AAAGGTTGTT GACAGTATTA TTTTATAACA 4800 

AGATGATGTT TATGGGAGGA TGCTTTAAAA 4860 

GCX3TGTAGGG AATGCGAAAA TAATATTTAT 4920 

CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4980 

ATTGCTGCAT GTGGGAACGA C3GATAGTAAG 5040 

GATGGTGTTA AAGCAGAATT AAAACAA6CA 5100 

CAGTTAAAtG AATTTTTAAA AGGTACAGAA 5160 

ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTGCAGAAG CATTTGGAGA TTTAGATCCT 5280 

GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 5340 

GATAAGAAAA TTGATGATGT GACTAAAAAA 5400 

GAATTGCATG CCAAAGCTGA TACATTAGAT 5460 

GTTGACCTAT TAAATGAAGT TGCAACTTCT 5520 
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GTTGAAGGCG CACAAAAAAT TTATGACTTA 

AAATTAAGTG ATGATATCCA AATGAACTTC 

5 AAAGATAACA ACGGCGGTTA TGAGTCATTT 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG 

TGACAAATTA TX3AACAAGTT AACGATAGTA 

TGTTAGGTAT TGGCGGTGCC GGT6TTGCAA 

CTTTCAAATC AATGTTCAAT ACACCAGAAG 

GTAAAGTGCA ACCAGGCATT ACCACACCCA 

IS 

ATTTGAAGTC AAAAGATAGA GATGCAATTA 

CTGATCGTAT GATGGATGGT GATACAGTTG 

CAGTAGATAC CGGTGAATCG ATAGGATTAG 

20 

TTAGTAAGTC TTTGATGAAG AAAATTGGGT 
ATTTACCGCA TTTTCCGAAT GATCAGTTAA 

25 TTCAAGCATG CTCAAATGAT TCGCAAGTAT 

CATTTCGAGA TATTGTTAAG GTACGTTGGG 
AGGAAACACC TAGAAATTTA ATGGCATTTA 

30 ATCAACTTAA AGATTATGTG TTTATTGATG 

TTGTCAGACG TATTCAAATA CACATTQAAA 
AGGCTACATT TGGTCGGAAA CGACATAGTG 

^ ATGAAATTGA CTTAAAAGCG AAAGATAGTC 

ATACGAGGCT AGCGAAAGAA GCAAATACGT 
ATGGTACGGA TGACCGCACA GGTAACTTCG 

40 

AAGCGACAAA ACAATTTATC GATATACAAA 
AATATATTAC ACATAGAGGT TCTGCTTCAT 
GATACCTTGG TGAAACATTA TTTGACTAAA 

45 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG 
TATAGTGTGA TAACGGATGC GAAATCTGCA 
SQ AAGCAGAAAG CAATTGAGCA AGTGGTAAGT 

AGTGAAAGTA ATGCTGTCAA ATCAGATGTG 

55 



0 786 519 A2 

TTTAAACCTA TTTTAGAGAA AAAAGATAAA 5640 

GATAAAGTGA ATCAATTATT GGATAAATAT 5700 

GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 5760 

CCACTAAGTA AAATGGCTGT GATTACTGAA 5820 

CGCAATTTTC AAGACGTACA TTTTTGAAAA 5380 

TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 5940 

ATCCGGAAAA AGATGCGTAT GAATTTTATG 6000 

CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 6060 

AGGCAATGTT TAAAAAGTGG ACGGTTATGG 6120 

GCAAGCCGAG TAACAATCCT TTAATGCCAC 6180 

GTGCAAGCAA GTTAACGATT ACCTTTGGGA 6240 

TATCTAGTAA AATTCCCGAT GCCTTTAAAG 6300 

TAGACGATTA CAGCGATGGT GATATTATGA 6360 

CCTTTCATGC GGTTCATAAT TTAGTTCGTC 6420 

CGCAATCTGG TTTTATCTCT GCTAAAGGTA 6480 

AAGATGGAAC AATTAATCCT AGGAAGAATA 6540 

ACGGATGGGC GAAACATGGA ACTTATIGCG 6600 

CX3TGGGATCG TACTGCGCTG GAAGAACAAG 6660 

GTGCACCGTT AACAGGTGGG AAAGAGTTTG 6720 

ATGGCGAGTA TATTATTGAT AAAGATGCCC 6780 

CAATTTTACG TAGAGCCTTT AATTATGTGG 6840 

AAACAGGCTT ACTTTTTATT G CmT CAAA 6900 

ATAATTTAGG TAGTAATGAT AAATTAAATG 6960 

TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7020 

TTTGTAGCAA TGCTAATAAC TGCTGCTATG 7080 

GCAGCAGAAC AACAAAGTAT TAGTGATGTA 7140 

CTTTCTAATA ATTCGATATC GAATGACAAT 7200 

GCAGTTAAGA AATTATCGCT TGAAGATAAT 7260 

AGAAAGCTTG AAGATGCAAA AGCGAATGAT 7320 



1173 



EP 0 786 519 A2 

GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 7440 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

5 AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7560 

TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7680 

AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCX^TTA 7740 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7800 

CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7860 

IS 

TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7920 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 8040 

20 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

GAAACTTTAG GGAATAGTG6 CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 8280 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 8340 

30 TTAGCGACGA TTGGTTTAAT ATCTOTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 8400 

ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 8460 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 8520 

TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 8580 

AGTATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8640 

ACQATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 8760 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT 8820 

TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8940 

GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 9000 

50 TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 
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GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 9240 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 9300 

5 AATTGATAAA 9310 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 
CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 120 

20 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 
AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 240 

2^ TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 360 
TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 420 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 480 
GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 540 
AGATGTAAAA GACTACGATA ATGCCAOAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

^ GAGAGATTTC TTAAAAAAAT TAAT6AGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 
AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 
GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

40 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 840 
TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 
TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 960 

45 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 1020 
TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT' (SATGATAGCC TTGATATATT 1080 
AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 1140 
GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 1200 
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AACX3TACAGA GCTTGTTAAG 


TGTTTAGATT ATTTACX3AGA GAGCGACACA TTAGTTGTCT 


1320 




ATCAACTTGA TCGGTTAGGT AGAACGACAA AACTATTaAT TGAATTaTCA CAATGATTCG 


1380 


5 


ATGATAACGG 


AATTGACTTA 


CAAATTAGTA ACATGAACAT 


TTCAACX3AAA GACACAATGG 


1440 




GCAAAATGTT 


TTTTACGATG 


ATGAGTGCAT 


TTTTCaGGAT 


TAGAAGTTAA TTTACTATGT 


1500 


10 


GAGTGTATAA 


AATAGACTTA 


GCAGCAACAA 


GAGCGAGAGG 


CCGAAAAAGC GGGCGCCCCT 


1560 


CTTTACCAGA 


GAATAAAAAA 


TGAGAAATTA 


AATTTTTATA 


TGATGAACAA ACGATAACAG 


1620 




GGGAAGAAAT 


AGCTAGTTAG 


ACAAGGGTAT 


GTCACTCAAC 


TGTTTATCGA GTTGATTAAG 


1660 


TO 


AAATGAAAAA 


ACTTATACTA 


TGAATTACTG 


TTTAAAAGTG 


TGCATGTTAT AATATTTATT 


1740 




GAGCAAGTTG 


GATAGATGGT 


GGCTAATCTC 


TTAATAAAGG 


GGTGATGCCT ATGGTTATAG 


1800 




TTGTTACTCC 


TAGnAAAGGA 


CTAGCATGTC 


TGATTTTGAA 


ATGCTTATGG TTGTATTAAC 


1860 


20 


AATCATTGGT 


TTAGTATTGA 


TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 


1920 




TGACCGGCAT 


TTTTGAAGGC 


TATTTTTAAA 


TAAAATATAA 


GGTCACCGTC TTTTTAACGG 


1980 




GCTCATTAGG 


GTAACATGTT 


TCCGAGTQTT 


GCCCTTTTTG 


TGTTTCAAGA GTTAATGATT 


2040 


25 


TTCATCTTTT 


GCTTCTACTT 


GCTACAAATA 


TATTTTAACA 


CATTTTTCTT ATGAATTGTA 


2100 




GTTCTGAACA 


TAATCAGAAT 


TAATAAAACC 


AACTTTCCAT 


ACAGCAGAAA ATACAATTAA 


2160 




AAGTATAGAA 


TGTAATCACG 


AATTATATAA 


AAAGTATTGG 


AGACCTCTTC ATATATAGAT 


2220 


30 


AATTCACTTA 


GTTATTTTAG 


AAAGAAGCCC 


CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 


2280 




ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 


2340 




AAAATTTATT 


TTGGTTGATT 


GTCTTCTGGT 


TTATCTGAAG 


TCATTGTTTT TGTTGTATTA 


2400 


35 


TCATTTAATG 


AATCTTGTGC 


TTTTTCTGTT 


TTTGTTTCAA 


CAGATGTTGT CACTTTATCA 


2460 




TTTTCTACTT 


TTGTATTTTT 


TGGTTCAGTG 


ACAACTTTTT 


TATTCTTGTC AGCTGTTTTT 


2520 


40 


ACTTTATCTT 


TAGTAATTTT 


TTGACCACGT 


TTTAAGTAAT 


ATTGAACGAT ACCCATTAAA 


2580 


ATGATTGCAT 


GAATAACAAC 


GAATAAAATA 


ATTGTCATTA 


CTGTATAAAC TCTTATGATA 


2640 




TTTTCTGCAA 


CACTTTGAGA 


GAGTGATTGC 


GTATTAAATG 


AAATCAAGTA ACCAACAGGT 


2700 


45 


GTTTTAAATA 


AAATAATAAT 


TAAGTTTAAT 


AATAGTATTC 


CAATGAAAAA TTTGAAAAAT 


2760 




GTTTTTTGAC 


CATTTTTCAT 


TGCTTTAAAT 


CCATTAGCTA AGTGTGTTTT TACTTTATCG 


2820 




TTAGTTGATT 


CaACGAAACT 


AGTCaTAAAG 


TTGAAAATAG 


GTATTAACAA TAACCAAGTA 


2880 


SO 


ATAATTGCTT 


TAATCAATAA 


AACAACAATG 


ATTATGATGC 


TTTGAGTTGT AATAGCAATA 


2940 




CCAATCAGGT 


GTGTACTATC 


TGCATATGAT 


GATTGTACTG 


AATTCATAAT CATTTCTGAT 


3000 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC 


CTTTTTTAAA TGCAATAAAT 


3120 




AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT 


TGTAATTCCG 


3180 


5 


ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG 


CTATAAGTAA TACAGCGATA 


3240 




ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT 


TGCAATAGTT 


3300 


10 


TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA 


CAACTGCAAT 


TACAGCAAAT 


3360 


GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT 
TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 


TAGCATTTTT 


AAATGCTGAT 


3420 
3458 


IS 


(2) INFORMATION FOR SEQ ID NO: 318: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 






25 


GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT 


TATGAATATG 


TTCCAAAATC 


60 




ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA 


TATTATATGT 


ACGCCAGAaA 


120 




TATTTGCTAA ATTQTTACAT ACQATTGCAA CTAGAAATAT 


CGACATTCTA 


TCTGCCAATT 


180 


30 


ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA 


AAAGACACAT 


CTATACGATG 


240 




ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT 


GTATTATTTA 


TCTTCGGAcG 


300 


35 


ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG 


TGGTACTGTC 


ATACCTACTA 


360 


CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC 


CATTTGTTGA 


TGCGTGAAAT 


420 




ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA 


GCGTAATCTT 


GTTGTACAGT 


480 


40 


TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC 


GAGGTAATAT 


GCAGCTTTGT 


540 


CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC 


GCTATCACCT 


TTTTTATTTT 


600 




TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT 


TTyCTTACGT CCAGGTAATG 


660 


45 


GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 
(2) INFORMATION FOR SEQ ID NO: 319: 






695 


50 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 









(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

AATATTGaTa TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 60 

^ AAGACAATCA AGCAGTTAAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 120 

GCATTCAAGA AATTTGATAG TCATTTGCTA TCAATTTCAG AAATATATTA TGAATTGCTA 180 

AATTTAATAA AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCGCCG 240 

10 

AAGtGCAAGT AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 300 

GAATTGGTGA TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTTAAGGGGG 360 

TATTTCAATG TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 420 

75 

TGCAAAAATT GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 480 

GGAGCATAAT GCTGTTCGAT ACGAAATTGG CCTGTTTGAT GTTAGTCATA TGGGTGAAAT 540 

20 TGAAGTAACA GGTAAAGATG CTAGTCAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 600 

TAATTTAACT ACTTCAAAAG CATTATATAC TGCTTTATGT AATGAAGAAG GCGGTATTAT 660 

TGATGATTTA GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTOC 720 

25 TAATACTGAA AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 780 

ACAAAATGTA TCAAACCAAT ATGGTCAATT AGCAATACAA GGACCAAAAG CtAGAGATTT 840 

AATTAATCAA TTAGTTGATG AAGAkGTAAC TGAAA 875 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 
(B} TY9E: nucleic acid 
3S iC) STRANDEDNESS: double 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

40 

TCTTTTTACG AAGGCTACCG TTTAGGATCT 

AATACAGCGA TGACAGGTTA TCAAGAAACT 
45 ATTACTTTTA CGTATCCATT AATCGGTAAT 

TTAGTACCTA CATTAAACGG TATTGTAGTG 

AGACAGCAAA AGACACTTCA TGACGTTTTA 
50 GTTGATACAA GAAGTATTAC GCGTAAAATT 

ACTGATCGAA AAGAAGATAT TGATCAACTT 



GATAACTTAA CTGTAGGAGA AATTGTATTT 60 

ATTTCAGATC CATCATATAC AGOTCAGATC 120 

TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

AAAGAAGCGA GTGCGCATCC AAGTAATTTT 240 

GAATTGCATC AAATTCCAGG GATTGCAGGT 300 

CGACAACACG GTGTGTTAAA AGCTGGTTTT 360 

GTCAAACATT TACAACAAGT AGAATTACCT 420 



1178 
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IS 



20 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 540 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCX3 AAGAAATTTT AGCAATGGCT 600 

CCAGATGGCG TTATGCTATC AAACX^GACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCX^T CATATCAATG CCTAAAGGTA ATGATATCAA 1060 

AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 1140 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC X260 

25 GTTAACTCAT GATTTTATAG CGCXjAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 1380 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 1440 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TCTCATAATG ATGAAGAATT 1620 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CX3ATAATGCC 1740 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 1920 

45 CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACX3 TTCATCAGCG 1980 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTPCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2340 

GATGAACGAT T A TTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA 6AAGAAATTC 2400 

^ ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 2460 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 2580 

10 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2640 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 2760 

IS 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 2880 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 2940 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CXSATTAATTr AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3060 

25 AATTTGAAGC ACTATTAAGA AAAATTAAOG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3X20 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3X80 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3240 

30 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 3420 

3S 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTIGAGGA CTATACTATA AAATTAGCTA 3480 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGQTGTQT 3540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

40 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 3660 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 3720 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3790 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 

5^ AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3960 

GAACAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4020 
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TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 4140 

ATGGTATTCC ATGTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT GTAATTGAAA 4200 

5 

GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4260 

TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4320 

TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 4380 

10 

GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 4440 

GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 4500 

GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4560 

CTAGCTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 4620 

TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4680 

20 CAGCTTACGT CAACAACAGA AGACATGTTA CGACACX3AAC AAAATATACA AACATCGATT 4740 

GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4800 

TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4860 

ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4920 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCX3AT TACACAAAGT 4980 

GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 5040 

30 

AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 

CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCGTGTTACG TTAGGTTATC 5160 

CTTTAGTTCG AGGCXSCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 

35 

AAGTAGAAGT TATTTCPGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 5280 

AAAAATTAAA ATTACCAATG AATTATGTTC GTTCATCAAA TAAGAGTCAT GGTAAGCAAA 5340 

ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 5400 

40 

CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 5460 

TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 

45 TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAmTG 5580 

AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 5640 

AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 5700 

TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 5760 

TAGTCACGAT GTCGACGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 5820 

5$ 
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TTAAAAACAA AAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



5897 



70 



75 



20 



25 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 60 

TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 240 

GCCTTAGOAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 300 

TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 360 

TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA 420 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 480 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 540 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 600 

kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 660 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 720 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATQAAGC TATCTGAACA 780 

TTATOGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 840 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 1020 

CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 1140 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 1200 

AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 
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TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 1440 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

5 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 1620 

ATGAAAAATA TCCAGCCTGT CXJTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 1680 

10 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACX3A GATATTTTCC 1980 

20 AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 2040 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CTU^CCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCX5 TTATGCACAA ATTAAATACG 2160 

^5 AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 2280 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 2340 

^ TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACX3CGG ATATACAAAG 2400 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 2520 

35 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2580 

GACGTACTCA TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCCTGTT ATTGGTGATC 2640 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

40 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2820 

45 TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2880 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2940 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 3000 

TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 

TAATAAAACG GACTCCATGA AAAGTTTTTC CTTTTCATQG CTCTATATCA AATCAGACTT 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 324 0 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 3300 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 3360 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 3420 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 3480 

10 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 3540 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 3600 

^5 TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 3660 

TTCAGGGATA 6AAATTTCTG CCAAACCAGC TCX3TGGCCCT GAAGCCAACC ATCTCACATC 3720 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 3780 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 3840 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 3900 

ACCAACAGCC GTACCACCX3A TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3960 

2S TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4020 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 4080 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 4140 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4200 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATACA TATTAGCAAC 4260 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4380 

GGCATCTTTC TTTGCCTOGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4440 

CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

40 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4560 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTQAGCAATT TATGATTAAA ACGTTTTCAT 4680 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4740 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4800 

CTTTCTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCX3CATA 4920 
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TACAATACCT 


GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 


5040 




TTTTGATAAA 


TTGAGTTGTA 


ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 


5100 


5 


AATTGCCAAT 


GGAAAAGCTT 


TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 


5160 




TTGAATCCAG 


TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 


5220 


TO 


GCATATCATA 


ATCTTGGCAT 


ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 


5280 


TGATAGATGA 


CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATCG 


5340 




TGTTTTAATA 


TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 


5400 




TTTTATGCTT 


TACAAAGTTA 


TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 


54€0 




CGGAATGTCA 


GCTTCTATTT 


CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 


5520 




CCCAATTTCT 


ACAGACATTG 


CAAATTGGOG AAACGGGCCA CAAACTCAGA AACTGGTGGA 


5580 


20 


AAGTCAGCTT 


AAAATAACAT 


GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 


5640 




AATTTCTACA 


GACATT6CAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 


5700 




TCAGCTTAAA 


ATAACATCCA 


AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 


5760 


25 


TTCTACAAAC 


AATGTAAGTT 


GGGGAAACA6 CCCCAACACT GAAACTAGCA GAAAGTCAGC 


5820 




TTCTATGAAT 


ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 


5880 




AATCAATCCT 


ATTTGATATA 


TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 


5940 


30 


TATTACACAA 


TTATAATACT 


TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 


6000 




TGATCAGAGA 


TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 


6060 


35 


ATACTGTTTT 


CTTCTTCATG 


ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 


6120 


ACCTCACAGA 


TTTTTAATTA 


AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 


6180 




AAACCTGCAG 


COGCTTCTTT 


ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 


6240 


40 


GCTGGTTCGA 


AGCTTGTTTT 


ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 


6300 




TTAGCACCTG 


TCACTACTAA 


GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 


6360 




ACATTTAACT 


TACCTTGCTC 


ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 


6420 


45 


TCTTTAGTTT 

^ ^ AAA n>V7 AAA 


TAATCTTATC 


AGCGATTAAT TCXfATTGCAT CTTTTTCATA AGCAATTGGA 


040U 




TAAACTTGAC 


CTGCTGGTAC 


ACCTAACGCT TCX3AAATATG ATTTTTTATC GTCATAATCT 


6540 




AAAATATTTA 


TTTGGCTAAA 


ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 


6600 


50 


AGAACTGGCX3 


CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 


6660 




AATTGCTTTC 


TAATAATATC 


AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 


6720 
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GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC TTTAACATGC 6840 

TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT GAAAATGAGT AAATTTTAAT 6900 

5 TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT TAACAACGCA GCAGTATTTG 6960 

ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG TCTTGATATT 7020 

TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG TAAAAATCTT 7080 

10 

TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT ACCAAATTTT GTTAATAAGT 7140 

AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT ATCATGATAC GTAATATTCA 7200 

CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT ATCATCAGTT CTAAATCCAA 7260 

IS 

GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 7320 

ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATGATTT GTCATATTAC TATTCTCTCC 7380 

20 rrGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA TCATAATTAG 7440 

CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTT6ACC AATGGCCCAA TAAGCAGTTG 7500 

CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA ATTGCATCCA 7560 

25 CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT TTACCACGCC 7620 

AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT ATTAGACATA CGTAGtAAAG 76 BO 

GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT GTCATCTTGT TCGGTATTAA 7740 

TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT CTATTCCCAA 7800 

TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG CGTTAAAAAA CTAATGCATT 7860 

TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT TGGACAACGA TCAACACAAA 7920 

35 

TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 7965 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : douJble 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 60 
SO GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 120 
ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 
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TGTAGTTGTT GAAAATATTT ATCGACGCTT AACA6ATTCA GAAGAACAAC TAAAAGGTGA 300 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 360 

^ TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 420 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT CAATTACACT 480 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 540 

10 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATT GGTCATTAAA 600 

TCATAAGTGG ATTGTAATTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATTTGG 660 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAAATTTT TAGCTATTAC 720 

IS 

TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 780 

AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 840 

TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGTTGAAT ATGATAATGA 900 

20 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 960 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 1020 

2s TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 1080 

AATGAAACAG GTTAAAGGAC TAGCCAATGT CAAATCTGAT TTATCGCAAA CATATGATCA 1140 

GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 1200 

30 TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 1260 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA 6C 1302 
(2) INFORMATION FOR SBQ ID NO: 323: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 60 

45 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 120 
TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT IBO 
TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 240 

SO 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 300 
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GATAGCTTTC TTTGTTCGCT GGAACATGTA AAGAAATAAT ATCGGCATCT TTAATAGCTT 
CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT ATTAGGATAG GCGTCATAAG 
CTGTAATTGT AGCACCAAAT CCTGCATATA TtTTAGCTGT AGCAGCACCG ATACGACCCG 
TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA CATGATTTCT GCTTGCCAAG 
TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA GCGACGCACT AATTGTAGGG 
CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA ACTAGGAACG TTAGATATCA 
CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC AAATCCA6CA 6TACGTTGTG 
CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC GTCATTTTCT AACTTACCAA 
ATTGCATTGT AGTTACGCCA TCGTAATCTT TTAATTGAtC GACTGTAGCA CTTGATAATA 
GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC CCAATTTAAT GCCATCTCTT 
TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCGTCAT TAT 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
GGCTTTTATA GTGTTTACGA TTATTATGAT TACTTATATT CAOGCTGTTA AAAATTATCC 
AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 
TATCACAGGT GCATTATCTA TTATGACAAA TGTTAACCTG ATAATCGCAC TTTTCCATGC 
ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC 
AGTAAGAAGT GACArGCAAT AACaAAAAAG cmAACCGTAA TTTTAATGGC ACGCCCATTA 
AAATTACGGT nTTTTATATC AATATTTAAA AtTAAACCtA AGCCATGTAA AAACGAGATT 
ACACGTCAAT TGTTGTGTAA TCTCGTTTTA TnTTAATCAT TTTAGTCAGT TGCTTTTTCA 
ATTTCGATTA ATAA?^TCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 
ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AGTAATTAGC 
AACGGCTGAT TAGCTTTCAC AGTTTCACCT ACACTAACCT TGACTTCAGT TACTGAACCT 
GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 
TTTGTATGCA CATTTTCATC TTTAATGTAA ATACGTCTCG CTTGACCATT CATCGCATAG 



1188 



CGTTTACCTT TATCGATTTC GATTTCTACT 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG 

^ TATACTTTTG GATATAGTAC ATAACTAATA 

TCTTCTTCAA GCAACTCACG GACTTTTTCA 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA 

10 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG 

TTATAGCCAT CTGTAATCAC GGATTGTTCA 

ATATCACCAA CTACTTTAGA CGATGGTGTT 

ACTCTGCGAT ACATATCTTT GACTTCATCA 

TGTTGACTTA AATTCGAATA CTGTCCACCA 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA 

CTAAGTGACT CCATACCTTC AATATCAGTT 

TAATATAACG AATTGGCGCT TGGCTGACTT 

25 

TCAATGATAT CGACACCAGC ATCTATTGCT 
GTATCATGAG TGTGAAGATG AATTGGTAAA 
TCGTAAGCGG CTTTAGGTTT TAATAAGCCT 

30 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG 
CX5CTCAGGAT TTAAAATGTC ACCTGTATAA 
35 TCTTGTACTG CTTCATTGGC AACTTTCATT 

CTAAAGACAT CTATGCCTGC TTTAGCACTT 
GGATAGTTTT TATAACCAAC TGCGTTTGAA 
GGAATAGCTT TACGTAGACG TTCAAGTCGT 
GCCACATCAA ATGTAGCACC GCCCCACATT 
GCTGTTTTGG ATGCGATATT AATCATATCC 

4S 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT 
TCAGCTACAC CTTTTGGACC TACTTCATCA 
so ATTTTACTTG AAGATACAGT TGGAATTGAT 

TCAACATTTG GGAAACCATT AATTGTTACA 
CTATCTAGAG ACGGCTGAAT GTCGAAAAGT 

55 
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GTTTCACCAT TACGCATTCC AAAGAAGAAT 840 

TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

AACGACACTA CTGATTCTGG GAAATCTAAT 1140 

TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ACTTTTACGA TATCACCAAA TAAQAAATTC 1260 

AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

GGCATTTCAT GTTGATAAAT TTCAGTATTC 1380 

TAATAAGTAC GTACAGTTGA CCAATAATGA 1440 

CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

TCTACAGCTG ATTTTAACTC ACCAATCAAT 168 0 

GCCATATCTT TAATCGCTAA AATATGGAAA 1740 

ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 1860 

TGATCTACCC AGTTTAATGA ATCX3AAAATT 1920 

TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GCACX5TAACA ACATTTGGAA TAATACATTT 2040 

TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

TCTAGTGAGA AACCATCTTT AAATACGTCC 2160 

TTAGTTCTAA CTCGT6TAGC TAATAATGAT 2220 

AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

AGCAATTGTT TCGTACCACT AAATGAAGCG 2340 

GCTAATTCAT AGTCTGGTTT CGGACGTTTC 2400 

TTACCTATAT ATTCTAATGT TTTAGTACCT 2460 

TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 


2640 




AATGATATCG 


CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG 




2700 


5 


GCACCTTGGA AACCATCTCC AGCATCAAGA 


CGTACACCAA AGCCCCCGCT 


TGAAuvjA 1 AA 


2760 




GCAATGATTG TTCCAGTATC CGGCATGAAA 


TCATTTAACG GATCTTCTGT 


TGTAA 1 AUuA 


2820 




CATTGGATGG 


CATAGCCTAA TGTTGTAATA 


TCTTTTTGTT GCGGCATATT 


AATCTCTTCA 


2880 


10 


CCAAATAAAT 


CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC 


AATTCCTGTT 


2940 




ACCATCTCTG 


TAATTGTATG CTCTACTTGT 


ACACGAGGGT TAACTTCTAT 


AAAGAAGAAT 


3000 


IS 


TCGTCACCAG ATACTAGAAA TTCAACAGTA 


CCTGCATTGA CATATTTAAT 


ATTTTCCATC 


3060 




AATTGAATTG 


CAGCATCACA AATACGTTGA 


CGTAATGTTG ATGATAATCC 


AACTGATGGT 


3120 




GCAACTTCTA 


CAACTTTTTG ATGACGACGT 


TGTACTGAAC AATCACGTTC 


AAATAAGTGT 


3180 


20 


ACGATATTTC 


CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 


3240 




TATCTTTCTA 


TGTAAACTTC ACTATTACCA 


AATGATTTTT CAGCTTCTGA 


TTTTGCTCTA 


3300 




TGGAAAGCAT 


CTTCTAATTC ACTTTCTTCA 


CGAACGATTC TCATACCTTT 


ACCGCCGCCA 


3360 


2$ 


CCACTTGTGG 


CTTTAATCAT TAGCGGGAAA 


CCAGCTTCTT CTGCAAATTC 


TTTTGCTAAT 


3420 




TCATATGATT 


TAATTGGACC GTCTGTACCA 


GGAATAACTG GTAAATCTGC 


CTTGATAGCC 


3480 


30 


GTTGTACGAG 


CTTTAACTTT ATCTCCAAAC 


ATATCTAAAT GTTCTAAATG 


AGGACCAATA 


3540 


AATTTAATTC 


CTTCTTCTGC ACAACGACX3C 


GCAAATTGTT CATTTTCACT 


TAAAAATCCA 


3600 




TAGCCAGGAT 


GAATCGCATC CACATTCGCT 


TGTTTTGCTA CATCAATGAT 


ACGCTCAATA 


3660 


3S 


TTTAAATAAC 


TTTCAGCAGG ACCTAAATCA 


CTTCCAACTA AATAGGATTC 


ATCTGCTTTA 


3720 




TATCTATGTA ATGAACTTTT GTCTTCATTC 


GAATAAATTG CAACTGTGCT 


GATGTCTAAT 


3780 




TCTGCCGCCG 


CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC 


AAGTAACTTT 


3840 


40 


TTTATTTGTT 


TCAATAGCGA TACACTCCTC 


AAACTATTAG AATTTTCTAA 


CTAATTAGAT 


3900 




AATAAAATTT 


TATCTTAAAG CGCTCTGTTT 


TGCTATAGTt mTGTTTCtnAA 


TTTTCAAAaT 


3960 


4S 


TTaACATyCT tGAGACAATT AAAaCCyCCXS CTTCmGaAAT AATAATTTCA AAAATGACTA 


4020 


TGCAACAACA 


GGTAGTTCCA CGTTTTTGTT 


GTGAAACATT TTCGATTTCT 


ACAACTCTAA 


4080 




AAAATTAAAA ATAAAATTGC AAAACATCAA 


CATTTATTAT CAATAGCGAT 


AACTTTATCT 


4140 


SO 


TATCATCATG 


ATTCTAATTT CX5CCACCACA 


TTTAGTAATT TTTAGTCATA 


AAATTTAGTT 


4200 




ATAATTATAC 


GTTGTTTTGT TTATAAAATT 


TGATAATCak GAGTAATCtC 


GTAATATCAA 


4260 




AACaAAAAGG 


AAGTTAAGCG TTGTTTGGTT 


GcCTAACTTC CGTTATTGAA 


CTCATCcAGT 


4320 
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TCTCGTACTA 


AATATTGGCT 


AGTATTTTTT 


TAATTAAATT 


GTCTTCTTAT 


ATCAACTTTT 


4440 




TGTTGTTGTT 


TCTTTCGTTG 


CTGGTCTACT 


TTGATTTGTT 


TACCTACAAT 


CAGAAGTAAA 


4500 


5 


CCCATAGCAA 


TACTTAAACT 


AATCATTGAT 


GATCCACCAA 


AGCTGATAAA 


TGGCAATGGC 


4560 




ACACCAGTTA 


ATGGAATTGT 


TGCCGAAATA 


CCGCCAATGT 


TTACAAACGT 


TTGACTTCCA 


4620 




AAGTATGTGG 


CAATCCCAAC 


ACACACAAGT TTATAAAAAT 


ATGATGATGT 


TTTATTTGCA 


4680 


10 


AACTGGAAGG 


CACGATATAC 


AATAAAGAAC 


TCTAAAGTAA 


TAACTAGCAA 


TCCTCCGATT 


4740 




AAACCTAATT 


CTTCGCAAAT 


AATTGCAAAA 


ATAAAATCTG 


TATGTGGTTC 


TGGTAAATAG 


4800 


IS 


CCCAATTTCA 


TTGCACTATT 


TCCTAATCCT 


TTTCCAAATA 


CGCCACCGTT 


ACCTATCGCA 


4860 




AGCAAT6AAT 


TGGAAATATG 


GTATCCAGTT CCTGATTCGA 


ATTGGAATGG 


ATCTGTTAGC 


4920 




GTACTAAATC 


TGGCAGTTAA 


ATAACTTGGT AACCAACCAG 


CCATTAATGC 


AATGACAAAT 


4980 


20 


ACTACTAAGA 


ATCCTAGCAC 


TGCTGGTATA CCAAATCTTA 


GGACTTTGTT 




5030 




(2) INFORMATION FOR SEQ ID NO: 325: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: doilble 
(D) TOPOLOGY: linear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 








CTTGTTAATC 


CGAAAATTAT 


TAGTCAATCA 


AATGAAACAA 


TAACAGACTT 


nGAAGGTTCA 


60 


35 


ATTACATTGC 


CAGATGTTTA 


CGGCGAAGTG 


ACAAGAAGTA 


AAATGATAGT 


TGTCGAAAGT 


120 




TATGACGTCA 


ATGGGAACAA 


AGTTGaACTA 


ACTGCACATG 


aAGATGTAGC 


AAGAATGATT 


180 




TTGCATATTA 


TAGATCAAAT 


GAACGGTaTC 


CCTTTTACAG 


AACGTGCGGA 


CCGTATTTTA 


240 


40 


ACAGATAAAG 


AAGTGGAGGC 


ATATTTTATA 


AATGACTAAA 


ATAATATTTA 


TGGGTACACC 


300 




AGACTTTTCA 


ACAACTGTTT 


TAGAAATGCT 


TATTGCAGAA 


CATGATGTCA 


TTGCAGTCGT 


360 


45 


AACGCAACCA 


GATCGACCTG 


TTGGACGTAA 


ACGTGTTATG 


ACACCACCAC 


CAGTTAAAAA 


420 


AGTTGCAATG 


AAATATGATT 


TACCTGTATA 


TCAACCTGAA 


AAATTAAGTG 


GATCAGAAGA 


480 




ATTAGAACAA 


TTGCTTCAAT 


TAGATGTAGA 


TTTAATTGTA 


ACTGCTGCTT 


TTGGACAATT 


540 


SO 


ATTACCTGAA 


TCATTGTTGG 


CATTACCAAA 


TCTTGGGGCA 


ATTAATGTAC 


ATGCATCATT 


600 




GTTACCGAAG 


TATAGAGGTG 


GTGCACCAAT 


TCATCAGGCA 


ATTATCGATG 


GTGAACAAGA 


660 




AACCGGCATA 


ACAATTATGT 


ATATGGTTAA 


AAAATTAGAT 


GCGGGTAATA 


TTATTTCGCA 


720 
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ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG 940 

CGTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACX5CG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCX5TGGAT TATCACCATG 960 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 1080 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 1140 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG 1320 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAA6AAAATA TACGTTAGAT 1380 

20 TTTTATTTA 1389 
<2) INFORMATION FOR SEQ ID NO: 326: 



TO 



TS 



2S 



30 



45 



50 



li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

35 AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 240 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 300 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 360 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG ctnAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 480 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 540 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 600 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 720 
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AAATGATTGA ATACGGTAAA TCACAAGGCA 
CATATGAACA AGTTGATCAA CTTACAAAAG 

^ GTGTATCTCA AAACCCTAAT GACCCACATT 

CTAAAATTAA TGACCAAGAA AAACTTATTT 
TCCAAGATGC AGATTCAAGC CTATTACATT 
GTTCAATGAT AGGTTACkAA AAAGTAATAT 
TATCAACTAC AATTTATAAA nTAGTTTAC 
ACCAATATCA ATTTATTCAT TGGTAATTGG 
ATACGTCATG GCGATGATAC AAGTCACAAT 
CATCAACGCA TTAAATTTTC ATCTATTGAT 

^0 GATGATACAC AGCATATCCT CATGCAAACG 

CCAATCGTTT ATGAGCGCTT AGTATAACTA 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT 

25 

TATTCTTTCT CTATTAATTT TTCTCAAACT 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC 

GACTTAATAT CATATTCAAA ATAACTTATG 

30 

CACGTTTTTC TTTAAACGCA TCACGGCCTT 
ATGTTGCATC CCCAGCCATT TGTTGTAAAC 
35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG 
CAGTTTCATC TTCAACTTTC TCTAAAGGTA 
CTTCTTGTGC ATTGTATTGA CGACATAAGT 

40 

CGATACGTGC TAAATATCCT GAACCATAAC 
TTTGTCCAAA AATAGCATTA TCA6CAGCAA 
TACCGCCACC TACAGCATAA CCTTTTACCA 

45 

TTAAACGCTG TAAATCTAAT ACATTTAAGC 
CATGTCCACG TTTCTTCTGG TCACCACCAG 
SO ATACGATAAC TGAAACGTTT TGATCATCAC 
CAACTGTTTT AGGTGTAAAC GCATTGCGTA 
TCCCTTCGTA AAATTCATAT TTGATTTCAT 

55 
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GAGATATTCA TTATCAAGAA GGCGTACCAT 840 

ATAATGTAGG AATTATGATC CTTGCAQnAA 900 

TAGGACATGC GCTAGCAGTT GTTGGTAATG 960 

ACTGGAATCC TTGGGATACA GAaTTATCAA 1020 

TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

AGATATTGAT TAAAGGCAGG TAAAACTATG 1140 

GACACAACCA AACTCACACA TCTAGAACAA 1200 

AGTAATCATC AATTACAAAA ATCAATTTGT 1260 

CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

AATGAAGAAA TCATTTATAT TCTTGATTAT 1380 

TCATCCAAAC AAGGTATTGG CACTTCGCGA 1440 

ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 

TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CACCTATCGT CGCTAATAAA CGACGCAAGT 1680 

GGAATTTAGG GAATTGATCG AAGTCAGGAT 1740 

CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

CAGCTAAACC ATCTGTGTCA GCATTCATAG 1860 

AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CCACTGTATT TACTAGACCC ATATCTAAAG .1980 

ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2040 

COGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TOGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TCX5CGATAAC CGGTTTTGGA ATAATACGAA 2220 

GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

AACAGAATGC TAAATCACCT TCACCAGTTA 234 0 

GTGCACGTGA AAATGCGTCA ATCATTTCAG 2400 

CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 2460 

CATATTCTCT AAGTGTTTCC CATTGTCTGT 2520 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 2640 

TTTGCCATTT TTTTCXKrAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2700 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 2746 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



30 



35 



40 





TTTATTAACA 


GTAAGTTCGT 


ATATCAATGT 


TTAGTGCTCC 


CCAAAATTGA 


60 


AGTTTGAATT 


TTAAAA6CAT. 


CTTGTAGAAT 


TTAGTTGTAT 


TTTTTTCAAA GAAATTCATT 


120 


TTGATTATTT 


TTGATAATGA 


GCATTTTAAT 


AGTAATACAT 


GTTTATAGTG 


TGTAGTATAT 


180 


GTCTATACTA 


GTAGTAACTA 


TATAGAGAAA 


GTAGGAATAA 


ACTATGTCAC 


AAGATGTAAA 


240 


TGAATTAAGT 


AAGCAACCAA 


CGCCAGATAA 


AGCAGAAGAT 


AACGCATTTT 


TCCCATCACC 


300 


ATATTCCCTT 


AGTCAATATA 


CAGCACCTAA 


AACAGATTTT 


GATGGTGTTG 


AACACAAAGG 


360 


TGCCTATAAA 


GATGGTAAAT 


GGAAAGTATT 


GATGATTGCT 


GCTGAAGAGC 


GATATGTATT 


420 


ATTGGAAAAT 


GGAAAAATGT 


TCTCTACGGG 


TAATCATCCT 


GTTGAAATGT 


TATTACCTTT 


480 


ACATCATTTA 


ATGGAAGCAG 


GTTTTGACGT 


TGATGTTGCG 


ACATTATCTG 


GTTATCCAGT 


540 


TAAATTAGAA 


TTATGGGCTA 


TGCCAACTGA 


AGACGAGGCA 


GTTATAAGTA CTTATAATAA 


600 


ATTGAAAGAA 


AAATTAAAAC 


AGCCAAAAAA 


ATTAGCAGAT 


GTGATTAAAA ATGAATTAGG 


660 


ACCTGATTCA 


GACTATTTAT 


CTGTCTTTAT 


CCCAGGCGGA 


CATGCTGCAG 


TTGTTGGTAT 


720 


TTCTGAAAGT 


GAGGACGTTC 


AACAAACATT 


AGATTGGGCA 


TTAGACAATG ACCGCTTTAT 


780 


AGTTACATTA 


TGTCATGGAC 


CAGCAGCACT 


ACTTTCAGCA 


GGGCTTAACA GAGAAAAATC 


840 


TCCATTA6AA 


GGATACTCTG 


TTTGTGTCTt 


CCCTGaCTCA 


TTAGATGAAG GTGCAAATAT 


900 



45 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 642 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT 


TTaAGcGtTT 


60 




CaTCaGCTAA CaCtXCaTTa 


TCtTTAGtCG CTTTATGcCC CTCtATTAAA 


CtCXXSaACtA 


120 


5 


AAAATGaTGa 


CtTTTCAACa 


CCGAACaCTA AAACaTTGtC tTTTTGATGG 


CCaTTAGACA 


180 




CAATTTCCCC 


TGTTTGCTTC 


AAAGTAGCTT GCTTCTTGTA TTTATTTTCA 


ATATCTTTCT 


240 


10 


TGTTAAAAAC 


AGATTGTTGC 


ACAGTTTGAT TGGCATCTTT ATTTAGAACA 


ATGGCATCTG 


300 


CTTGCCACTT 


ATCAATGCCT 


TCTTTATTCA TATTGATAAG ACCATTCGCC 


AATCCAGATA 


360 




ATAAAAATAG 


CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA 


AACTTCAATT 


420 


IS 


TGTTGCGCCG 


TATTTCATTC 


CAAGCTAAAA ACATGCATTT CTCTCCCTAC 


TACTATGATT 


480 




TAAACATTGT TTATATTCTT 


AGATGCACGT ACGTCGTGTT GCGCTCTCTA 


ATGTTATACA 


540 




TACACTTATC 


CTTCATTATA 


CCCGAACTTT TTATATTAAA AGCAAATTTA 


TGGAAAATGC 


600 


20 


AATTAATTGT 


CTATTATTTT. 


TGTACGGTAC ATTTAAAATT AAGGATCAAT 


TTAAAAACGC 


660 




CTACATATAC 


CTTTAAGTAC 


ATGTAGACGT CCAATTCATA TATTATTTAA 


CTTCGCCTGT 


720 




TTTAGGATCG 


AATTGCTTAA 


TAGCATTTTT ACGTAATTTA TCTTTTGCTT 


TrTCACTTGC 


780 


25 


TTTATAGTTA 


TTGTTGTAAA 


TCGTAGCTTC CCAACTACCA TACATTGGGT 


TAGGGAAAAT 


840 




GATATATTTC 


TTACCGAAAT 


CGTCTTTATG TTTTTCAATT AATGCTTCAC 


GAGATTCAGC 


900 


30 


TGTAGCTTCT 


TTTGGATCTG 


TAAAGTCTAA TAAATTATCT CCAAATAGCA 


TGACAAGTTT 


960 




ATGATCCTTT 


TGAACCATTT 


GTCTGCGTGA TTCTTTACTC TTATCATCTT 


TACCTTTTAG 


1020 




TAAAATATGA 


CTCTTCTTAG 


CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT 


TTTGTGTTGC 


1080 


35 


CTTTAAATCT 


TTTTCTTTAT 


CTCTATCAGA AATATAGTAG ATATCGACAC 


CTTTTTTGTC 


1140 




AGCATATTTC AAGAATTCTT 


TTGCGCCATA GACAGGTTTA GCTTTAGCAG 


CTTGTACCCA 


1200 




TTCATGCCAA 


CCTTCTGGGA 


AAGGTTTATT ATGTATTGAT GCATAGCCTT 


GATATGGAGA 


1260 


40 


ATTATCTAAA ACTGTTTCAT 


CTAAATCCAA AGCAATAGCT AACTTATGTT 


TACCTTTATT 


1320 




CTTTTTAATC 


TCTTTATCTA 


ACTGTGTCTT TGCACTGTTA TAACCTTGTA 


AATATAATGC 


1380 


45 


TTTTGCTTCA GCTGAATTTT 


GATACCAAGC CACTGCCATA ATATTTTGAT 


TACCAAGATT 


1440 


CGCCTTTTGT 


GATGCTGGTA 


TAGAAGCTTG TTGCGTTTGT TGAACTTCAG 


CAGAACTTTT 


1500 




GGCAAACGCT 


GTAGAATTTG 


TCGTTTGTGG TGCTGAAACT GTAACCGCTA 


CCGATAATGA 


1560 


SO 


TGCTATTGCA 


ATATACTTTG 


AAATTTTATT CATCTTATCA CCTCATGATT 


AATATTTAAA 


1620 




ATACAGTTAA 


AATTATAAAT 


GCATTTATTT AATATTGCTA TACTATGAAA 


AGATATTTAA 


1680 




TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 


1740 
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GTGACTAGGC 


CTTCCTATCA GACATATTCA 


CTCATCCACG TATCATTATG TGTACAGTGT 


1860 




GCTATCTCTT 


ATTTACCTAT TGGAACAACC 


ATAAACTCAT CCATAGTTTA 


CCTTTTATAA 


1920 


5 


ATAGCAGTCC 


TCACTCATAC AATTTCTCAT 


AAAAATCACA ACGCTCCAAC 


GTATTTCCAA 


1980 




CTTACTTTCA 


CCTATTTTAA TTCATAAAAA 


CGACACTTTA ATTGTCATTA TCCAATAATA 


2040 


10 


GCAAGACGTT 


ATTATTGCAA TCTTTTTTAT 


AAAATAATAG AATCATAGTA 


TTGTCATTTA 


2100 


AAGATAAAGT 


AAGAACGTTT TTATTTTTCA 


GATTTTTTAA ATTATTATGA 


ATATCTAGTT 


2160 




TTAGGAAGGA 


AATTACATTG AAAAAACAAG 


TTATTATTTC GGGCCTCATG 


TTATTTTCAC 


2220 


IS 


TATTTTTTGG AGCCGGAAAT TTAATATTCC 


CGCCCATGCT TGGCCATACA GCGGGTCAAA 


2280 




ATATGTGGAT 


TGGTATGCTA GGCTTTGCCC 


TTACAGGCAT ATnACTCCCC 


TTTATTACTG 


2340 




TTATTGTTGT 


TGCATTTTAT GATGAAGGTG 


TTGAAAGTGT AGGCAATCGT 


ATACATCCAT 


2400 


20 


GGTTCGGGTT 


TATTTTTGCT GTCGTGATTT ACATGTCTAT GGGAGCATTT 


TACGGTATTC 


2460 




CACGTGCTGC 


AAATGTCGCG TACGAAATTG 


GTACAAGACA CATTTTACCT 


GTGCATAACC 


2520 




AATGGACTTT AATTATATTC GCAGCAATCT 


TTTTTGCCAT CGTTTACTGG 


ATTAGTTTAA 


2580 


oc 

i£9 


ATCCATCGAA 


AATCGTTGAT AATTTAGGTA 


AATTATTAAC ACCGTTATTA 


CTATTAATGG 


2640 




TCGCTCTATT 


AAGTATTGCT GTCATTTTCA ACCCTGAATC TGCACTAAGT 


GCACCTAAGG 


2700 


30 


ATAAATATAT 


AACACATCCT TTCATTTCAG 


GAAGTTTGGA AGGCTATTTT 


ACAATGGATC 


2760 




TTGTTGCTGC 


GTTAGCTTTT TCCGTAGTCA 


TTGTCAATGG CTATAAGTTT 


AAAGGCCTCA 


2820 




CAGATCGCAT 


GAAAATTTTA AAATATGTCT 


GCTTTTCAGG TCTTATTGCA GCCATATTAC 


2880 


35 


TTGGAATGAT 


TTACTTTGCA CTTGCATACX3 


TTGGGGCATC AACAGCTCCA 


GGAAACTTTA 


2940 




AAGAT6GTAC AGATATATTG ACGTACAACT 


CATTACGATT ATTTGGTT06 TTCXK5TAACC 


3000 




TCGTATTTGG 


AATGACGGTT ATCCTTGCAT 


GCCTAACAAC ATGTATAGGA 


CTCGTcAATG 


3060 


40 


tTGCGCCACA 


TTTACTAAGA AACAGGTACC 


TAAGTTTTCT TATAAAATAT 


TCGCACTTAT 


3120 




TTTCtCTATC 


ATAGGGTTCT TATTTACAAC ACTTGGTTTA GAAATGATTT TAAAAATTGC 


3180 


45 


TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 


3240 


TAACATGTTT 


AGCACATTCA GATTCAGTTG 


GGCCTATCGA CTCGCAACTG 


TTATTACATT 


3300 




GATTATTTCA 


ATTTTACAAA TACTAAATAG 


TTTCAACTTA TTACACGGTG 


TTATTTTGAA 


3360 


50 


ATCGTTTATG 


ATGTTACCTT TAGCAGATAT 


CGATTTAGCT TGGCTTGTAC 


CATTCATGCT 


3420 




CTTTGCTATT 


ATCGGTTTCA TAATOSATGT 


ATTTATACGC CGTCCGAAAC 


AAGCGACAAC 


3480 




TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTttATATGA 


3540 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3642 
{2) INFORMATION FOR SEQ ID NO: 329: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

TO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



IS 


TTATTGATAT 


TGAAAATTCA 


AAAACTGCAA 


CACATCTTAT 


CACTAAAAAC 


CAACACTATC 


60 




ATCAAACTGA 


CACTCATTTT 


GAACAGTATA AGAAATTTAT 


TTTAGATTCA 


GGTATATCAT 


120 




CAACACAATT 


TGTATATAAT 


AACCTGTCTG 


TAAGCGGATT 


TAAATATACT 


AATGATGGTA 


180 


20 


AGAATCCAAT 


TCAATTATCT 


GACATAGTGT ATCACTTAAT 


CGCATTATTA 


CGATATGGCG 


240 




GTGGCATTAG 


CTATCAATTA 


TTAGATGACC ATTCAAATTA 


TATTTCCTTG 


TACAACAAAT 


300 


2S 


ATGGTAGCCC 


CCTACCATTA 


ATGCATTTAT 


ATAAAATGTT 


TAGACCTTTT 


GTTAATGAAO 


360 


ATATTGAAAT 


TACAAATAAT 


TATGTATTGA 


GTCGTAAAGA 


TAATAACTAC 


CATTTCTTAT 


420 




TATTCAATAA 


AATTAATGAT 


CGATATATGT 


CAGACGTAAA 


ACAAQATTTC 


ATTTTCCATA 


480 


30 


ATGAATTACC 


TCAAGACTCT 


TTGATGATTA 


TTAAAACATT 


GAATCATGAA 


CATGGTTCAA 


540 




TTCAACATTT 


GCTTCCAATA 


AGCGATCAAC 


TTGTTTATAT 


AGAAAAAGAA ATTTTAGATG 


600 




AATTAGACAA 


AAGGAATTAC 


CCTAAAACGG 


AGCTTGCAGT 


TCAAGAAGAA 


ACTGGTCGTA 


660 


35 


CATTTGAACT 


CAAGTTAAAT 


CACGACGAGG 


TTAAATATAT 


TTGCTTTAAA 


CCAAGCTAAA 


720 




TACTAACAGT 


CCTCTTGTGT 


TTAGTTTCTT 


ACGTTAAAGG 


CTATTTATAT 


CATAAGGAGA 


780 


40 


TGATTTGTAA 


TGAGTAACTC 


ACAAGCAATT 


CAAGCAATTG 


AAAACGTGTT 


AGTAACGTCA 


840 


AAAGTTGGTG 


TATTATCAAC 


TGCATATAAT AATAAnCCTA 


ATAGTAGATA 


TATGGTCTTT 


900 




TATAATGATG 


GTCTTACTTT 


ATATACTAAA ACGAATATCC 


ATTCTGCTAA 


GGTCAAAGAA 


960 


45 


ATTAAAGATA 


ATCCAGCAGC 


ATATGTTTTG 


TTAGGCTATA 


ACGACACAAC 


TAATCGTAGT 


1020 




TTCGTTGAGA 


TGGAAGCGAC 


AATCGAAATC 


GTTACAGAAC 


AAGAAGTGAT 


TGATTGGCTA 


1080 




TGGGAAACAC 


AAGACAAAAG 


crrrrrcAGT 


TCAAAAGAAG 


ACCCAGAGTT 


ATGTGTTTTA 


1140 


50 


AGAGTAGTTC 


CGCAATCCAT 


TAAGCTAATG 


AATGATAAAT 


CATTAGATAC 


ACCTATCAAA 


1200 




ATCGATTTAT 


AACACAAAGT 


GTATATAGGA AATAACTTTT 


ATGAATTCTA 


GATATAACAA 


1260 




TGTTAAATAC 


TTAAAATAAC 


TCGCTATAAT 


TAAAGTGTTT 


AATATGTTTA 


CAATTCAATT 


1320 
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TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 


1440 




AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 


1500 


5 


CTTGGTATTA GTTTAATCGC 


CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 


1560 




ATGACTGGCG GAAAGAAAGA 


TACATACTAT 


GGAATTATGA AAGATAATAC TACTATTGAA 


1620 


10 


AAAATGATTA GTGAAAAAGA 


TGAAAGTATT 


GAAAAAAATG TTAAATTACC TTCAGATTCA 


16B0 


GATGTTAAAG TTAAAAAAGG 


TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 


1740 




ATTGTTAAAG TTAAAAAAGT 


TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 


1800 


IS 


CATGACATGG GCAAAATGCA 


CATGAAACAC 


TAATTGTAAT TTAAATTACA AATTTTAGTT 


1860 




GCCATCAAGG TATATACGAG 


TAAAAGCAGC 


GGTAAGTTGA TTTCCAATTT GGAATCATTT 


1920 




TACTGCTGCT TTTTATATTT 


GAAATACTTT 


CATATTGAAT AGCTCCACTT GCCGTTCGCC 


1980 


20 


TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGOGCT CCGGTTCGCC 


2040 




TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 


2100 




CTGCGCTTTA GCGCATGCAT 


AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 


2160 


25 


CCTG.CGCTTT AnCATGGCCA 


TAAAAGC 




2187 




(2) INFORMATION FOR SEQ ID NO: 330: 




30 
35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: double 

(D) TOPOLOGY: linear 




(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 






CCnCCTTTTT AAACCTGGnG 


AAATGtmCAG 


tTTTGATGGt ATTGGGtTCT TTAGTATTAT 


60 


40 


GCTTAAGATA GAGTGTAATA 


CGCTCCTGTT 


GTTCTTTATA GTATATTGCT TTTTGTTTTT 


120 




CTTTTTCGTC ATATTTCACT 


TTTAAATAAA 


TGACTGATGC AACTATACAT ATACATAAAA 


180 


45 


TGACACCAAT AATTATAAAA 


ACATGTTTTT 


TGTTCATTAA AGTAACTCCT AAAATGTGGT 


240 


GGAAAATGAA AAAATTTTTA 


TAATCTATAA 


TTATGAACAT TACAGATTAT AAACCATAAC 


300 




ACTAACATCG TCGCCTTCAT 


TAACTTGATT 


GTTAAAATCA GCAATTACTG AGAATTGTAC 


360 


SO 


AAGGAATAAT TGCCTATTAT 


GCCCTCATGT 


AATTATTGCC TTACTAACAT TAACAAAATC 


420 




GATAGCTATT ACATTAAATG 


CCTATACCCC 


AGACCTCAAA CACCTTTTTA TACAGGACGC 


480 


SS 


ACTGTAATTT CATyyACGTT AACGTGCTTT 


GGTTGTGTTA ATGCATATAA TACTGCTTCT 


540 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

5 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 780 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 840 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

10 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 960 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

15 TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 1080 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 1140 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 1200 

20 

GTAAATCATA AATACATATT ACTTTTAAAA AGCGTATCAC AATTCATATA ACXSGTCATAA 1260 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 1320 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 1380 

25 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 1440 
GAGGAATAAA ACgTGACATA TCATGAGCGT GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 1500 

30 ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 1620 
TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 1680 

^ CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 1740 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTtlTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 331: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
^ TAAGCCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 60 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 120 
ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 

55 
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10 



IS 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 360 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 480 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 540 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGGAGGT TTAATTATTC CGGTAGAAAT 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 660 

ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 780 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 840 

^ TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 900 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 1080 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 1140 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 1200 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 1260 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATT6 CTGAAGCTGA 1320 

35 AGCAGCAGGT GCTGACTATG T 1341 
(2) INFORMATION FOR SEQ ID NO: 332: 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

SO ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 180 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 240 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 
GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 
TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 480 
TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 540 
CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 
AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 
TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 
CCAATTTTTC TAACATGATC TAAAATGCTC ATTQCTAATG CAGCACCTTC ACTCGGATCT 7 BO 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 840 
TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 
20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 960 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 1140 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 1260 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 1380 

35 TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440 

ACAGGTATAA CGTTTCTTrC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACX3TTGATT CGTGCTAGAA 1560 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 1740 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 1860 

so AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 2040 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 2160 

CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCGC ATCAGTGTTA ATAGTTAATT 2220 

5 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 2280 

TAATTTCTAA TACTGTATTC GTTTCTTO^G CTAATGCCAT TAATTGTTCA ATATTCGGTT 2340 

TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CXX:ACGTATG 2400 

10 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 2460 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 2520 

IS AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2640 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCQATAT 2700 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATACGTTGTA TGCATATGAA 2760 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2820 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 28 80 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CITACTTTTT 2940 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3000 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 3060 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 

TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

^ AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 3240 

COATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3300 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 3360 

40 

CCGTTTTCTT AGCAAATCCX5 CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 3420 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 3480 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 3540 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 3600 

TGCCTTTtAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATTCAAGAC 3660 

SO TTTGAGCTGC TTTTCXSaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATTCCATAT 3720 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 3780 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 3840 
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2S 



30 



35 



TAGATATTTT 


AAGCTGATGT 


TGTAATGCTT 


CGTTAGGATA 


TAATGCCAAT 


AGATATAACG 


3960 


TAAAGTGTAA 


GACAATTATC 


GTCATAAACA 


CACCAACTAT 


CATTCCCATT 


GCACGACTGA 


4020 


AAATATGAAT 


GTTTTGATAC 


GCTATTATTT 


TATCAAAAGT 


TACGATAATT 


AGATATAAAA 


4080 


TGAACTTACA 


AAACAATGTA 


ATCATTAAAA 


AAGCTACAAT 


CGCTTCAAAT 


CGATTTTGTA 


4140 


GATGATTAAA 


ATGAAACGCA 


AAAGTTGTAT 


TAAATGCTGT 


TGTTTTAGGA 


TATGGAATAA 


4200 


ATACAATTAA 


TCTTTCTACA 


ATAGATTTGT 


AAAATTGACT 


GGCAATCCAC 


AATGATACAA 


4260 


nCGTTGCACT 


CAAATGTATC 


ATAGATAACC 


AAAAACCTCG 


TCTGAATCCA 


ACGATGACAA 


4320 


AATACACAAA 


GAAAATGATT 


ATGATAAAAT 


CAATGACCAT 


TTATTGCTCA 


CQCTGCTGCA 


4380 


ATTTOTQAAT 


TTGTTGTTTC 


AAAC6TCGAT 


TTTTTTTCTT 


CTAGTAGTAC 


TTTTTCATGC 


4440 


ATAATATTCA 


CAGCAGTTAG 


TATTGCTTTT 


CTTGAAGTAT 


CTAAACCTGC 


TGCTTTATAC 


4500 


CCTAATTCTT 


TTATTTTATC 


ATCAACTAAA 


TGTGCTACAT ATCGTATGTG 


CTCTGGGTTA 


4560 


TCTTCCCCAA 


CAATTGTAAA 


AAGCTGATCA 


TTAATTGATA 


CATTTACCTT 


GTTTTTAAAC 


4620 


TGTGTCATTT 


ATAATTTCTC 


CTGATCCTTT 


TTTTAAAATC 


TAAATTCACG 


TTATAAAATA 


4680 


TGACTGGATA 


GTTTGTCTGA 


ATTTGATACT 


AATATTGTTA 


TATTGCAATT 


ATGATAAAAC 


4740 


AACAACACAA 


TCTCTATAGA 


TGACTTAATG 


TTCTTTTTAT 


AATGAAATAA 


TGTAAAGAAT 


4800 


TTTCTATTCA 


ATACTTTATC 


ATGTTTAAAT 


TGTGTCACTA 


TAACATTTTC 


ATAAACATTA 


4860 


TACATGACCA 


CTATGTATTT 


TQTAAGTATC 


CGCAATTAAT 


TCTTTACAAC 


ATACATAAAT 


4920 


GTTTCTGACG 


TTATTATCAT 


TTATGATATG 


ATTATTTTTG 


CTAAAGACAA 


TGAAATTTTA 


4980 


TGAAAGGATT 


TACACAAATG 


GCGAATATCG 


TTTTTAAATT 


GTCGGATAAA 


GACATAACGA 


5040 


CATTAATGTC 


ACGCATTTCT 


TTTGATACTG 


AGAATTTACC 


TCAAGGnATG 


AnAGCACTGTG 


5100 


CAAAGTATCC 


AAAATACAAC 


TGTAAATATT 


TACCAT 






5136 



40 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 
ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 240 

CACAATATTA TATAGACCTA TTTTATTTTG ACXJTAAGTTG CTAGTATCTT CAAACAAAAA 300 

ACCTTTATAA AAATTCATAC CTTTATGCTA TCGCTGTAGG CTCATTAACT TGTTACATAT 360 

AATTCTTAAC TATCCTTTGA TGATTGTTTT ATTAGATTGT TTCGTTGATG GATACTTTCA 420 

CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCCCAA AACTTTAATT TGTTTTGGAC 480 

TGTTTTATAA TTATGCTTGC GATGGTGTTT TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 540 

ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTCAATT TGATTACCAT 600 

CTAATATTAA TTCACTTAAA TTATCTTCGA TAGTTTTTTG TATCGCTCTA ATTAATGGTC 660 

TTGCACCATA TTCTGGATCA TATCCTTCTT CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 

CAATAATATT TATOTTTTGT TCAGATAATC GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

20 CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAC AATGATATCA TCTACACGGT 840 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

CATAATCTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTT 960 

GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 1020 

TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AATATTAAAT ACATCTGGAT 1080 

GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 1140 

TTAATTGTCC ACCATCATCA TGACCAACAT ATCCTGGAGG AGCACCAACT AATCGGCTCA 1200 

CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCGCATCA TCATCGCCAA 1260 

ACATTGATTC AGCTAAAGCT CTAGCTAATT CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TGCACGGGCA CGTCTAACCG 1380 

CTTTACTGAT TGAATTAACA GCATCTTTTT GCCCAATAAC TCTCTCATGT AATGTATCTT 1440 

^0 CTAGACTAAG AAGTTTTTCA GATTCTGTTT CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT CTGACAATGA AGTTGaCATG CCATTTTGTG 1560 

CATTCTTCCA TTCATTTTTA GCTTCTTCAT ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG CATOTACTGC GGCATCTTTT TCATTTTTAA 1680 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 1740 

TTACTTTAGA ACTTGCTTCA TCAATTAAAT CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATTAATAC 1860 

GATGGTGTGC TTCGTAACGA TCTCTTAATC CTITTAAAAT AGCAACTGTA TCTACTACTG 1920 
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5 
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IS 



30 



35 



SO 



TTTTGCGATA 


TTCATCTAAT 


GTAGTAGCAC 


CAATACATTG 


TAATTCACCA 


CGTGCTAATG 


2040 


CCGGCTTCAA 


AATATTCGAA 


GCATCXSATAG 


CACCTTCAGC 


ACCACCAGCA 


CCAACTAAAG 


2100 


TATGCAACTC 


ATCAATAAAT 


AGGATGACAT 


TACCTGCTTG 


TTGGATTTCT 


TCCATAACCT 


2160 


TTTTCAGACG 


CTCTTCAAAT 


TCACCACGAT 


ATTTAGTACC 


TGCAACTACT 


GTTCCCATAT 


2220 


CTAAAGACAT 


AACACGCTTA 


TCTTTTAATG 


TCTCTGGTAC 


CTCATTATTC 


ACTAtGGCtT 


2280 


GCGCTAAACC 


TTCAGCAATA 


GCAGTTTTAC 


CAACACCTGG 


CTCTCCAATA 


AGCACAGGAT 


2340 


TGTTTTTCGT 


ACGTCTACTT 


AATACTTCAA 


TTACACGTGT 


AATTTCTTTA 


TCACGTCCTA 


2400 


TAACAGGATC 


TAATGTACCG 


TCTTTGGCAA 


TGACT6TTAA 


GTCACX3AGCT 


AAACTATCTA 


2460 


AAGTTGGAGT 


ATTATTTGAC 


TTACTAGCTT 


GTGCATTTTT 


ATTACTCATT 


TCAGGGTTTC 


2520 


CTAAAGCTTT 


CACAACTTGT 


GCACXjTGCTT 


TAGTAATATT 


TAAATCTAGA 


TTTGCAAAAA 


2580 


CTCTTGCTGC 


AACACCTTCA 


TTTTCACGTA 


TCAAGCCTAA 


TAAAATATGT 


TCCGTTCCAA 


2640 


CAAAATTGTG 


ATGTAATTTT 


CTAGCTTCAT 


CCATCGATAA 


TTCAATGACT 


TTTTTAGCTC 


2700 


TAGGTGTATA 


ATGCAATGTA 


CCAACATGAT 


CTTGACCATG 


TCCGATTAAT 


TTTTCAACTT 


2760 


CTTCAATTAC 


TTTATCTTCA 


GTGATATTAA 


AACTTTCTAA 


TACTTTTGCA 


GCAATTCCTT 


2820 


CAGGTTCTTT 


CATTAACCCC 


AATAATAGGT 


GTTCTGTTCC 


TATATTTGAA 


TGATTTAAAC 


2880 


GAATTGCTTC 


TTCTTGGGCA 


TGTGCTAATA 


CGCGCTGTGC 


ACGCTCAGTT 


AATCTACCAA 


2940 


ATAACATAAA 


TAATGACCTC 


CTACTTTATA 


TGTTCTCTTA 


GTATATCTGC 


TCGTTTTTCT 


3000 


TTTACAGATT 


TGTCATCTTC 


TTCATCTAAT 


AAAAATGGTG 


ACTGTATAGC 


TACCATCAAT 


3060 


TCATTAAATT 


TAAAGTTTTG 


TAATTCAATG 


TAATTTAAAT 


CTATACCAAG 


TTTAACTcGC 


3120 


TTAATCTATA 


AGAAGCCTCT 


TCCATAGTTA 


TCATTCTACA 


GTTTTGTAAA 


ATACCTAGCG 


3180 


AGCGAAAAAC 


ACGGTCTTGT 


GTTTCTAATT 


GATTATAAGT 


GTCTAACTTT 


TGTCGTATTT 


3240 


GTTTTTCTTC 


ATGAATGATT 


TGATTAACAA 


CTTCTGTTAA 


TGTTTCTATG 


ATTTCTAACT 


3300 


CAGATTTACC 


AAGTGTAAGT 


TGGTTGGATA 


CTTGATAAGT 


ATGTCCATAA 


ACTTGCGAAC 


3360 


CTTCACCGTA 


AATACCTCTG 


ATTGTATATC 


CAAAACGATT 


AATGGTTTGA 


GCAATCCGTO 


3420 


TCATTCTTTT 


CATAATAGAT 


AGACCTGGCA 


AATGTAGCAT 


CACGCTTGCT 


CTCATACCAG 


3480 


TACCTATATT 


GGTAGGACAT 


GTAGTTAAAT 


AACCAAGTTG 


TTCATCATAA 


CTTATATCAA 


3540 


GGCTTCGATC 


TAATTCATCA 


TCAATTGATG 


AAGCTTGATT 


ATATAAAGCC 


TGTAATGTCG 


3600 


TGTCAGTTCC 


CATAGCTTGA 


ATACGAATAT 


GGTCCTCTTC 


ATTTATCATG 


ACACTTAAAG 


3660 


ATTCATCGTC 


ATTCACTAAT 


ACTGCGGCTG 


CTGGTTGTTT 


TATTAGTTCA 


GGACTAATCA 


3720 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3840 

GCATACATTA GTGGATGCAC ATQATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 3900 

^ ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC ATGAATATTA 3960 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4 020 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4080 

10 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 4140 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4200 

^5 ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 4239 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTrATTT 60 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 120 

30 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TTAGTACATC ATTAAGAGGC 180 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAGATGAAGA TTCTTGCAAT 240 

^ AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 300 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 360 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 420 

40 GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 480 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 540 

AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 600 

4S 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 660 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 780 

SO 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA TACATTTTTC GAGCACACGA 840 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 10 BO 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 1140 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 1245 



10 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 
^5 (B) TYPE: nucleic acid 

<C} STRANDEDNESS: doxible 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 



2S 



30 



35 



40 



45 



50 



CTTGTAAAGG 


TAGTTGTTCA 


TTTAAATTAA 


AACAGTATGC 


TAAGTTnGTG 


CTTATATTTT 


60 


GCAACTTCAA 


TTCGACAGGC 


TCATCTTGAT 


ACATGAATGC 


CTCAATTTCA 


TCATGTGATA 


120 


ACTTTTTCAA 


AATATCAATT 


AAATGrATAC 


TAAAAACGAT 


AAATAACATA 


TCCCAATTTG 


180 


AATTCAGCCC 


TAGCGATTTT 


AATTTGTTTA 


AAATCTTATC 


TTTTTGAAAA 


ATTCGATTCC 


240 


TAATGTCTTG 


TATATCGTTA 


TCAGTCAAAG 


TTTCCCAATC 


TATATGTGAA 


TGAAGACCTA 


300 


AATAACACTT 


ATCCATTAAT 


AATTCATATA 


CCGTTAATGC 


AGAGACATCG 


AAACAACGTT 


360 


CTTCACTTAr 


AAAAACGCCA 


TCAACATCAA 


ATAAAATTTT 


CTTCACAATC 


CCCACTCCAT 


420 


TTCTGaAAAT 


TCAGaTATAA 


ATCATTCTAC 


TATTTGACTA 


AAAAAAGCGC 


AAACCCTATT 


480 


GAAGTaGATT 


TGCGCTTTAG 


CTGTTnAAAT 


TTTATAAATG 


TnTTTCAATT 


TCATCAGCAA 


540 


CCTGCTGTAC 


GTGTGTACCG 


ACAATAACTT 


GAGTTGAATG 


TTTGCCATTA ACAGTAACAC 


600 


CAACTGCACC 


GGCGTTTTTA 


ATCTTCTGTT 


TATCAATAAT 


AGATGTGTCT 


TTTAACTCTA 


660 


GACGCAACCT 


TGTTGCACAA 


TTGGTTAAAT 


TAACAATATT 


CTCTTGACCG 


CCTAAACCTT 


720 


CTAATATTTG 


TATAGCATGT 


TGATGATATT 


TACTTTGTTT 


AATATCATTT 


TCACCAGGAG 


780 


CAATATTATC 


TTTTACAACT 


GTTGGGTCAA 


CTAATTCATT 


TTCACCTCTA 


CCAATCGTAT 


840 


TCAAGTTAAA 


TACTTGGATT 


ACTACACGGA 


AAATCACATA 


GTATAAGATG AAAAATACAA 


900 


CACCTTGAAC 


AAGCAACA7C 


AATGGATGAT 


TTGATACTGG 


ATTAATTAGT 


GATAACACAT 


960 


AATCTATCAA 


ACCTGCACTA 


AATGAAAATC 


CAGCTGTCCA 


ATGGAATGTA 


GCTGCGATAA 


1020 


ATAAAGATAA 


TCCTGTTAAT 


AACGCATGAA 


CAACATATAA 


GATTGGCACA 


ACAAACATAA 


1080 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 1260 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1380 

CACCATATAT GAAAGCTCC 1399 



TO 

(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



2S 



30 



3$ 



SO 



TATAGTTnTA 


TTATTTAGCG 


AAGCATTAAT 


ACTACCACCG 


GTTATAACAA 


ATGTATTTTG 


60 


CGTATTAAAT 


TGAATGGTAG 


GACCAATCAA 


AGTATATTCA ATCGCTGGAC 


CATCATTGTT 


120 


AATTAATGAC 


TGCGCAACCT 


TAAAACTAAA 


TTGATCCATG 


GCACCTGCGC 


CTGAAAATCC 


180 


AATATGTTCA 


TAACCTATTC 


TTCCTAGATC 


TTGTACCGTT 


GAAAAGAGAC 


CTGGTTGTAA 


240 


AATCTTAATT 


GACATTTTCA 


ATCACCACCC 


AGTCATCAAC 


ATTAAAGTTG 


CCATCTGATA 


300 


TATCTCTTTC 


GATTTGTATA 


AATTTCTGTT 


CATCTATTGC 


ATAAAATTGT 


ATCCATTCTC 


360 


CTGCTTCGTA 


CATTGACATT 


GGTTCACGCT 


CGCTGCTAAA 


TACTTTTAAc 


GGTGTGCGTC 


420 


CAATAATTTG 


CCATCCGCCA 


GGAGAATCTG 


ATGGATATAG 


TCCTGTTTGA 


TTATTCGCAA 


480 


TACCTACAGA 


ACCTGCATGA 


ATTTTTAACC 


TTGGCTGATT 


ACGTCTAGGT 


GTATGTAGTT 


540 


GTTCATCAAG 


TcCGCCTAAG 


TATGGAAATC 


CTGGCATAAA 


TCCTAGCATA 


TATATTAAAT 


600 


AAGGTTTACT 


TGTATGTTTT 


TCAATAACTT 


GCTCAACAGT TATTCGATTA 


T6CTTTGCTA 


660 


CTTCTTCAAT 


ATCTGGTCCA 


TATGTACCAC 


CATATTGAAC AGGTATTTTA ATAATACGAT 


720 


TGGTTTGATT 


CACAGCATGA 


ACAT'rnTlT 


CATTAAATTT GTTAAGTTCT 


AAATTTTCAA 


780 


TTAATTTAGA 


AGATGTTATA 


GCTTGTTCAT 


CAAAATATAT TAGAACTGCT 


CGATACGAAG 


840 


GGACAATATC 


TTGAATTTCT 


AATATTTCTT 


TTTCTCGTAT 


CCACCGTACC 


ATTGCTGTGA 


900 


CATTACGATA 


TGTCTCTTCG 


GATATTTTAT 


TTTCAAAATA AATCATAATT GTCTGCTCGT 


960 


TAATAAATCT 


TACATCCACT 


TTAAATCCCC 


CTTTGTATTG CAATAAACCA GTATTGAATA 


1020 


CCTTTTCATT 


GTATCATTGA 


GAAGCACAAG 


TTGTTTAATA AGTAATTCAA ATCGCATATA 


1080 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

5 ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

20 6AATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 120 

GACGAAACCA ACTACTATTA AACCAATAAT TGT06CTATT TTAATGATAG AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 240 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

30 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 480 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 540 

35 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 780 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 840 

^ TTACATAGAT TTACAAAAAA ATTGrTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 900 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 1020 

SO 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCIGA 1080 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 1140 

55 
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TTACAACAAA TTTTAACTCX3 TCCAGCTGAG CAIGATGTTG TAGC^ACTAT GAACTTGAAT 3060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

5 GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3240 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 3300 

10 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 

G 3421 

15 

(2) INFORMATION FOR SEQ ID NO: 338: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 



30 



35 



SO 



CCCTnGATAC 


CCAAACTGCC 


CTAATACCAC 


TGGCTAAACC 


TAATGGATAG TACCATTTAT 


60 


TTTCCAATAA 


ATAAnCCAAC 


TGCAATnGCT 


ATaACTCTAA 


ATATAATAGA GATAATCmcA 


120 


TTAATTGGAT 


TAAtACGCCA 


AATACTAGTA 


ATAATAGGCT 


AGATAATAAT CCACCTAAAA 


180 


AGTACTTTTT 


AATTCCAAAG 


AAAGCTAATA 


TCAATAATGC 


TGCCGGTGCA GATAATTGAA 


240 


AATCTAATCC 


TGGTATAATG 


GACGGTATTT 


TCAAAACTGC 


CAAAATGGTT AAAATCGCAG 


300 


CAATGACACT 


AATTTGAGTA 


ATATCTTTTG 


ATGTCATACT 


AAAACCCCTA TACCGTTTCA 


360 


TAAACAACTT 


GCTTCGGTGT 


GCTTTCTAAA 


AATGATATGT 


AATGATTTAA ATCAATACAA 


420 


TCGTCCACAA 


ATATTATTCT 


GCCTCCATAT 


CTCGTATTAA 


CTGGTTTAAT ATCAAATAAT 


480 


CGATGGTAAC 


CAATTTTAGC 


AGCGGCAAAA 


TAACCTGTCG 


TATACGTTAA GTCATCGGAC 


540 


ACGCAAAGTT 


CTCCTTTGAC 


ATACGGATGC 


GCATTGATAC 


AACTAGCAAT TGCTAAGGCA 


600 


TCAGTCACTC 


TTTCATTAAG 


ATCACCTTTA 


TTATTTATAT 


CTTCAAACGA AAAATGTGTT 


660 


GCCCTAATCC 


CCCTTTGTCC 


AAATGAATCT 


AAACGTTTAC 


CAGATATAGC AGATAGAATA 


720 


ATAGCTCCTG 


TATAAACCGT 


TTCATTTTTA 


ATATATGTCA 


TCCCTTGATT TAGCGCTTGT 


780 


TCAGTGACAC 


CACATTCTTG 


TGTTAAATGT 


TGGAGATTTG 


CTTTATCATC CTCAATAATT 


840 


TGTAATGCTT 


TTATTTGTTG 


AATCGGTTCC 


ATGATTTTTT 


GTATTTTTAT ATTGAGAAAA 


900 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA 
CGCATTTTTA TACTATACAT GACGATCACC 
^ CCTATCTATA TCTT G ' X TCAG TATGAAAATA 

CACAGTtGGA TACCTTAAAT AACTTGTAAA 
GTTCTCAGCC GCTTCTATGT CATC/^AACTC 

10 

ATAATkAACA TTGAGTGCTT TTAACTTTTG 
AGTGCGTCTA TCATCAGCAT TTATTAACTT 
ATAAATTGGC AAACTACTTG AGTAGATGAG 

IS 

CaCATCaTTT GaACTkaAAA TcACACCCCC 
AGTGAGTATA TCTACACCTT GATAATTCGA 

20 ATCGAGTATC AGTGTTGCGT TAnATTTATG 
AACGTCGCCA TTCX3TTGAAA ATACACTATC 
AGGGTATTTC TCTAACCTTT TTTCTAAATC 

2S TTCTAAACCA CTTAACTTAA TACCGTCAAT 
CACGACACAA TTTGTATTTT TGAAAATATT 
ATTTAAGATA GTACATGcAC TATATCCGAG 

30 

ATAAGCTGTC GAACTTCCAC TAATTAATCT 
CATAAATCTT TCGAAATCAT CCTTATCAAA 
AGATGTATAG TTCGTACATC TCTTATTTTC 

35 

TACCGATTTA AGCGACCGAT ATAACCCTTT 
CTTCATTCTT GTTTTCCTTA TTTTCACAAG 
TAATCATATC GTAGTCTAAT TGGTTCGGTT 
AATATTGAAT TCGCCGCTTT TAATGCTAAT 
CCTCCAGCAA TACGAATTTC TTTCGTAGGA 
^ AAACATTTCA TTGGTGTTAA ATCATCCATG 
AAAAAATTAA TCGGAATACT GTCGGCATCC 
ATATCTTGAT TAGATTCTCC CATACCACAA 

SO 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA 
TCATGGTAAT TTTCACTTGT ATTAATATTG 

SS 
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CCACTAATAT GAACATCTTG ATTGCTAGAA 


1020 


TCAATCTTCT TGaTGCAAAA TTTCAAACAA 


lOBO 


CGACAATGAT ATTCTTAACA TTGGCTTAGT 


1140 


CACATGATGC 


TTTAATAATG 


TTTGATGAAT 


1200 


AATAAACTTA 


ATCGGCGAGT 


TTGaACTATT 


1260 


GTTAAAATAT 


TTACTCAAAC 


TATTTAATTT 


1320 


TTCAATGTTT 


CTTTTTATAA 


AATACAAATT 


1380 


TGGTCTACCG 


TGATTAATTA 


aCATATCCtT 


1440 


GTATGCACCA 


CATGCTTTAG 


ATAAACTAGA 


1500 


GTAAtTCTCT ATtCCAAAAC 


TATGTGAAAC 


1560 


CTTTAATGAG 


ACTAATTGAC 


CAATATCCAC 


1620 


AGATATGATT 


ATTrrrGGTA TATTTTGATT 


1680 


AGCAATATCT 


AAATGCTTAT 


ATATCACTTT 


1740 


AATACTCGCA 


TGATTTTCTT 


GATCTGAAAA 


1800 


AAATAACGCC 


AAATTAGCAT 


CATAACCACT 


1860 


CCAACCTGCT 


AACATTGTTT 


CAATTTCTTC 


1920 


TGAACTTGAT 


AAGTGATAAC 


TATACTTCCG 


1980 


CGCTATTTGA 


CCTAATCCTA 


AATAATCATT 


2040 


TACTTCAATA 


TACTGTCTAT 


CTATATACCC 


2100 


CTGTTGTAAT 


AAATCAATTT 


GCTCTTGAAA 


2160 


TGTCATAATC 


AATTTCAAAG 


CCTAAATCAT 


2220 


GCCCACCAGT 


AATTAGATAA 


TnCACCGACA 


2280 


GGCTGTAACG 


AACGTAAGTT GACCTCTCTT 


2340 


TTGATTAATC 


GGAATAATGC 


TACGATTCTT 


2400 


CTTCCAAACT 


TTGTGCCTTT 


GATTGGATGC 


2460 


ATTTCTTTTA AAGCAAATGC 


CATATCAACA 


2520 


ATCACGCCAG 


AACATGGTGA 


TATATTATTC 


2580 


TCTTTATAAC 


TATGCGTTGT 


CACGACGTTA 


2640 


TGGTTATATC 


TGTCTACACC 


AGCTGACTTA 


2700 



1212 



EP 0 786 519 A2 



TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 
CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 
CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 
GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 
TTTAATTTTA CTnTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 
TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 
tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 
{2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



CGATTATCCA 


TTAATACAAC 


CCTAAGTAAA 


TGTATAAAAA 


TTATCTTCCA 


CAAACTTCAA 


CAAAAGCCTA 


AATAAATTAC 


AGCAATTTAT 


CAAATATTGC 


TTACTTTGAT 


TTTATGAAAT 


nACTTAATTC 


TAACACATAC 


TAAATCATCA 


TATACTAATT 


CGAAATCAAA 


TGCATTTAGA 


GATAATCaAA 


ATGCGGAAAC 


ATCTCCaATA 


ATCAATAATC 


TATTCCCAAT 


AAATATGAAT 


GTTCTCAACA 


ATACATTATT 


TATATCTCTT 


TACACTGTCA 


TCGACAAAAA 


CTAAATCTTT 


CACTTTCAAT 


TTCGAACGTG 


GTTCTACGAC 


ATTTGCTGCT 


ATATCATTTA 


ATGGGATTAA 


AACAAATGCA 


CGTTCATTCA 


TTCTCGGATG 


TGGCACCGAC 


AGTTTTGGTA 


AATCTATCAT 


TTCTTCTCCA 


TACAACAAAA 


TATCCACATC 


TAAAGTTCTA 


GGACCCCATC 


GTTCCTTTCT 


AATACGGTGT 


AAACATTCTT 


CTGTOcTCAA 


ACAACATTCC 


AACAGTTGTA 


ATACTGTGAG 


TGTTGTTTsA 


ATTTCAACAC 


ACAAATTTAA 


AAAGTTAGGT 


TGCTCAGTAT 


ACCCAACTGG 


TGCTGTTTCA 


TAAATCGGAG 


AAATAtTAGA 


TACGTTAATA 


CCATCATATT 


CATTCaAAAT 


CTTyATAGCA 


TCGTTTAACt 


GGCTTTCTCT 


ATCACCAaTa 


TTACTACCTA 


AyCCTAAGTA 


TGCTTGAATC 


ATyTATTCTC 


CCTCACTATT 


TCGATACCTA 


CTCCATCATA 


ATGACCCGGA 


ATCGGTGGGT 


TTTCTTTAGT 


GATTCTCACT 


TTCGTTTCCA 


TTACACGATT 


ATATTGTGAA 


TTTATACGAT 


TTGCAATACG 


TTCAGCTAGA 


TGCTCAAGTA 


AATTAACGGC 


CTTACCTTCC 


ATAATTGATT 


TAACCTCTTC 


GAACACTTCA 


CCATAATGAA 


CTGTATCAAT 


AACATTATCA 
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ATTTCATTTT CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC TTTAAGAAAG 1080 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT 1140 

CAACTCGACA TTATGAACGC GTACTGCTCT AACGCCTTTC ATAATACCAT ATGCAGTCGT 1200- 

AGCTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 1260 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 1320 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 1380 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 1440 

TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 1500 

AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 1560 

tGATCATTAA TCATATCAAC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 1620 

GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 1680 

TTCCATCTCT TCTG 1694 
(2) INFORMATION FOR SEQ ID NO: 340: 

25 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 60 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 120 

TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 180 

CAAACGTCAC GTAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC TGCATATTTA 240 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 300 

AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 360 

45 AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCQACAA TAATCATCGT TGACTTTGTT 420 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 480 

GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 540 

50 

ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 600 

CCAACTTTTT CAGCCCAATA ACTTACATTT TTCTTCATAT CATCTTGTTC AACACAATTA 660 

55 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCX3 840 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCXy^G CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 1140 

ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

20 TCCAATCCAG CCATATAAAC A6CCTATAAT TCCCGAGC 1358 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 240 

CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTITGTCA TTACAGATAT 300 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATGtGAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

^ TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 480 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 540 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

SO 

AGA6ATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 
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AGGGAAATCT 


GAGATTATCC 


AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 


840 




TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 


900 


5 


TTTAGTCCAT 


TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGCGA TGCGTGCCTT 


960 




AACATCAATC 


AAAGCCGAAT 


TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 


1020 


10 


CCATATTAAT 


CAATACCATA 


AGTTATTTAA AGAAGGTATT GCGACAGAAC 


CAATGCCACA 


1080 


TTTATTCATT 


ATTTCCGATG 


AGTTTGCCGA ATTAAAATCA GAACAACCTG 


ATTTTATGAA 


1140 




AGAACTTGTA 


TCAACGGCAC 


GTATTGGACG TTCGTTAGGT ATTCATTTAA 


TACTTGCGAC 


1200 


IS 


ACAAAAACCA 


TCGGGTGTTG 


TTGaTGACCA AATTTGGTCT AACTCTAAAT 


TTT^GTTGGC 


1260 


ATTAAAAGTA 


CAAGATAGAC 


AAGACAGTAA TGAAATTTTA AAAACACCAG 


ATGCAGCAGA 


1320 




CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 


1380 


20 


CCAATCTGCA 


TGGAGTGGTG 


CAACATATGA CATCGAAGGC GATAAATTAG 


AAGTTGAAGA 


1440 




TAAGACGATT 


TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA 


AAGACTTGAG 


1500 




TGGACTTGAA 


GATGAAGAAA 


CGAAAGAAAA TCAAACTGAG TTAGAAGCGG 


TCATAGATCA 


1560 


25 


TATQSAATCT 


ATTACAACAC 


GATTAGAAAT CGAAGAAGTT AAGCGTCCAT 


GGCTACCACC 


1620 




ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC 


1680 




AGATGATGCA AAAGAAGTGG 


AATTAACATT AGGACTTAAA GACGTACCAG 


AAGAACAATA 


1740 


30 


TCAAGGACCG 


ATGGTATTGC 


AATTGAAAAA AGCTGGGCAC ATCGCGITAA 


TCGGAAGTCC 


1800 




AGGATATGGT 


AGAACAACGT 


TCTTACACAA CATTATTTTC GATGTTGCAA 


GACACCATCG 


1860 


35 


TCCTGATCAA 


GCACACATGT 


ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 


1920 


AGACATACCA 


CATGTCGCTG 


ATTACTTTAC AGTAGATCAA GAAGACAAGA 


TTGCTAAGGC 


1980 




GATACGTATA 


TTTAATGATG 


AAATTGATCG TCGTAAGAAG ATTTTAAGTC 


AGTATCGTGT 


2040 


40 


CACTAGTATT 


TCTGAATATC 


GAAAATTAAC TGGTGAAACA ATTCCGCATG 


TCTTTATTCT 


2100 




TATTGATAAC 


TTTGACGCAG 


TAAAAGATTC ACCTTTCCAA GAAGTTTTTG 


AAAATATGAT 


2160 




GATTAAAATG 


ACGCGTGAAG 


GGCTAGCATT AGACATGCAA GTAACCTTAA 


CTGCTTCAAG 


2220 


45 


AGCTAACGCT 


ATGAAAACAC 


CAATGTACAT TAATATGAAA ACGCGTATCG 


CAATGTTTTT 


2280 




ATATGATAAA 


TCAGAGGTGT 


CGAACGTAGT AGGACAGCAA AAATTTGCGG 


TTAAAGATGT 


2340 




TGTGGGTCGA GCATTGTTAA 


GTAGTGATGA CAACGTATCA TTCCATATTG 


GCCAACCATT 


2400 


SO 


TAAACATGAT 


GAGACCAAAT 


CATATAATGA TCAAATTAAT GATGAAGTAT 


CGGCGATGAC 


2460 




AGAATTTTAT 


AAAGGTGAAc 


ACCAAATGAT ATtCCTATGA TGCCAGATGA 


AATTAAATAT 


2520 



ss 
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GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAT TAACTGAACC AGCAATGATT 2640 

TCATCAGAAA ATCCGAGAGA AATTGCGCAT ATTGCTGAAA TTATGATGAA AGAAATTGAC 2700 

^ ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT T7UUU3CTTAT 2760 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 2820 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

10 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 294 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3000 

GAATTAAtAG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3060 

IS 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACX3AGAA 3120 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

2^ TGGTTTAAAT AGCAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 3240 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3300 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3360 

25 ATTTOAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 3420 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 3480 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 3540 

30 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3600 

AAGCAATGAA TTAATA7TGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 3660 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCAAAAAC AGCTAAAGAT ATTGAAGCAT 3720 

35 

GTCAAAAQCA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 3780 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3840 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3900 

40 

ATACATTATC AATTAAGCAA GGGCTTGATA GGGTGAACCC ATGATGAAAG ATGTTAAGCG 3960 

AATAGATTAT TTTTCTTACG AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4020 

45 AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 4200 

^ TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4260 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAATCATATC CTTTATTGAT 4320 
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25 



30 



35 



40 



45 



SO 



CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 4440 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TOTTATTCAT 4500 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4557 
(2) INFORMATION FOR SEQ ID NO: 34 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 60 

CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 120 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 

CCATCACAAA TQGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 240 

AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 300 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 360 

GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 420 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 480 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 540 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 600 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 660 

TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 720 

GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 840 

AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 960 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 1020 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 1080 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 1140 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 1200 
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GTATTGGTGT TTCTTCTCGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 1380 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 1440 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 1500 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 1560 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 1680 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA OGTTCAAGAA 1740 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTTTC 1860 

20 aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTAGACGA 1980 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 2040 

25 CATaTTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 2220 

AAAAATTGGG TGTTTCCCAT TCAAACACAT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 2280 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 2340 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 2400 

TATAGTACAA AACATCATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 2460 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG TCATGGTATT 2640 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

45 TATACGACAG CX5TATGCTAC ATCTGAACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2760 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2940 

GAGCTAAAGA AGCGTATGAG GCTGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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AATCTCGAAA 


AGTTAAGGAC 


GTTATAAACG 


ACGTAGATTA TTTCCATGCT 


TTAGATCGAT 


3120 




TGAGCTTA6C 


CAAAGAAATT 


AACAAACGTG 


CAGAACATAA AATTAAATGT 


TTCTTGCAAG 


3180 


5 


TGAACGTTTC 


GGGAGAAGGT 


TCTAAACATG 


GTATTGCTTT AGAAGAT6TT 


GATCAGTTTA 


3240 




TAGATGATCT 


TAAAAAATAT 


GACAAAATCG 


AAATTGTAGG TTTAATGACG ATGGCACCAT 


3300 


10 


TGACAGATGA 


TGAAGCATAT 


ATTAGATCGT 


TATTTAAACA GTTACGTTTG 


AAAAAAGAAG 


3360 


AAATACAACG 


ACTCAATTTA 


GAATATGCGC 


CTTGTGATGA ATTATCAATG 


GGAATGAGTA 


3420 




ATGACTATCT 


TATTGCAGTT 


GAAGAAGGTG 


CGACGTTTGT TAGAATTGGG 


ACTAAACTTG 


3480 


IS 


TAGGAGAAGA 


GGAGTGAGCC 


ACTTGGCTTT 


AAAAGATTTA TTTAGTGGAT 


TTTTTGTAAT 


3540 




AGATGATGAA 


GAGGAAGTAG 


AAGTACCTGA 


CAAACAACAA CAGGTAAATG 


AAGCGCCAGC 


3600 




AAAAGAGCAG 


TCACAACAAA 


CAACAAAACA 


AAACGCAATC AAATCAGTCC 


CTCAAAAATC 


3660 


20 


TGCATCAAGA 


TATACAACAA 


CGTCAGAAGA 


AAGQAATAAC CGTATGTCTA 


ATTATTCAAA 


3720 




AAATAATTCA 


CGTAATGTTG 


TAACTATGAA 


CAATGCTACA CCAAACAATG 


CATCACAAGA 


3780 




AAGTTCAAAA 


ATGTGTTTAT 


TCGAACGACQ 


TGTTTTTTCA GATACACAAG 


ATATTGCTGA 


3840 


2S 


TGAGCTTAAA 


AACCGCCGTG 


CGACACTTGT 


CAATTTACAA CGTaTTGATA AAGTATCAGC 


3900 




GAAAAGAATT 


ATTGATTTTT 


TAAGCGGTAC 


T 




3931 



(2) INFORMATION FOR SEQ ID MO: 343: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 
AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

40 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 
TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 240 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 300 
TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

^ GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 420 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 480 
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IS 



30 



35 



SO 



AG6TATTAGA 


TATTACCTTT 


TCGAAAAACG 


ATACCATCCA 


TTTATTGCA6 


CAATTGTAAT 


600 


TjSGACTTATC 


GCACTCTTAG 


CTTGGCCAAT 


GAGTGCATCA 


ACTGGAAGAA 


ATGACGGTTT 


660 


AGGTATAACA 


ACGCCTTCAG 


CAAATTTAGT 


ACACTTTTTG 


ATTACAGGTG 


AAACTAAATT 


720 


TATTGATTGG 


GGTGTCTTTT 


TAGTTCTAGG 


AATTTTCATT 


GGTTCATATA 


TTGCAGCTAG 


780 


AGGATCAAGA 


GAATTTAAAT 


GGCGATTGCC 


AGACAAGATT 


ACAATACGAA 


ACAGTGCCAT 


840 


TGGTGGCATA 


TGTATGGGAT 


TTGGTGCGTC 


AGTTGCTGGT 


GGTTGTTCTA 


TCGGTAACGG 


900 


TTTGGTTGAA 


ACGGCAACGA 


TGACTTGGCA 


AGGATGGATT 


GCGCTAgCAT 


gCGATGATAG 


960 


TTGGTGTATG 


GACAATGAGT 


CATTTTATCT 


TTGTTCGTCC 


AATGAAAAAA 


GTACACCAAC 


1020 


AATCTGCAAA 


GGTTAAACAG 


CAAACGCAAA 


TAGTATAGAA 


GATTATTATG 


CAAATGATGT 


1080 


TGATCAAATA 


AAAGTGATTG 


GAAAAGGAGA 


AATAATTATG 


ATACACGAAT 


TAGGTACAGT 


1140 


AGGAATGGTA 


TGTCCATTTC 


CGTTAATTGA 


AGCGCAAAAG 


AAAATGGCAA 


CATTGCAATC 


1200 


TGGAGATGAA 


TTAAAAATTG 


ATTTTGATTG 


CACGCAAGCG 


ACGGAAGCCA 


TTCCAAATTG 


1260 


GGCTGCAGAA 


AATGGTTATC 


CTGTAACAAA 


CTATGAACAA 


ATTGATAATG 


CTTCATGGAC 


1320 


AATTACAATT 


CAAAAAGTTT 


AACGTTATCA 


TTTTAACAAT 


AAAATAGATA 


TTAGATTCTA 


1380 


TGGCTACTTC 


CGCTAATTTA 


AAAGTGAGTA 


AGTAGTCTTT 


TTTTTTTTAG 


TTCATGAAAT 


1440 


CATTTTTATA 


TAGTGTGGCA 


CATTTTATTC 


CAAAAGATGT 


AATAAAACTT 


AACGCATTTT 


1500 


TGCTTTTTAT 


AAATTGTCAG 


ATTATTATGA 


AAAAAAGGGA 


GTGGTAAGTA 


TGAATCTTAA 


1560 


CGATACGATA 


TTTATGTTTT 


TGTGTACATT 


ATTAGTTTGG 


TTAATGACAC 


CAGGATTAAG 


1620 


TTTATTTTAT 


GGTGGGTTAG 


TTCAATCTAA 


AAATGCGCTT 


AATACTGTCA 


TGCAAAGTAT 


1680 


GGCAGCAATT 


GTGCTTGTTA 


CATTTGTATG 


GATAACAGTT 


GGTTTTACAA 


TTAGTTTTGG 


1740 


GAATGGGAAT 


TTATGGTTCG 


GAAATTGGGA 


ATATACTTTT 


CTTAATCATG 


TAGGTTTTGC 


1800 


GACTCAAGAA 


GATATTAGCC 


CACATATTCC 


TTTCGCTTTG 


TTTATGTTAT 


TTCAAATGAT 


1860 


GTTTTGTACG 


ATTGCAATTT 


CTATTTTATC 


TG6TTCAATC 


GCTGAGAAAA 


TGAAGTTTAT 


1920 


TCCTTATTTA 


TTATTCGTAG 


TAATATGGAC 


TGCTCTTGTA 


TACAGTCCAG 


TAGCACATTG 


1980 


GGTTTGGGGC 


GGCGGTTGGA 


TTAACAAACT 


CGGTGTATTA 


GATTTCGCTG 


GAGGTACGGT 


2040 


TGTTCATATT 


ACATCAGGTG 


TTTCTGGTTT 


AGTATTAGCT 


ATTATGATTG 


GAAAAGGAAA 


2100 


CAAACATTCT 


GAATCAACAC 


CACATAATCT 


TATCAriACG 


TTGAITGGCG 


GTATATTCGT 


2160 


GTGGATTGGT 


TGGTATGGAT 


TTAATGTAGG 


TAGTGCTTTT 


ACATTTGATA 


ATATTGCGAT 


2220 


GCTTGCATTT 


ACAAATACTG 


TCATTTCAGC 


CAGTGCAGGT 


GCTATAGGTT 


GGTTAATTTT 


2280 
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ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 2400 

TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA 2460 ' 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2520 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 2580 

TGGTGACATA CATATTATAC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 2640 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 2700 

GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 2760 

^5 AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 2820 

TCATTGTCCT GACATTTAAC TAAAGGTTGA TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2880 

ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 2940 

AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTQAATAA GTATGAAAAA TATATCTQAT 3000 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3060 

GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 3120 

25 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 3150 
(2) INFORMATION FOR SEQ ID NO: 344: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTQCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 180 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 240 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 360 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 420 

TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTA1TTT ATTATAATTA TTAAATTTTT 480 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 540 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 660 

AAGAGGTTTT TCTATTTCGC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

5 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 780 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 840 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 

10 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 1020 

IS GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACX3TG GCTTAGCGTA TATTTATGCT 1080 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 1140 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

20 GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 1260 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 1380 

25 

AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 1440 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 1500 

AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

30 

TGTTACAAAA TATTTAGTGC AAAAGCACGA CGGAGGTATT CAATATGaAT AACGGTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 1680 

35 TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 1740 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTITTTTAT GCTTTAAATC 1860 

^ GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 2040 

^ TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

so 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2280 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 2340 
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TO 



IS 



20 



2S 



30 



3S 



40 



45 



ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 2460 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 2520 

TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2580 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 2640 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 2700 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2760 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 2820 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 2880 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 2940 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 3000 

TTTAATTGCA TAGGGTQAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3060 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 3120 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG TTTTTCTTAA CCCATTCAGC 3180 

CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT TTCATCCAAT CATTTTTGCC 3240 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 3300 

CTCAATTTGT AAAATCGTTT TATCATCAAT OGCGATACTA CAATGACCAA AAAATCscCA 3360 

CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT AATTGGAATT TGTCATCTTG 3420 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 3480 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 3540 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3600 

AAACATATCT TAAATATTAA AGTATGAGAQ TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

(2) INFORMATION FOR SEQ ID NOt 345: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 240 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 300 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 480 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 540 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 720 

20 ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 780 

TAAACGTCTT OATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 840 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 960 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 1080 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

35 CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 1260 

TGCACACGTT GATCTTTTTT AATATGTT6T CGTAGTTGTG CTACACCATT AAAATAACTA 1320 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 1380 

C06TAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCXSTTTT ATAAGTTTCA 1440 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAA6AAC ACTTGCAGTT 1620 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 
50 (2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

TACAGTAGGA ATCATAAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 60 

TGTGTTCCAA ATTTGATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 120 

GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 180 

TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 240 

GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 300 

^5 AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAATTGTT CAACAGCTT6 360 

GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 420 

TOCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAGGTAATAA 480 

TTTTCCGCGT GTAATAGCAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTCCTAT 540 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 600 

GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 660 

TAATTTTGTT CGAGAGACGT GGGTAGCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 720 

CATAAATACT TCAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 780 

30 ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 840 

ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 900 

CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGGC ATAATGACAT AGATATATTG 960 

ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 1020 

TGTTTAAATT ATTGATTGTC ATATCTGAAA TGACCGCTGT CATTrTGTCG TTGTTCATAC 1080 

GCGAGCTTTT CAGCATTCGT TTTGTATTTT TtATAAAAGa AAAATAaAAA TATnAACCaG 1140 

AATCGCQAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 1200 

AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 1260 

CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 1294 
(2) INFORMATION FOR SEQ ID NO: 347: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGTCATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 240 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 360 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 420 

AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG 480 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 540 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 660 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAATOXSGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 780 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 840 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 960 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 1020 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

35 ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 1140 

CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 1200 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

GGCGGCAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 1380 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 1440 

AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

SO GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA TCAAATCGAT 1620 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 1680 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 1740 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT 1860 

GATTTOTCAT ACATATATTG ACTAATGGGC ATATAAAAAQ ATAGCCTCTA ATAGTnACAT 1920 

AAACTCGTAA AAnCC 1935 
(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG COACOCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 240 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 300 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 480 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 540 

ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 840 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC 6ATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 1080 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 1140 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 1320 
ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 1351 

5 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CXSGGGTCGAC AGTTTGTAAA 60 
GTATAACCAA CTGCACAGTG GCTACAAC6C ATATTACAAA GATTTGTAGT TGTAAATTCG 120 

20 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 
TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 240 
GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 300 

25 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 360 
ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
30 (2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 
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TCATTTTCAT 


AGGTTATTAC 


GCAGATCAGC 


ATAATAATCC 


ATTCCATATG AGTcCTTATT 


60 


TTGGTTATGC 


AGCACGTCTA 


TTGGCAACAA 


GTGGCATTGA 


CTATACQTAT 


GTAAGAATGG 


120 


CAATGTACAT 


GGATCCACTT 


AAACCATATT 


TACCAGAATT 


GATGnATATG 


CATAAACTGA 


180 


TTTATCCnGC 


TGGCGATGGT 


CGTATTAATT 


ATATTACTAG 


AAATGATATT 


GCTAGAGGTG 


240 


TCATTGCTAT 


TATTAAAAAT 


CCAGATACTT 


GGGGCAAACG 


CTACTTATTA 


TCAGGCTACA 


300 


GTTATGATAT 


GAAAGAACTT 


GCTGCAATTT 


TATCTGAGGC 


ATCaGGCACA 


GAAATTT^T 


360 


ATGAGCCCGT 


TTCATTAGAG ACATTTGCAG 


AAATGTATGA T6AACCTAAA GGCTTTGGTG 


420 


CATTATTGGC 


ATCAATGTAC 


GACGCAGGAG 


CAAGAGGACT 


ATTAGACCAA 


GAaTCCAATlc 


480 
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TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTC5GC 600 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 660 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 780 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 840 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 960 

TTGGAGGATT TQAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 1080 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 1140 

20 TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCX5AAAT TCCCGTATGA 1200 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 1260 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 1320 

ATTGGCATTA CATAACGCTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 1380 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 1440 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 1500 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 1560 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 1620 

35 TAATCTCAGA AATGCTATA 1639 
(2) INFORMATION FOR SEQ ID NO: 351: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 60 

SO CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 120 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 240 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 352: 

^ GCGTTGTCGT CGATGATTAA TAAAGTATGG GTATACCATT AAGAATAAOG CTACCCAAAT 60 

gAkTGCTAGT GACGTQCCGC CAATGACATC TGAAAAGTAA TGTGCATGAA AATAAAGGCG 120 

ACAAAATAAT ATGCTAAGCC ATAATATTCC CATAACCAAC GCACTCAACA CTTTTGTTAT 180 

10 TGTCTTAGCA GCAAGTGAAA TAATAATGAT CATTAAGGCG AAATATAATA ATGTGCTGGC 240 

GTTGGAATGT CCACTCGGAA ATGAAAAGCC TGTATCAACG GCTAAATGAT TATATGGTCT 300 

TGGACGTAAT ACAGTATCTT TAATTAATTT GTTCATGATG ACACCTGAAA CCAAATATGT 360 

CACAAACCAA ACCGCTAAAT GCCTCTGTTT AATAAACAGT ATGATTGTGA CGATAATGGA 420 

AATCAACACG ACACCTTTGA CATCTCCAAT TTCCGCACTA AACGTCATAT AGTAATTAAA 480 

CAAATTGTTA ACATACTGAC GTTGTGGCTC ACCGAAATAA TCTGTAAACC ATGTTAATGA 540 

20 

TCCCATATCT ATATTTTTAA GCCATTCTTG ATTTGTCACT ACACTGTAAA ACATACCTAT 600 

AAATACAATC AGCGCGATTA AAAATAAAGG CACTGTcATT TTCGGTGATG TTAATTTTTT 660 

25 ATCTATCATC TTACAATCTC CTCGTATCAT CATTTTCATT TTACAAATGT TATCCATAAT 720 

ATCAATGTGC CACAAATTTC ACTTTACCGA CAATATCAAA ATTATAAAGT TCATATTGTT 780 

ATGTATATTG CAAATAAAAC ATTGTATAAT TGAAATAACA ATATTTTGCT ATTTTCAATT 840 

TAGTACGATT TATATTTATT ATACAGAGGG GGTAAGGCGT ATCAATAGAG TTATTTTTGT 900 

CTATATAGCG TTAATCATTA CATTAGTTAG ATTCTTTACC CCTATTCATC CATCATTTTC 960 

AAATTTGATT TACTGGATAT TTGTATTATA TTTTATTCCT ATTATACTAT GCGTTATCGG 1020 

35 

TTTCAAGGCC GAAAAACTTA TTGCAACAAT GGTCATTATA CCTAATTTTT TAGGAATACT 1080 

TTATCGATTA TATGCCTACG TCACACATAT ACTCTTTATG TAAAAGGATA GTGGGCATGT 1140 

^ CTCGCAACAA ATATGCTTGC GTCGACTTGT CACCATTCGC AAATTTTATG ACATACGCCT 1200 

TTCACGGGCT ATATTCAGAC CCACGCATTC ATCCACGTAA TAAACACATC ATGTAATAGA 1260 

AAAACAGCAC ACCCAAATAT ATGGCGTTGC GCTGTTTAAC CAAGCATACT TCTATAGCTT 1320 

^ TAATAAGCCA GCAGAAGCAT ACCTAACCTT CTTAAATATG CTTTTCCAAA TTATCCTCAA 1380 

GTTTGAATAC GATAATACGT TCACCTGTAA CTGTACTTAA ATCACTATGG AAGCTCATCA 1440 

CTTTGATACC TGTAATTTTA AAAATGATAT CATTCAAATC TTGCTCACCG GATTCAACTA 1500 

SO 

ATTCAGAACG TGTTCGTTTA ATATTTAATA ATCCTTCATT CGTACTACAT ACACGATATT 1560 

CAGCTGGC6T TAAGATACCT TCTAAACTAA TAATCACCAT ATCTCTTAAA ATGTCTGATT 1620 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA 


TTAGCCATAA 


3540 




ATTTTTAAAT ACATTTTTAT 


TCATAGTTGC 


TGCAAAATGA ATAAACGTTT 


GAATTTCATT 


3600 


5 


GACX3TCATGT TGTAGTAATA 


CATCGCTAGG 


CATATCACTG TAGTAACACC 


ATGATGCAAC 


3660 




AGTTTGCTTA AACCAMTTT 


CCGATCTACT 


TTCACAATCT TTAGCGACTG 


ACTTAAACTC 


3720 


10 ' 


ATCACCAACT AGCAATTGTT 


CGACAACTAA 


CCGAATTGCC AAATAATCCG 


TTAACAAATG 


3780 


TTGTTCAAAG TGATGCTGTT 


GTGAACGGTA ATACAACATA CCTGCCCAAC 


CCGGTAACGC 


3840 




CAAAAGAT6T CCTTCAACAT 


AAGCTTGGTA 


GTCTTCCTGA TCTATTGAAA 


AATGAGTTAA 


3900 


IS 


TACTGACTCT ATCGTCATTT 


CAGGATCATT 


GGGTAAGCCT TTAATCACTT 


GGCGCTGTGC 


3960 




TTTAGTAAAA CTATGGTCAT 


GTTGCGCTAA 


ATGCAACCAT GCATGGTAAA 


AACTTTGcTC 


4020 




ACGCTTCGGC ATTGTCCAAC 


TCGATAGAAA TTGATCGATA TAAAGTTTCG 


TCCATTTAAT 


4080 


20 


CATTTGAOGA TTCACTTGTT 


CX3CTAAGTGG 


CTCACCTTGT TCATCTATTA 


TTGCATCACT 


4140 




CATCGGACGT ACATCATAGT 


GATGATATGA 


TTCAGCCATA TCAOGTTTTG 


ATTTTTCTAA 


4200 




TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATGCAG 


GTACGTCTTT 


4260 


25 


TAATGTTTTA ATGTTATCAA 


TATAAAGATT 


GATGTAGTGT TGCGGGATAT 


TGTAGTGATG 


4320 




TTCAAGTAAC ATATCAGTAA 


CAAGTTGATT 


AAAGACACTT TCATCTAATT 


CACCACGTGC 


4380 


30 


CACAGCGCTT TCTATTAATG 


CTTTATTTGG 


GAAAATATCC ACATCTCGAA 


CATCACGTAA 


4440 


CCATTTTGCG ACATCTTCAA 


ACX5TATCCGC 


TTCTAATCCT TCCCATGGAT 


TTCGTGCTGC 


4500 




AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA 


TGACTGAATT 


4560 


35 


GATATTTAAC TGTGTTGTCA 


TACCTTTCAC 


CTCCTATAAA TACTTCTTCA 


AATAATTCGG 


4620 




ATGACTTTCT ATCGCTTTCG 


AGCGTGCTTC 


ACCTAGATTA ACTAACCACA 


CGTACAATAC 


4680 




CGCAAAAGCC TTAGAGTATC 


GATGCCGCGC 


CACCCAAATA CTTAATAAAC 


TGCCAAAGAT 


4740 


40 


TAAAATAACA ACACTAATGA 


TGACACTCAC 


TGTAGGCX^GC GTTGTCGCAT 


GTGTTGTTAT 


4800 




ATTTTGTAAT ACAGCGTAAA AATAATTATG 


TGTGATGACG TAGATAAATG 


TCACGATTGC 


4860 


45 


AATCAAAATC ATACCAACAA 


GACGTGCCAT 


GCGTCCTTTA CTAAAGGCTA 


CCATTTGATT 


4920 


CCAAGATACA AGTAATGACC 


ATCCTAGAAT 


GAGTGCACTT AACACTTCAT 


ATGCACTTCT 


4980 




GTCACTACTC ATCCAAAATA 


GAAATGCCAC 


GATAATAGCT AATACACGTC 


CCATGACAAT 


5040 


SO • 


CCAGCCATAA GCGTCTTTAG 


CAGATGCTTG 


TTTTGGAATA TTGAATCX3CT 


TCACGATAGA 


5100 




ACCTGATTGT AAAAATAATG 


TTGCTTTAAA AATACCGTGC AATATTAAAT 


GAATAATCGC 


5160 




TGCTGAATAT ACACCCAATG 


CACATTGAAC 


TAACATAAAG CCCATTTGAC 


TCATCGTAGA 


5220 
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AGAAATACTA GAAAGGATAA GTAATAATGA TAACGCAAAT CCATTATCAA ATATCGGOGC 5340 

AAAACGAGTT AGAATAACAC CACCTGCATT CACAATTCCT GCATGCATAA TTGCCGATAC 5400 

TGGCGTTGGT GCCGTTACAG ATTCAATCAA CCATCGATGA AAAGGAAATT GTGCTGCCGG 5460 

TATCATGACA GCTAATACAA GTAGTACATT CGTCAACAAT GACCATGTCG GATGAACTAT 5520 

ATGTTGTGGT ACCCGCCACT CGCCAGTCGC AATATAAATA GTTACAATTG CTCCAACGAA 5580 

TGCAAGCCAA CCACATAAAA ATGTCATGCT TGATAATTTC GCAGACTCAC GTGGCACTTT 5640 

CCAAAAACX3A TTAACGTTCA TCAGCAATGT TAAACATAAT AATGTAATAC CCCAGCAGAG 5700 

TGCCATCAGT CTTAAGTCTT CAGACATCCA TGCTAAAGAT GCAAACGACG TAATCGCAGT 5760 

GAACAATGGA AAGTAATGTC TATAATGATG ATCACCTAGT AAATATCGCA TTGAAAACTT 5820 

TTGAATAATA AAGCCAAGCG CCATTACAAA GCCAGCTAAT AACCAAGATA AACGATCTAT 5880 

TTTAAATGGA CCTAAGACAT GTTGACCATG AATACCGAAA AAGCCAATGA CTGCAAATAA 5940 

TACTGGCATG ACTAGTATGT ATAAATGTAA TTTAATATAT CTCATTGGCA TAACTGGTGC 6000 

TAAAAACAAC AAGCCACTTA TCAATGCAAT GATAAGCGCA ATAACAAACA GTGAAAATAG 6060 

CAATTGAAAA CTTAACACTG CATAACCTCC TTATTTCTAA TCTCTCGCAT AATTGCTTAT 6120 

GTATAAAAAT AAAAACCTAC AATAGTAGAT TCTGTACATA ATGGCAGAAA ATTTACTATT 6180 

GCAGGITTCA GTTTAACTAG ACACTGCATC ACGGTACGTT GATATACCTT GTTGCAGTGT 6240 

TCTCTTTAAG CGTGCTCCCA TGCACATATG TATATAAAAT GTTACTTCTG TCTGTTCAAT 6300 

TCATCTTCAT AAATATGCTT TGCCTAGACG AGACCTAACG TGTTATTCGT TTTAAACTTA 6360 

TAACATAAAA TATAATTAAA TTTCTGCTTC ATGTCAAATT CATGAGCTTA ACCTCTATTA 6420 

AACCAATGAT TGTAAAGATT TTGTAAATGC ACCTGTACAG TTAGGCAGTA TTTCCCX3TCC 6480 

TTTTAAAATA AAAAATTCGC AGTTATGATC ATAACAATTC AAGTTAGGAA AAAAATCAAT 6540 

TACGCACAAG ATAACTATGT ACAATGAAGT TAACTCATAA GCAAAGGAGG TAATCTTAAT 6600 

GGGTATCATC GCTGGCATCA TTAAAGTTAT CAAAAGCTTA ATCGAACAAT TCACTGGTAA 6660 

ATAAGATTTC ATAACAAACA AAGGAGOTCT TTCACATGGG TATCATTGCA GGAATCATTA 6720 

AATTCATTAA AGGATTAATT GAGAAATTCA CTGGTAAGTA AGTTATAAAA ATCTCATAGA 6780 

TATGAACATC TTATTTGAAG GGGGCCATTC ACATGGAATT CGTAGCAAAA TTATTCAAAT 6840 

TCTTTAAAGA TTTACTTGGT AAATTTTTAG GTAACAACTA ATCTCAAACA TTAACGATCA 6900 

ACAACTCATC ACTATGTTAA ATCJ\ACATAC AGGAGGACAA AACGATGGCT ATTGTAGGTA 6960 

CTATCATTAA AATCATCAAA GCAATTATCG ACATTTTCGC AAAATAATTT AAGCGAATTG 7020 
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TTATTGATGT GAGGTGAGTC rrGTTAGTTT GTTGCAAATA AATGGTCTTG GTGTTTTTTG 7140 

TATAGGACGT TCTTAGTGGG ACATACGGAA TATTCGTGAT CTTTGTAGTC TGACGCGTTA 7200 

TATTTTTGTG GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 7260 

GCGAAATCTT AATGATTTTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 7320 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 7380 

GCATTTACTT CACTCATTTT TATGACTGTT AAACAATGAT TGTACCTTCA ATTAACAGTT 7440 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA 7500 

TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 7560 

TAGATTGAAA ACAAAAAGAT CCTAAGAAGA CCTTAACTTT TTATTAATTG TCATAAATTG 7620 

CAAACAATTA AGCCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 7680 

AATTTGCCAT TGATATGGAT CTTCAAATTG TTGCCAATCT GCATCAATTT CTTGCGCATT 7740 

GACTAAGCAT GCGTCX3AGTT CnTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7800 

GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACX3 7860 

TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATGTAACTGG 7920 

ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 7 980 

TAGCCATACG ATACCTTTTG ATCGAACGAC ATTATTTGGC ATGCTTTCTA ACCAATCATT 8040 

GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 8100 

TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 8160 

TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 8220 

TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 8280 

TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 8340 

AAATTCAACT TGGTCAATTA ATAAATCAGC AATCX3AACGC TCATCTGTTT CGTCAACGCT 8400 

TTGATCACGA TCCATCAATA AATCTTCTGA GTTGATGTCA TGTACX^AAGC GGTTAGCATC 8460 

CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 8520 

ATCAATATAT GAGAAAGTTT GTGCAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT 8580 

AACAATTTGA TCGATGCCAC CTTTTTTCAC TAAACGCTCA ACTTCTTTTA ATAAATCGTC 8640 

TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8700 

TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8760 

AATTACCGCG ATACGTCGAC CTTCTCX3ATT TTGTAAAATA TGATTTAACA AOGTTGTCTT 8820 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 8940 

TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 9000 

^ TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 9060 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 9120 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

10 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTGCAG CGCTTTATGG 9240 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCX3GGAG AGGTTAACTT GCGTTGATTT 9300 

IS TGATAAAGTG ATCATAGCTT TTAGTACTTO AGGATTTTTA TTGTTGCTGT TACGAATGTG 9360 

GTCATGTTTA ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 9420 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 9480 

^ GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 9540 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC 9600 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg ctTCGCAATG TTGCGCCATT 9660 

25 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 9720 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 9780 

AGATTAAGTG TTTCTACTGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

30 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 9900 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 9956 
35 (2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 60 

TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 

AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 240 

AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 300 
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AACAACTTTA TCGTTTGCAA CGACTAAACC 
AGACCAATCA TTATTTAATT CAGGCAGCAA 
^ ATCTGTAATG TTAAATTCAT AAGTCTCCAT 

GTATCCTTTA ATAAGGCAAT AATAATTAAT 
ATATTAAATA TTGGAAAATC ATAACCAAAA 

10 

CCTGTTAAAA TTCTATCAAT AAAGTTTCCA 
ACTTGCATAA ACAAATTATA TTGAGCATCT 
f5 ATAATAATGG TAATAATAAA GAAAAAT6TC 

CCATZKVTTC GATGTGaTGT TATGTTTAAA 
ATTTTCATTG TAGTAGCTAT AATATATTTA 
GCTATTAAAA TGGAAGTGCC AATAAAATAT 
CGTCCATGAG ACAACTCTTT ATATTATAGC 

TTACATTAAA TCTAATGTTA CTAACTCAAT 

25 

ATATGAACTT AATTATTCAT TTATCATATT 

CTGGAACATA AATCCCTAAA AAGACAGTAG 

CTGCTGTTCT CTATTTATAC AATACTTCGT 

30 

CATTACTGTA AAATTCTATA AATAGAATTT 
CTCAGCCTCG GtCTTCGACT GGCACTGCTC 
35 TTAACATGTA ATTTTACTTT TAAATACTTT 

AATTGGACAA ATTCTATGAG AATAGATATT 
TtCACTCGAA TGTCAGTTCG AGGAATAAAT 
ATGGCAATTA ATATAAGGAA AATAGCAGCT 
AAAAAAGCTG ATTTCTATCA AATAATTAAT 
CTTAATGTCC CAAGCCCTAA AACTTGTTGT 

45 

TATCTTAAGT ACATAAATGC TAAAAGTATA 
CTTGTATCAA CATTAATAAA TAATAACCCG 
^ TAGCCCATAT ATTTGCCACC TAATAGTTTG 

TTACGACTAT AATTGATATA TGCAATGATA 
GTAGAGATGG TAGTCACATA CGTAAATACT 

55 



TGCTTTAATC CAATCTTGTA TCTGGTTACG 420 

CTTATCTACA CGCATACCTG TTTGTTCTTT 480 

TACTTAACCT CCTTCTCCTT TTTATTGGAA 540 

ATTACACCAA TTGTTAAACT TGAATCTGCG 600 

ATATTTGTAT CAATAAAGTC AACAACTTCT 660 

AGTGCACCTG CAAAAAGTAA ACTAATAGCA 720 

TTAATAAAGA AATATACTAA GGCTATTAAT 780 

ATTTTTCCAC TCAATATTCC CCATGcAGCA 840 

AAGTGCGGTA TCACTTCAAA TGAATCTCCA 900 

GTAACTTGGT CAAATATAAC GACAAATACT 960 

TTTTTGTGCA TTTTCGTTCC TCCAATCAAT 1020 

TTACACCTGC TAATAAAAAA AGTAAGCATA 1080 

ACTTGATAAA CTACTATGTT TTGACATTAA 1140 

TAAGATGACA TTAAAAATTA GGAAAGCAGG 1200 

TAAGATATTT TCTAATTAAA AATTATCTTA 1260 

ATTGAATGGC TTCGCTATGC CCATCTGGCA 1320 

TTGATGATGG GTCCCTTCCT AGGGTGCCGT 1380 

CCTCAGGAGT CTCGCCATTA ATACTACGTA 1440 

AAAAAAATAA GACATGAATC GTCTACACTT 1500 

GTTAATTTAA GAAAGTAGGC TATTTTGAGT 1560 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 1620 

CAACGAGCTG TACATTATAA AATACATATC 1680 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA 1740 

TATTTATTTG ATTTAGCAGC GATACGTTTA 1800 

AACCAAATCG GAATAAAATA AATTGCACGT 1860 

AACACAAAAA TGAAGAATAC AAATATTAAG 1920 

TACGTAGCAT TTTTATGTAG ATCTGGGTTT 1980 

ATCAGACCCC ATACAACTAA AAATAACACT 2040 

TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 2100 
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TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 2220 

CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCX3GTTAA TACTACAAAA 2280 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 2340 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 2400 

TTnnATATTA A 2411 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 60 

AGGTCAAAAT A TTTT T GATG CGCTAAGTCA ATATGAAGTT AAGCGAOGCG GCGATATGGA 120 

AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 240 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 300 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 360 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 480 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 540 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 600 

40 TCCAG 605 
(2) INFORMATION FOR SEQ ID NO: 355: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 
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ACAATTAGAC ACTGAACATT TAQCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA 180 

TCATGCTAAA AAAGATTCTO CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 240 

GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 300 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 360 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 420 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 480 

TGTATATATT TAAATATGGC GATTGGTTCT TCXSAAAATTA ATTTTAGCGA TATCATTCAC 540 

TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 660 
GCAATGAC 

20 {2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
tD) TOPOLOGY: linear 



10 



IS 



30 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 120 

CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 240 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 300 

40 ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 360 

TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 420 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 540 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 780 
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(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 
TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

IS 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 240 
20 TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 360 
CAATGGTACC TATCGCAACT AATGCTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 420 
GTGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 480 
GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 
(2) INFORMATION FOR SEQ ID NO: 35 B: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



40 


GGTGAGTCAA ATTAAATGAA 


TCTAATAAGT 


CATAACTATC TATTTGTAAT GTGCAACGCT 


' 60 




TAACGCATAT ACAAAATGAA 


TGTGCTGATA 


ATGATTTACT CAAATTAAAA GGTGATTTTT 


120 




ATTCAATGAT GAATGAAAGT 


TGCCTTTTTA 


TTTTTGGTAA AAGTTAATGC GTCAGTGAAT 


180 


4$ 


TGTGTAAGTT TTTCAAAAAG 


TAAAAAGAAA 


TAATAAAGGT GAATTATTAG AATTCCArAA 


240 




ATAATTCATT ACATTCATAA 


AGCATTTTAC 


AAATGGTAAG AAAATGAGTG TTACAAATCT 


300 


SO 


AAATATTGCA AAAGAAGCTG 


ATTTAGTCAC 


AAAAAATGTC CTATGTAATA ATTCGAGAAA 


360 


GATGCACTAT ATACGGTCTT 


CTTACTATTC 


AAATGTAAAA GTTCCTTATT TGCGTGGCTT 


420 




TTTGTTTTAT AAAAGTATAA 


AATTTTACTA 


TAATATATCT TGTAGAGAAC AATGAAATGA 


480 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 600 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 660 

GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 720 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 780 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 840 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 900 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 960 

75 ATGAA6AAAA T6AAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 1020 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CATCGAAGCG AAAGTAACAG 1080 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 1140 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 1320 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCX3GTGTTG 1380 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 1440 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 1560 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 1620 

AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTG 1680 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 1740 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

"^0 TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA GGAAGAAGAA GATAATCCAA 1860 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 1920 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

QAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 2040 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 2220 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA TGTTATTATT TTTATGGTTA 
ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATGGTCGC TCAAATTTTA TACAAATCTA 
AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT GGAAATGCGT ACAGACGTGT 
ATGATTTCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT ATCAGGGTCA CATGGTTTAG 
GTCTTGGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG TGAAGAGGAA GAAGATCCTT 
ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC AAACGTAGGT AAATCAAGTT 
TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC TAATGTTGCA GGGACAACGA 
QAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA TTATGTTTTA ATCGATACTG 
CTGGTATGOG TAAAAAAGGA AAAGTATATG AATCAACTGA GAAATATTCA GTATTAAGAG 
CTTTAAAAOC GATTGAACGT TCAAATGTTG TTTTAGTGGT CATAGATGCA GAACAAGGCA 
TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA ACAAGGTAAA GCAGTCX3TGA 
TTGTCX3TAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA AACGATGAAG AAATTTGAAG 
ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA AATTGCTTTT GTGTCTGCTA 
AAGAACGCAC AAGATTACX3T ACATTATTCC CTTACATCAA TGAAGCAAGT GAAAACCATA 
AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGTTGTTAC TGATGCAATT TCCATGAACC 
CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA TGCAACACAA GTTGCTATAG 
AACCACCGAC ATTTGTTGTA TTTGTTAATG ATGTAGAATT AATGcATTTT TCTTATAAAC 
GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTTTTGA AGkTACACCA ATTCATATTA 
TAGCTCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA CTAAAATTAC CGTTTTTGGT 
ATGGGAAGTT TTGGGACAGC CCTTGCCAAT GTTCTTGCAG AAAATGGACA TGATGTTTTG 
ATGTGGGGTA AAAATCAAGA TGCTGTTGAT GAATTAAATA CATGTCATAC AAATAAAAAG 
TATTTAAAAT ACGCGAAATT A 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3621 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 



60 



55 
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10 



IS 



AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 180 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 240 

ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 300 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATAC6 360 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 420 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 480 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 540 

GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GOCTTTTTCA AAG 643 
(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

2S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 60 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 24 0 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 300 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

40 TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 480 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 540 

CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 600 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 660 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 720 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACOGTGAT AAAGAAGCGA 780 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTAT6AAC 840 

55 



30 



35 



45 



SO 
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IS 



20 



25 



(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : double 

(D) TOPOLOGY: linear 



(Xi) SSQUENCE DESCRIPTION: SEQ ID NO: 361: 

10 TCGTTGAGTA AAAGTCCAGA AAATTGGATG AGTAAACTTG ATGATGGAAA ACATTTAACT 60 

GAGATTAATA TACCGGGTTC ACATGATAGT GGCTCATTCA CTTTAAAGGA TCCAGTAAAA 120 

TCAGTTTGGG CAAAGACTCA AGATAAAGAT TACCTTACCC AAATGAAGTC GGGAGTCAGG 180 

TTTTTTGATA TTAGAGGTAG AGCAAGTGCT GATAATATGA TTTCAGTTCA TCACGGCATG 240 

GTTTATTTGC ATCATGAATT AGGAAAATTT CTCGATGATG CTAAATATTA CTTGAGTGCT 300 

TATCCAAACG AAACAATTGT GATGTCTATG AAAAAGGACT ACGATAGCGA TTCTAAAGTT 360 

ACGAAGACAT TTGAAGAAaT TTTTAGAGAA TATTATTATA ATAACCCGCA ATATCAGAAT 420 

CTTTTTtACA CAGGAAGTAA TGCGAATCCT ACTTTAAAAG AAACGAAAGG TAAAATTGTC 480 

CTATTCAATA GAATGGGGGG TACGTACATA AAAAGTGGTT ATGGTGCTGA CACGTCAGGT 540 

ATTCAATGGG CAGACAATGC GACATTTGAA ACGAAAATTA ATAATGGTAG CTTAAATTTA 600 

AAAGTACAAG ATGAGTATAA AGATTACTAT GATAAAAAAG TTGAAGCTGT TAAAAATTTA 660 

30 TTGGCTAAAG CTAAAACGGA TAGTAACAAA GACAATGTAT ATGTGAATTT CTTGAGTGTA 720 

GCGTCTGGAG GCAGCGCATT TAATAGTACT TATAACTATG CATCACATAT AAATCCTGAA 780 

ATTGCAAAAA CGATTAAAGC AAATGGOAAA GCTAGAACGG GTTGGCTGAT TGTTGACTAT 840 

GCAGGATATA CX3TGGCCTGG ATATGATGaT ATCGTAAGTG AAATTATAGA TAGTAATAAA 900 

TAAGGATTCA ATAATGATAT TAAGACGAGT ATGAAAATAG TTAGATTCTA ATTATTTTCA 960 

CTACTCGTTT TTATTTTGAA AATAAGTAAT AATTCAACAA TATTATAAAT TGAACAGATT 1020 

GTTTGTGAAA TTTTTGATAA TATTAAAGTG AAAAAGTGTT ATAAATTGAT AAATATATGT 1080 

AATTAACAAA AACAAATCAT TTTAAAAAGA AGAGAGTTGT AAGATGATGa AACGATTAAA 1140 

CAAATTMTG TTAGGCATTA TTTTTCTGTT TTTAGTCATT AGTATCACTG CTGGTTGTGG 1200 

CATAGGTAAA GAAGCGGAAG TTAAGAAAAG CTTTGAAAAA ACATTGAGTA TGTACCCTAT 1260 

TAAAAATCTA GAGGATTTAT ACGATAAGGA AGGCTATCGT GATGATCAGT TTGATAAAAA 1320 

SO TGATAAAGGT ACATGGATTA TAAATTCTGA AATGGTTATT CAACCTAATA ATGAAGATAT 1380 

GGTAGCTAAA GGCATGGTTC TATATATGAA TAGAAATACC AAAACAACAA ATGGTTACTA 1440 

mTATGTCGAT GTGACTAAGG ACGAGGATGA AGGAAAACCG CACGACAATG AAAAAAGATA 1500 

55 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 
TACATGTTTC GGATGCTACT TTATTTAGTT TGAAGGGTGC ATTATGGACG TTAGCGCAAG 
AAGTTTATCA AGAATGGTAT TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 
CACGAACTAC TTTTAAGACA GGTTATATTT ATCAAGAAAT TATTTTGAGA CCAGTAGATG 
AAGTTAAGGT ACTTCTGAAT GATTTAAAAG GTGCTGGTTT CGAATTAGGT ATTGCAACAG 
GTCGTCCTTA TACTGAGACT GTTGTGCCAT TTGAAAATTT AGGATTGTTA CCATATTTTG 
AAGCK3ATTT TATTGCAACA GCAAGTGATG TTTTAGAAGC AGAGAATATG TATCCGCAAG 
CACGACCATT AGGAAAGCCG AATCCTTTTA GTTATATCGC AGCTTTATAT GGTAATAATC 
GCGATAAATA TGAATCTTAT ATCAATAAGC AAGATAACAT TGTAAATAAA GATGACGTAT 
TTATAGTAGG CGATTOGTTA GCTGACTTAT TAAGTGCTCA AAAAATAGGT GCAACGTTTA 
TTGGAACATT AACAGGTTTA AAAGGTAAGG ATGCTGCAGG TGAGTTAGAA GCGCATCATG 
CCGACTATGT TATTAATCAT TTAGGTGAAC TTAGAGGTGT ACTAGATAAT TTGTAATTTG 
ATTGTTGTTT GACAGCATAA CTTGTAGTGA ATGATTGAAC CAAAGGTTTC ATATTGAGTT 
ACAATGAAAT TAATAATGAA AAAATGCCAA GAAGCAATGG AAGTAATCCA ATGTCTTCTT 
GGCATTTTGA ATTTACATAA ATTGTTTATG ACTGTACCGT CAATTCAGTT GTGAAAATTT 
GAriGTATTC ACCAACTTGT TTAAGTTCAT CAATTATATT GTTTGAAACA GGTTGATCAA 
CGGATAAAAT CATTAGCGCA TCTCCGCCCG CTTCAGTTCT ACCTAAAGTC ATAGATGCAA 
TGTTGATAIT GTATTTACCT AACAATGCGC CAGTTTTTCC TACCATACCT GGAGTATCAT 
TATGATATGA CACAATTTGA TATTGATTTG GCTTTAAGTC TACAGAAAAA TTATTAATTC 
TAACAATTCT TGGACCGAAA CCTGTAAAGA CAGAAGCGCC AACTTTAACG GAATCGCnAT 
CGnTTGATAG TTCTACCTCT AAGTAGTTAC TAAAACCTGT CTCTGCTTTA TTATTTTCAA 
TATTTAATGT CACTTG 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

ATCAAAATAT TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 60 

ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 120 

GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 180 

AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 240 

AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATITTAC TGAAGGTAAT ATCAATCTTG 300 

GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 360 

TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

AATTTTCTAA CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 480 

TCAATTTTAT TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 540 

TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 660 

CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 720 

30 ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 780 

TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTA6 840 

AAGGTGAATT AGGGGTCATC TAACATT 867 
(2) INFORMATION FOR SEQ ID NO: 364: 



10 



IS 



20 



25 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 60 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

SO AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACnTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 24 0 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCG1TTATTA 300 

S5 
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CAATCGCATG CCGCAGTTAA TTATTATAGT 
CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT 
^ TACAATGCAC GTAAATCAAA ATATGCGACT 

CAATCAACTG CAGGATATTA TGGACATGTC 
AGTATTAAAG TTTCAGAATA CAACTTTTAT 

10 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT 
CGCTTCAATT TCCTGTATQC GTGTGATAGA 
TTCTACAAAT TTCAAATTGC CGTAATTGAA 

Yd 

CTTTAGCATA ATCTATATAT AAAATTTTAT 
ACATACGTTT TCGATATAAA CCTTGTTCTA 
20 TCATTAGTTA AAAAAATAAT TTAACTGAGT 
AACCTTTATC CCCTTTTCAC TCGGTTTAAT 
ATTTTGCTAC TAAGTTTGAA ATATTTCGAT 
CGACAGCTAG AAAAATATT6 TTAAACCAAA 
ATATAAATTT TAACTTTGAG ACAGAACTGC 
TTACTTGaTA TACTTATTGG TAAATCATTA 

30 

ATAATCCCAG TTATAGCGAT TCCATGACCA 
ACTTTTTGAT AAATTTATAT ATTTGTATAA 
TATAACAGCT ACAGTTATCA TTGCGTCTTA 

35 

TCATTAATAT CATGGCTATT TGTAACGTCA 
TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA 
TTTAATTTTT TAAATTTTTT TCTAACCTTT 

40 

CTTAAATACA TATAAATCTC TATTATAATC 
AACATTATAC TCAATATATT AAAAATAAGA 

45 GATATTTATT CTTAAGTATC TTATTATTAC 
ATAATTGTAT TTAAATGCAA TGCTGTTGTT 
TGAGAAATTA AATTCTTATA TGATGAATAA 

^ GGAGTGGAGT GAGGCATTCT ACTGrrCATA 
TGAAGAATCT AAGACTGTTA AAATCGAGAC 

55 



AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

AATAGATGGG GCAATGCTAA AAATTGGTAT 480 

GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

CGCCCATTAA AATACAATAC ACGTGTACTA 660 

ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

AGTTTTTATT TTATGAAATT ATATTATTAC 780 

CGTATATTTC TTCTTCAACT ATTATTTCAT 840 

GCTATTATTT AAATAATTCX5 CTATAACTTA 900 

AATCTCAATA ATTTTTTGCT GTTTTCATCX5 960 

TTTTQAGACC T6TTTAAATT GATGACTCAA 1020 

TGCTTTAATA TTTAACACAG TCTCATTGTA 1080 

TCXSAATGTGA AcATCATTTA TTATTACTTT 1140 

ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

TTGTTAACAA ATATCAAAAT GATAGGAACT 2040 

AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 2280 

S 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCXSCT GCTTGTGTAT 2340 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 2460 

10 

ACACAAGCGC GCQACATGCT GCAAAGTTTT GCX5CTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2580 

15 CTTTTGTACC ACCGCCAACX3 ATTGCCAATG TCATAGGCAC TTCAATTOTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2700 

CX3TATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2820 

CCAATACAGA AGCACGTTCC ATTCX3TTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2880 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GQACGCTGTT GCATGATTGG 2940 

ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

30 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 324 0 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3300 

35 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACX5GGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 3480 

ATTGTTCTTC TGATAACCAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3540 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3600 

TTAACATTTC AATCACTACT ATTTTTACCA CMAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACX5 CTTACAAGAA 3720 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 3780 

SO 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3840 

ATAAAATAAA CnTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA 6CAGAAGCAC 3900 

SS 
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CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4020 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4080 

5 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 4140 

AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATTAGCTAAA GATTATTTAG 4200 

CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 4260 

10 

ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGGTTATT GCACATAATC 4320 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 4380 

IS GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 4440 

TCCX3CTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4500 

ACTTCX3CATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAA6 GCATTAGAGT 4560 

20 CAATCATTGA TAAOGCTGAT GAAACAACTC AAGAGC6TTT ACGTTCAGGA TATGAAGATG 4620 

CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 4680 

TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTCAGITATG 4740 

GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CX3ACATTAGT TGAAGGCTAC AAAGATCATT 4800 

TAGATCAAGC TGGACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4860 

ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4920 

30 

ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4980 

GACCAGAGTA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 

35 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT CCTTTATACG 5160 
TTACGGCTTC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 5220 

40 TAACAGCTTA ATTTTATTAT TAATCGTTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 5280 
GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 5340 
CGCTTGGGCT GACATTTTTG GCTTGTTAAG CAGCTTGCCT ACTTTTTTGG CAATAGCACC 5400 

^ ATACX3TTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 5460 
ATGACTACCT GTTGGCTTTA AAGGTATTGT 6ATTTCAGGA TTGTCACCTT TAAAATACX5C 5520 

GTCTAACCAC TGTGTCGCCT CTCTAAATAT C6CTAAAGAC GTATTTTCTT CCCTAGTACC 5580 

50 

ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 5640 

CAATCGTCCT ACAGGCGAAT CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 5700 

55 
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ATAAGTCGTC AATTACGTAT ATAAACACGT AATACCAGCT ATCACTTTGC TGCAATATAC 5820 

AGTTACATAT CTTACTACAC GTGCTAACCT CTTACTTTGT AAACCAAATC TTAAATTAAA 5880 

5 

ATATTGAAAA TGCAATGAAT CCTTAATATT TTATTAAACC TATAATTACT TATTAAAAAT 5940 

AACACACAAT ATTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTACT ATTAGTGGAA 6000 

AAGTACAATT GCAATTGTAT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT 6060 

10 

TAGTTCAAAA AAATTTAGAG GTGATGTTAT ATGAATAACG GTTTTTTCAA TAGCGACTTT 6120 

GATTCAATTT TTCGAAGAAT GATGAAAGAT ATGCAAGGTT CAAATCAAGT CGGAAACAAA 6180 

IS AAGTACTATA TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA 6240 

GGTGGCAATC ACTCTGCTGA ACAAAGTGCO CAAgcTTTTC AACAAGCAGC ACAAAGACAA 6300 

CAAGGGCAAC AAGGTGGCAA CGGCAATTAT TTAGAACAAA TTGGTCGTAA CCTTACX5CAA 6360 

20 GAAGCACGTG ACGGTTTATT AGATCCAGTC ATTGGTCGTG ATAAAGAAAT TCAAGAAACT 6420 

GCTGAAGTTT TAAGTAGACG AACTAAAAAC AATCCTATAT TAGTTGGAGA AGCTGGTGTT 6480 

GGTAAAACTG CGATTGTTGA AGGTTTAGCA CAGGCAATCX3 TTGAAGGAAA TGTACCAGCA 6540 

GCAATCAAAG ACAAAGAAAT TATTTCTGTA GACATTTCAT CATTAGAAGC TGGAACGCAA 6600 

TATCGTGGTG CTTTTGAAGA AAATATTCAA AAATTAATCG AAGGTGTTAA ATCTTCACAA 6660 

AATGCCGTAC TATTCTTTGA TGAAATCCAT CAAATTATCG GTTCAGGTGC CACAGGAAGT 6720 

30 

GATTCAGGTA GCAAAGGGTT ATCTGATATT TTGAAACCTG CATTAAGTCG TGGTGAGATT 6780 

TCTATTATTG GTGCAACAAC ACAAGATGAA TATCGAAACA ATATTCTTAA AGATGCTGCA 6840 

TTAACGCGCA GATTTAATGA AGTGCTTGTT AATGAACCAA GCGCTAAAGA TACTGTTGAA 6900 

35 

ATTTTAAAAG GTATTCGCGA AAAATTCGAA GAACACCATC AAGTAAAATT ACCAGATGAC 6960 

GTATTAAAAG CATGTGTTGA CTTATCAATT CAATATATTC CACAACGATT ATTACCAGAT 7020 

40 AAAGCAATCG ATGTGTTAGA TATTACAGCA GCACATTTAT CTGCGCAAAG TCCAGCTGTC 7080 

GATAAAGTTG AAACTGAAAA ACGAATTTCT GAATTAGAAA ATGATAAACG TAAAGCAGTA 7140 

AGTGCTGAAG AATATAAAAA AGCTGACGAC ATTCAAAATG AAATCAAATC ATTACAAGAT 7200 

^ AAATTAGAAA ATAGTAATGG TGAACATACT GCTGTTGCTA CAGTTCATGA TATTTCAGAT 7260 

ACTATTCAAC GATTAACTGG TATTCCAGTT TCTCAAATGG ATGATAACGA TATTGAACGT 7320 

TTAAAAAATA TTTCTAATCG TTTAAGAAGT AAAATCATAG GTCAAGATCA AGCTGTAGAA 7380 

ATGGTTTCAC GTGCAATTCG CCGTAATCGT GCTGGGTTTG ATGACGGCAA CCGTCCAATT 7440 

GGCAGTTTCC TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC TAAACAATTA 7500 

55 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 7680 

5 

GAAATCGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 7740 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7 BOO 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7860 

TO 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT 7920 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAA6TTACA 7980 

^5 TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 8040 

GAAGGCTATG ATGAAGAATT AGGTGCACX3T CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

20 GTTGAGGATA ACGAATTAGT OGTAAAAGGT AAATAACGAC ACTTTAACAT ATCX3CGCATC 8220 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA 8280 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 8340 

25 TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 8400 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 8460 

TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8520 

30 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 8580 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 8640 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CATCGAACTT 8700 

35 

ACGATAACTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8760 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 8820 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8880 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 894 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 9000 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 9060 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTAOG 9120 

AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT 9180 

GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 9240 

CCAAATCCGA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTCCATAATA 9300 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 9420 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 9480 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 9540 

ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT 9600 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 9660 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 9720 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 9780 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 9840 

ATCX3CAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATO CGGTTGATAC 9960 

CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10020 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 10080 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCOGATGT ATATTGGTTG ATTAAGTTCT 10140 

AACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 10200 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAGAAATC 10260 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CX5AACCAGTT 10380 

AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10500 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 10560 

TIATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 10620 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 10680 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10800 

ATTGCAAAAA TGT 10B13 
(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

TGATGGATTA GCAGACATTT TACGAGCGAA TGGTTTCAAA GTGITTGGTC CAAATAAGCA 60 

^ AGCAGCTCAA ATCGAAGGCT CAAAATTATT TGCTAAAAAG ATAATGGAAA AATATAATAT 120 

TCCAACTGCT GATTATAAAG AAGTTGAGCG AAAAAAGGAT GCTTTAACAT ATATTGAAAA 180 

CTGTGAATTG CCCGTTGTTG TCAAGAAAGA TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 24 0 

10 

TGCAGATACT ATTGAAGCAG CCAGAAGTGC TATTGAGATT ATGTATGGTG ATGAAGAAGA 300 

AGGTACTGTT GTATTTGAAA CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 360 

TAATGGTGAT TTAGCAGTAC CTTTCGACTG TATTGCACAA GATCATAAAC GCGCATTTGA 420 

IS 

TCATGATGAA GGACCAAATA CTGGTGGTAT GGGGGCTTAT TGTCCmgTAC CACATATTAG 480 

TGACGATGTT TTAAmACTTA CAAATGAAAC AATTGCACAw CCCATTGCAA AGGCAATGCT 540 

TAATGAAGGT TATCAATTCT TCGGTGTATT ATACATTGGT GCTATTTTAA CTAAAGATGG 600 

TCCAAAAGTA ATAGAATTTA ATGCCCGTTT TGGTGATCCT GAAGCTCAAG TATTATTAAG 660 

TCGCATGGAA AGTGATTTAA TGCAGCATAT TATTGATTTA GATGAAGGAA AACGTACTGA 720 

25 ATTCAAATGG AAAAATGAAT CTATTGTAGG GGTCATGTTG GCATCAAAAG GATATCCTGA 780 

TGCATATGAA AAAGGGCATA AAGTAAGTGG CTTTGATTTA AATGAAAACT ATTTTGTTAG 840 

TGGATTAAAG AAGCAAGGTG ATACCTTTGT TACTTcAGGT GGTAGAGTTA TACTTGCCAT 900 

4/} 

CGGAAAAGGT GACAATGTAC AAGATGCACA GCGAGACGCA TACAAAAAAG TATCACAAAT 960 

ACAAAGTGAC CATTTATTCT ATCGTCATGA CATTGCGAAT AAAGCACTAC AACTTAAATA 1020 

AGTAAATTTA AAATACTAAG aTTAGCTATG 7VACGAATCTA TAACGATAGA TTTTTTCATA 1080 

35 

GCTTTTTTAG TTGTAGAGTC TAGGACATTG ATTTCTGTAC CAAATTTGTG ATTATGCATA 1140 

TGTAATACAA AAGAGGCGCC ACAACATGTT TGGATGAACA AAATAACATG TTTGTGGCAC 1200 

CTCTTTTGTT TAGTATGGAA TAAATOGTTT TCTTTTTCTA TACAATGAAT TTCTAATTTA 1260 

40 

GTATCTATAC AATTATGGAT AAAATTTAAC CTACACGACC AAGACGAACA TCATCTATGC 1320 

CCGTGATGGG TAAGGTGATT GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 13 80 

^ TAATAAAGAT TATATCTTTA TATGAGAAAG GTACGTTGTA ATAGTAAGTA CGAGGACCAT 1440 

CTCTAAATCC TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 1500 

TTAATAGAGG TATAATTAAA TGCTTAAATC GCTTTAACCC TCTATAATTT GCCGCGTCTA 1560 

SO TCATCTGATA GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG 1620 

GTATCATACG AATGGCAGCC ATGAATGCAT AAGCAACTTT TGATTTAACC TTTAAATGTT 1680 
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ATGAAATAGC AATGGTTCTT 


AATGATACAT GTAAACCACG 


AACTAAACTT 


TCTGTTGTAA 


1800 




TATGGATAAA TCCGAATTTC 


AAAATTGTAT GGCTACCATT 


CCCGTATAAA 


ATCATGAACA 


1860 


5 


GGGAAGAGA6 


TAATGCAAAG 


CCAATACTTA TAGTTATAAA 


AATTGCTGTA 


ATTTTAAACT 


1920 




GAOTACCATT 


AAACATCAAT 


AAGAAAACTA ACATTAAGAT 


AGTGATATAA 


AGCATAAAAT 


1980 


10 


CGAAATTATG 


CACAAATATA 


ATAAAGAAAA ATAGTATAAT 


TCCAAGAAAT 


AGTTTCGTTA 


2040 


TAATGTTGAC 


ATCATCAACA 


AATGATTGCC GAACTTTCCA 


TTGCTCATAC 


ATTCGTATCA 


2100 




CCATCACAAT 


CTAGTAACGC 


ACCATCTGAA ATTTTAAGTC 


TTCTTGATGG 


ATAACGTTCA 


2160 


IS 


ATTATTTCAT 


CGTCATGTGT 


AaCCATGACA ATACTTTGTC 


CCAAATTAAT 


TCGCTmiiG 


2220 




AAAAGTTTGA 


TCAACTGGAA 


TGTATTATGG CTATCAAGTC 


CAAATGTCGG 


TTCATCTAAA 


2280 




AAGATAATAT 


CAGCTTTAGA 


ACTTAGTGCG GTAgcTACGC 


TAAGGCGTCG 


TTTTTGACCA 


2340 


20 


ATAGACAACT 


CATAAGGATG 


TTGATCTTTT ACATTTTGTA 


AATCTAAAAG 


TTTTAAAAGT 


2400 




TGTATCGTTT 


CATCATCACT 


TTGATCTTTA GAAAGGTGAT 


TAAAATGAAT 


GTTAATTTCA 


2460 




TCATAAACCG AATTTGTTAT 


AAATTGTAAT TCTGGGTTTT 


GATAAACTAG 


GTACATGTGT 


2520 


2S 


TTTGCTGCAT GTTTAATTTT 


TGTTAAACGC TGATTTTCAA 


AATAAACATC 


ACCTTGATAT 


2580 




TTAATCAATT 


GCATAATTGA 


TTCAAGCAAG GTTGTTTTAC 


CACTACCATT 


TGCCCCTGTA 


2640 




ATTGTAATCC ACTCACCTAG 


ACCAATTTCT AAATCTGAGA 


ATGAGAGCAA 


TGTTGATTTA 


2700 


30 


CCX3CGAATAA 


TACGTCCATT 


TTTAAATTGT AATAAGTGTG 


AGTTTGTTGT 


TGGAAAGTCA 


2760 




ACACGACTTG 


GTGCGAATTC 


CCATGCACGT GGATGCCACA 


CACCATATTC 


ACTGAGTAAA 


2820 


3S 


TGAACATACT 


TCTGTAATAT 


GATTTCAGGA CATTCATCGG 


CAATGATATT 


TCCGTTATAA 


2880 


TCCATCAAAA 


TGACGCGGTC 


GACATGATTC CAGATGTGTT 


TAACTTTATG 


TTCAACGATT 


2940 




ACAACCGTTT 


GATCTTCCCA 


AAGTTCAATT AGTTTAGTCC 


ATAAATCTTC 


TGTTGCTTGA 


3000 


40 


ACATCTAACA TTGCTGTCGG 


TTCATCTAAA AAO^ATGTTT 


TTGATTGTTG 


AAGAATGGTT 


3060 




TCAACAATTG 


CCAATTTCTG 


TTTCATCCCG CCACTTAAAT 


CTTTGATATA 


CGTTTCAGGG 


3120 




GTAACATTTA AATTGACCAT 


ATTTAAAGCA TTGATAATTA 


ACGCATCCAT 


GTCTTCACGT 


3180 


45 


GGTAATTGTC 


TATTTTCTAA 


AACGAATGCA AGTTCTTCX3T 


ATACTTTTGG 


CATACAAAAC 


3240 




TGGCTATCAG 


GGTCTTGGAA 


AATAACGCCA CTTAATGGGT 


CAACGATTAG 


TTCATCATAT 


3300 




TTCATAGGTA 


ATTCAATTAA 


ATTAGGAACA ATACCACTTA 


ATACATTCAG 


AAGTGTACTT 


3360 


SO 


TTACCX3CAAC 


CAGAAGGACC 


GAGTAAAAGT ACTTTTTCTT 


TGTCTTGAAT 


AGTGATATTT 


3420 




AAATGATCGA AAATTTTACG 


TTGACCACTT GGATATTTTA 


ATCGTAAATC 


ACTTACTTTT 


3480 
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ATTTTGTTAC GCXTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCX»A 3600 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3660 

5 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3720 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 3840 

10 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3960 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 4020 

CTACTGAAAT AAGTACXKyTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4080 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 4140 

20 ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4200 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4320 

25 OGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 4380 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 4440 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4500 

30 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4560 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4620 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT ATCTTGGATA 4680 

35 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4740 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 860 

40 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4 980 

45 AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 5040 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

^ CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 5220 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 5280 
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5 



10 



IS 



20 



35 



40 



4S 



TTTTCATTAT 


TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT 


GAATTGAATC 


5400 


AGGAAAATGT AAAAAATCAT 


TTCATGTGGA 


TAATTGTGAG 


TTATGGTATA 


AGTTACTTAA 


5460 


TTACACAAAT 


TGCATTGTAT 


GGCAQAATTG 


ATGCTAATGA 


AATAGAGTCA 


ATTGATATCT 


5520 


TAAGTGTCAA 


TGCTTTCTTT 


ATAATTATGT 


GGTTACTTGG 


TCAAATGGCT 


ATTTGGAATT 


5580 


TCTTGTTCTT 


GCGCCGAGCT 


TTACCTTTAA 


CAAAGCAAGA 


ATTAGGTGAA 


GAGGAGCCAG 


5640 


AATTATCAAG 


AACAAGTAAA 


GGGAATGTCA 


CGAATCAAAC 


TAAAATTCAC 


TTGAAACAAC 


5700 


TCCAAGATAA 


GACTACAGAA 


TATGCACGTA 


AGACAAGAAG 


AAGTGTCGAT 


TTAGATAAAA 


5760 


TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 


5820 


ACGATATTCC 


TGATTGGATG 


AGAAAACCGA AATGGGTTAA ACCAATGTAT 


GTCGAACTAT 


5880 


TTTGTGGTGT 


CGTCATCTTT 


TTATTCACAT 


TTTTAGAATT 


TAATAATC6T 


AATGCATTAT 


5940 


TTGTATCTGG 


TGATTGGAAA 


TTATCACAGA 


CACAATATGT 


TATTGAATGG 


GTTACATTAT 


6000 


TAATTCTGTT 


ATTCATTATT 


ATCGCATATA TCGCTACAAC 


GTTAACnTC 


CACTTGAAAG 


6060 


GTAAGTTTTA 


TTATTTACAA 


TTATTTATGG 


GGAGCATTTT 


ATTCTTTAAA 


TTGTTAACGG 


6120 


AATTTATAAA 


TATAATGATT 


CATGGACTAT TACTTTCAGT GTTCATTACG 


CCAACATTAC 


6180 


TATTAATGTT 


ATTGGCAATC 


ATCATTTCTT 


ATTCGTTACA 


ATTACGAGAG 


CGACCATAAT 


6240 


TAAAAGCATT 


ATAAAAGTAC 


TATCTATTAA ACATTTTGAT 


GTGTACGCTA 


TAAGTTAGAT 


6300 


ATATCTCTAA 


CTTACTTAGA 


TACAGGTCAA 


TGAAGTTTAT 


GGATAGTACT 


TTTTTTGTAA 


6360 


CTAGATTTGA 


TTGATTCAGG 


TGATGTGAAT 


TAAGTATTGA 


rAATTGTATA 


CAAAGTTTTA 


6420 


GTGCAAATAA AATAGTTGAA AAGTTATCCA 


TTTGTAAAAT 


CAAGAAAACT 


AGTAAATAGT 


6480 


TGAAGCGACT 


TATGGaATTT 


GCGAAACGAT 


ATATAGTATT 


TCCTTTGTAG 


AAATTTtnACA 


6540 


TATATCATTC 


AAATTACTAA 


TTTGTTAAAA 


TCAACAGTAA 


QATTAGAAGT 


AQATGATATT 


6600 


GAAATTTGGC 


AAACAaTtTA 


ATCTATATAA 


AACTACAACG 


AAACACAGAA AGGAAGTTGT 


6660 


CAGATGAAAA 


TAGCAACTCT 


GAACAAAGGC 


AAAGAAACAA 


AATATTTTAA 


TGGATATCCT 


6720 


TTAATTGAAG 


AAGAGGATAT 


CTATTCACAA 


GATCATTTAA 


AAGAAGGAGA 


TATTTTTCAA 


6780 


ATTGTGACTG 


ATAAATCACA 


ATAT 








6804 



(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

aaaAGAGACG CATTaAAACA AGCTATACAA ATTATCGATA AATTAACATG GGGTGTTTAG 60 

^ TGGTGGTTAA AGAAATTTTG AGACTATTAT TCTTACTAGC GATGTATGAG CTAGGTAAGT 120 

ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATGTAGAG GCGCCAAGTG 180 

ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAGCT ATTATCATTT TTGAATTAAT 240 

10 

TATATTAATG TGTGTAGCAA TAGCACTGGA GGTGTTGTAA ATATGTGGAT TGTCATTTCA 300 

ATTGTTTTAT CTATATnTTT ATTGATCTTG TTAAGTAGCA TTTCTCATAA GATGAAAACC 360 

ATAGAAGCAT TGGAGTATAT GAATGCTTAT CTTTTCAAGC AGTTAGTAAA AAATAATGGT 420 

GTTGAAGGTT TAGAAGATTA TGAAAATQAA GTTGAACGAA TTAGAAAAAG ATTCAAAAGC 480 

TAAAGAGAGG CGTTGGCTTC TCTGCTCTAT CyAAAATAAT GAAAGGAGCC saACATGTTA 54 0 

20 GaCmAAGtCA CTCAAATAGA AACAATTAAA TATGATCGTG ATGTCTCATA TTCTTATGCT 600 

GCTAGTCGTT TATCTACACA TTGGACTAAT CACAATATGG CTTGGTCTGA CTTTATGCAO 660 

AAGCTAGCAC AAACAGTTAG AACTAAAGAA GATTTAACTG AGTACAATAA AATGTCTAAG 720 

25 TCTGAACAAG CCGATATAAA AGATGTTGGC GGATTTGTCG GTGGATATTT AAAAGAAGGC 780 

AAACGGCGTG CTGGTCAAGT CATGAATCGT TCAATGCTAA CACTTGATAT CGATTATGCA 840 

GCCCAAGATA TGACTGACAT ATTATCTATG TTTTAT6ATT TTGCATATTG TTTATATTCA 900 

30 

ACACATAAGC ATAGAGAGAT AAGTCCAAGA CTGCGTTTAG TGATTCCTTT AAAACGAAAT 960 

GTAAATGCAG ATGAGTATGA AGCTATTGGG CGTAAAGTCG CAGATATCGT TGGCATGGAT 1020 

TACTTCGATG ATACAACTTA TCAACCACAT AGGTTAATGT ATTGGCCTTC AACTAGTAAC 1080 

35 

GATGCGGAAT TTTTCTTTAC CTATGAAGAT TTACCTTTGT TAGACCCAGA TAAAATATTA 1140 

AATGAATATG TTGATTGGAC TGACACATTA GAATGGCCAA CGTCTTCAAG GGAAGAGAGT 1200 

AAGACTAAAA GATTAGCAGA TAAGCAAGGC GACCCAGAAG AAAAGCCGGG AATTGTTGGT 1260 

40 

GCATTTTGTA GAGCCTATAC GATAGAAGAA GCTATAGAAA CTTTTATTCC TGATTTATAC 1320 

GAAAAACATT CTACTAACCG TTATACCTAT CATGAAGGTT CAACTGCAGG TGGATTGGTG 1380 

^ TTATACGAAA ATAACAAGTT TGCCTATTCT CATCATAATA CGGATCCCGT AAGCGGTATG 1440 

CTTGTGAACA GTTTTGATTT AGTACGCATA CACTTATATG GTGCTCAAGA TGAAGAAACT 1500 

AAAACAGATA CTCCGGTTAA TCGACTACCT AGTTATAAAG CAATGCAGCa AAGAGCGCAA 1560 

50 AATGATGAGG TTGTTAAAAA GCAATTAATT AATGACAAAA TGTCTGATGC AATGCAGGaT 1620 

TTCGATGAAr GAGAAAATAG CGATGATGCA TGGTCTGAGA CGTTnGAAAT TACTTCGAAA 1680 
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(2) INFORMATION FOR SEQ ID NO; 367: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



70 





(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 








AAGATTAGGC 


ATCAAGACAG 


GGTCGCGATT 


GTTTGAAATC 


CCACATAGnA 


ATGATATTTa 


60 


15 


CaTTATCAAT 


CCaAGTATGC 


GTAAATATCT 


TAATGTTTCA 


GTTGCTATTT 


CTAAGATTGC 


120 




ATTGCGTTAT 


ATTCCACCTG 


AAGATTTACA 


CCAATATAGT 


ATTGAGGAAT 


TTTTTATGGA 


180 




TGTTACTGAT 


AGCTATCATA 


GATTTAGTTC 


TACAGTACAT 


GCATTTTGCG 


AAAGACTTAA 


240 


20 


ACGTGAAATT 


TATGAAGAAA 


CAGGCATTTA 


TTGTACTGTG 


GGCATTGGTT 


CTAATATGTT 


300 




ATTAAGTAAA 


ATTGCTATGG 


ATGTTGAAGC 


GAacATAGTc 


AAAATGGTAT 


AGCTGAATGG 


360 




CGATATCaAG 


ATGTACCAAC 


GAAATTATGG 


CCAATTCmGC 


CCtTGCGAGA 


TTTTTGGGGT 


420 


25 


ATTAATCGTC 


GAACAGAAGC 


CAAATTGAAT 


AAAAGA6GAA 


TTTTTACTAT 


AGGAGATTTA 


480 




GCGAAATATC 


CATATAAATT 


TTTAAAAAAA 


GAGTTCGGTA 


TTTTAGGTGT 


TGATATGCAT 


540 




CTACATGCGA 


ATGGGATAGA 


TCAGA6TAAA 


GTACGTGAAA 


AGCACAAGAT 


CAGCAATCCA 


600 


30 


TCGATATGCA 


AAAGTCAAAT 


ATTAATGAGA 


GATTATCATT 


TTGATGAAGC 


AAAAGTAGTA 


660 




ATGCAAGAGT 


TAATTGAAGA 


TGTTGCTAGC 


AGAGTTCGAG 


CAAGAAAAAA 


AGTGGCAAGA 


720 


35 


ACGATACATT 


TTGCCTTTGG 


CTATAGTGAT 


GAAGGCGGTG 


TACATAAGCA 


ATATACTTTG 


780 


AAAGATCCAA 


CAAACTTAGA 


AAAAGATATT 


TATAAAGTAG 


TAATGCATTT 


CGCAGATAAA 


840 




TTAT6TAATA 


AACAAGCACT 


ATATCGTACG 


CTAAGTATAT 


CTTTGAGTCA 


ATTTATTAAT 


900 


40 


GAGGATGAGC 


GACgTTAAGT 


CTGTTTGAAG 


ATGAATACCA 


ACGCAAACGT 


GACGAATGTC 


960 


TAGCTAAAAC 


GATAGACCAA 


TTACATTTGA 


AATACGGCAA 


AGGTATTGTG 


TCCAAAGCAG 


1020 




TATCGTTTAC 


AGAAGCAGGT 


ACAAAACACG 


GCAGATTAGG 


TTTAATGGCT 


GGACATAAAA 


1080 


45 


TGTAATGACT 


ATACGGTTTA 


AGTAATATAT 


AACTGTGATT 


CGTATAAAAT 


AAGTCTCTAA 


1140 




AGATAAATAT 


TTCATATATC 


ACAATAGATT 


TTCACAATAA 


TATCTAAGAA 


TACATGGAAT 


1200 




TTATCAAAAG 


AGACTTAATA 


ATTATTGGAT 


ATAACAATCA 


AAATCACTCA 


ATGCTTGCAT 


1260 


SO 


ACCGCGTTCT 


CGGTCAGTAG 


GG TTTTTGAA 


ACTAATTTTT 


AAAGCACCGT 


ATATATCTTC 


1320 




GCGTACTTCT 


AAGATTCTTA 


AGTTGCTTAT 


AGATATGTTA 


TGTAAACTCA 


GGATATAAGT 


1380 
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TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 1560 

5 

TTTTAGCTGT CX5AATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 1740 

10 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 1800 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCnTTAAC GTGTTTG 1847 
15 (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS: double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 60 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 120 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

30 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 240 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 360 

35 

TCACTATCTA TT7WVTAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 420 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 480 

^ CCGTCCTCTG AAGT 494 

(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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GATAATTTCn AATGTAGCGA CCATCATTAC GAGATGATTT GATATAAGCA CAGTTTGGAT 180 

GTTGACCAAT ACTATCGCCT TCTTCTTCGA TGATATCTAT TTTAATACCA TCATCAGCTG 240 

^ CAATTTCTAA TGAAGATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCXa^ 300 

CAATAGCGAC ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 360 

TTTCAATATT TTTAGGAATA TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 420 

10 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 4 80 

TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT TCTTTTTGCT 540 

CGGCATTTTA AAGTTGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 600 

75 

GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 660 

ATTAAAGCCA AACAATACGA TGGCGCCTGC AATTG6TGTT GCCATACCTT TGACACCTAT 720 

20 TACTAGTCCG CTAAATQTCA CGATACATOC GTTGACGACC CCAATCAGTG CATTTGTACC 780 

ATATAGTTGT ACTGGATATT GCGCTATTAA ATCAATTTGC GTC3UVTGGCT CAATACAAAC 840 

TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 900 

25 TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 960 

ACTTGTTAAT ACCATTGAAC TTAGCGGTGT CATACCTGTA ACAGGAATCA CTAGTCCTAA 1020 

AATGACCGCT AATGCATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TACCTATTTG 1080 

30 TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATGCA AGTCCGAAAG CAATTGCTGG 1140 

TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 1200 

AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

35 

TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 1320 

ACCTACTGcO ATACCGCTTA AACTTCCTGC TAAATCCCCA ATATCTTGTA GAAATTTAAT 1380 

ATCAAATACG CCACCAATAG CATAACTTAA OAATGCTTGT GGTAGAAATG TCGCACAAGC 1440 

40 

TGCACCGGAT AATGCTTGTA GTCCTTGTTT ACCGTAOGGT GCATACTTTA AAAATAGCGT 1500 

CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCCA TTTCCCAAAA 1560 

45 ACCTCTCTCT ATGTTTATTT TATTTTCAGA CCATAAACAT CGTACACCCT TAAGAAAACG 1620 

TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTG AAATATTTCA TATTTGTGAA 169 0 

AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 1740 

SO ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTATCTT TCGAGCTATT 1860 
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ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA ri TTT A GnT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAC TTTTTTTTAA AGCTAAATTT ACATCAACTT 2040 

^ AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGGCGATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAOCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 2220 

TO 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACX3CAAA AACATTTGAG 22 80 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 2340 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 2400 

IS 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 2460 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2518 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
2S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



30 



35 



40 



4S 



ATACTACTGG 


ACCGTCnACC 


AATTGACAAA 


ACCTGAAAAC 


TGGATTTTTA 


TTGCAGGATT 


60 


ACATCGAACC 


TGGAAGAACA 


ATCCCCAGGA 


GTTGATGATA 


TTCGAGATAT 


GGCATACAAT 


120 


CAAGGTAGTT 


TAGATAAGAC 


AATTTATGAA 


ATTTCTAAAC 


GCACAGTACT 


ATTTTTAATA 


180 


CAGAAAGATA 


TTACGGTATA 


TAATAAGACG 


ATTGACTGTT 


TAAATTATTA 


TAACTATAGT 


240 


GACGAAAGAA 


TAAAGGATGA 


TTAAATGAAT 


TCACAAGAAT 


TATTAGCAAT 


TGCTGTGGAT 


300 


GCAATTGACA 


ATACCCCAGG 


CGAAGATACG 


ATTTCTTTAG 


AAATGAAAGG 


TATCAGCGAT 


360 


ATGACAGATT 


ATTTTGTTGT 


AACGCACGGA 


AATAATGAAC 


GACAAGTTCA AGCGATTGCT 


420 


AGAGCGGTGA 


AAGAAGTAGC 


CAATGAACAA 


AATATAGAAG 


TAAAACGTAT 


GGAAGGATAC 


460 


AATGAAGCGC 


GTTGGATATT 


AATTGACTTA 


GCTGATGTTG 


TGGTACATGT 


TTTCCATAAA 


540 


GACGAAAGAA 


ATTATTATAA 


TATTGAAAAG 


TTATATCAAG 


ATGCACCATT 


AGAATCATAT 


600 


AGTCAGGTTG 


CGTATTAATT 


ATGTCGCAAT 


ATGCAGAAAT 


GAGCCTAGTG 


TACGATCAAT 


660 


TGACTCAAGA 


TCAACCATAT 


GAAAAATGGT 


TTGAAATTGT 


AAAAAATCAC 


TGCAAAGATG 


720 


AATCAAATAT 


TTTAGATATT 


GGATGCGGTA 


CTGGTAGTTa 


ACAGTTCAAT 


TAGAAGCTTT 


780 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 60 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

IS 

TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 180 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 240 

20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 300 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 360 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 420 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 480 

ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATGCAC 540 

TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 600 

AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AQACTTTACT TCATCTACTA 660 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 720 

CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 780 

ATCAAACTGC TATTGAAAAT AATAATAA6T GGTTAGGCCA TTATGGATAT AATGTTAGTA 840 

GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 960 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 1080 

45 CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 1140 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 1200 

TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 1260 

SO ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 1320 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 1380 
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CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGGTGAA AGTGGGCTCA 150O 

TTGCTCTTGA TTGGCATAAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 1560 

GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGCm TATTTaGaAG 1620 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 

aAAAGGtATT TGCaTGTGGc gOTAtACCtA AAAAGAATGC TGTTATGATG GATATCTATG 1740 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 1800 

CAGCAATATT AGGTGCAGTC AGT 1823 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



25 


ACGATCATCT 


GCATCAGCGA 


ATTCCGATGC 


AaTTcATATG 


tCCtAATAAC 


AAAAGAATAG 


60 




GTTTTaAAAG 


ATATGCATAC 


CGTAATGaTA 


GATATAGTTT 


TAAACGTGAC 


TTCAAGCTAT 


120 




ATGaATGTGA 


TGACTGTTCA 


TCATGTTCTT 


TGAGACATCA 


ATGCATGAAG 


CCAAATTCGA 


180 


30 


AATCCAATAA 


GAAAATTATG 


AAGAATTATA 


ATTGGGAATA 


CTTTAAAGCC 


CAAATTAATC 


240 




AAAAGCTTTC 


TGAACCAGAA 


ACGAAAAAAT 


CTATAGTCAA 


AGAAAAATTG 


ATGTAGAGCC 


300 




TGTTTTTGGA 


TTTATGAAGG 


CTATTTTGGG 


TtTCACTCGA 


ATGTCAGTTC 


GGGAATAAAT 


360 


35 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


ATGGCACTTA 


ATATAAGGAA AATAgcaGCT 


420 




CAACGAGCTG 


TACATTATAA 


AATACATATC 


AAAAAAGCTG 


ATTTCTATCA AATAAATAAT 


480 


40 


AGAAATCAGC 


TTTTTTACAT 


TGCCTAAGAA 


CTTTAAGGAA 


CTTAATGTCC 


CAAGCTCTTT 


540 


TTTGTTATAT 


CTAATTCGTA 


ATTTATGATT 


GTTTATTCGG 


TCCTTTGATG 


TTCACTAAAT 


600 




GTGACTTTAA 


ATCTTGTTCT 


AATTGTTGTA 


ATTCTTTTTC 


AGCTAATTGT 


CGTTCTTCGC 


660 


45 


GACCGTGTTG 


TTGAATAATT 


AATGTTTCTT 


CAATTGTCTC 


AATAATGTTA 


CGCTGTGTAC 


720 




GTTTCAATGT 


ATCAAGATCA 


ACAATGCCAC 


GCTCATTTTC 


TGTTGCAGTT 


TCAATCGCAT 


780 




TTTGTTTCAA 


CAITTCAGCA 


TTTGCTGTTA 


ATAAATCATT 


AGTTGTATCA 


GTGACAGCTC 


840 


50 


GTTGTGCAGC 


AACTGCATTA 


CGCTGTCTCA 


TTAATGTAAG 


CGCAATGGCC 


ATTTGATTTT 


900 




TCCATAGTGG 


AATACTTGTC 


AAAATTGAAC 


TTTGTATCTT 


CTCGGCAAOT 


GCTTGATTAA 


960 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 10 BO 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 114 0 

5 TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 1200 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 1260 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 1380 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 144 0 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 1500 

IS 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 1560 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 16O0 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2S (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

^ ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 60 

aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 120 

ACAGAAACCA AGCAACAGAT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 180 

35 

ATGGACCTGC CATTGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 240 

CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 300 

CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 3 60 

40 

TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 4 20 

ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 80 

45 CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 540 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 600 

AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 660 

50 CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 720 

CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 780 
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AA6TTAAACA ATATGGTGAA AACTCATTTG TzGGACATAG CCAACAAGAA TATCCACAGT 900 
TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC TTTAAACGTC 960 
^ CTAGCTCAGT AAGTGGTAOT ATCCCATCAA TCAATGTTTC TGGTAGTCAA GATAACAACA 1020 

CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT GGTACTGCAC 1080 
AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA ACAGGTATAT 1140 

10 

TCAACTAATC CACTCAACAT AAAATCCTCA GTTATACCAT ATTTATGGTG TAGCCGAGGA 1200 
TTTTnTTAGG TTCTTCATCT TTTATGG 1227 
(2) INFORMATION FOR SEQ ID NO: 374: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^ (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

25 CCATATGGtG CAACATTCTT CGTATTTAGT GATTATTTAA AACCAGCGTT ACGTTTATCA 60 

TCAATTATGG GATTAAATgC aACGTTCATC TTCACACATG AyTCaATTGC AGTAGGTGAA 120 

GATGGTCCTA CTCATGAACC AATTGAGCAA TTAGCTGGAT TAAGAGCCAT TCCAAATATG 180 

^ AATGTTATCC GTCCTGCTGA TGGTAATGAA ACAAGAGTAG CATGGGAAGT TGCCTTAGAA 240 

TCTGAATCTA CACCTACTTC ATTAGTATTG ACACGTCAAA ACTTACCGGT ATTAGATGTA 300 

CCAGAAGATG TAGTTGAAGA AGGCGTTCGA AAAGGTGCCT ATACAGTTTA TGGCTCTGAA 360 

35 

GAGACACCAG AATTCCTATT ATTAGCTTCA GGTTCAGAAG TTAGTCTTGC AGTTGAAGCT 420 

GCTAAAGATC TTGAAAAACA AGGTAAATCA GTACGTGTTG TTTCAATGCC TAACTC3GAAT 480 

GCATTTGAAC AACAATCTGA AGAATATAAA GAATCAGTTA TTCCATCAAG CGTAACAA7UV 540 

40 

CGTGTTGCGA TTGAAATGGC TTCACCGCTT GGATGGCATA AATATGTAGG TACTGCAGGT 600 

AAAGTTATTG CTATTGACGG CTTTGGCGCA AGTGCACCTG GCGATTTAGT AGTTGAAAAA 660 

TATGGATTTA CAAAAGAAAA TATCTTAAAC CAAGTTATGA GCTTATAAGA ATAATTTATA 720 

45 

AAGCGAGTAT GTTTAGAAGT CTAGGATGCA TAATCTTAGG CTTCTTTTAA AGTGTTGAAA 780 

TTTAGAGTAT AGCACTTAAA CTACATCATA AGTGATAAGT TATGAAAGTA TACTATTTCA 840 

50 GATTAATCTT TAAAAGCTCT GTTATAACAG CATGATTTTT GATATTATTT TTAGTATCGA 900 

TATTAAAATA CTTGAATAAA CTAGTTCTTG AAATAATGTG ATGAATTTAG TAAAATICAG 960 
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5 



10 



IS 



20 



TGCATTAATT 


TTAGGTTTAA 


TTGGAGGTTT 


CCTTTTAGCT AGAAAATATA TGATGGACTA 


1080 


CTTGAAGAAA 


AACCCACCAA 


TCAACGAAGA 


AATGCTTCGT 


ATGATGATGA 


TGCAAATGGG 


1140 


TCAAAAACCT 


TCTCAGAAGA 


AAATTAATCA 


AATGATGACG ATGATGAATA AAAATATGGA 


1200 


TCAAAATATG 


AAGAGTGCGA 


AAAAGTAAAT 


TCGCAATTGA 


TAGAGGCTAT 


TTTCCAGATA 


1260 


TGGAAATGGC 


CTCTTTTTAT 


AATCAAATTA 


ATAAGAATAA 


ATATGTTTAT 


TAAAATTAAA 


1320 


GTTAACAAAA 


TGACGAATAG 


ACTGAGAAAT 


GCTATAATTC 


ATTTTGTATG 


ATTTACAGAG 


1380 


AGTTTATTTA 


ACGAGAAGGT 


GTCyGCGTGC 


TCTATTTAAT 


ATTTTCAATC 


ATTGTAGCTT 


1440 


TATTTATGGG 


AACTATAGTT 


ATAGTTATTC 


GTATGAAAGC 


TCAAAATTAT 


CCGGTAAAkG 


1500 


A6AAAAAAAT 


AGTTTTGCCa 


CCgTTTTTTA 


tGGCgACCGG TGCATTGATG TACGTCGTTC 


1560 


CaTATTTTAG 


GCTAACAGGA 


TCGGAAATGC 


TAGAAGCCTT 


TATAATTGGT 


TTGCTTTTTT 


1620 


CtACAGTTCT 


AATTTGGACT 


TCTCGATTTO 


AAGTCAAAGG 


TACAGAAATT 


TATATGAAAC 


1680 


GATCTAAAGC 


ATTTCCAGTT 


ATTTTGATTT 


CATTACTTAT 


CATTCGTACT 


GTGATGAAAA 


1740 


TATTCATTAG 


TAATGAAATA 


GATCCTGGAG 


AATTAGGCGG 


CATGTTCTTT 


TTATTAGCAT 


1800 


TCTGTATGAT 


TGTTCCTTGG 


AGAGCA6CAA 


TGCTATATAA 


ATACAAAAAA 


CTAAAGAAAA 


1860 


CATTAATCAA 


TTAATTACTT 


TTaAAACCAC 


TTGTGATCGA 


CTTCTAAATC 


AGTCAATGAG 


1920 


TGGGTTTAAT 


nTTACTTGGA 


AAAGGnGGAA 


AGG 






1953 



(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



45 



ACATTTGATC AAnTTATCGA 


CATTAAAGAT 


GAATTCAnTT 


GATCGTnTCA 


ATGATTATCC 


60 


TGTTGAAGTA GCACGTTTGC 


TTGATATAGT 


GGAmATaAAA 


GTACACGCAT 


TACATTCAGG 


120 


TATCcACGTT GATTAAAGAT 


AAAGGGAAAA 


TAATTGATAT 


TCATTTATCT 


GTAAAAGCCA 


180 


CTGAAAATAT TGATGGCGAA 


GTGCTGTTCA 


AAGCAACACA 


ACCTTTAGGT 


AGAACAATGA 


240 


AGGTTGGTGT TCAAAATAAT 


GCAATGrCAA 


TTACTTTAAC 


GAAACAAAAT 


CAATGGCTTG 


300 


ATAGTTTGAA GTTTTTAGTT AAGTGCATTG 


AAGAAAGTAT 


GAGAATCAGT 


GATGAAGCAT 


360 


AAAGAAGCAT TTAATGGCGT 


TGTCGTGTTA 


ACTGCTGCAT 


TAATTGTCAT 


TAAAATTCTG 


420 



55 



1268 



10 



IS 



20 



EP0786 519A2 

CAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT CGATGAATGC CATTCCTAGT 540 

GCAATTACAC AAAATATAGG GAAGTATCAT AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 

TATATACAAT TAGTTGGTAT ATTATTATTT ATTGCTATTT TTGTGTTTGC GAACAATATT 660 

GCACATATGA TGGGTGATGG CCATTTAACA CCAATGATTC AAGCTGCAAG TTTAAGCTTT 720 

ATATTTATAG GTATGCTTGG CGTGTTAAGA GGTTATTATC AATCTGCAAA TAATATGACA 780 

GTTCCGGCTA TTTCCCAGGT TATAGAACAA GTTATACGAG TAGGTATTAT CATTGTTACT 840 

ATTGTTATTT TTGTAGACAG AGGTTGGACG ATATATGAAG CGGGAACAAT TGCTATTTTA 900 

GCATCAACGA TAGGTTTTTT AGGTTCTTCA ATTTATTTAG TAGCGCACCG ACCTTTTAAG 960 

TTTAAAATGG TAAATAACAC T6CAAAGATC GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

ATTTTCGCTA TCAGTCAATT AATCGTAATT TTATGGCAAQ TGATTGATAG TGTTACTATT 1080 

ATTAAGTCAC TTCAAGCGAT ACGOGTGCCA TTCGATGTTG CCATAACTGA AAAAGGAGTC 1140 

TATGACCGTG GTGCATCATT TATTCAGATG GGATTGATTG TAACTACAAC ATTTAGTTTT 1200 

GCGCTCATTC CTCTGTTAAG TGACGCAATC AAAATGAATA ATCAGGTACT TATGAATOGT 1260 

2S TATGCAAATG CGTCATTAAA GATTACGATT TTAATAAGTA CAGCAGCGGG AATAGGATTA 1320 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG TTTTTTAAGA CGAATGATTT AACCTTAACG 1380 

TTAAGTGTTT ATATGATTAC GGTCATTTGT GTATCGTTAA TTATGATGGA TATGGCATTA 1440 

TTACAAGCGC AACATGCTGT GAGACCTATT TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

AAATTTATAC TTAATATCAT TTTGATTCGT TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ACTGTTGTAT CATTAATTAT ATTCGGTACG ATTATCCATA TTGCT6TCAC GAGAAAATAC 1620 

CACTTATATG CGATGAGACG ATTTTTTATC AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ATTGTTGTTC AATGCGTGTT AAACATAGTG ACAACACACG GTAGAATCAC TGGACTCATT 1740 

GAATTATTAT GTGCAGCAGT ATTAGGTATC ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

AATGTTTTGA CATATAAAGA GTTAACTTAT TTACCATTTG GTTCAAAGTT GTATCAAATT 1860 

AAGAAAGGAA GACGTTGATG GCACATACCA TTACGATTGT TGGCTTAGGA AACTATGGCA 1920 

45 TTGATGATTT GCCGCTAGGG ATATATAAAT TTTTAAAGAC ACAAGATAAA GTTTATGCAA 1980 

GAACGTTAGA TCATCCAGTT ATAGAATCAT TGCAAGATGA ATTAACATTT CAGAGTTTTG 2040 

ACCATGTTTA axSAAGCACAT AACCAATTTG AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

SO TGGTTGAAGC TGCTAATGAA AAAGATATTG TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 2160 

CTGAGACAAC TACAGTGAAA TTACTGGCTT TAGCAAAGGA CAATACTGAT ATAGATGTGA 2220 
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ATGATGGCTT CACACTGTTA GATGCGACAT CATTACAAGA AGTAACACTT AATGTTAGAA 2340 

CGCATACATT GATTACGCAA GTTTATAGTG CAATGGTTGC TGCTAATTTG AAAATCACTT 2400 

TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT CACTGGTGCA CGAAGCGATG 2460 

GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA TCATGATGAA AATGCATTCA 2520 

ATAATTTGAC GAGTGTATTC GTACCAAAAA TCATAACATC GACATATTTG TATCATGACT 258 0 

TTGATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATGA AGATAAAGGT TGTCCATGGG 2640 

ATAAAGTGCA AACGCaTGmA AcgCTAAAGC GTTATTTACT TGAAGAAACA TTTGAATTGT 2700 

TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTGA AGAACTAGGA GATATTTTAT 2760 

TACAAGTGTT ATTGCATACT AGTATTGGTA AAAAAGAAGG GTATATCGAC ATTAAAGAAG 2820 

TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACCC ACACATATTT GGTGATGCCA 2880 

ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA GGCGAAAGAT 6CTGAAGGTA 2940 

AACAGCCAAG AGTTAAATTT GAAAAAGTAT TTGCAGAGCA TTTTTTAAAT TTATATGAGA 3000 

AGACGAAGGA TAA6TCATTT GATGAGGCCG CGTTAAAGCA GTGGCTAGAA AAAGGGGAGA 3060 

2S GTAATACATG AGATTAGATA AATATTTAAA AGTATCACGG TTAATAAAGC GACGTACGCT 3120 

AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TACAATAAAT GGTAATGTTG CTAAAGCTGG 3180 

ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATTCGCTTT GGTCAAAAAT TAGTAACAGT 3240 

TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAAC GCGAAGGGTA TGTATGAAAT 3300 

CATTGAAGAG CGTCGACTTG AAGAAGCGTA AATTGGAGGT GACAAGCAAT GAAAAATAAA 3360 

GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAACA AGAAAAAACA ACGTCAAAAA 3420 

ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTTG cGGGCGTATT aCTTGCGATA 3480 

ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAACATC GCAATGATAT TGATGCACAG 3540 

GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAACAAA ATGAAGAAAT TGCGTTAAAA 3600 

GAAAAGTTGA ATAATCTGAA TGACAAAGAT TACATTGAAA AAATTGCGCG TGATGATTAT 3660 

TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG AAGACAAAGA TTCGTCTAGC 3720 

TCAAAATCTT CGAAAAAATA AATCCAAATT GATTCAAAAT TATCCGAGTA TAGACATTGT 3780 

GAAAAAA 3787 

(2) INFORMATION FOR SEQ ID NO: 376: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1644 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

ss 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

^ TAAACCATTT CAAACTGAGG AAOGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 60 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 120 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 180 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 240 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 300 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 360 

IS 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 420 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 480 

20 AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 540 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 660 

2S AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 720 

CATCGTTGTG GGTCAATGTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 780 

AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 840 

AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 1020 

35 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 1080 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 1140 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

40 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 1260 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 1320 

^ TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 1380 

ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 1440 

CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 1500 

50 GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGTTG 1560 

cATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 1620 

55 



1271 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 

TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 120 

IS 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 

CACGCtnTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTG6C0GTAA CCTGCCAACC 240 

2^ AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 300 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 360 

aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 420 

2S CGAGCTTCAG T 43X 
(2) INFORMATION FOR SEQ ID NO: 37 B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
TTTnTATAAC GTATTATAAA TCGTTAAAAA TTTTGGTTGT GTTTGCGTCA CGTAGACAAC 60 
CTCCATAAAG TTACTTAATC ACTCTCATCA TACAATAATT TTTACTCAAA TTGGAAnAAT 120 
TATAAAAATT AAATATAGAT AGGCTTTGAA AATTAGTTTT ATACAAGGTT AGTAGCTGTA 180 
ACTGTAAAAT GTTCTTAATA TTGTCAAAAT GTAATGCTTG AAAGCGCTTT TAAAaAATAT 240 
TATTATATAC ATGGTTAGAC AAATAGACAA ATCACTATAC AAATATTGGG AGGAATATTT 300 
TATGAAATCA ACACCACACA TTAAACCAAT GAATGACGTC GAAATTGCAG AAACGGTTCT 360 
ATTGCCAGGA GATCCGTTAA GAGCTAAGTT CATTGCAGAA ACTTATTTGG ATGATGTGGA 420 
50 ACAGTTCAAT ACAGTGCGAA ACATGTTTGG TTTTACCGGA ACATATAAAG GTAAAAAAGT 480 

TTCTGTCATG GGTTCAGGTA TGkGTATGCC ATCTATTGGC ATTTACTCTT ATGAATTAAT 540 
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CATTGATTTA 


TAT6ATGTGA 


TTaTTkCACA 


AGGTGCCTCT ACTGATTCAA ATTACGTTCA 


660 




ACAATATCAA 


TTACCAGGTC 


ATTTTGCGCC 


AATTGCTTCT TATCAATTAT TAGAAAAAGC 


720 


5 


AGTTGAAACA 


GCAC6TGACA 


AAGGTGTACG 


TCATCATGTA GGTAATGTGT TATCAAGTGA 


780 




TATTTTCTAT 


AACGCGGATA 


CAACAGCGAG 


T6AAC6TTGG ATGCGTATGG GTATTTTAGG 


840 




T6TAGAAATG 


GAATCaGCTG 


CaTTATACAT 


GaATGCAaTT TACGCTGGTG TCX5AAGCATT 


900 


10 


AGGTGTGTTC 


ACAGTGAGCG 


ATCATTTAAT 


TCATGAAACG TCAACAAcAC CTGAGGAAAG 


960 




GGAACGTGCA 


TTTaCAGATA 


TGATTQAAAT 


TGCACTGTCA TTGGTGTAGA TGATTATGAA 


1020 


IS 


TGTTGAATAT 


TCTAAAATAA 


AGAAAGCAGT 


ACCTATTTTA TTATTCTTAT TTGTATTCAG 


1080 


TTTGGTTATA 


GACAACTCAT 


TTAAATTGAT 


TTCTGTAGCC ATTGCTGATG ACTTAAACAT 


1140 




ATCTGTAACG 


ACAGTAAGTT 


GGCAAGCGAC 


ATTAGCCGGT TTAGTAATTG GTATTGGCGC 


1200 


on 


TGTAGTATAC 


GCTTCATTAT 


CTGATGCCAT 


TAGTATACGC ACACTATTTA TTTATGGCGT 


1260 


GATATTAATC 


ATTATCGGAT 


CAATTATTGG 


TTACATTTTC CAACATCAAT TCCCATTACT 


1320 




TTTAGTTGGA 


CGTATTATTC 


AAACTGCCGG 


TTTAGCTGCT GCAGAGACAT TATATGTGAT 


1380 


2S 


ATAT(3TTGCA 


AAGTATCTTT 


CTAAAGAGGA 


CCAGAAGACT TACCTTGGCT TAAGTACGAG 


1440 




CAGTTATTCC 


TTGTCATTAG 


TTATCGGTAC 


ATTATCAGGT GGATTTATTT CTACGTATTT 


1500 




ACACTGGACA 


AATATGTTTT 


TAATTGCATT 


AATCGTAGTA TTTACGTTGC CATTCCTATT 


1560 


30 


TAAATTATTA 


CCAAAAGAAA 


ATAATAC6AA 


TAAAGCTCAT TTAGATTTTG TTGGCTTAAT 


1620 




TCTAGTGGCA 


ACTATTGCTA 


CAACAGTCAT 


GCTGTTTATT ACGAACTTTA ATTGGTTATA 


1680 




TATGATTGGT 


GCCTTAATTG 


CGATTATCGT 


TTTTGCGCTA TATATTAAAA ATGCGCAACG 


1740 


35 


TCCATTAGTA 


AATAAATCAT 


TTTTCCAAAA 


TAAACX5TTAT GCTTCATTTT TATTTATAGT 


1800 




ATTTGTAATG 


TATCCTATCC 


AATTGGGTTA 


TATTTTTACG TTCCCATTCA TAATGGAGCA 


1860 




AATTTATCAT 


CTGCAACTAG 


ACACAACATC 


ACTGTTATTA GTACCGGGgT TaTATAGTAG 


1920 


40 


CAGTCATTGT 


TGGtGgCACT 


AAGTGGgTtA 


AAATCX^gCG rAATATCTGG AATTCCAAAA 


1980 




CCAAGCGGAT 


TATCACAGCC 


AATTAA 




2006 




(2) INFORMATION FOR SEQ ID NO: 379: 





(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 
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TGGGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG 60 

AATCGTATCA TAAAATTAGT CGAAGAATTA CTTGAATTGA CTAAAGGAGA TGTAAATGAC 120 

ATTTCTTCTG AAGCACAGAC CGTGCATATT AATGATGAAA TTCX5CTCGCG AATACACTCA 180 

TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 240 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 300 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 360 

ATAATTGAAA TTACAGATCA TGGAATTGQT ATTCCAGAGG AAGATCAAGA TTTCATITTT 420 

GATCGCTTTT ATCgAGTGGA TAAATCTCX^T TCAAGAAGTC AAGGCXX3TAA TGGACTCGGA 480 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTGAA 540 

ATTAACAAAG GAACAAOGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 600 

20 TCATAGGATC AATTTTTTAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 660 

AACAAATAAA CAGTGGTTCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 720 

CATCTATAAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 780 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG OGTTTCAACA 840 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTTTCAGAGG CTCCTGTAAA 900 

CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 960 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 1020 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 1080 

35 TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 1140 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC TTTGATTTAG ATCAACAGAC 1200 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 1260 

40 TTATGAAGCA ATTTTAAGAA TGGAAAAACX3 TTACAAAGGA CCAATTGCAT TTGAGTATAC 1320 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 1380 

AGTAACX3TTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAGCGT ATGTTGAAGG 1440 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 1500 

AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 1560 

AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTTTTAACGC ATGTCTTAGA 1620 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATGA AATTCTTACC 1680 

TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG 1740 
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AAGTCACTTG GAAATTGTTG CACCTOTTGT TGAGGGGCGT ACC5AGAGCAG CACAAGATGA 1860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 1920 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGCAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCX3CAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 2400 

20 TCATGATAAA ATTAATAAAG CTGATAAAAT GGATAATCCA GATATGGAAA AGCCTGCAGA 2460 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAQAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 2640 

AGAACAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCQATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

AAAAAGCTTC AATATTTGGG AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT 3000 

ATTCTTACCT CATGCATATG AGQGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 3060 

ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTCAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3240 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3300 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3360 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTO TATCCATTCC CAQAGGAAGA 3420 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 3480 

ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3540 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3660 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAOAATTAGC AGAATCTATT ACAQAAGGTA 3720 

5 CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3780 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3840 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3900 

10 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3960 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4020 

AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 4080 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGT6 GTTCGTAAAG 4140 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4200 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 4320 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 4380 

ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 4440 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 4500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4560 

30 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 4620 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 4680 

^ TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 4740 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCXSA TTGATCAAGA 4800 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4858 

40 (2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) T0P0UX3Y: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

SO 

ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 60 

TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 120 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 2040 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

5 AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACX3TGTCAT TAGATACAAA 2160 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 2220 

TG 2222 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

IS (C> STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 





TAACAATTCC 


TaTATTCATG 


TTTAATACGA 


AACACTACAT 


TTACATTGTA ATTCACTATC 


60 




TTTTGAAGTA 


ATAAAGTGAT 


TTGTTCAATC 


GATAGCTCAT 


TGCTTGTGTC 


GATTGTAACA 


120 


25 


ATTATATGCA 


AGTTTTCAGG 


ATTAACACCT 


AATCTTTGAA 


TGATTTGTTT 


AATAGTATAA 


180 




TAATCCATCC 


AATAAAAATT 


ACTTCCTTTA 


ATATAAATGT 


TTTTAGGTTG 


ATACATTTGA 


240 




CTCCTTTTTG 


TAGGCTCAAA 


AGGTATATCA 


ATCTCGCGCA 


TACTTGAAGA ACTTTGATTA 


300 


30 


GTATCATCAA 


ATAATTCAAT 


TATATTTTTA 


TCAATTTCAA 


CTAATTGGGA ATGGTTAATT 


360 




GAATGAGATG 


TTGGACTATA 


TCTTTTTCTA 


ATTAATTTAG 


GAGTGTTACC 


ATACGTTTCT 


420 


35 


TTAAAGAGGT 


GTATAAAACG 


AGAATAATGA 


TTAAAACCAT 


GACTACTTGC 


GATTTCTTCA 


480 


ATAGGCTTCT 


TAGrAGTTAA 


AATATCAATT 


AAGCAATGCT 


CCAGTCTAAT 


ATGATTTAAA 


540 




TATTGAATAA 


AATTACTATA 


AGGTGTCGCT 


TTAAACATGT 


CACTTAAAGC 


TTTGTTTGTA 


600 


40 


ATACTAACTT 


GATTAATGAC 


ATCTTTCCTA 


TTTATCTTTT 


TATGGTGGTT 


GTTTGTTAAA 


660 




TAATCGTGCA 


CTTCTTCGGC 


TACTAAATGA 


CGACTACCAT 


CGTATGTATT 


TAATGACATT 


720 




AATTCAACAC 


ACATGTAATT 


AATAATCTTA 


TCATTAGCAT 


TATAAGACTG 


TTGTTTAATT 


780 


45 


TTGCTGTAGA 


TTAAATACTT 


AATCAAGATT 


CTAAAAGTGC 


TAGCAACCTC 


AnTGTTAAT 


840 




ATCTTGCCAC 


GCATCATATA 


GTTTGCATTC 


ATAAACTTCA 


AAAATGTCTT 


TGCACTTACT 


900 




TCAACAACAC 


AACATACACT 


ATCATCATTA 


CCATCAATTT 


GATACAAATC 


ATTCAACATA 


960 


SO 


ATAATGGTGA 


CATCATTTCT 


TTTTACATCA 


TATTGTTGTA 


GATTGATTGA AACTACACCT 


1020 




GTACCTTGTA 


GCCAATATAT 


AATTTTtAAG 


TTCGCTTCCT 


TCACACTACT 


CATTTTCATA 


1080 



55 
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ATT 1143 
(2) INFORMATION FOR SEQ ID NO: 382: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: do\lble 

(D) TOPOLOGY: linear 

10 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAGCATGC 60 

IS 

CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 120 
TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 180 

^ CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 240 
ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 300 
GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 360 

2S AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 420 
AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 480 
TCAAATACAA ATTCGTTAAC GTCACT 506 
(2) INFORMATION FOR SEQ ID NO; 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



AGCATCTGCA AtGAGTcTCT 


AATACAACGA 


TACGTTTTGC 


ATCTTTAGGT 


ACTTTTACTG 


60 


TACCATTTTC 


ATCTTTTACC 


GAAATAGTAT 


CTTTAGTTGA 


TGATTCTTTT 


TTACTTGAAT 


120 


TATCCGTATT 


ACCACAAGCT 


GCAACTAAAA 


GTAAGGCAAC 


TATTAATCCC 


AATATACTAA 


180 


AAGTTTTTAG 


ACCTCTCATC 


GTTCCACTCC 


TTAATATGTA 


TAACTTCATT 


TATTATTTTA 


240 


TTGATAACAA 


TTATCATTGT 


CAAGTAGCGT 


TCAATCTTTT 


TTATATTTCT 


AAAATGTATG 


300 


ACTATATATT 


TCCTCTAATA ATTATGACTA 


CAATTAGCAC 


ATTTCCTTAG 


ACAAAATACT 


360 


GATAATGTAT 


CATTGCTATA 


TCATCTTTGC 


ATTAATACAA 


TTGACACCAT 


TTAGCATGAC 


420 
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IS 



20 



(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 
TGCACTTGAT ACTTTAGCAC ATGAAATTGC TAATCGATTA GTTGCAAATG ACAAGAATGA 60 
AGCAACTTTG GAAATGACTA ATAAAATGGC AACGATTCGT TTTACAGAAC CTACGCTGAT 120 
TGCATTAGCA GGGGGTAATG TCAAAGCTTA CACTGAGCAT ATGACTATAT CTCCATATAA 180 
ATTGTATTTG TTAGATAAAG GCGATGTTTT AAAGTTTAGA GAAACAAGIT ATACATCGCG 240 
AGTGTATTTA GCTGTgGGAG GCGGATTTGA ATTAGATGCA TGGTTAGGAT CTAACTCAAC 300 
CGACTTTAAT GTAAAAATTG GTGGTTTTAA AGGTAGAACA TTACAAGATG GCGATGAAAT 360 
AAAGCTTAA6 AGAGATTATA CAGCTCGTCA TCATAAGTTA TTTGAAAACC TTGCTCACAC 420 
25 GAAACAAACA GATTGGGGTA TTGATGGATA CGCCTTGTCA TTTAATTATA TGTCTGATGT 480 
ATTTCATGTC 6TTAAAAATA AAGGTACGGA AGATTTTAAA GAAGAT6CCA TTCAAAGATT 540 
TGTGAAACAT GATTATAAAG TAACGAGCAA AGCAAATG6C ATGGGGAT6A TGCTTGAAGG 600 
TGAAAAAATC AAAGCTTTrT ATGAAGATAT GCCACC6TAT CAGACTGTCA AAAAAGGAAC 660 
GATACAAATT AAGCGTGATG GCACACCTAT TATCCTATTA AATGATCATT ATACGCTAGG 720 
TAGCTACCCG CAAATCGGTA CAATCGCAAG TTATCATTTA ACGAAATTAG CACAAAAACC 780 
GCAAGGATCA CGTTTGAAAT TTCAATTTAT AGATATTTTA ACGGCTGAAA AGAACCTTGT 840 
TAAgTATAGT AACTGGTTAA AC 862 
(2) INFORMATION FOR SEQ ID NO: 385: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



30 



35 



SO 



ss 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 60 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 120 
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CAGTTTTACG 


TCCTACACCT 


GCTAAACTTT 


CTAATTCCTT ATGTGTTTGT GGTATTTCTC 


240 




CATTAAATTG 


ATCAATCAAA 


GATTGACAAA 


GTTTCTTAAT ATTCTTAGCT TTGTTACGAT 


300 


5 


ACAGACCGAT 


AGAACGAATA 


TCATTCATAA 


GTTCTTCATC ACTGACTGCC AAATAATCTT 


360 




CAGGCGTTTT 


GTATTTTTTA 


AACAGCTCAG 


TTGTTACTCT ATTTACTAGA ACGTCTGTAC 


420 


10 


ATTGCGCTGA 


CAATAATACA 


GCAATAGTTA 


ATTCGAACGG ATTATCATGT TTTAATTCAC 


480 


ATTCTGCATC 


CGGAAACATA 


TTTGCTATAA 


CATCAATCAT TTCTAATGCT TTTTTCTTAC 


540 




TTACCATCAA 


GGTTCTCCCC 


GTTTAACCAA 


TCAAATTTAG GTACCGTTTT AACTGTGTGC 


600 


IS 


GTCATTTTCG 


GTTTATTGAA 


TTTTTCTCTT 


ATTTTTCTAG AATCGTCAAT TGTTTTGACA 


660 




TTGTTTTTCT 


TCCAATTAAG 


TAAAATACGA 


TCTATATATT TAAAGCTAAG TTTATTCAAA 


720 




CTATTCGCCT 


CGTCTAATGC 


CGCTTGTATA 


ATTGCAGTAT CGTGTTTATC AACATCAATC 


780 


20 


CATTGATTTA 


ACGTTTCTAT 


TTCATATGGA 


GATAACGGCC TTGCAAATGT ATCCTCTAAA 


840 




ACTCTAAATA 


ATTGTTTAAA 


TTTTTCTTTA 


CTATTTAGCT CTTTCGTTTC CATACTTTGT 


900 




TGCTTCAATA 


TATGACTTAA 


TTTTTCQAAA 


AAAGGATCTA GATTCATATA TTCGGkAAAT 


960 


2S 


CTACCTTCyT 
TACCATT 


CATCyTTTTG 


aACTkGtAAT 


tCTAGCAATT CACGTgTATC AAATTTTGGA 


1020 
1027 




(2) INFORMATION FOR SEQ ID NO: 386: 




30 
35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 




40 


AAGGnTTGGA 


GGGAATTAAT 


nGATCSGCAAA 


TCCCAGaTAA AGTACACaAC GCATGGAATG 


60 




TGATGCATTT 


AATATTTCCA GTAGTTAGTA 


CGACGTTTGC AAGCTTTAAA TCTATGTATG 


120 




GGGGCATACC AAAAGATTTC ATAGACTACT 


TATTTATTGA TGAAGCAGGA CAAGCAATAC 


180 


45 


CTCAAGCAGC 


TGTGGGAGCA 


TTATATCGTT 


CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 


240 




TTCAAATAGA ACCGGTTGTG ACTTTAGAAA 


GTCATTTAAT TGATAACATT CGTAAAAATT 


300 




ATCATGTTCC 


GGAATATCTA GTTTCTAAAG 


AAGCTTCTGT GCAGTCTGTT GCAGACAACG 


360 


SO 


CCAATCAATA 


TGGTTTTTGG AAATCTGATG 


CTACTGATAG TAATCAAAAA ACCTGGATAG 


420 


55 


GCATACCTTT ATGGGTGCAC AGACGATGTT 


TAAAACCTAT GTTCACGATa GCTAaCCAAa 


480 
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10 



IS 



30 



35 



SO 



ss 



G6TATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGA6CAT GGTGA/^AAaG 600 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 660 

GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 720 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 780 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 840 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACX3TTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 960 

GATX^AaACCA TTTTATGAGA CGATTTTTAzi AGnAAGAAAT GTAAAA 1006 
(2) INFORMATION FOR SEQ ID NO: 387: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 60 

GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 120 

ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 180 

CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 240 

CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 300 

ATGGGATAA6 TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 360 

CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 420 

40 TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 480 

CTGATATTAC GAGTCOTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 540 

CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 600 

^ AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 660 

TT 662 
(2) INFORMATION FOR SEQ ID NO: 388: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 



5 


TCATCCAAAT 


TTTGGAAATT 


CCACATTTTA 


CATATCGTAA TTTTTTAGGA AACTAGTGAA 


60 




TATAACAAAT 


CCCTCCTCTC 


ATTTTTAAAA 


TAGATATATC ACTTCCCCAC TTTTACTTAA 


120 




CTAAACTGCA 


ACGGTTCCTA 


ATACCAAAAT 


CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 


180 


10 


ACTTATTGAA 


AAATGTTAAC 


GTTTTCTTGA 


TAATCATTGT AAGCGCATTT ATTTTATAAA 


240 




CTAACGTTTG 


AAATATACTA 


CAGGAGTGAC 


ACGTAATGAC TCAAATTACT GAAAAAGAAT 


300 




TAAAAAAGAA 


GTATTTAGAT 


TTACTATCCC 


AAAATTTTGA TACTCCAGAA AAACTTGCAA 


360 


IS 


CTGAAATTAT 


CAATTTAGAA 


TCAATTTTAG 


AATTACCTAA AGGTACGGAA CATTTCGTCA 


420 




GCGATTTACA 


TGGTGAATAT 


GAAGCTTTCC 


AACACGTATT ACGCAACGGT TCTGGGAACG 


480 


20 


TGCGAGCGAA 


AATCAATGAT 


ATTTTCAAAG 


AGAGACTTTC AACTAAGGAG CTTAATGACT 


540 


TAACTGCTCT 


TGTCTACTAT 


CCAGAAGrCra 


AATTtAAAAT TgATTAAAAG TGATTTCCAA 


600 




AATTgCGGtC 


mACtTAATGt 


CyGGtATATC 


ACaACmATCG aACATTTAAT TGAGTTAATT 


660 




AAATATTGT 








669 



(2] INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : dovible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGAAAT GGTGCAAAAT 120 

40 

AAAACAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC ISO 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 240 

TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 300 

45 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 360 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA ACATAATACG ATTAATAATA 420 

SO GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 480 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 540 

55 
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CXyVTTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 660 

AATCATAATT GGATAAGGTG tTTGTCCGTA CAGTGTTTCA ATAATCAGCC AACAAT6TGT 720 

^ ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 780 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 840 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

10 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CCATTGCAAT 960 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 1020 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

IS 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 1140 

TCTTGTACTC TCTACGATGC CATCTTOGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 1200 

20 ACGACTQGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 1249 

(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



3S 



40 



45 



SO 



AATGATGATT 


CATTCAAAGA AATTAACGCT 


CGGTATATGC 


TTGGTATTAC TCATTATATT 


60 


GATTGTAGGT 


TATGTCATTA 


TGACAAAAAC 


AAATGGTCGA AACGCCCAAA TTAAAGACAC 


120 


ATTTAATCAA 


ACATTAAAAT 


TATATCCAAC 


CAAAAATCTC 


GATGATTTTT ACGATAAAGA 


180 


AGGCTTTCGA 


GATCAAGAAT 


TTAAAAAGGG 


TGATAAAGGT ACTTGGATAG TTAATTCTGA 


240 


AATGGTAATC 


GAGCCAAAAG 


GTAAGGATAT 


GGAAACGAGA 


GGAATGGTGC TCTATATCAA 


300 


TCGCAATACT 


AGAACCACAA 


AAGGGTATTA 


TTTTATAAGT 


GAAATGACAG ATGACAGTAA 


360 


CGGCAGACCA 


AAGGATGATG 


AAAAAAGGTA 


TCCGGtAAAA 


ATGGAACATA ATAAAATCAT 


420 


ACCAACGAAG 


CCACTACCGA 


ATGACAAGTT 


AAAAAAAGAG 


ATTGAAAACT TTAAGTTCTT 


480 


TGTACAATAT 


GGCAACTTTA AAGATATTAA 


TGATTATAAA 


GATGGTGATA TTTCATATAA 


540 


TCCTAATGTA 


CCAAGTTATT 


CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 


60O 


ACAGTTaAGa 


AAAAGATATG 


ATATTCCAAC 


CAAACAAGGG 


CCGAAACTAT TATTGAAAGG 


660 


CGATGGAGAT 


TTAAAAGGTT 


CATC06TAGT 


TcTAGAAGTC 


TTGAATTTAC CTTTGTCGAA 


720 



55 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT 
AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA 
S GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA 
GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG 
TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA 
AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA 
TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA 

rrrCTTAATT ttccaaaata gtgataaatt tgtgaaatac 

IS 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT 
GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT 
TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAC 

20 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT 

GCGTTTQTAT CACTTTGCTA TGGAACGTAA ATCXSTTAGTC 
2S AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT 
^ tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC 

(2) INFORMATION FOR.SEQ ID NO: 391: 
(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 2407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 
GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 120 

45 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 
ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 240 
50 TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 300 
TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 

55 



1285 



ATAACAAGTA ACTCTGAAGA 840 

GCCATTTCAA AAAATTTAAA 900 

AGGCGAAGAA TGAACT6GAT 960 

TTTAATAGCT TATATGAITC 1020 

ATATAAGCAC CGCAACAAAA 1080 

TGTTAGAAGC TTTTAATGTC 1140 

AACTTTTATA TAATTAAATA 1200 

ATCACAAATC CCTTTATTTA 1260 

AAATAGAGAG AGACGAGAGA 1320 

GAGAAAAGAG ATTGAAGCGC 1380 

ACTGGTGTAA ATCAAGGATT 1440 

ATGCCTTACA TTGATGCTGA 1500 

ACTAACTAGT TCTTATTGCC 1560 

TGATGTAACT TGCCCTCGTT 1620 

CTAACTCCCC ACAAATCTGG 1680 

AGTCAkACTT ATATTATTTT 1740 

ATGnCTGC 1788 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCX3AA GAAAATCTCA 480 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 540 

^ TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGGT 660 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 720 

10 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 840 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCXSaT GGGCTTAGTT 900 

IS 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 960 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 1020 

GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 1140 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

2S GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGTTTA ATCACATTCG 1260 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 1320 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 1380 

^ AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 1440 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 1560 

35 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 1620 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAQACTTCCC TTTCAAATGG 1680 

TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

40 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACX5TCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

45 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 2040 

SO ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA ACAAGACATT 2280 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 2340 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 2400 

ATTAACG 2407 



(2) INFORMATION FOR SEQ ID NO: 392: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 



20 


ATGATGnATG 


GCnCCGCCAA 


GAAGTTGAAC 


CAGTCTATTG 


GCTAGAATTG 


CTCAAAAAGC 


60 




GAGAcaTGTG 


GTATTCaTAT 


GTTaGTAgCt 


ACGCAAAGAC 


CATCT6TCAA 


TGTAATTACA 


120 




GGTTTAATTA 


AAGCCAACAT 


ACCAACAAGA 


ATTGCATTTA 


TGGTATCATC 


aAGTGTAGAT 


180 


25 


TCGAGAACGA 


TATTAGACAG 


TGGTGGAGCA 


GAACGCTTGT 


TAGGATATGG 


CGATATGTTA 


240 




TATCTTGGTA 


GCGGTATGAA 


TAAACCGATT 


AGAGTTCAAG 


GTACATTTGT 


TTCTGATGAC 


300 




GAAATTGATG 


ATGTTGTTGA 


TTTTATCAAA 


CAACAAAGAG 


AACCGGACTA 


TCTATTTGAA 


360 


30 


GAAAAAGAAT 


TGTTGAAAAA 


AACACAAACA 


CAATCACAAG 


ATGaATTATT 


TGATGaTGTT 


420 




TGTGCATTTA 


TGGTTAATGA 


AGGACATATT 


TCAACATCAT 


TAATCCAAAG 


ACATTTCCAA 


480 




ATTGGCTATA 


ATAGAGCAGC 


AAGAATTATC 


GATCAATTAG 


AGCAACTCGG 


TTATGTTTCG 


540 


35 


AGTGCTAATG 


GTTCAAAACC 


AAGGGAT6TT 


TATGTTACGG 


AAGCAGATTT 


AAATAAAGAA 


600 




TAATTATGAG 


TAAC3GAGTTT 


TATATAATGA 


CACACTATCA 


TTTTGTCGGA 


ATTAAAGGTT 


660 


40 


CTGGCATGAG 


TTCATTAGCA 


CAAATCATGC 


ATGATTTAGG 


ACATGAAGTT 


CAAGGATCGG 


720 


ATATTGAGAA 


CTACGTATTT 


ACAGAAGTTG 


CTCTTAGAAA 


TAAGGGGATA 


AAAATATTAC 


780 




CATTTGATGC 


TAATAACATA 


AAAGAAGATA 


TGGTAGTtAT 


ACAAGGTAAT 


GCATTCGCGA 


840 


45 


GTAGCCaTGA 


AGAAAtAGTA 


CGTGCACATC 


AATTGaAATT 


AGATGTTGTA AGTTATAATG 


900 




Arri'T'lTAGG 


ACAGATTATT 


GATCAATATA 


CTTCAGTAGC 


TGTAACTGGT 


GCACATGGTA 


960 




AAACTTCTAC 


AACAGGTTTA 


TTATCACATG 


TTATGAATGG 


TGATAAAAAG 


ACTTCATTTT 


1020 


SO 


TAATTGGTGA 


TGGCACAGGT 


ATGGGATTGC 


CTGAAAGTGA 


TTATTTCGCT 


TTTGAGGCAT 


1080 




GTGAATATAG 


ACGTCACTTT 


TTAAGTTATA 


AACCTGATTA 


CGCAATTATG 


ACAAATATTG 


1140 
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5 



10 



IS 



20 



35 



TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA TGATGAACAT CTACGTAAAA 


1260 


TTGAAGCAGA TGTTCCAATT TATTA1TATG GATTTAAAGA TTCGGATGAC ATTTATGCTC 


1320 


AAAATATTCA AATTACGGAT AAAGGTACTG CTTTTGATGT GTATGTGGAT 


GGTGAGTTTT 


1380 


ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT TTTAAATGCA 


TTAGCTGTAA 


1440 


TTGCGATTAG TTATTTAGAG AAGCTAGATG TTACAAATAT TAAAGAAGCA 


TTAGAAACGT 


ISOO 


TTGGTGGTGT TAAACGTCGT TTCAATGAAA CTACAATTGC AAATCAAGTT 


ATTGTAGATG 


1560 


ATTATGCACA CCATCCAAGA GAAATTAGTG CTACAATTGA AACAGCACGA 


AAGAAATATC 


1620 


CACATAAAGA AGTTGTTGCA GTATTTCAAC CACACACTTT CTCTAGAACA 


CAGGCATTTT 


1680 


TAAAT6AATT TGCAGAAAGT TTAAGTAAAG CAGATCGTGT ATTCTTATGT 


GAAATTTTTG 


1740 


GATCAATTAG AGAAAATACT GGOGCATTAA CXSATACAAGA TTTAATTGAT 


AAAATTGAAG 


1800 


GTGCATCGTT AATTAATGAA GATTCTATTA ATGTATTAGA ACAATTTGAT 


AATOCTGTTA 


1860 


TTTTATTTAT GGGTGCAGGT GATATTCAAA AATTACAAAA TGCATATTTA GATAAATTAG 


1920 


GCATGAAAAA TGCGTTTTAA TATGTTTATA ATAGAGTAGT ATGGGTATTT 


ATTATTAATG 


1980 


ACATTATTAC ATGTTAATTA GQAGGCGTTT TTAATGGATT GGATTTTACC 


AATTGCTGGA 


2040 


ATTATCGCTG CGATTGCATT CTTAATTTTA TGTATCGGTA TCGTAGCTGT ATTAAATTCT 


2100 


GTTAAGaAAA ACTTAGATTA TGTTGCAAAA ACACTTGAOG GTGTAQAAGG 


TCAAGTTCAA 


2160 


GGTATTACTC GTGAAACAAC AGATTTACTT CATAAAGTAA ACCGTrTAAC 


TGAGGATATC 


2220 


CAAGGTAAAG TAGATCGTTT AAACTCAGTT GTAGATGCTG TTAAAGGTAT 


CGGTGACTCA 


2280 


GTACAAACGT TAAACAGCTC TGTAGATCGT GTAACAAATT CAATTACACA 


TAATATTTCT 


2340 


CAAAATGAAG ATAAAATCTC ACAAGTTGTT CAATGGTCAA ATGTTGCAAT 


GGAAATTGCA 


2400 


GACAAATGGC AAAATAGACA CTAC 




2424 



(2) INFORMATION FOR SEQ ID NO: 393 t 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
4S (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 60 
GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 120 
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AGTGTAAAGT TTTCAACATA ATACTATTAQ TTCGGTCATG TATCGGACTG ATGGAAAAGC 240 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 300 

5 

GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 360 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 420 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 480 

10 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACQATAT GAAGCGTATT ATGGTAAAAT 540 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 600 

TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 660 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 720 

TTTGCAAAAT TGATGTTA 73 B 
20 (2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 



30 





AAAAGTTGTA 


ATTAAAAGTG 


GGATTTTACT 


TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 




ATAAAAGAGn 


AGATGTCCGC 


AATATAGCAA 


TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


35 


CTTTAGTAGA 


TGAGTTGTTA 


AAACAATCTG 


GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


180 


AACGTGCAAT 


GGACTCTAAC 


GATATCGAAA 


GAGAGCGTGG 


AATTACGATT 


CTAGCCAAAA 


240 




ATACGGCTGT 


TGATTATAAA 


GGTACACGTA 


TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


40 


ACTTTGGTGG 


AGAAGTAGAA 


CGTATTATGA 


AAATGGTTGA 


TGGGGTTGTC 


TTAGTAGTAG 


360 




ATGCGTATGA 


AGGTACAATG 


CCTCAAACAC 


GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 




ACCTGAAACC 


TGTTGTTGTT 


GTTAATAAAA 


TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


480 


4S 


TTGTAGATGA 


AGTTTTAGAT 


TTATTTATTG 


AATTAGAAGC 


AAACGnTGAA CAATTAGAAT 


540 




TCCCTGTTGT 


TTATGCTTCA 


GCAGTAAATG 


GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 




ATGATAATTT 


ACAATCATTA 


TATGAAACAA 


TTATTGATTC 


ATGTACCAGC 


TCCAATTGAT 


660 


SO 


AACAGTGATG 


AGCCCATTAC 


AATTTCCAAG 


TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 




GACGTATTGG 


TATTGGTCGT 


GTATTCAGAG 


GTAAAATGCG 


TGTCGGAGAT 


AATGTATCAC 


780 
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20 



35 



40 



45 



GATTAAAACG TTTAGAAATT GAAGSAAGCAC AAGCTGGAGA TTTAATT6CT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 960 

TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 1020 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 1140 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 1200 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 1260 

CATTTGAACG 1270 
(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

AAGACCAGGA GAACAmGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATGGAATT 60 

TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACGCAACTG CTGTnGCTTT AOAATATTTA 120 

30 CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAC CAGGTGCTAG AaCAGCAATA 180 

ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 240 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTGCCTGT 300 

CCAGGTTTTG TGCCACTTGT AGAACAAAT6 AGATATAGTG ATCCAACAAT TACAAQCATT 360 

GTCATTCATC AAACACTGAA ACGTTGGCGT AATAGTGAGT CTGATACTGT CATTTTAGGA 420 

TGTACCCACT ATCCATTGCT CTATAAACCT ATCTATGATT ATTTTGGTGG TAAAAAGACA 480 

GTGATTTCGT CTGGATTAGA AACGGCTCGT GAAGTTAGTG CATTGCTAAC ATTTA6TAAT 540 

GAACATGCAA GTTATACTGA ACATCCAGAT CATCGATTTT TTGCAACAGG TGATCCTACT 600 

CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 660 

GTGAATGACT AGGAGGATTT TTAATGAAAG AGATTGTTAT TGCATCGAAT AATCAAGGGA 720 

AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTACCACGT AATAGGTATT TCAGAACTAA 780 

SO TACCAGATTT TGATGTGGAA GAAACAGGAT CAACATTTGA AGAAAATGCT ATATTAAAAT 840 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATGACAGT GGACTAGAAG 900 
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GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 1020 
CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 1080 
5 GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGATC 1140 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 
GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 1260 

10 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 1320 
ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 1365 
(2) INFORMATION FOR SEQ ID NO: 396: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



2S 



3$ 



40 



45 



50 



AATTCCTGGT 


GCAATAATAA 


ATAGGATGAA 


AAAGATTCGG 


AAAATATGAT AACTCGTAAT 


60 


CATAGCAACA 


TCGGCACCAG 


TAGCTAATGC 


AACTAAAACT 


ATCTGATTAA CCCCTCCTGG 


120 


TGCTGCACCA 


AGaAACAATT 


CATTAATAGG 


ATTATTATCA 


AAGAAATGTA TGATATAAAC 


180 


CATGATTAGC 


GCACCAATTA 


TCAACATAAT 


ATTTTGAATT 


GTAATTGCGA TTGCTAGTCT 


240 


ACCTTTTAAA 


TCTGACAATA 


AATGCGCAAT 


TTGAACTCCA 


ATTCTAATCA TATATATTAG 


300 


TTGTGCCATG 


TTCAACAACC 


AATGATCTAG 


TGTAAATGTT 


AAACCTGTAG AAAAATTCCA 


360 


AACAATTAAT 


ACAATGAGT6 


GTGCTAATAA 


TTGAAATGTT 


GGAAACTTTA TTTTAGACAT 


420 


AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC 


TAAAATAACT 


ATTTGCCCTA TGTTTAATAC 


480 


TTGTGATAAA 


GGCAAGACTT 


TTGITAACTT 


TCCATTCGCA 


TQCATOTTAC CATCAT6AAA 


540 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC 


AACAAATATA 


ATTCGTGATG TTTGCGTTAA 


600 


GCTAACAACT 


AACAAATTAG 


CACGTTTGTC 


TTGTTCAGCC 


ATGACCAGCA TTTGTGTTAG 


660 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT 


TTCTGTATTA 


ATACGTGCAA TTTTTTTAAA 


720 


AACAAATGCC 


ATTACTATTG 


CAATTAATAA 


TATCGAAATA 


GATACAACAA TAATCGAAAG 


780 


CCAATTGTTT 


TTAATATCCA 


TAACGACATT 


TTTCGTAAAC 


GTTGATCCGA TTTGCACACC 


840 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA 


GAATGGCCAT 


TTAATATCAA GTTTGAAAAC 


900 


TTTTACACAA 


ATGATTGATG 


CGATAATAGG 


ACCAAACATA 


AATGGAAGTA ATACGTGCGA 


960 
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20 



4S 



SO 



55 



TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 1080 

AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 1140 

GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 1200 

GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 1260 

TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 1320 

AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 1380 

TCC 13 83 
(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

2s TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 60 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 120 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 180 

30 GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 240 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 300 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 360 

^ TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 415 

(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 60 

aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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10 



IS 



20 



25 



AATGTGGGGT 


CGTATAAGAG 


CTTGTAATGC TTTTTGGCGT AATTGATAAC TGCTTTAAAA 


300 


CCTGCCCACA 


CAGATCGTGC ATAGTCAACA GAAAGACCTG CATCGTTTAA CAAATAATTC 


360 


CTGAAAGCAG 


TACATTGCGT 


AGTAGTGATT TTGCCAATAG GGATATTTCC GAACCTTTCT 


420 


TTTATGTGAG 


TATTATATTC 


TGTAGTTCGC TTTTCTATTG AGCX3TGCAGA AAGATTTTCA 


480 


TTTTTTAAAC 


GATCAAAAAA 


TATATATTCA AAGGGTTGAT TGTCCGAGTA TCCATATTTA 


540 


ACArm'GTA 


TAAATTCGCT 


TTCAGCTAGT TTGGCATCTT TCTTACGTTC AAACCCACGC 


600 


TTCATTTTTC 


GTTTGTTATT 


ACCGTATACA TCTTTATATC TAATGGAAAA ATACCATTTA 


660 


CCTGTATTAT 


CATCCTTATA 


TACTGGCATT TTGCTTCTCC CTCCTCAAAA TTGGCAAAAA 


720 


ATAATAAGGG 


TAGGCGGGCT 


ACCCGAAATT TAGTACTAGG TACTAAATGT GATATAATAA 


780 


AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA AAAACAATCA TAATTTCAAT GAATTAGTTT 


840 


TAACGAATGA 


AGACATTAGA 


ATTTTAAAAA ATGTCTTAGA AGATGCAGTC AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG ATTTTGCTTA CTGTTTATTA AGAGACTTAT 


960 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA ATAATGTTTG AATTATCGAA TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA ATAGAGTTTC TGTGTTCAAT AGCAOCTTTG ACTGAATGTk 


1080 


TTAAAT6TTC 


TTCTATTAAA TCGTTGTTTk CCAtTTCGtk TAaAAATGTt CyTATATTCC 


1140 


T 






1141 



30 (2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 

ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAG6TAA 60 

CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 120 

ATATT6TATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAAGGAATAA AGAAATTAAA 240 

ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 300 

TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

SO 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 



55 



1293 



EP 0 786 519 A2 



ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 540 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 600 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 660 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 706 



(2) INFORMATION FOR SEQ ID NO: 400: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

IS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



20 


ACACAATCTG AAGATTCACG 


TTGTGGTGCT 


GGACATGATC GAAAAATTAG AGCTGAACAA 


60 




ATGAnAGAAA TCAGTGATTT 


TGTTAAAAAG 


AAAAATATCC 


CTAAAGATGA 


AACGGTATAT 


120 




ATAGGTGGCG ACCTTAATGT 


CAATAAAOGC 


ACTCCAGAGT 


TCAAAGATAT 


GCTTAArAAC 


180 


25 


TTGAATGTAA ATGATGTTCT 


ATATGCAGGT 


CATAATAGCA 


CATGGGACCC 


TCAATCAAAT 


240 




TCAATTGGGA AATATAATTA 


CCCTAATGGT 


AAACCAGAAC 


ATTTAGACTA 


TATATTTACA 


300 


30 


GATAAAGATC ATAAACAACC 


AAAACAATTA 


GTCAATGAAG 


TTGTGACTGA 


AAAACCTAAG 


360 


CCATGGGATG TATATGCGTT 


CCCATATTAy 


aCGTTTACAA 


TGATTTTTCA 


GATCATTACC 


420 




CAATCAAAGC CTATAGTAAA 


TAGTGCTCAA 


CTAACTAATA 


ACTTGCTtCG 


TTCTAAAAGG 


480 


$5 


ACGAAGCGAG TTATATTGTT 


AAAATTTGAA 


TTGACTTACA 


TTTTAATAAA 


ATCATCTTAA 


540 




CAACTTTAAT TTTTCaTTAA 


TACAaGTCTT 


TACTCTACAC 


TCAAACnAGA 


TTCATACACT 


600 




GCACGTCATA ATAAATCTAT 


CTATTCAAAT 


ATAAATAAAA 


GTTACCTACT 


ACATTCTATG 


660 


40 


TAGCAGGCAA CTTTTATTAC 


TTATTTCTTT 


TCATTATCAT 


TAAGTACTTT 


TACAAACTTC 


720 




ACATTATGTG TCTTCCAATC 


AACTTCATAT 


AATGCTGATA ATTTTTCTTC 


TTTTTTATCT 


780 




ACATGGTTTT CACCAGACCA 


ATAGCCCCAG 


AAACCATGGC 


GATTCCAATC 


TATTTTAAAC 


840 


45 


TCATCCATTG ATCTTTTATA 


ATGAACAACA 


AATTGTGATT TACCTTTGTC 


TTTTTTATCA 


900 




TGTGACATAA CAGCTAAAAA 


TTCTGGATTA 


AACCCTTCAG 


ACACAGTTAC 


AGGCATTTTG 


960 


SO 


TCTTTAGGTG TGAAATTATC 


TTTCGCCCAT 


AAATTTCCAT TTCGTGTTAA AGAAAAAATT 


1020 


TCACTTTTAG TTCTATTATC 


ACTATCATTA 


GTTAATTGTC 


TCGTATGGTC 


ATGTCCCATA 


1080 




TTATTTATCA AATGTGCTTC 


TACTTTCCAA 


CCTACACCTT 


TATGTGACGT 


AGATTGATCA 


1140 
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(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 





CCAGAATTAT 


TTTTTCAAAA 


AGGACAATTT 


AACAATGTCG 


ATAACGTTAT 


CATAAGCAAA 


60 


15 


CCGATGAAAG 


GGACAATGCC 


TAGAGGTAAA 


ACGGAaGCTG 


AAGATCAACA GTATTATAAA 


120 




ACATTGCAAA 


CTTCTTCGAA 


AGATCGTGCA 


GAAAATGTCA 


TGATTGTTGA 


TTTACTAAGA 


180 




AACGATATA6 


GGAGAATATC 


ACAGAGTGGC 


TCAATTAAGG 


TGTATAAACT 


ATTTTTTATT 


240 


20 


GAGGCATATA 


AAACTGTATT 


TCAAATGACT 


TCGATGGTAA 


GTGGAACTTT 


AAAAAATAAT 


300 




ACAGACTTAA 


CTCAAATTTT 


AACATCGTTA 


TTTCCTTGTG 


GTTCGATTAC 


AGGTGCACCG 


360 




AAACTGAATA 


CAATGAAATA 


TATTAAACAA 


TTAGAAAGTT 


CACCTCGTGG 


TATATACTGC 


420 


25 


GGACAATTGG 


ACTATTACTT 


CCAACTGAAG 


ATGATAAAAT 


GATTTTTAAT 


ATTCCGATTC 


480 




GCACTATTGA 


GTATAAATAT 


GGACAAGCGA 


TTTATGGAGT 


CGGAGCAGGT 


ATTACAATTG 


540 


30 


ATTCTAAGCC 


AAAAGATGAA 


GTGAATGAAT 


TTTACGCAAA 


AACCAAGATT 


TTGGAGATGT 


600 


TATAATGCAA 


TTATTTGAAA 


CAATGAAAAT 


TGATAATGGA 


CATATCCCTA 


GACTTACTTA 


660 




TCATACTAAT 


CGCATAAAAT 


GTTCTTCTGa 


GCGATTAAAC 


TTTAAATTTG 


ATGAACATGC 


720 


35 


ATGGCGAAAT 


GAATTAAACG 


ATGTAACAAC 


AAAGTATCAC 


AGTGGTCAAT 


ATAGACTTAA 


780 




AATCGTATTA 


AATGCTGAAA 


GCAAATTTGA 


AACGATAGTG 


TCACCTTTAC 


CTGAGAAAAG 


840 



TAGTTTT 847 
(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

50 

TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 60 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CATCATGAAA 120 

55 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 
GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 
AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 
AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 
CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 
TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 
AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 
TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 
GTGAAAAAAG CATAATCACA CAGCPGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 
ATAAGTAGAA ATAAGATTAT 
(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACGACTACC AATATAAGTA 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 

GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 

GGTACTATCA AACGTTTTGA AATTTTTTCT 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 60 

10 CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA IBO 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 240 

AGTTATATAG TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 300 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 360 

TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

20 

CAATATAATT GTTCAACCTA TTTTTACGTT AATATTAACC TGTTTGACAT TCTTAGGATT 480 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 540 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 

2S 

ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 660 

TATTATTGGC ATQATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 720 

30 TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 780 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAQ ATTCAACTAA 840 
TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 900 

^ AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATQAAC GTATTGATGC 960 

TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 1020 

AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 1080 

40 

AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 1140 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

45 

CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGGAGGA 1320 

AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 1380 

SO TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 144 0 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 1500 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 

55 



1297 



EP 0 786 519 A2 



AGTAAGCATC ATACAGAAGT ACTTGAAAAC 
AAAGGTTTAG ATTCAGGTAC TGCATTTGAA 

5 

ATTAGTAAAA ATATTGGTGC AGACTTACaA 
GCACAAGCAA AAGCTGAAGA ACGTAGAGCT 
GCGCGTGTAC AAGAAATGCA TGCTAAAGTA 

10 

ATGGCTGAAG CATTACGTTC AGGTAATATC 
ATCGAAGCTG ATACAGGCAT GAGAAATGCA 

^5 GAGTCACCTG AACATTAAGT CGAGAGGTGA 
CATATCAGTG ATCATTTCTA TCATTACTAC 
AAATCAAAAG CCACCTCAAA AAACATCTAC 

20 TGAAGAAATT GAGCGAACGT TTAAA6AAAT 

ATCATCGAAA CGAAAATATG ATGATACGTT 
AGAGCCTAAA TCGAAACCTG TTGTAGAACC 
ACCGATGACA GAGAAACCAA TCACAGTGCC 
ACCTTCTAGA CAAGATAATT CTGACGAAAT 
TGATATTAAA ACGATTCGTA CTGACATTGA 

30 

GGAAAAACGT GCTAGAGATA TTATTGAGGA 
GAAATTAAAG CAGCTGCTTA ATTCTCAAAA 
CCAATTTGAT AAAGATGAAG TAATCAATGG 

35 

AAAACAATTA TAAAATTTTT TGAAAACAAG 
TACGTTAAGG AAAATTAAAA AACAAAGAGA 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA 
AATGATTATT TTAAATTAAA GAAAAATGAG 
AATTGCATGA TATAATTAAG TAGATATTAA 

^ GAAAAGGAGC GCGTGTATGC CTGGAATTAT 
TTTAATTGGA AATAATGATG AACATTTAAA 
CCATGCAAGA GGACAAGaAG TTGCCGTTAA 

SO 

GGAATCAGTA TTAATCAATT TGCTGAAGGT 
AGATGTTGAA GCAGCTATTA AAATGGCGCA 

55 



CCAGATAATA TTTCTAARAC AGTTTTAAGC 1680 

ATTTTATCAA TTGATATTGC TGACGTTGAT 1740 

ACTGAACAAG CATTAGCAGA CAAAAATATT 1800 

ATGGCTGTAG CAACTGAGCA AGAAATGAAA 1860 

GTTGAAGCCG AATCTGAAGT ACCATTAGCT 1920 

AGTGTTAAAO ATTATTATAA TTTGAAAAAT 1980 

ATTAATAAAC GAACTGATCA AAGTGATGAT 2040 

TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

TATGCGCGAA AATAGTCATA AAGATAGACA 2160 

CX3ATAATGAA CCAAAAAAAG GTGGCTTTTT 2220 

AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 2280 

ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2340 

TATGGCACCT AAAAAACAAC AAGAAACAAA 2400 

TAAAGCAGAA CCGGTGGAGC AGAAACATAG 2460 

TAGACGTCAA TTAGAAAAAT CACTTAGAGA 2520 

TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 2580 

TAAATACTTA TCTGAACGTA CAAAACGTTT 2640 

TGTCQAAAAA GATTTGACTA AATCAGCGTT 2700 

TATGATATGG TCAGAAATTT TAGCTAAACC 2760 

CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

ATTTTTCGAG AAATATTAGT TATTTAAATT 2880 

ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2940 

TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AGCATCATAG AATGAATATA AATGATATAT 3060 

ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

AGGTACAAAA ATAGAAAACG TAGAAAAAGC 3240 

TATTGATTTA GGTAATAATA TTACAATTAA 3300 

TAATAACACA ATTCAACATC TGTTAGATTT 3360 
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GCAACGTATA 


TATGTTAATG 


CCATGAAAAA 




TGGTACAGGT 


AAGACATTCT 


TAGCTGTAGT 


5 


TGTTAAACGT 


ATTGTATTAA 


CAAGACCTGC 




ACCAGGAGAT 


TTGAAAGAAA 


AGGTAGATCC 


10 


TACTGTTCTT 


GGGCGTGAAC 


AAACAGAGCG 


GCCACTTGCA 


TATATGCGCG 


GACGAACATT 




GCAGAATACG 


ACACATGCGC 


AAATGAAAAT 


IS 


AATGGTASTT 


ACTGGTGACC 


AAACTCAAAT 




TAAGGAAGCG 


GTCAGTAGGT 


TACACAACGT 




GAGCGATGTA 


GTCAgACATC 


CATTGGTAAG 


20 


TTAAATGTTT 


ACGATAGATT 


TTAGCGATCA 




ACAAATTGAA 


GATTTATTAG 


AATTTGCTAA 




GCTTTCTGTT 


ACATTTGTAG 


ATAAACAAGA 


2S 


TAArGWTAAr 


GTTmCaGATG 


tAaTCyCaTT 




TtTTAGTGGT 


CTTGATATAC 


CACGTGTTTT 


30 


GCAAGrACAA 


GCAAACAATT 


ACGGACATTC 


ACATG6ATTT 


TTGCATCTAT 


TAGGTTATGA 




6TTTGGTCGA 


CAAGATACAA 


TATTAAACGC 


3S 


AGGTTTAAAT 


ATGCACTTGA 


TGGGCTGAAA 




TTACATGTGT 


TTGCAATGAT 


TGTTGCTATT 




ATTGAGTGGA 


TATTTATACT 


CATTGCTATT 


40 


ACTGCTATTG 


AATATGTTGT 


CGATTTAGTG 




GCTAAAGATA 


TTGCGGCTTT 


TAGTGTACTT 




TTAATA6TAT 


TTTTACCACA 


TTTTATAGCG 


45 


ACCTCATTAT 


TTTCAAGAAG 


TTAGAAAAGC 




ATTTAAAGTA 


GGGGCTTATT 


TAAAAmCQAA 


SO 


AGAAAATGCT 


TCTTATCCAT 


TATCGATATG 


TTCTCAAGGA 


TACAGACCAG 


GTGATTTTGA 




ACCGTCATCA 


CCTTGTGGTG 


CATGTCGTCA 



55 



TAAT6ATTTA GTATTTGGTA TAGGTCCTGC 3480 

TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3540 

TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3600 

ATATTTAAGA CCTTTATATG ATGGTCTATA 3 660 

ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

AGAAGATGCA TTTGTAATTC TTGATGAGGC 3780 

GTTTTTAACA AGACTAGGTT TTGGCTCAAA 3840 

GGATTTACCT AAAGGTGTTA AAAGTGGACT 3900 

TAAAGGTATA AGTATATTGA AATTAGATCA 3960 

TAAGATCATT GAACATTATG AAGGAGAGAA 4020 

CACAGGCTTA GTTAAAGATG CTTGGTATAA 4080 

AAAAGAAGAG CATATAGAAG ACGATGCTGA 4140 

AATACAAGAA ATTAATCX3AA CATATAGAGA 4200 

tGCTTTAGrA GrAGATGAGC CmGaGATkGA 4260 

AGGGGATATA ATTATCTGtA CGGATGTAGC 4320 

TTTTGAACGA GAATTAGGAT TTTTAGCATT 4380 

TCATATGACT GAAGCGGATG AAAAGGAAAT 4440 

ATATGGATTA ACACGAGACT AATTATGAAA 4500 

ATCTTAATTC AAAAAGACTA TAAATTTCTT 4 560 

GTCTTTGGTC TCGTACTAAA TATTAATCGG 4620 

GCATTAGTTC TCACTGTTGA AGCTTTAAAC 4680 

ACCGTTQAAT ATCATGATTT AGCTAAATAC 4740 

ATAGTTTCAA TATTAGCATT TATTATAGGT 4800 

TTATTTTAGG GAGGCATATA TGAGTTATCA 4860 

ACAACAAGAA TCATATTCGC CATACAGTCA 4920 

AGACgGTAGA ACTTTrTATG GTACCAATGT 4980 

TGCTGAACGA GCTAGTTTGG TATCGGCAAT 5040 

ATCAAtAACT GTAACCGTAG ATGCAGATAA 5100 

AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 5280 

TTCAATTATA GGTAGACC^VA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 5340 

^ TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 5400 

GACAAGAGAT GACGCGCAAA TTATATTCAT TGATACGCCA GGTATTCATA AACCTAAACA 5460 

CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 

10 

CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 5 5 BO 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 5640 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 5700 

IS 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CXU^TCATTTT ATTGATGTTT TAAAGACGTA 5760 

TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT SB20 

20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 5880 

TGCGATTGGT GTTAATGTGG ACCX^TATGGT TAAftGAAAGC GAAGATCGTG TTCATATCGA 5940 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAM^GGAATT GTCATTGGAA AAGGCGGTAA 6000 

AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTG TAGGCTCTAA 6060 

AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 

TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

30 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6240 

CACGATTTTA AATG 6254 
(2) INFORMATION FOR SEQ ID NO: 405: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 
45 GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 
TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

SO 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 240 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 300 

SS 
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TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGGATATC TCXSAATTGCG 
GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 
TTCGCTTTAT CAAAACAACC TTCTTCAGTT AATTTTTTCT TTAATGCTTC TAATTTTTGA 
TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACQA 
CGTTCAAAAA CAGAAACACG TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 
TTTAATTTAG AAGCACTACC TTTGAACATC ATGGCACTTA TAACGCTTTC TTTATCTTTC 
ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTQAAAGCTC ACCTTTAATC 
AATACAGATT GGAGATGTGG ATCTTGATCA AATTTATATT TAATATATTT CGTTAAAGCT 
GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT TATTTTTTTA TATTACTCAA 
TACACCATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 
TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 
TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 
AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 
TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA 
ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 
ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 
GTATGACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 
ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 
GAAATATTAA CACCATGTTT TAATGCACAA TATACATCTA TATATATGCC ATCTTCTTTA 
CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 
TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAOTAGCTAT ACTTGCAATA 
ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 
ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 
AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 
ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA 
TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 
ATGGCGGTAT CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 
GTGCAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 
CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 2220 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 2280 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 2340 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 2400 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 2460 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 2520 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACOGC 2580 

CA6CAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 2640 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 2700 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 2760 

20 ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 2820 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2880 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 2940 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 3000 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 3180 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 3240 

3S TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 3300 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3360 

CATTGCCGCA GCTTCAACTO GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 3420 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 3480 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTQACTCCTT 3540 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3600 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3660 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3710 

(2) INFORMATION FOR SEQ ID NO: 4 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 
GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC 60 

5 

AATATATTTA ATGCTTCTGC TACTTrTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

10 TACTCTTGAA AAATAAAACC AATATCATGC TTGCGTATAT CAGATwATTC CTTGTTTGAA 240 

AGCTTTTCTA ATTTTTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 300 

CTTAAAACAT TTAATAATGT CGTTTTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 360 

TCGCCTTCTT CAATAGACAT ATTGATATCT CGCAACACTT CTTGTGCCAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 480 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 540 

20 

CATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC SOO 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATGCGTTC AACAATTTCA TTTTGTAATG 660 

2s GGAAAATCAA TCTGACAGTT GTCCCCITAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 720 

CTAATT6ATC CTTTACACTA TTTACTAAAT ATAGACCCAT ACCTGAAQAC GTCGTTTCAT 780 
TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATTCG CGGCATATCT TTTTTACTAA 840 

^ TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTT6 ATCATTTAAT TCTGTCCCAA 900 
TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 960 

TACGACACCA TTTTATATCT GTATAAACAT AATCATCCAC TTTAAAGTCA ACATCAAAAC 1020 

3S 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTOTAT TTCATCAATG ACCATGCGTT 1080 

TAAGTGACAC GTAATCAAAA TACATATCTT TAOGTTGAGA TTCTAATCTA GTAATATACA 1140 

GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

40 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

4S TTTCTTTGTG CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 1380 

TTTCCGCTAA ATCTTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 1440 

ATAATTTTAC TTCTTTAAAA TATGTCAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 1500 

50 

AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 1560 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 

ss 
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GCACTATCCA TACTAATTTC AGATA 1705 
(2) INFORMATION FOR SEQ ID NO: 407: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



IS 


TCAATATATC 


TTAAAATTCA 


ATGATTAAAT 


CnATTATCAC 


TAGACATtlAA ATACATAAAT 


60 




CCTATTCCCC 


ATTTTCATTT 


nTTAATTCAT 


AAATGAATCA 


ATAACCACCT AATAACAAAT 


120 




CATATTATAC 


ACCTTTGTTC 


TCTATTTTTC 


TAAGGTTTAA 


AAAATATTTT TAGGTAAACC 


180 


20 


TAAAAATAGA 


TGTAATAAAA 


ACGCCTCCTC 


AGATATTTAT 


ATATCTATGA AGACGTTTAA 


240 




ATACATTATA 


GATGGTCTGG 


TTCTGGGTGA 


ACGTATACTG 


AGGAAAtACC TTTTTTGTGC 


300 




AAATGATGTT 


CGACATTGTC 


ACAAATTTGA 


TGCGCTTCTA 


CTAAGGAAAG GTTAGCATCT 


360 


25 


ACAACAATTG 


TGACATCAAT 


AAACACACTA 


CTTCCATGGT 


AACGCCCTTT AATACTTTTA 


420 




ACTTCTTGTA 


CTTCATCAAC 


TTCTAAAATA 


TCATTGCGAT 


ACGCTTCTAA TTCAGTTTCA 


480 


30 


TTGAAACCAT 


CACTCAACAT 


AAAAATTGCT 


TCTTTAAAAA 


TACCAAAACC AGTATAAACG 


540 


ATTAGTAAGC 


CTAGTAATGT 


TGCTAAAATA 


ATATCGACAA 


TTGGGAAACC GATTTGCGTA 


600 




AAAATTAATC 


CTATCGCTGT 


TCCAATGCTG 


ACTAAACTAT 


CCGATAAATT ATCTTTGGCA 


660 


35 


GCCGAATTTA 


AAGAACTACT 


TTTCGTTCTT 


TTCGCTAGTC 


TTTGATTGAC TGCAAATACA 


720 




ATCAACATTA 


CAAGACCACT 


GATTAAGCTG 


ACGATAATTG 


TTATTGCGTT AGGTACAACG 


780 




TCATCTTCTT 


TGAACAAACG 


AGGTGCATTT 


TGAATAACTA 


CTTGGATACC TACAAACATA 


840 


40 


ATGACAAATG 


ACACCAATAA 


TGAAGAAATA 


TTTTCAGACT 


TCAAATGGCC ATAAGGATGA 


900 




TTTCGATCGG 


CAGGTTTAAT 


TGAAATTTTC 


AATCCAATAA 


TAACAGCTAA AGAAACGATA 


960 


45 


ATATCTGTCA 


TATTGTTTAA 


TGCATCGGCT 


CTTACAGCTG 


CAGAGTTAAA GACAAAACCC 


1020 


GTGACATACT 


TAACAATAGA 


TAAGATTATA 


TATACAATTA 


AACTCAAATA AGCACCGCGT 


1080 




TGCGCCAATT 


TAAGATTTTC 


ATTATGAGAC 


ATGCGTTGAA 


CCACCTTGAA TTAGTATAGT 


1140 


SO 


AACAATATTA 


TGAATGATTC 


ATTTTAATTT 


TACAACGTTT 


TTAATTTTTA TAAATTTTTA 


1200 




TAAAATTAAA 


CTAATTTATT 


CATTCGCAAC 


CCCTAAAAAT 


AATTTTTAGC CTTTCTGCGA 


1260 




ATTTTATGAG 


CTAGAAAGGC 


GCCCAACTCT 


CCCTGTTTGT 


TAACTTTCGC CTCGAAAGTT 


1320 



55 
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CGAAWVTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 1440 

CX3TCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 1680 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) ItiFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

6CGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 240 

TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 300 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

35 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 480 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 540 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 780 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 840 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 960 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 

55 
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5 



10 



IS 



30 



35 



SO 



AAAAATGAAT 


ACCAGCAACT 


TCATCAATAA 


TTGTTGATGC 


ATAGGACTTA 


AATATGACAT 


1140 


CTTTCTCCAA 


AATATCATTT 


AAAATACGTG 


TCATTAGTGG 


ACCATTGTGC 


GTCGTTTGTT 


1200 


CTGATAATGT 


ACGAATCTCA 


CCTGTAATAT 


GAACGTTTGT 


CGACAATTTG 


ATGTGTGGCG 


1260 


ACATAGCTGG 


GTATTTAGGA 


ACTAATGTTC 


TGAAAGATAA 


ACCAGCATAA 


TAATCCaACXt 


1320 


TATGTTTTGC 


TTCAATGATT 


AATTCTTTAT 


CTACTTCTGC 


TTGATAATCA 


GAATGXAATA 


1380 


CGTCATCTAA 


TTGCCATGGA 


TGAACAATCA 


TAATGTGATA 


ATCATTAAAG 


TTAAACTTTG 


1440 


GCGTAAATTC 


ATTTTCTAAT 


TGTTTAATTA 


AGTCCGGAAA 


TAGTTGATGA 


ACAGTTGTAT 


1500 


CATAATCTTT 


AC3ACAGTGAC 


ATAGTACGGC 


TTAATTTACT 


GTGAATCAGT 


ACTATTTTCA 


1560 


ACTTAATAGG 


TTGATTGAAT 


TCTGAAGAAT 


ATAGGAATGT 


TTGTAATGCA 


TTTAAACCTT 


1620 


TACGTAATTT 


AGCCCCAGGA 


TGTAGCX3GAT 


GACCTTCAAT 


AACGGCTTGC 


TCTGAACGCA 


1G90 


AGTAACTATC 


TTCGCTATTT 


TCGATAATAT 


TAAATAAAGG 


TGCAGAATCA 


TGTTGCATTG 


1740 


ACAGTGCTTG 


ATAGCTAATT 


GCAAATGTCA 


TATtAGTtGC 


ACTGTTTATT 


AAATCTT6CT 


1800 


GAAATTGATC 


ACTAGCAGCA 


TTTTTTAAAT 


CTGGTGCTTC 


AATTAAAATA 


CACTCAAGAA 


1860 


TTTCATTTGG 


ATGGTGTACT 


CGTGTAATCG 


TATTTGTAAT 


GTCATCTTTA 


ATGTAGAAAG 


1920 


GGCCTTCAAC 


ATCAATTCGA 


TCAAAGGCGT 


GTTCTCCAGT 


GATAGGAGCA 


TATAATGTTT 


1980 


GCTTAGCTTG 


TGGGAACTGG 


ATTTCTAGTA 


TATGAGTCGT 


TGAGATATCT 


AACATAATCA 


2040 


AATCACGACT 


CAATATTTTC 


TTACTTTGAG 


TGCGCGCTTT 


AACTAAGTTT 


TCGCGATGCA 


2100 


TTGATGTGAC 


CAATCTCTGA 


GTGACTTTAT 


CTCTTCCTTG 


TAAAATCATC 


TCTTTAAAAA 


2160 


TATTAGCCCA 


ATCGCTATTA 


TGTTGTTGTA 


AAAATAAATA 


TGTTTCTTGT 


TCTTCTTTGT 


2220 


TAAATTTTAA 


TGTCTGTTCT 


TTAAAAATTA 


AGTTCAAGTT 


CATAATTCAC 


CTCTATGAAA 


2280 


TATTTTACAA 


AAGCAAGATA 


GATTTGTATA 


ATCCATATTA 


ATGATAATGA 


yTCTTATTAT 


2340 


CAACAGAATG 


CGGGTGTAAG 


TTTTATGACA 


AAATATTTTT 


TTAGCAGTTC 


TTTTCTACTA 


2400 


TTTCTAGGTA 


ATTGGATTGG 


ACAAATAGGG 


CTAAATTGGT 


TTGTACTTAC 


CACTTATCAT 


2460 


AACGCAGTTT 


ATCTGGGGAT 


TGTCAATTTT 


TGCAGACTTG 


TACCAATATT 


ATTACTAAGT 


2520 


GTGTGGGCAG 


GGGCAATTGC 


CGATAAATAT 


GATAAAGGGC 


GATTGCTGAG 


AATTACAATT 


2580 


TCATCATCAT 


TTTTAGTAAC 


TGCAATTTTA 


TGTGTGCTCA 


CGTATAGTTc 


ACTGCAATTC 


2640 


CAATTAGCGT 


CATTATTATA 


TATGCGACAT 


TAAGAGGGAT 


TTTAAGTGCG 


GTTGAAACAC 


2700 


CTTTAAGACA 


AGCAATCTTA 


CCAGATTTAT 


CAGATAAAAT 


ATCTACTACA 


CAAGCTGTmw 


2760 


CATTTCATTC 


ATTCATCATT 


AATATTTGTC 


GTTCAATAGG 


GCCTGCCATT 


GCTGGTGTCA 


2820 



ss 
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CAGTTTTATT ATGCTTACCA TTACATTTTA AAGTAACTAA AATACCTGAA GaTGCATCAA 2940 

GaTACATGCC GTTAAAAGTT ATTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

5 TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACX3ACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

10 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 3240 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 3300 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

IS 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 3420 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 3480 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3540 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3660 

AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

A6AACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3780 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3840 

30 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4020 

35 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4080 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGITGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 

^ GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCOGTTTAC 4380 

AATACTTCCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 4440 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4500 

SO 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 
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AGTGCAAGTC TAGTTAATAA AAATCCAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 4740 

GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 4800 

CAAGCATTAA TCCCGCCTTT AATTGCnTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 4860 

CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTGAGA 4920 

GATTTAGGTG GTTCTAGAAT TGATTTAGAA ACATTACAAC ATCGTGTATC AGATATTAAA 4980 

ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 5040 

GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 5100 

GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CGCATGCCAT TAACCCAACA 5160 

CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAACAATTAC TGTCAAAGCG 5220 

TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 5280 

20 CCXSATAAAAA AAGAGGTGTA GTACTACATG GCACACX3TTA ACATAAATAT ATCGAAGATT 5340 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCA/VAGTA AAAATATGCA ATTCACACCA 5400 

GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 5460 

AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 5520 

A 5521 
:) INFORMATION FOR SEQ ID NO: 409: 



10 



IS 



25 



30 



35 



4$ 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

40 AGGTCGTCTr GCaGmAGmGm TATTTGAOGA TCTTGCTTTC CCAAAACACG ATGATGATTT 60 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 120 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 240 

TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTATCTTGCA TAAATTGATA 300 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 360 

GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 420 

GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 480 

55 
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5 



10 



IS 



20 



CAAAAAATAA GTGGTTTAAA 


CAAAACTGAT CAAGCAAACT TAAATAAAAT 


TGAAAATGTG 


600 


TATAAAATCT TAAATAGTGA 


TTATTACAAA AAACAGGACT CTGACAAGTT 


AAGTAAAGCT 


660 


GCAATTGATG GCATGGTCAA 


AGAATTAAAA GATCCTTATT CTGAATATTT 


AACAAAAGAA 


720 


CAAACXMAT CCTTTAATGA 


AGGTGTTTCA GGTGATTTTG TAGGTATTGG 


TGCAGAAATG 


780 


CAAAAGAAAA ATGATCAAAT 


TATGGTTACT AGTCCTATGA AGGGATCTCC AGCAGAACGT 


S40 


GCTGGCATTC GTCCTAAAGA 


TGTCATTACT AAAGTAAATG GAAAATCAAT 


TAAAGGTAAA 


900 


GCATTAGATG AAGTTGTCAA 


AGATGTTCGT GGTAAAGAAA ACACTGAAGT 


CACTTTAACT 


960 


GTTCAACGAG GTAGTGAAGA 


AAAAGACGTT AAGATTAAAC GTGAAAAAAT 


TCATGTTAAA 


1020 


AGTGTTQAGT ATAAGAAAAA 


AGGTAAAGTT GGAGTTATTA CTATTAATAA ATTCCaGAAT 


1080 


GATACATCAG GTGAATTGAA 


AGATGCAGTT CTAAAAGCTC ACAAAGATGG 


TTTGAAAAAG 


1140 


ATTGTTTTAG ATTTAAGAAA 


TAATCCAGGT GGACTACTAG ATGAAGCTGT 


TAAAATGGCA 


1200 


AATATTTTTA TCGATAAAGG 


AAAAACTGTT GTTAAACTAG AziAAAGGTAA AGATACTGAA 


1260 


G 






1261 



2S (2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

35 

AAATATATTG AAnAGAllAAT TACTAA6ATT AAATCnTCTT AAAATATCCC TGAAATAACG 60 

TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA TGTCGTCATG 120 

GACAAAGCGA GTGGAATGCT AAAAACTTAT TTACTGGATG GGAAGATGTT AATTTATCTG 180 

40 

AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT AACATTGCCA 240 

TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT TATATTTTAA 300 

45 CTGAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT GAACGCCACT 360 

ATGGTGGATT GCAAGGCTTA AATAAAGATG ATGCTAGAAA AGAATTTGGA GAAGAACAAG 420 

TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAACCGAA GAACAACGTG 480 

^ AAGCTTACTT AGCTGATCGT CGATATAATC ATTTAGATAA ACGTATGATG CCTTATTCTG 540 

AAAGTCTGAA AGATACTTTA GTTCGAGTGA TACCATTTTG GACAGATCAT ATTTCACAAT 600 

55 
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TTAAATATCT TGAAGATGTG TCAGATGAAG 
CACCGCTTGT TTATGAATTA ACGGATGATT 
^ AaAAGAGCTG CAT6TACACA AGGAGTGAGT 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT 

GTTAGACATA CAATATAGTC AAATTAAAAC 

10 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG 

TGTAAATCTA ATTTCATCTC TATCCCTCCT 

TATGTAAATA GTAATGATTA CGTTTTAAAG 

IS 

GAAAATGGGT ATAATTACGG GATATCTAAA 
GAAGCTTTAA ATGTCGAAGA AACAAAAATT 

20 ATTTTTTGGA TTATTAAATG AAACACTATT 
TTTTGAAATT TCATATACAC AAGTTCAATG 
GATTGTTATT CCTTTGTCCG CGCTTGTTAT 

2^ AGTGGGTATT TCTATCTTTT TCTTAGGTAC 
AACATTATTA GTTGCTAGAA TTATTCAGGC 
GATGACAACG ATTTTGGATG TTTTCCAACC 

30 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT 
ATATTTTAAC TGGAGATCGC TTTTCCATGT 
AATTGGaTTT AAAAcGATAA AAAATGTTGG 

35 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG 
AGAAAAAGGT TTTGATAATC CTAcGtATTA 

^ GCATTATTTG TAwTACXSTCA ATATCGGCTA 

AAAAATAAAC AATTTACAGT TGGTATCATT 
GGTTCGGAAA CGATTTTACC TATCTTTGTO 

45 TCTGGATTAA CTTTATTACC AGGAGCAATT 
GCTTTATATG AAAAGTTTGG TCCTAGAAAT 
ATTACTACGG CTTATTTTGT TGTAATGGAT 

5^ GTTTATGCGA TTCGAATGGT TGGTATCGCG 
ATGAATCAAT TAAAGCCAGA AATGAATGCA 

55 



ATATCATTAA TTATGAAATT AAAACAGGTG 720 

TAGAAGTTAT AGATAAATAC TACTTATAAA 780 

GTATATGCAG CTCTTAAAtA TGTGAAGTAA B40 

ATTAACGAGC TAGGGATACT CGAAAATATA 900 

AATTATTTCG CTCTTTTATG TTGCTTAATA 960 

TTAGGGCTAT TTAAATTACG ACGAGCAGTC 1020 

TGTAAATATA TTATGACCGA TAACTACTCA 1080 

AAATTGTAAT AAAGTCGTGG TAATTTTTTG 1140 

AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 1200 

AACOATGATT ATTACTATGC TGATGGGTGG 1260 

AGTGACGGCT TTACCAAGTA TTATGAAAGA 1320 

GCTGACAACA GCTTTTTTAT TGACTAATGG 1380 

ACAACGTTAT ACAACAAGAC AAGTGTTTTT 1440 

ATTACTCGGC GGCTTGAGTC CGCACTTTGC 1500 

GTTAGGCGCA GGTATTATGA TGCCATTGAT 1560 

ACATGAACGC GGTAAATATA TGGGGATATT 1620 

TGGACCTACT CTTTCAGGTT ACCTTGTTGA 1680 

TGTCGCTCCA ATTGCAGCTG TGACATTTTT 1740 

AACTACAATT AAAgTACCTA TTGATTTTAT 1800 

CGGGTTATTG tATGGAACXyV GTTCaATTTc 1860 

GTATCTATGA TTGGAGGCGT TGTTTTAGTC 1920 

TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

ATTATGGGTG TCaCAATGGT ATCGATGATT 2040 

CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

GTTATGGCAT TTATGTCGAT GACTTCGGGT 2160 

CTTGCTTTAG TAGGTATGGC GATTGTTGTT 2220 

GAACAAACAT CAACAATCAT GTTGGCAACA 2280 

TTAGGATTAA TTCCAGTAAT GACCCATACG 2340 

CATGGTTCAT CTATGACAAA CACAGTACAA 2400 
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AACTTTTCTC CAACTATGTC AGACTATA 2488 
(2) INFORMATION FOR SEQ ID NO: 4X1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



IS 


TTTTACAATT 


TCAGATATCT 


CTAAAACACA 


TACCGATCCA ATGGCCAAAA TTATTCGTCA 


60 




AAAATTGAAA 


AAACTAGGAA 


TTCXSTAAAGG 


GATTCCAGTT 


GTATTTTCAG 


ATGAAAGTCC 


120 




AATTGTCATA 


AGAGAAGAT6 


TAAAAGATAT 


AGTTGGAGAT 


AAAAATGCTA 


TCAATCGAAA 


180 


20 


AGGACAAATG 


CCACCTTCTT 


CAAATGCCTT 


TGTGCCAAGT 


GTTGTTGGAT 


TAATTTGTOC 


240 




AAGTTATGTG 


GTGAATGACG 


TATTAAAAGA 


TATTCCAGTT 


CGTCGCATTA AAGACAAAGG 


300 




GCAATAATTC 


ATTTTGAAAG 


GGATAATTTT 


CAACGTAAGG 


CAAGTGTAAC 


CACACCATAA 


360 


2S 


AACTAAAACT 


GACTAGTTCG 


CAAGCATAGT 


AGAATATGCC 


TCGTGTACTA GTCAGTTTTG 


420 




ATTTGATTAT 


AGCTAAAATA 


TCAAAGGTAA 


TCGATTTAAT 


CTTATTTTTA 


CGGCCTTTGT 


480 


30 


TTAAGTAAGT 


TGTTATATAT 


TTCTTTGAGT 


TGTTGTTCAC 


TTTTAGATGT 


CGTTTTTGGT 


540 


TCGTAATAAA 


TTTTGTTTTT 


TAGTTTATCA 


GGTAAATATT 


GCTGTGAAAC 


ATAGCCATTA 


600 




ACATATTGGT 


GTGGATATTT 


GTAACCAATA 


GATCGGCCTA 


GATCTTTAGC 


ACCTTGATAA 


660 


35 


TGTCCATCTT 


TTAAATGGTT 


TGGAATTTGG 


CCCACATGAC 


CGTTTCTAAT 


ATCGGACAAT 


720 


GCACTATCAA 


TGGCACTCAT 


TGCTGAATTT 


GACTTAGGTG 


ATAAGCATAG 


TTCGATTACT 


780 




GCTTGGCTTA 


GTGGAATTCT 


AGCTTCTGGT 


AAACCTAGAC 


GTTCTGCTGA 


TTCAATAGCA 


840 


40 


GCAAGTGTTC 


TCTGACCAGC 


ATTAGGTGAG 


GCTAAGCCTA 


TATCCTCATA 


GCTAATTACA 


900 




AGTAATCGTC 


GAACTATTGT 


AGGTAAATCT 


CCAGCTTCAA 


TTAATCGTGC 


TAAATAATGT 


960 




AAAGCGGCAT 


TGACGTCGCT 


ACCACGGATA 


GATTTTTGGA AAGCGCTCAT AACATCGTAA 


1020 


46 


T6CATGTCAC 


CATCCTTGTC 


ACTTACAAAT 


GCACCTTTTT 


6TAAACAGTC 


TTTAGCATCT 


1080 




TGCAATGTAA 


CATGTCGATA 


ACCGT 








1105 



<2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 60 

5 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 240 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 300 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC QTTGCTCGAA TAATTCTTCT 360 

IS AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 480 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 540 

20 TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 

(2) INFORMATION FOR SEQ ID NO: 413: 

' (i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
GTGGTGAAAC TTCTTGCTTT 6TAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

35 CAGGTTGAAC TGTCATCTTT TTAGTTACTT TGTTTTCAAG CGTAATATTT GCTAATTcTA 120 
GGtTTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTmm CtTGATACTy CTCTTGATAT IflO 
TTTCCAGCTT TAGCATTTGA TAAATCAATC ACTACTCTTA AATCTTCTGG ATTTTCAATT 240 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 300 
AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCITGAAT CGTTTTACTA 360 
GACTTTTGAC CAAGATTACC AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAGATAAA 420 
AAGAAAAACA ATGCCAAAAG AAAGGCAATA AATCTCAAGC CCCATTTACT TTCTAGCATA 480 
TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 540 
AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 600 

SO 

AGTTTCTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 

55 
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TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 780 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGARATATTT GAATCCATTG CAATACCTGT 840 

^ TTCAATATAA TCTTGAAGAC CiX j l lT ClTr TTCAAAGACA ATTAATGCAC CTATACGTCT 900 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 960 

ATACGTATTA GAAGTATAGC G TTTT A AAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

10 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACGATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 1140 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 1200 

IS 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 1260 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 

20 ATACCCATTA TACCAATCAT TT 1342 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
2S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



"^"^ (Xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 414: 

CT6TATAAAG ATGGAGGTGT TTTGCATGGT AAAACGTACT TATCAACCAA ATAAACGTAA 60 

ACATAGTAAA GTTCATGGTT TCAGAAAACG CATGAGCACA AAAAATGGCC GTAAAGTTTT 120 

35 

AGCGCGCCGT CGTCGTAAAG GCCGTAAAQT TTTATCTGCA TAAGATCACT GACTCATCAG 180 

TGAtCTkTTT TTTCGTTTAA ATTAAGAATA AATAGAAATT TATGTTATAA GCTCAATAGA 240 

AGTTTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT TGGAAAAAGC 300 

40 

TTACCGAATT AAAAAGAATG CAGATTTTCA GAGAATATAT AAAAAAGGTC ATTCTGTAGC 360 

CAACAGACAA TTTGTTGTAT ACACTTGTAA TAATAAAGAA ATAGACCATT TTCGCTTAGG 420 

45 TATTAGTGTT TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA AnAGAGCAAT 480 

ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA TTGTAATAGC 540 

AAGACAGCCA GCTAAAGATA TGACGACTTT ACAAATACAG AATAGTCTTG AGCACGTACT 600 

SO TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG GAAGGAAAAC 660 

ATTAACCACT CAACACATCC CGAAGTCTTA CCTCAQACAA ACGTAAGACT GACCTTAGGG 720 

55 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG 840 

ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 960 

TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 1020 

TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 



10 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 



25 



30 



35 



40 



CTTTACCAAT 


GCCAAATCCG 


AAGTAAAGTA 


TAGCAATAAA 


GATTACTAAT 


ACAATTCTGT 


60 


AAATQGCAAA 


TGGAATTAGT 


TTGATTTTGT 


TAATTAGATG 


CAAGAATGTT 


TTQATTGCAA 


120 


TTAGTCCAAC 


AGTAAAT6CA 


GCTAAAAAGC 


CTAAAATATA 


AAAAGGTATA TCAGCAATCT 


180 


GAATATCTTG 


ATAATGTTTT 


AATAAAGATA 


AACCACTAGC 


TGCTAACATA 


ATTGGAACAG 


240 


CCATAATAAA 


TGTAAAGTCC 


GATGCTGCTT 


TATGATTTAA 


TTTCATTAAT 


ACCCCAGTTG 


300 


AAATTGTTGA 


GCCTGAACGG 


CTGAAACCAG 


GCCACATAGC 


TACTGCTTGA gAAATACCAA 


360 


TTACAAATGC 


TTGGAAATAA 


CTGATTTGAT 


CTACTGTTTG 


TGGGTTTTTA ACTTTAGCTG 


420 


AGTATTTATC 


AGCAATAATC 


ATATAGATAG 


CACCTACGAA 


TAAGCCAATC 


ATAACAGTTG 


480 


GCACACTAAA 


TAAATGTTCT 


TCGATGAAAT 


CATCAAATAG 


TAAGCCTAAA 


ATACCTGCTG 


540 


GCACCATACC 


CACTAATACA 


TGTAATAAAT 


TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


GATCGTTATC 


TCCTTCAACA 


TGTTTGTGTT 


TACCAATATG 


TAAAATCTCT 


AAGAAGCGTT 


660 


CGCGGAACAC 


CCATGCTGCT 


GCAAAGACGG 


ATCCTAATTG 


GATGACGATT 


TTAAATGTAA 


720 


ATGCTGACTG 


AGAACCTAAA 


AATTCAGATG 


ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


TATGTCCAGT 


AGAGGAAACA 


GGTGCAAATT 


CTGTTAATCC 


TTCGACGACC 


CCTAAGATAA 


B40 


TACCTTTTaT 


TAATTCAATG 


ATAAACATAA 


TGTACCCACT 


TTCATTACTC 


AATTTAATTT 


900 


ATTTAAATAT 


CAAAATTACC 


ATATCATGAT 


AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


AGTTATAATA 


CTAGACTAAA 


GATGTATATA 


TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 


ATTTTATTGA 


GCTATTTAAT 


TTGATTTTAA 


GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 



55 
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TTCTTGGTCA GTACTGGTCT CGGCATAATC GTTATAACGC AAAATATTTT AATAGCAGAT 1200 

TTTTTAGCTA AAATTATAAG ACATCAATTT CAAGGTTTAT GGATTGTATT ATTTATTTTA 1260 

^ TTAGGTGTTT TACTTTTAAG AGCAACTGTG CAATTTCTAA ATCAATGGTT AGGTGATACA 1320 

TTAGCATTTA AAGTTAAGCA TATGCTTAGA CAGCX5GGTTA TTTATAAAAA TAATGGTCAT 1380 

CCAATCGGTG AACAAATGAC TATACTCACA GAAAACATTG ATGGTCTAGC ACCTTTTTAT 1440 

TO 

AAGAGTTATT TGCCTCAAGT GTTCAAATCA ATGATGGTTC CGCTCATCAT AATCATTGCA 1500 

ATGTTTTTCA TCCATTTCAA TACCGCATTA ATTATGTTAA TAACTGCACC ATTTATTCCT 1560 

TTGTTTTATA TTATTTTCGG TTTGAAAACG CGAGATGAGT CAAAAGATCA AATGACTTAT 1620 

TTGAATCAAT TTAGTCAACG GTTTTTAAAT ATTGCTAAAG GTTTAGTGAC GTTAAAGCTA 1680 

TTTAATCGTA CAGAGCAAAC AGAGAAGCaT ATTTaCGACG ATAGTACTCa GTTTAGAACT 1740 

TTAACAATGC GCATTTTaCG CAGTGCTTTT TTATCGGGAT TAATGCTCGA ATTTATAAGT 1800 

ATGTTAGGTA TTGGATTGGT TGCATTGGAA QCAACGCTAA GCTTAGTAGT ATTTCTVTAAT 1860 

ATTGATTTTA AAACTGCGGC AATTGCGATT ATT T TAGCGC CTGAATTTTA TAATGCAATT 1920 

25 AAGGACTTAG GGCAAGCGTT CCATACTGGA AAACAAAGTG AAGGTGCCAG TGACOTTGTG 1980 

TTTGAGTTTT TAGAACAACC GAACTATAAT AATGAATTTC TATTAAAGTA TGAGGAAAAC 2040 

CAAAAGCCAT TTATTCAGTT AACAGACATA TCATTTCGAT ATGATGATTC TGATAGATTC 2100 

30 GTATTAAATG ATTTAAATTT GGAAATATTT AAAGGTGATC AAATTGCACT TGTAGGTCCA 2160 

AGCGGGGCAG GTAAATCCAC TTTGACACAT CTTATTGCAG GTGTTTATCA GCCAACAATA 2220 

GGTACTATAA GTACAAACCA GCGTGATTTA AATATAGGAA TACTTAGTCA ACAGCCATAT 2280 

35 

ATTTTCAGTG CTTCTATAAA AGAGAATATT ACGATGTTTA AAGATATAGA AAATAATACT 2340 

ATTGAAGAAG TGCTAGACGA AGTAGGTTTA TTAGACAAAG TGCAATCTTT CACAAAAGGC 2400 

ATTAACACAA TAATAGGTGA AGGAGGCGAA ATGTTATCTG GTGGACAGAT GAGACGCATA 2460 

40 

GAACTTTGCC GTCTTTTAGT TATGAAGCCA GATCTCGTTA TATTTGATGA GCCTGCAACT 2520 

GGTTTAQATA TTCAAACAGA ACACATGATT CAGAACGTTC TGTTTCAACA TTTTAAAGAT 2580 

^ ACAACGATGA TTGTCATTGC ACATAGAGAT AATACAATTC GCCATTTACA ACGACGCTTC 2640 

TATATAGAAA ATGGAAGACT GATTGCTGAT GATCGCAATA TTTCAGTAAA TATAACAGAA 2700 

AATGGTGATG ACTTATGAAA ACACGACTAA AATTTCAAGT AGATAAGGAT TTATTGTTAG 2760 

SO CTATAGTTGT TGGTGTTTGT GGAAGTTTAG TTGCGCTCGC CATGTTTTTC TTAAGTGGTT 2820 

ATATGGTGAC ACAAAGTGCA CTTGGTGCGC CACTATACGC TCTGATGATT TTAGTCGTTA 2880 

55 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 3000 

ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 3060 

GTCGTGTTGA GGCATTACAA AATATAkATT TACGTGTTTA TTATCCACCA GTCGTCATCG 3120 

GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 3176 



(2) INFORMATION FOR SEQ ID NO: 416: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: douJble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 



20 


TCTTTATTTC 


AATTTCCAAT 


TGAATTTTTT 


TATTATTTAC 


GCATAGCTCT TAAAATTAAC 


60 




GTTACGATTG 


CAATTAAGAT 


AATTGAACCA ATTAATGCTG GCAAGATGTA AATACtTCCT 


120 




AATTCAGGAC 


CCCATTGTCC 


GAATAGTTTG 


CCACCTACCC 


ATGATCCAAT AATACCTGCG 


180 


25 


ATAATATTGC 


CTAAAATACC 


ACCTGGGATA 


TCTTTACCCA 


TAATAGCACC AGCAGCCCAT 


240 




CCAATTAAGC 


CACCGACAAT 


TAACATTCCA 


ATAAATCCAA ACATAATTTT CAGTCTCCTT 


300 




TTTCTATTTA 


TTTTGCGTTA 


TTCTAAGTAG 


TACCCCTTAT 


TTACAATTCT AAAACAAATT 


360 


30 


CAAATTATTT 


TTATCCAAAT 


ATTTTTAAAA 


GTAGTAATTG 


AATATCAATT TTATTCAATG 


420 




TAGCTATCGT 


TATTTAAAGT 


CTCTGTACCG 


ATAATATCAT 


ATACATTTAC ATTATTTTTT 


480 


35 


CTGCCGAATT CATAGCTTGA TTATTTTATO TTATAGGACT AgAATATACA CATATTATTA 


540 


GAGCATCTTT 


GAATTTTAAA 


TCAAGAA6C6 


AGGTTAATGA 


ACAATGAATA TGCATATTTT 


600 




ATATaACTTA 


CGAACTAAAC 


ATAATTTAGA 


AATTGACGAA 


TTAGCACAGC AATTAAATGA 


660 


40 


GAAATATGGT 


ACTAAATATG 


AAGCACATCA 


AATTTGGGAA 


TGGGAGAATC ATCACCATGA 


720 




ACCTAAATTT 


AAAGATGCCA 


TGCATTTAGC 


TGACTtCTTT 


GATGCACCAT ATGAAATGTT 


780 




TTTAGAAAGT 


AAGGTTAAAG 


AATATCAGAA 


ACATTTAGAA 


GAAGTCGATA rTCGCATGGA 


840 


45 


TAAATAGATG 


CAAATAAACC 


CTCACAACAC 


GTTTGGCATA 


TATCCTTTCA AATCTATACT 


900 




GGATATATTA 


CATTACGTTG 


TGAGGGTATT 


TTATTAATTA 


ATATGAATTA AGACATTTTA 


960 




CAA6CGTTAA 


TGCAACGAAT 


CTTTTTAGTG 


ATCTTGCTCA 


CTCTTTAATA CTTTACCGTT 


1020 


SO 


CTTAGCATCA ACAGTAACTT 


CTTGTTTTTT 


ATTACCTTTT 


TTCAAATCGA TATTGTAAAC 


1080 




AAGTTTGCCA 


TCATCTTTTT 


CAAGTGACCA 


TTCTTTAATA 


TCACCATCAA ATTCTTTTTG 


1140 



55 
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£ 



10 



IS 



20 





TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT 


TTTTTATTTT 


TATCAGCAAC 


1260 


AAGTACTTCT 


GACTCTTCAC 


CAGATTTTTG 


TTGOGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


AGAATTTTCA 


AATGAAATTC 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTTT 


1380 


CACAGCATCT 


TCTGGGCTTG 


TTTTAACATC 


TTTTAAAGCA ATAACATCTT 


TTGTTGTATT 


1440 


AGTGTCTTGA 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


TCTTTTGGAG 


TATCATTGCC 


1500 


ACATGCAGTA 


AGCACCACCG 


CTGACATTGA 


TAACACTGCT 


AATGATTTTA 


ATTTCATAAT 


1560 


ATCACTCTCT 


CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 


TTATATGCTA 


TATAGATTGT 


1620 


ATTACCCCTT 


GTTTTTAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TGCGTTACTT 


TCGTTCGTAC 


TCCTTTTATA 


AATGAACCAT 


GTAATATAA6 


CATGCTATTA 


1740 


TCGACTCAGT 


TTGTCTAATG 


CrTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA 


TGCGGTTTAA 


CATCAAAT6T 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA AAAACACTTT GACTATAACT 


1920 


tcx;gagttga 


TCATCCATTC 


TAGTAGACAC 


CTCTGTCGTT 


TTAACATAAG 


AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


ACCATGGGAG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGCCAA 


TGATAGTAAG 


TACAAGATAt 


GTlcATTGTTT 


TTAAAATCGT 


2100 


TTCAAAAGA 












2109 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 



4S 



GTTACATAAA TGAAAACAAA AAAGATAATT 


TTACTGCTTA 


TGCTACACCA GAACATAATT 


60 


ATCAATTTGG TGGTGCTATG 


ATAGAAAGTG 


AAAAATTAAG 


CGAGTTACTA AAGCCAGCCA 


120 


ATCAGTTAAA ATCACCAGAT 


GATATAAAAA 


AAGAACTAAA 


TAAAAAGAAG AGTCACTAAA 


180 


GTTAGGAGTT ACTTTAATGT 


CCAAAAAACA 


TGTTTTTATA 


ATTATTGGTG TCATATTGTG 


240 


TATATGTACA GTTTCTACGG 


TCATGCATTT 


TAAAATGAAA 


TATGATGAAA AAGAAAAACA 


300 


AAAAGCGATT TACTACAAAG 


AACAACAAGA 


ACGTATTACA 


CTCTATCTTA AGCATAATAC 


360 


TAAAGAAACG AACACGATTA 


AATCTGTACA 


TTTCACAAAC 


TTGGAAACAA GTCCTATGGG 


420 



55 
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10 



15 



20 



30 



35 



40 



50 



55 



ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTQAAG GAGTAGATAA 540 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 660 

TGCTATCXSAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 780 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

(1) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 60 

TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 120 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATQATA AAATTGGCGA 180 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 240 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 300 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 

ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 420 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 480 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 540 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 600 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 640 
(2) INFORMATION FOR SEQ ID NO: 419: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419; 
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TTCTTGGATA 


CGTAATAAAT 


TACCTTTTGT 


CATAGGATTA 


AACATAGATC 


CAGGTAAAAC 


120 




ATAAACATTT 


CTATTTTGCT 


CTAATGCAAA 


ATCTATCGTG ATGTGACTGC 


CACTTTGTTC 


180 


5 


CTTAGCCTCA 


GTAATTAAAA 


CCCCTTTTGA 


CAAACCXICTG 


ATAATTCTAT 


TGCGCTCAGG 


240 




AAATCTATAT 


TTAGCGATTG 


GTGTATGTGG 


TGGATATTCA 


GATATAACTA 


AACCTTTTTC 


300 


10 


TTCTATTTTA 


TTTCTTAATG 


CTAATGTACT 


TTTGGGATAA 


TGTGTTTGAT 


GGCCAAAGGC 


360 


TAAAACTGCA 


ATTGTAGGGA 


GATTGTATTT 


TAAAGCTATT 


TGATGTGCCA 


TTGCATCAGC 


420 




TCCTTGAGCA 


AGGCCGGAAA 


CAATTGTTAA 


ATATTTGCTT 


TTATCATTTG 


ATAATAAAAA 


480 


IS 


TTCTAAAGAC 


TGTTGGGTAT 


AACTTGTAGA 


ATCTCTTGCA 


CCTACTACTG 


CCAAATGATG 


540 


CATATTATTT 


ATTAATTTGA 


TGTTCCCTTT 


ATAGAAAAGA 


AGTAATGGAA 


ATTGATATAT 


600 




TTCTTTTAAT 


AGCACTGGGT 


ATTCAGAATC 


CATATATGTA 


ATGTAACTTA 


CTTTTAATTT 


660 


20 


CTGCAGTTCT 


TTAATAATAT 


CGTTATGATC 


AAGTTTTATA AAAGCATAGT ATTTACX3TAA 


720 




TAAATGAACA 


TTTTCTTCCC 


TATTCACCCA 


TTCACATAAA TAACTATCTT 


TTTTTCTTCC 


780 




CTCCTCTTTA 


ATTACATTAG 


GATATGCCAT 


TAAAAATTGA 


TGAATTTGTT 


TAGTCGAAAA 


840 


25 


GTGTGCCCAG 


TATAACTTAA 


GCAAAAATAG 


TTTAATCAAT AAATCAACTC 


CTTTTTGTAA 


900 




TCATACAAAA 


TCATATTCTA 


TTTTTGTTTT 


ACATTTCTAA 


TACAAAAACA 


TTGTCGATGT 


960 




AATGTTATTT 


TAAGGAGTAA 


AAATACTGAC 


TTAAAAAGTG 


AAAAGTATGT 


TGGAAAGAAT 


1020 


30 


TTAAAATTTC 


AATTTCCAAC 


ATACTTTATA 


ATTAAACCTT ATAAATAAGT 


TTTGCAAATT 


1080 




TATATAGAAT 


TGGTCTTACT 


GGTTTGATGA 


AATCACCAAT 


TAATTCTTCA 


ACATGTGCAT 


1140 




TAAAACCCTT 


TTTAAATT6 










1159 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 (C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 6 0 

CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 18 0 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 240 



SS 



1319 



EP 0 786 519 A2 



10 



IS 



20 



ATCAGCTACT GAAAATCGTA AGCTTACATT TTTACAAGTA TTTTTTGAAG TCATGTCTGC 360 

ATTTGGAACT TGTGGACTAT CGCTTGGTGT CACAAGTGAT ATTAGTGATA TTTCTAAGGT 420 

CGTACTAATG ATACTCATGT TTATAGGACG TGTTGGCTTA ATATCATTTA TCATTATGAT 480 

AGCAGGACGT CGAGAACCAG ATAAATTCCA TTATCCAAAA GAACGTATTC AAATAGGATA 540 

ATATAATAGC AATCTAAGTT TAGTTAATGT AGATTTTAAC TGGAACTTAG ATTGCTTTTT 600 

TAGTTTGTAT TTTTAACTTA TTTTATAAGA CQATTGGTTT CGAAAATGGT AAACTAGTAA 660 

CAATGAGAGG TGTAACATGA TGGAAAAAAA TGAAAACATT AATGTAGAGA TTTTAACTAC 720 

GTCAGATATG CATAGTCATT TCTTAAATGG TGATTATGGT TCAAATATTT ATAGAGCTGG 780 

TACTTATGTT AACCAAGTAA GAGCACAAAA TCATCGCGTC ATTTTATTAG ATAGTGGCXK3 840 

AAGTTTAGCT GGCTCGTTAG CGGCCTATTA TTATGCTATT GTTGCACCTT ATAAACGACA 900 

TCCAATGATA AAGTTAATGA ACAGAATGCA TTATGATGCT AGCGGTGTGA GTCCAAGTGA 960 

ATTCAAGTTT GGTTTATCAT TTTTAACTCG TTCAATTGCT TTGGCACGTT TTCCATGGTT 1020 

ATCAGCAAAT ATTGAATACA ATGTTACTAA GGAGCCTTAT TTTTCAACTC CATATTGTAT 1080 

25 TAAACATTTT GGTGACTTAA AAATTGCTAT CX5TAGGCGTC ACAGCAGATG GTTTAATGGA 1140 

AAATGAGTAT TCTGAAATGG AGCAAGATGT ATCTATTGAA AAGACATTAG TGGCATCAAA 1200 

ACGTTGGATT AGATATATCC ATGAAGTTGA AGAGCCAGAT TTTTTGATTG TAATTTATCA 1260 

^ TGGTGGATTG AATAAAATTA GTAATAGTAC GAAAAATAAA AAGGCAAGTT CGAATGAAGC 1320 

TGAAAAATTA ATGGAAGAAC TCGGTGTTAT AGATTTAATG ATTACAGCTC ATCAGCATCA 1380 

AACAATAGTA GGTCAAGATC ATGAAACGTA TTATGTTCAG GCTGGTCAAG ATGCCAAAGA 1440 

GCTTGTACAT CTTTCGATTA ATTTTAAAAA GAGAACAACA ACTTATGATG TTGAAAGCAT 1500 

TGATTCTAAA GTGATTGACT TAAATGAGTA TGAAGAGGAT CAAGAATTAT TAGATTTAAC 1560 

ATTCTATGAT AGAAAAGCAG TGGCTTATTG GTCACAGGAA ATCATAAGTG ATAAAGGTTT 1620 

GATGTTATCA GTAAATGGGT TACAAGATTT AGTCTGTCAA ACACATCCAT TTTCGCAATT 1680 

ATTACATGAT GCAATTCACC TTGCATTTGA TAATGATATA ACATGTGTCC ACGTGCCTAT 174 0 

GaACGGAGAG AAGGGGTTGA GTGGACAGAT TCGAAATGAA GrTTTGTATC aTGCATACCC 1800 

ATaTCCAGAT aAGCCAtGGG tATGACAATT aGTGGTCAAA ATaTCAAGGT ATnTTGGGGT 1860 

ATAGTTATTC ACCATTAGG 1879 

SO (2) INFORMATION FOR SEQ ID NO: 421: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

55 



3S 



40 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 60 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 120 

10 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 240 

,5 TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 300 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 360 

TCCCTGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA ACGCCATCAA 420 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC CCCAATTTAT 4 BO 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 540 

TATCTAAATC TCTCATACCA ATTAGTAOGA TGnTTTCTGG CTTGATTACA TTACnATTTA 600 

ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 660 

TTCCACTTGG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 720 

CTAAATTATT ATAATGTTTA CTTATTGCTG ATACTGAACC TACCGCAATA GAATGATCTC 780 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 840 

TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 900 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

35 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 1020 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTGCTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAQ 1140 

40 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 1200 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 1260 

45 ATAATTTAAA GTGACCAAAC CCAAACTAAA. GGAGACAAGT GCGCCTATGT GTAATGATAC 13 20 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 1380 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 1440 

50 CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 1500 

TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 1560 

55 
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IS 



20 



25 



30 



GTGTTTTATC 


TCAAGATGTG 


TTACTCAAAA 


AGTTATAGAA 


GAAGCTACTA AAGTTAAAAC 


1680 


AGAGATTGAT 


ACTGCAGAAG 


ATAACTGTAT 


CTCTCCATCT 


ACTGTAAGTC GTATTAGAAC 


1740 


TAAAGCGGCT 


AATTCATTAC 


GAATTAAACC 


CTTTAATTGT 


TTGCCAGAAC ACATCGCTAT 


1800 


GGATGAATTT 


AAAAGCXSTTA 


AAAATGTAAC 


TGGATCAAT6 AGTTTCATTT TTATAGATAA 


1860 


TGATACTCAT 


GATGTTATAG 


ATATTTTAGA 


AAATAGAACT 


ACAAGATTCT TGCGTGCCTA 


1920 


TTTCGAGCGA 


TTCGATTTAA 


AAAATCGACA 


ACAAGTTAAG ACGGTTACTA TTGACATGTA 


1980 


TGAACCCTAT 


GTCCGATTAT 


TTCGCGACCT 


ATTTCCTAAT 


GCAGCTATTA TTTTTGACAG 


2040 


ATTCCATATC 


GTTCAACATT 


TAAATAGAGA 


ACTTAATAAG 


TATCGTGTAC AAGTTATGAA 


2100 


TGAATACCGT 


AATAAAAAAG 


GACCTGATTA 


TACAATTTTT 


AAGAATAACT GGAAAGTCCT 


2160 


ATTGATGGAT 


ACTAGTAAAA 


CCATATTTAG 


TAAATAGAGA 


TGGAATAAAT CTTTTMGGC 


2220 


TTATAAACGC 


TCATCTGACA 


TTGTAGAATT 


CATGCTTTCA AAAGACGATA TACTACGACA 


2280 


CTCCTACGAA 


CTTGTCCAAG 


GATTACGAAA 


AGACCTAAGG 


TTATGTAATT GGCCTAAATT 


2340 


TATTAATCGT 


TTAAATTCAG 


TTAGTAAAAA 


GTCTGTGAGT 


AAGGGTGTAT GGAAAGTGGT 


2400 


TAAATATTAT 


AGAAAACATC 


AAAGGATGTT 


AAGAAATACA 


ATTTATTACC CAGCATTTAA 


2460 


TAATGGTGCT 


ATAGAAGGAA 


TTAATAATAA 


GATAAAATTA 


ATCAAGTGAA TTTCTTTTGG 


2520 


TTACAGAAAT 


TTCAACAACT 


TTAAAGCACG 


TATAATGATG 


aTTTTCAGCT TGTACAAAGG 


2580 


AGaAAAAAAG 


AAGACAACCA 


AGCCCAATAA 


TGGACTGGCC 


GCCTAATAwA nGGGskCTAA 


2640 


AAGTTgTATT 


TTTAAAAATA 


GTTCCTTTAA 


ATTATATACC 


CziCCACATTT GGGGGAGGAC 


2700 


CTAAAAAAGC 










2710 



35 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 
^ CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 60 

AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 120 
GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

50 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 240 

SS 
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GGGTTACTTT GATGAAAATG ATTGAAGTTG ATAGAATTAA TGACOTATCT TGTGTATAAT 360 

ACCTAOGAAG TACTTTCATT GGAGGAAAAA TAGTGACTCT ATTTATTATT ATCGGGGTTC 420 

S TCGTGCCAAT GGTTTATACC ATGCAGTTAA ATATTAAAAA TGAACCTGTa ACAAAGCGCA 480 

ATCTTTIAAT AACATTAGCT TTATCTACGT TAGGTATTTT AGTAACCGOG TTAGCAGGTG 540 

TAATOGTTAC GAAACAAGCT TTTCCTTTAT TAAGTGTAGC AATTGGCTCA ATTTTTACTG 600 

GAATCGTTTG GGGC CT TTT A CTAAGTGGTA GcTACX^CGCT GATACGATTT TTATCTAACG 660 

CATITGGGCG TAAGTAATAG TAATCTGTTC ATCAAGTAGT ATCCGTGCTT GAAAACAAAC 720 

TAAAACTCCT AATGTGGAAC TAGTTAAAAA ATTCTAAACT AGTACATTAG GAGTTTTGTT 780 

IS 

ATGCAGAATA AATTTAATTG TTAAATTGAA AGTGCGGTAA AAATCCACTA TTTATTTGAA 840 

AAAAATCGAQ CAACCAAATT AAACCTTGAT ATCCTAAGTA AATACATAAT AAAACAAGTC 900 

CGACATAAAT TAAAAATCGC AAAATAGAAA GTCCAACTCT AAAAAGGAGG ATGACTAATA 960 

20 

GTGCTATTAA AATAATTGTt AGTATACTCA ATGCTACAAA CCTCCTArTA CGCTTTTAAA 1020 

TCCATAA 1027 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

3S CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 240 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 300 

GT6CTTGAAC GCGCTAAGAC ATATTCAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 360 

45 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GATCACCTAA GATATGAAGG ATTTGGCCCA 420 

AGCGGATCAA TGCTAATTGT TGATGCGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 480 

GTCCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 540 

SO 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 569 

55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

10 TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 60 

TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 120 

CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 180 

TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 24 0 

TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAGC ATAATATTAT 300 

AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 360 

20 

TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 420 

ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 4 80 

TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAGCAG ATGTTCAAGA 540 

25 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAGCAG GTGAATACGA 600 

aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AGCCAGGTGA 66 0 

AACACGTTTA CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 720 

30 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 780 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 840 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 900 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTGCAGTTT GGTCACAAGC 960 

AAATGTA6CA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 1020 

40 AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAGCTAAAGG 1080 

GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 1140 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 1200 

45 CQTTTACCTA GGTGTACCAA CATTAGTTAA CCAACATGGC GCAGTTAAAA TTTATGAAAT 1260 

GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 1320 

ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 1380 

SO 

ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 1440 

55 
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TATGACTCGT GAAAATTTAG TGACATQAAT 
GAAAGTAGTT CTGCATTAAT CACGAaGTAG 
^ ATGGT7TTGG AATGACX»AtG TCTTGTATCA 

CTTACTAGAA GGCACAGCTA CAATTAATGA 
AACGTTAACA GGGTCTGATG GCGAAtAATA 

10 

GAGCATAAAG AATTTATAGA ATTAAAAGGC 
AATTTTAAAG CGAGTAAGAC ATTTCCATTG 
CAAATTAAAA ATGAAATGTT AAGTGAGAAT 

IS 

TTTAAACATA TGCATGTACX3 AATGATGCCT 
GATTCAGCAC GCAGACAACC TGAGGAAAAA 
TTTTTTACAC CAGAATTATT TCATGGCGTA 

20 

GATGATGAAA GAGCTTATGG TGGACATGTT 
GAGATACAAA ACTTTGAAAC ATTCCAACAA 
AAAGCGAAAA TAGACTATAA AGATGTGGCA 
ATAATTAAAA GACCTTAGCG ATATTAGGAA 
TTTTATATTA TATATGTGCT TACATATTTT 

30 TATCAAGATA TGGGGTGTCA AGAATGTGTT 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG 
CGTCGGGATA ACCTTGATAT ATAGTCATAA 

35 ACCCCTCTGC CTGTGCAACA CGACAACCAT 

TTTCAGGATT ATGCATGATT 6CAACGGTCT 
CCAAGTTAAC TAACAATTTA TACCAAATAG 
CGATTTGACT GTCTTGGACC AAATCTCTAA 
GTAATTGATA ATCTCTGAAG TGCGTAACAA 
CAACTGCTTG GCATACGTTT TTAAATGAAA 
CTAAAATGAT GAGCGTGTCC TCATGTGCAA 
GTTGATGTGT TTTGACTGCA ATAATAATAA 
CTTTCACAAC AATATCTTGT GCAGGTGCAT 

50 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA 

55 



AAAAATGTTG 


AGGCGTCATT GAGTATAAAG 


1560 


AGCATGACmA 


CGAGGAATAA CTATAGGGaG 


1620 


ACATGGTACA 


TTAGGTACGT TAATGGCTGG 


1680 


ATTGTTAGAA 


CATGGGAATT TAGGGATTGC 


1740 


TTTTTAGACG 


GAAAGGCATA TCATGCTAAC 


1800 


GATGAGAAAG 


TACCGTATGC ATCGATTACT 


1860 


CAACAATTAT 


CACAAGATGA TGTATTTGCA 


1920 


TTATTTTCGG 


CTGTTAAAAT TTATGGCACA 


1980 


GCTCAGCAAC 


CGCCATATAC ACGTTTGATT 


2040 


AGACAAGATA 


TTCGTGGTGC CATTGTTGGA 


2100 


GGGTCTGCTG 


GTTTTCATAT ACATTTTGCG 


2160 


CTTGACTTTG 


AAGTGGATGA CGTTGTCGTT 


2220 


CATTTCCCGG 


TAAATAACGA GACGTTTGTT 


A* 0 V 


GAAGAAATTA 


GA6AAGCTGA ATAGTCTAAT 


2340 


CAGGTGGTTC 


TAAATGCATC GTTAAGGTCT 


2400 


GTTGATACGC 


GCGTAAAAAG CTATAAATAG 


2460 


CACGTGCACG 


TCGATAAATA AAGCCTTGAA 


2520 


CAATGTCGTA 


ATACATGCTC GTTCCCATTT 


2580 


TAGTATCGAC 


AGTTTGTTCT GAAAAGTTTA 


2640 


CTAGTAATAG 


CTGTCTACAC AGTGTACGTA 


2700 


GTCTACCGAG 


TGCTGTGATA GAATTAATGC 


2760 


CTTOTTQAAT 


ATTTGCTTCT AATACAATGT 


2820 


ATTGTCGAGT 


TAATGCATTA TCTTGTATAC 


2880 


CATCGCCTTT 


CTTTTGACCA CTTATATAGA 


2940 




TTQACCATAA CCATTTTGGG 


3000 


GATGAGTTAA 


ATGAGGAATA ACTGCATCAA 


3060 


CATCAAATGT 


GTTTGTGACA TCTTCATAAC 


3120 


GTGGTACAGT 


ATAATATGTT AITGTTTTGG 


3180 


ATGATTGTTG 


TAATTCATAG GCAATAGTTG 


3240 
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<2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



IS 



30 



3S 



AGAACGAACC 


TTTAATACGT 


ACGTGTCGAC 


CACGAACTGT 


CATGACAAAT TGATAAAACG 


60 


AATAGTTTTT 


CATTAGTTCA 


TTGTCACATC 


AATCACTTTT 


GTkTCACCTT TAATCACAGC 


120 


ATTTTCATCA 


TAAATATTAA 


TTGAAGCTGC 


TTGATCAGTG 


TTAGTAATTA TAATTGGTGA 


180 


AATTACAGAT 


TTAGCGTTAT 


TATTAATATA 


TTCAAGGTTG 


AATCTTACTA ATGGATCTCC 


240 


GACGTTAACT 


TCGTCACCAC 


TAGACACTAA 


CACTTCAAAT 


CCTTCACCGT CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC 


TAATCCGTTA 


TCTGCTTTTA ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG 


TCCTGCAATT 


GGAGACACAA CTTCTCCTTC 


420 


AGTTGGATTA 


ATACCAAAAC 


CTTCGCCCAT 


CArmTTGT 


GCGAATACAG GATCTGGAAT 


480 


ATCTTCAATT 


TTCACGAATT 


CTCCAGTTAA 


TGGTGCATAA 


ATTGCGATAT CTTTCTGAAC 


540 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA 


CATACTTTCC 


ACTCCTACTT ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT 


AACTTTTCAT 


ATGGAATAAA TTGAATTGCA 


660 


GCGGAATTGA 


TACAGTATCT 


TAAGCCGCCA 


CTTTCTTTAG 


GTCCATCATT AAAGACATGT 


720 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC 


ACTTCAGTTC 


TCAACATACC AAATGATTTG 


780 


TCGACTAATT 


CTATAA 








796 



(2) INFORMATION FOR SEQ ID NO: 426: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3393 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 120 

SO 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATAOG 180 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTA6T ACTAGTACTG CAAAAATAAA 300 

TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

^ ATTTGATTTT TCTTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 420 

TTTAAATCAT ATATGAATTT AQAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 480 

CATTtAAGAG ACCXSAACAAG TAATCATACT TTATTTTGTr GCATTATATA ATTCGTCAAC 540 

TTTTTCCCAG TTAACTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTGTTT TACCCTCAGT 660 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 720 

75 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 780 

TTCTAAAGAA CCCCATTGTT CTTTAATTTT TTCTACTACA GTACCTTTTT CTTCTGAGTT 840 
TGGTGAAAGT AACTCCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 900 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 960 

AGATTCTAAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 1020 

TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 1080 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCCTAAAA TGTCTGTAAG 1140 

TAAATAATAA CAAGCAGTGA CTGGTTCAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 1200 

30 ATTTTCTCTA CTCATTTAAT ATAACTCAAA TCAAAAATAA TTAAACATTT TGTATATAAA 1260 

AAGTTAACAG ATTTGCCATA AAATCATACG AACGGAGTAT GAAATGAACC TTTATCTTCT 1320 

ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 1380 

35 TTATATTAAT CTTGGcAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTGTGAATA 1440 

TTAACACCAG GTAGTGATAA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 1500 

TCTCCAGACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATTCCTA AATCTTTAAA TAAGTQCAGG 1620 

TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTACX5TTTA 1740 

4S 

TCTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CATCATTTGT ATTCATTGAT 1800 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 1860 

AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAGCTAGGAC 1920 

SO 

TAAGCCTAAT ATTACTGATA ATTCACCTAA AAATACACTT AGTAATATCA ATTGCTTAAA 1980 
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TATCCCTACA ACAC6CATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCX: TCATCAAATG ACAATATAAA 2160 

5 TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 2220 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 2280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT 7GCACTCAAG GCGATACCAG CACTCATTAT 2340 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAATTTTT CAATTAGAAG 2400 

CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTTGG 2460 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2520 

IS 

ATCAGCTWTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 2580 

ACCTAtCAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 2640 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 2700 

20 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 2760 

ACTTCATCAA GTGATTTAAA CTCATCA6TT GTACCATGGA AATGCAAATG CTTATTTAAA 2820 

CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 2880 

2S 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 2940 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 3000 

GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT ATGAATTAAA 3060 

30 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 3120 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 3180 

35 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTQATGTT 3240 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 3300 

CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 3360 

40 ATTCACCTTT ATTTATTTTG AATGniinTAT ATT 3393 

(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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5 



IS 



20 



25 



CCCTGTTTCT 


TAGCCTTTTC 


GTATATGCGT 


TCATAAGAAG 


AATTCm^TT 

nf\±. X v» X X X X X 


GATTT CAG AT 




TGTAAAGCAC 


TGTTTTCACT 


TGATTGTTTA 


TCTATTTTAT 


AATCTAAATC 


TGCAATCTTT 


XiO w 


CCTCGCGTAT 


CATACGCATC 


CATTTTTAAA 


GATAGCATAT 


AAATACTTAA 


CATAGCAATT 




ACAGTAATCA 


AAGTTATGTA 


TAAAACTTTT 


TCAAATTTAG 


*ITAATTGTAC 

X X X X W X ^ w 


AACCACTTTT 


3O0 


CrCGAAACAG 


TCTTCTTTTC 


GGGCTTAGTT 


TGTGGTTGTT 


GCTTCGGTAT 


ACTATTATAA 


360 


ACTTGTTCGT 


CATATGGTTG 


GTACAcmrr 


TCTACAGC CA 


TTATAAATTG 


CTCCTTATTT 


*s 4 U 


AAGTATTTCA 


GCTACACGTA 


ATTTCGCG CT 


TCGTGCTCTG 


TTATTGTCAT 


CTAAATCTTC 




TTCTGTAGCG 


GTAATCGGTT 


TACGATTAAC 


ACGCTTTAAC 


TTAGGTGTAT 


ATGCTTCTGG 




TATAACTGGT 


AATCCTCTTG 


GTACCTCTGG 


Af OTnrTT*f^ 2v 

nv*v> X X X A A 


TATTCTTGGA ACACCTGTTT 




ACATAAACGA 


TCTTCTAAAG 


AATGGAAAGT 


C3ATTAC*a2AA 


ATCCTGCCAT 


CTACTTTCAC 


DOU 


TAATTCAATC 


GCTTGTTCTA 


TTGAATCTTC 


AAAAGCTTGAr 


AATTCATCGT 


TTACTGCAAT 


/ 


TC6TAGTGCT 


TGAAATACTC 


GTTTTGCAGG 




TTTCTTCTTG 


CTTTTGCAGG 


/ OU 


AATACCTTCT 


T^TATAATGT 


CAACTAATTC 




GTTATTGGTT GTTGTTCGCG 




ATGTGCTTCG 


ATTCTTCGAG 


CTATCTGTTT 


TGAAAATTTC 


TCCTCGCCAT 


AGCGATAAAA 


900 


AATCTTCACT 


AACGCTTCAT 


ATGACCAATT 


GTTAACAATT 


TCATATGCTG 


TTAGTTCTTG 


960 


TGTTTGGTCC 


ATACGCATGT 


CTAATGTTGC 


6TCATGGTGA 


TAACTGAATC 


CTCGTTCTGG 


1020 


AATGTCGAGT 


TGTGGGCTTG 


AAACACCCAA 


GTCGTAATAA 


ATTCCATCTA 


CTTTTTCAAT 


1080 


GTTTAAGTCT 


TTTAATATTT 


GAGTTAATTC 


ACGGAAGTTG 


CTA 




1123 



(2) INFORMATION FOR SBQ ID NO: 428: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

40 



(xi) SEQXTENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

4S 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 120 
ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCT6 ACAGAATTGA 180 
AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 240 

SO 

CAAAT6AAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 300 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGOGGCAC AAGGGGCTTG GGTGAAAAAT 420 

GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 480 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 540 

AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 600 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 660 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 780 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 838 
(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 60 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 120 

TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 180 

CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 240 

CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 300 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 360 

TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 480 

TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 540 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 600 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 660 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 720 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 780 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 840 

ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 900 
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AAACCATTGT TGTCAACTTC ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 1020 

ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 

GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 1140 

GGtATXXSCAA 1150 



(2) INFORMATION FOR SEQ ID NO: 430: 

JO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 



(xi) SEQtTENCE DESCRIPTION: SEQ ID NO: 430: 



20 


AAGAATATTT 


AGGATTTTAT 


GTTTCGCAAC 


ACCCAGTAGA 


TAAAAAGTTT 


GTTGCCAAAC 


60 




AATATTTAAC 


GATATTIAAA 


TTGAGTAACG 


CGCAGAATTA 


TAAACCTATA 


TTAGTACAGT 


120 




TTGATAAAGT 


TAAACAAATT 


CGAACTAAAA 


ATGGTCAAAA 


TATGGCATTC 


GTCACATTAA 


180 


2S 


ATGATGGCAT 


TGAAACTTTA 


GATGGTGTGA 


TTTTCCCTAA 


TCAGTTTAAA AAGTACGAAG 


240 




AGTTGTTATC 


ACATAATGAC 


TTGTTTATAG 


TTAGCGGGAA 


ATTTGACCAT 


AGAAAGCAAC 


300 


30 


AACGTCAACT 


AATTATAAAT 


GAGATTCAGA 


CATTAGCCAC 


TTTTGAAGAA 


CAAAAATTAG 


360 


CATTTGCCAA 


ACAAATTATA 


ATTAGAAATA 


AATCACAAAT 


AGATATGTTT 


GAAGAGATGA 


420 




TTAAAGCTAC 


GAAAGAGAAT 


GCTAATGATG 


TTGTGTTATC 


CTTTTATGAT 


GAAACGATTA 


480 


35 


AACAAATGAC 


TACTTTAGGC 


TATATTAATC 


AAAAAGATAG 


TATGTTTAAT 


AATTTTATAC 


540 




AATCCTTTAA 


CCCTAGTGAT 


ATTAGGCTTA 


TATAACTTTy ATCAACTAAT AAATTATGAT 


600 




ATAGTAAACT 


GATGGTTAGA 


TATTTyTyAA 


CCATCAGTTT 


GCGTktATAT 


TAGTTTTTTA 


660 


40 


TGCTTATTAT 


TTTTATGAGT 


TTCACTTTAC 


ATTATTGATT AATCCATAAG AATAATTAGC 


720 




ACAAAAAGCA 


6TATACATAA 


ATTGAGTAAA 


GAATTTTGTC 


GATATAACCG 


nGCGGAAAAA 


780 


45 


TAATAACnTT 


TTGGATn 










797 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACX5CATC ATAATAAATT GGTCATTTCG 60 

ATGATTTCGk CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 120 

ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 180 

AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 240 

CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 300 

CTTAATAATT GtTTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 360 

CATCCACATG ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 420 

CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 480 

TTAAGTGATT TAGCAAGTCG ATTTATTTUIT CGAGACTTAT TTAAATATAT TCCATTTGAT 540 

GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 600 

20 GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 660 

TCAAATCGCA AACCGATTCA TTTATTAAGA CAAGATGGTA CXsATTAGAGA AATAAGCAAT 720 

CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 780 

CCTAGAGAAA TGGTTOCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG 840 

ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCX5T TAAATATGGA GGTTATATCA 900 

TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 960 

AAAGGTACAA ATATTGGTTC AATTAGCTTA GACAATATTG CACCAGTTTT TATCGATGTT 1020 

GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 1080 

^ GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AGaATATTGG 1140 

TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCG 1200 

TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 1260 

^0 AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 1320 

AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 1380 

GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 1440 

AAAGATATGA TTCATTTCTC AATGAC 1466 
(2) INFORMATION FOR SEQ ID NO: 432: 



25 



30 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

TTGTACQAAT ACAAAGTTGC ATTAATATAG AATCATTTAT GCTTTTTAAT ATCACAAAAC 60 

AAATTGATTA AACTAGCTAA AATAGTCAAA ATTGGAATCA AGATTACATA AATATCATAA 120 

CCTCTAGATA TTAGTATACA TAATATAATT AGCGACCCTA TTATAATTGC TAAATTATAA 180 

GCCAAACTAA TTTTATATTT CATTTCAATT CCCACCTTAA TAGCATTTAT TAAAATTCTA 240 

TAATGTTTAC ATAGACTTCT TAACAAACTT TAACCCAGCT AAGCTAAGTA TTGAAATAAT 300 

TACACCTCCT AATATAAATA TATTACTTAA GCTTAGTAAA GGTATAATTA TACTTATCAG 360 

TCCTAAAGAC AATGTATCCG CTOcATAATT CGaTGtAGAT QAGATACTAA ATACTTTCCC 420 

CATCAAATGA TTTGGCGTTT TTATTTGAAT GGCAACTGAT CTTGTTAGTC CCTCTATAGA 480 

TTGTCCAAGT CCCAATAATG TTGCACCTAT ATATAATATC GCCACACTTG GAAACACATT 540 

^0 AATAATCGAT AAGCCAATTC CCCAAACTAA AACACCAATA CTAAATTTAA AGATTAATCG 600 

CTTTTCTGAC AGCAAACCCA TAATCAATGA CATTAATAAA GATGCTATAC CTAAACATGA 660 

TGTAGCTAGT CCATATACAC CAACGCCCTC TTTTAATATA TTGGAAATAA ACAATGGTAA 720 

TACAACACGC CAAAGGCCAG TATTAATCAA TATGCAGGCA AATTGGATGA TTATAATAAA 780 

TGGAATTTCT TTAGATTGTT TCAAGAATTC CCAAGTTTCA GAAAAATCTT CTTTTGAGTG 840 

TCTATCAATC ATGTTGTTAT TTGTATATTT TAAAAGTGCA TTAAAAATAA ATCCTAAAAA 900 

TAGCAATATA CTACAAATAA AAAAGACGCC AACATTACCA ACTAGTATTA CAATGACACC 960 

AATTAAAGCA GGTAAAATAA TATTTGAGCC TCTTTGCAAA CTATCGATTA ACGCATTACT 102O 

35 TGTTGCTAAA TGCTCCTCAT CAATAATTTC AGGAAGAATT GCCCTAAACG CAGGATCCGT 1080 

ATAGCAGTTA ATAATGGTGA TAGCTGTAGA TATGGTTAGA AGCGTCAGAT AATTTAAATT 1140 

TGATGTTATT GCAAGTAAAG GAATAATTAT TATAATCAAA CTTAGTATAA GATCAGATAG 1200 

ATAAAGTATT TTCTTTTTAT TATGTTTATC AGAATATGCG CCACCGAAAA TACCAAATAT 1260 

AATAAATGGA AGTGTTTGAC TCATAACCAT CATTGATAAT TTTAAAGATG ATTGGTTTGT 1320 

CAATTCAACA GTAAACCAAA TTATTTGTAA CGAAAACAGC ACAAAACAAC TCCGACGTAA 1380 

GArATTACCA ATCAATAAAT ATGTAAAGTT TCTATTTTTC AAAACTTCTA AATACAACAT 1440 

ATTTATCACC TCTCATAAAA ATAATTGAAT GCATCCACCA GCTTTTTTAG ACCTTCTTCT 1500 

AAACTCTCTT TATCCAAAGC GCAATTAATT CTAATATAAT TTAGTCAGTT AAATATCAAT 1560 

TATTTCGAAA TATACATACT ACTTGAAACA CCATACATAA CCCCCAAAAT GACTACTCAG 1620 

AGGTTATATT CTACTAATTA TGATTATATT AAATATGAAA ATATTATCAA AAAAATCAAA 1680 

55 



25 



30 



45 



50 



1333 



EP 0 786 519 A2 



TCATTCTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 1800 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA 1860 

5 GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 1920 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1980 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 2040 

10 

GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 216 0 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 2220 

IS 

AATGGGkTCA CTGTGAwTGG TGkTGTGAiirT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2280 

ACTCGCTCTT CACCAGGTTG TAAT 2304 
20 (2> INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 



30 


GTCTTAATAT 


TTTACAAGAT 


ACAGCCTGAC 


AATATTTATA 


TTACAAACCA 


ATATGGTAAA 


60 




GAGGCTGTGT 


CTCGATTAAA 


AAGAATTTTA 


GAGCAAAGTA 


TACCTAATAA 


AGAATATATT 


120 


35 


GATTTATCAG 


AGATATCATT 


AATTACATTT 


TTTAAACCAG 


AATATAATAA 


ACAACATGTG 


180 




AATGAGGAGT 


TTAATTCTTT 


ATCTTCTTCG 


AAAATAAAGA 


ATTTAGCCAA AGAAAATGAT 


240 




GGTATTATTA 


TTTATTTAAA 


TTTTAAGGAT 


GTTAATTATC AACTGaTAaG TGAAGGCAAT 


300 


40 


ACi'TTTTTTT 


CTGCAAAACC 


ATATTTGCAT 


TGTGTGTTCA ATAAAGATAT 


TTTAAATATG 


360 




GATAAAGTTG 


ATATAGAAAA 


TTTTTTCAAA 


AGTATAGAAG 


TCAAATACAG 


TAAATAAGAT 


420 


45 


CAATTATTGA 


ATCCACCGAC 


CAGCAGATTT 


TTTTAAAAAA TTAATACCCC GTTAATACCC 


480 


TTTGCTTCAA 


TTTGATGAAA 


ATCAATGAAA 


TTCAAAATTG 


AAGAAATCCT 


TTAATATCAA 


540 




GGTGTACGAC 


AGTCTATATA 


ATCATGCGAA 


ATTCTAAAAT 


TTTCtQAcGT 


AAAAAAACAA 


600 


SO 


ATTCyTAAAG 


CAGCTCGTCG 


TTCACCTCAA 


TTCTCAAAAC 


GTTAATTGTC 


GGACGATATA 


660 




TATACAAAAC 


ACCTCGATGT 


TATGTCGAGG 


TGTTTTTTTG 


CGTTTGTGTG 


GGGAATATGG 


720 




AATAGAGTAT 


AAATGAATTA 


ATGTCTCAGG 


TATAGAATTA 


ATTCAACTAT 


GAATTATTGA 


780 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 900 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 960 

5 GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGnTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 1080 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 1140 

10 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

IS 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACX3CA CAAAATCTAT 1380 

TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 1440 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 1500 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

TACACTGCAG CCOATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATOCTT TAATTGATGA TGGTCCTGAA 1740 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

30 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTC6A ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

40 CGTTATTTCT TCACCAOTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT 6AACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

45 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 2340 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 2400 

ATTATTACGT AAAAGTGATT TAGTTOTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 2460 

50 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2580 
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CAATTAAGAA 


CACATATTGA 


TGAAGAAACT GGTATTAAAG CGACGCATGA AGAAGGAATT 


2700 




CTACATCCAG 


TGGAAATTAT 


TGAATCTATO CAAAAGGTAT TAACTGATGA TACTACTGTA 


2760 


5 


ACAGTTGATG 


TTGGAAGTCA 


CTATATTTGG ATGGCACGTA ATTTCAGAAG TTACAATCCA 


2820 




AGACATTTAT 


TATTTAGCAA 


TGGTATGCAA ACXSCTTGGTG TAGCATTACC GTGGGCAATT 


2880 


10 


TCAGCTGGAC 


TTGTGCGCCC 


TAATACGCAA GTTGTGTCCG TTGCTGGCGA 


TGGTGGCTTT 


2940 


TTATTTTCAT 


CACAAGATTT 


AGAAACGGCC GTACGTAAAA ATTTAAATAT 


CATCCAGCTT 


3000 




ATTTGGAATG 


ATGGAAAATA 


TAACATGGTT GAATTCCAAG AAGAAATGAA ATATAAACGT 


3060 


IS 


TCGTCAGGTG 


TAGACTTCGG 


TCCTGTAGAT TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 


3120 




AAAGGTTTAC 


GAGTTACTAA 


TCAAGAAGAA TTAGAAGCGG CAATTAAAGA GGGCTATGAA 


3180 




ACAGATGGTC 


CAGTATTAAT 


TGATATACCT GTAAATTACA AAGATAATAT 


CAAACTTTCA 


3240 


20 


ACAAATATGT 


TACCTGACGT 


ATTTAACTAA AATAAAGATA AATGTTAAAG AGGAGTGGGA 


3300 




GATTTTATGA 


CTAATGTTTT ATACCAACAT GGTACATTAG GCACATTAAT GGCAGGATTA 


3360 




TTAAAAGGAA 


CTGCATCAAT 


AAATQAATTA TTGCAACATG GTGACTTAGG 


TATCGCTACA 


3420 


2S 


CTAACAGGTT 


CAAACGGTGA 


GGTAATCTTT TTAGATGGAA AAGCTTACCA 


TGCAAATGAA 


3480 




CATAAAGAAT 


TTGTAGAATT 


AAAAGGTGAC GAGTTAACAC CATATGCAAC 


TGTAACTAAA 


3540 


30 


TTTGTAGCAG 


ATACAAGCTA 


TGAAACGAAA GATAAATCTT CAGAAGCAGT 


TTTTGCAGAA 


3600 


ATTAAGGAAA 


AGATGTTGAG 


TGAAAATTTA TTTTCAGCAG TAAAAATTTC 


AGGCTTATTT 


3660 




AAAAAAATGC 


ATGTACGTAT 


GATGCCGGCT CAAGAACCAC CTTATACACG 


TTTAATCGAT 


3720 


3S 


TCAGCTAGAA 


GACAACCTGA 


ACAAACTGAA ACGTATGTCA AAGGTTCAGT 


CGTTGGTTTC 


3780 




TTTACACCAG 


AATTATTCCA 


TOGTATCGGA TCAGCAGGAT TTCATGTACA 


CTTTGCGAAT 


3840 




GATGATCGTA ACTTTGGTGG 


ACATGTCTTA GATTTTGAAG TAGAAGATGT 


TAAAGTAGAA 


3900 


40 


ATCCAAAATA 


TAGAAACATT 


TGAACAGCAT TTTCCAATTC AAGATAAAGA TTTCACTAAA 


3960 




GCAAATATTG 


ACTATAAAGA 


TATTGCAGAC GAAATTAGAG AAGCTGAATA ATGAATCCAG 


4020 


45 


AAATATAATG 


ACGGTTTATG 


AAAATTGACT TCATAATGCG CGATTTAGAA ATGATAGTTT 


4080 


GTAAATATGA 


TTAACCATGA 


CTACAATAGA ACAAATATAT TTATAATTAC 


GTCTAAGTAA 


4140 




TAAAATAAAT 


CCCTTCACTA 


TTAGCAGTAG TGAGGGGATT TATTAGGTTC 


CAGATATTTG 


4200 


SO 


AGATTTGCTG 


TTATGTTTAG 


ATTATAAATT GTGGTACACA CTCATATAAA 


ATTTACTATT 


4260 




GTATAGGCCA 


ATCTGTTACT 


ACGAGAAGCA AACAACAATA ATTTACAAGT 


TCAATAACTA 


4320 




AAAAGACAAA 


CGCCAATTTT 


TCAGCGCTTG CCCTATAAAA CTATTTTCAA ATTATTATTT 


4380 
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10 



IS 



20 



25 



30 



40 



45 



TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCX3TAT AAGTTGCTGT 4500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTTTAC 4560 

TTTACTGTCA ATTTCTTGAT AAAGAACATT TTTATTTTCT GGGAAGATAA AGTAAGTTCG 4620 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 4680 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4733 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 434: 

AATAAATGGT AATTTTATAT CATCAACGGC TACAAATGGG GCAGTTTCAT ATATAAATAA 60 

ACACATCTAC TCAACTGnTC TTAATTAATT AATAAATQAT TTACCAGATG ACATATAACA 120 

GCCATTATTA GnTTAATTCT AATAGTTTAT TTAAATTTTC TTCGGTTGTC GCCCAACTGG 180 

TTGCGAATCT AACAACACGA TGTTGATCAT CGTATTTTTC CCAAACAGCA AATTTAACTT 240 

TTTGTTCTAA CTCTGCTATT TTCTCGTTAC TTAAAATAAA AAATTGTTGA TTGGTTGGAG 300 

AATCAAAGTA AAGACGATAG CCTTTATTTT TAAACCCGTC TTTCATCTTA TTTGCCATTT 360 

CGATAGCATG TCTGCTTATA TTAAAATATA AATTGTCCGT AAATAATTCT AAAAATTGTA 420 

35 TGCCTGTTAA CCGTCCTTTT GCTAAAAGGG CACCGTGGAT GCTTGATTCG AGTQGTAAAT 480 

TGTTTCGGTT CATTATTTTT OGTAAAAACA ATGGCTTCCC CGCATAATGC ACCTATCTkC 540 

GTACCACCTA TATAAAATAC ATCACAATAT TTAGCGrTGt CTTTAATAGT CATATCTGAT 600 

TGGTCACTCA TCAATCCATA CCCTAATCGT GCACCATCCA TAAATAATGG AAGCTGATAT 660 

TGCTTACATA CTTTGCATAA CTCTTCCAAT TCTGaTTTAG AGTATAATGT GCCATATTCT 720 

GTAGGATGAG AAATATATAC CATTCCTGGG AATACCATAT GGTCCTTTTT AAAATCACTT 780 

TTAAATGTCT CCATGTAAGT TTCAACATCT GAAGCACTAA CTTTTCCTTC CTTAGAGGGT 840 

ATAGTAATTA CTTTATGTCC ACTATATTCA ATTGCACCGC CCTCATGCAC AGCAACATGA 900 

50 CCAGTGTCTG CTGAAATGAC CCCTTCGTAA CTTTCTAACA TTGAATTAAT AACAACCTGA 960 

TTGGTTTGTG TTCCACCTaC TAAAAAACGA ATTGTAGCAT TTGGaCAGTC AATTGTATCT 1020 

TTAATCTTTT CAATTGCCTG AGCTGTGAAT TGaTCAAAGC CATATCCCGA AGCTTGTACA 1080 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 1200 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 1260 

TATCGCGAAG TTGnATAAAT ATAT 1284 



(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 



30 



35 



40 



4S 



ACAGCTTTTG 


GTAArGGAGA 


AAwTCAaTAt 


GAAACAGTAT 


AArGCGTATT 


TclATCGATTT 


60 


AGATGGCACA 


ATGTATATGG 


GAACAGATGA 


GATTGATGGA 


GCAAnACAAT 


TCATCGATTA 


120 


TTTAAATGTA 


AAAGGCATTC 


CTCATTTATA 


CGTAACTAAT 


AATTCAACAA AAACACCTGA 


180 


GCAAGTAACT 


GAAAAATTAC 


GTGAAATGCA 


CATTGATGCT 


AAACCAGAAG 


AGGTTGTAAC 


240 


GTCAGCGTTA 


GCCACTGCTG 


ATTATATTTC 


AGAACAATCA 


CCAGGAGCAT 


CAGTATATAT 


300 


GTTAGGTGGG 


AGTGGTTTAA 


ATACTGCGTT 


AACCGAAGCG 


GGACTTGTCA 


ttaaaaatga 


360 


CGAGCATGTT 


GATTATGTAG 


TTATTGGACT 


TGACGAACAA 


GTTACATATG 


AAAAGCTTGC 


420 


GATTGCAACG 


TTAGGTGTAA 


GAAATGGtGC 


AACATTTATT 


TCTACAAATC 


CTGATGTATC 


480 


AATTCCTAAA 


GAGCGTGGTT 


TATTACCTGG 


TAATGGTGCT 


ATTACAAGTG 


TTGTAAGTGT 


540 


ATCGACAGGT 


GTATCGCCAC 


AATTTATTGG 


TAAACCAGAA 


CCGATTATTA 


tggttaaagc 


600 


ATTAGAAATT 


TTAGGATTAG 


ATAAATCCGA 


AGTTGCTATG 


GTAGGCGATT 


TGTACGATAC 


660 


CGATATTATG 


TCTGGTATTA 


ACGTAGGTAT 


GGATACGATT 


CATGTACAAA 


caggtgtatc 


720 


TAC6TTAGAA 


GATGTGCAAA 


ATAAAAATGT 


GCCACCAACG 


TATTCTTTTA AAGATTTAAA 


780 


TGAAGCAATA 


GCTGAATTAG 


AAAAATAGAT 


ATAGTCATTT 


TATAAAGTAG 


gtgaattgat 


840 


TTGGTAAAAA 


TAGTTGTTTC 


GAGGAAAATT 


CCAGATAAAT 


TTTATCAACA 


attaagtaaa 


900 


CTTGGTGAOG 


TTGTTATGTG 


GCAAAAATCA 


TTAGTGCCTA 


TGCCTAAAGA 


tcaatttgtg 


960 


ACaGCcTTCG 


TGACGCAGAT 


GCTTGTTTTA 


TTACATTAAG 


TGAACAGATC 


gatgcagaaa 


1020 


TTTTAGCGCA 


ATCACCAAAT 


TTAAAAgTAA 


TTGCGAATAT 


GGCTGTAGGA 


ta 


1072 



(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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IS 



20 



25 



(C) STRANDEDNESS: double 

(D) T0P0LCX3Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCxiTT TnCTTAAGTC 60 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 240 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAAOGA TGTTTTTTAA 300 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 360 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 480 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 540 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 780 

TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 840 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 900 

TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 1200 

4S TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AQTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 1320 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 1380 

^ TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 1440 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGATCAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 1560 
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10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



SS 



GTATAAATAA 


TTTAGTCATT 


TGCAAGACCT 


CGCTCTACCA TATATTGTCT AATTGTTTCC 


1680 


ATATCTTTAT 


CGCCACGTCC 


AGAAATAGTT 


ACTACAATAA 


TATCTTCTTT 


CGACATCGTA 


1740 


GGCGCTAGTC 


TTTCAACATA 


ACTCAGTGCA 


TGTGCACTTT 


CAATTGCAGG 


TATAATACCT 


1800 


TCATGTTTTG 


TAAAGTTGAT 


TAAAGCATTC 


ATTGCTTGTG 


TATCACTAGC 


ATTTTCAAAA 


1860 


GTTACTCTAC 


CAATGTCGTG 


GTAATAAGAA 


TGTTCTGGTC 


CAATACCAGG 


ATAATCAAGT 


1920 


CCTGCTGAAA 


TAGAATGTGC 


TAGTTGCACT 


TGCCCATCTT 


CATCTTGAAT 


TAAATACATT 


1980 


TTAGTACCAT 


GTAATACGCC 


AGGTGATCCT 


TTGCCAATTG 


CAAGTGCATG 


TTTATCAGTA 


2040 


TCATCGCCTT 


GACCTGCGGC 


TTCAACACCG 


TATAATGCAA CATCATCTTT AATAAATGGA 


2100 


TAAAATGTAC 


CGATTGCATT 


TGAGCCACCA 


CCGATACATG 


CTACAATTGC 


ATCCGGAAGT 


2160 


OGACCTTCTT 


TCTTCAATAT 


CTGTGATTTT ATTTCTTTAC 


CAATCACACT 


CTGAAAATCT 


2220 


CTAACAATOG 


TTGGGAACGG 


GTCTGGACCT AATGCAGAAC 


CTAATAAATA ATGTGTATCA 


2280 


TCTACATGAC 


TTACCCAATA 


TTGCAATGCT 


TTATTAACTG 


CATCCGATAA AGTCCCTTGA 


2340 


CCATCTTCAA 


CTGCCACAAC 


CTTTGCACCA AGTAATTCCA TTCTAAATAC ATTAAGTTGT 


2400 


TGTCTTTTAA 


TATCTTCACT 


TCCCATAAAG 


ACAACAAGTT 


CCATATCAAA 


TAATGCAGCA 


2460 


ACCGTAGCAC 


TAGCTACACC 


ATGTTGACCC 


GCACCAGTTT 


CAGCAACAAG 


CTTCTTCTTG 


2520 


CCCATTCTTT 


TAGCAAGCAA 


CGCTTGACCT 


AACGCATTAT 


TAATTTTATG 


GGCGCCTGTA 


2580 


TGATTTAGAT 


CCTCTCGTTT 


CAAATATATT 


TTAGCGCCAC 


CTAGGCTTTC 


AGTATATGAT 


2640 


GCAGCATATG 


TAAGTGGTGT 


CGCGCGTCCT 


ACATACTCTG 


ATAAATAGTA 


TTCCAGTTCT 


2700 


CTTTGAAACT 


CTGGGTCTGC 


TTTTGCCTCT 


TTATAAGCTT 


TTTTCAACTC 


AATAATTGCT 


2760 


GGCATTAATG 


TTTCTGGAAC 


ATATTGCCCT 


CCATATTCAC 


CAAAGAAACC 


TAATTCATCT 


2820 


GCTTCTGTTT 


GTATTTGTTT 


ATTCATTGTC 


TCTATCTCCT 


TTCACAATAT 


TTACAATTGC 


2880 


TGTCATTTTT 


TCTATATCTT 


TTCGCCCATT 


TACTTCTATA 


CCTGATGCAA 


GATCATAACC 


2940 


TTGATGTGAT 


AATTTAAGTT 


GATTAACTGT 


TTGAATATTT TCAGAGTTAA TGCCTCCTGC 


3000 


TATCAAATAA 


GGTATGTCTT 


TTATGTGCTT 


CAAAATAGTC 


CAGTCATATG 


TTTGACCGGT 


3060 


ACCACCATAC 


GACACTGAGG 


GTGTGTCGAT 


AATAAATAAA TCTACGAACC 


CTTTATATTT 


3120 


ATTTATGTTT 


TGGATTATGT 


TTTCATCTnG 


CAGtAAAGCT 


TTAGTGATTT 


TAATGCTTGA 


3180 


ATATkTCTTT 


TTAATTTCCT 


GTAtAAAAtC 


AATAGATTCt 


GtGTAACTGT ATTGTGkTAA 


3240 


wTGaCGwATg 


CtTAAwACgT 


GTGCCAATGG 


T 






3271 


(2) INFORMATION FOR SEQ ID NO: 437: 
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IS 



20 



25 



(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

10 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGCCCAA AATACTTGTT 240 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATA/^C TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 480 

ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 540 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 600 

CTATTCTTTA AAGGCAAAAA A6CATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

ACTTTA/U^CC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 780 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 840 

ACAAAATGGT TAGACACAAA AGCATTTAGT AAQAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGT6 AAATATTTAG GACCTTATAA CTTTACAGTA 1020 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 1080 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 1140 

GGGGTGGcAA rGAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA 1200 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 1320 

SO GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 1380 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 1440 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA 1500 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





ACAAAATCAA 


TCAGCAAAAT 


GATTGGATTA 


AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 


60 


IS 


AAACAGTTGG 


CATTAATGAT 


CAAAATATAG 


AAATATATAG 


TGAAcGtTgA 


GATTTATACC 


120 




ATACACTCGT 


ACCTTTAAAT 


CAAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 




CCAATGAAAA 


TTATAATATT 


AATGATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


240 


20 


AAAGACTA6C 


ACGAGAACTT. 


CACGATTCTG 


TTAGTCAGCA 


ACTTTTTGCG 


GCmAGTATGA 


300 




TGCTATCTGC 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




AGAGAAAATG 


GTTCAAGATT 


CGCAGTTAGA 


AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACC 


420 


25 


GCTTGGTTTA 


AAAGACAAAT 


CTTTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 




AAAAAAAGTG 


CCAATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC 


CTAAAGGTAT 


540 


30 


TGAAGATCAT 


TTGTTCAGAA 


TTACACAGGA 


AGCAATTTCG 


AATACATTGC 


GTCATTCAAA 


600 


CGGTACAAAA 


GTGACAGTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




AGATAATGGT 


AAAGGTTTTA 


ATGTTGATGA 


AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 


3S 


TATGCGTGAA 


AGAGCTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGC CAC3ATTC 


/ o u 




AGGTACACGT 


ATCGAGGTGA 


AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG 


CTTCTQOTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAGA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATA6T 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAGAT 


ATCGCCGATG 


CAGTTCGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 


SO 


TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA 


TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

ATGTnGGCGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 60 

TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 120 

ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 180 

CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAQACTCA GAAAGTCCTA 240 

20 AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 300 

AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 360 

ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 420 

ATTAATAAAA TTCACATAAA ATTCAAATTt StCrAcCAAA AATTTTTGGT TGGyTATTTT 480 

TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 540 

CATAAATGT6 ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 600 

GTCCAGTG 608 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 60 
^ AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 120 
TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 180 
AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 240 

SO 

ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 300 
GAAGTAAATC ATAATATGC6 CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 360 
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ATTGGACCGT GGGTAAOVAA TACTAATXOA AGTCGTGATA ACGTAAAAAT TAAAAATACT 480 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 540 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 660 

ACAACACTTC CAGCACAAAA AA 682 



10 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 
^ <B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCmr AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 60 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 120 

AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 180 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAQATTTCAC 240 

CTGTAATTTA GGCATTTQAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 300 

30 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCATCTTT 420 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 480 

35 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 540 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTGTTACCTG 600 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATOTTTGACA 780 

^ AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 840 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 900 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 960 

SO 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 1020 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 1080 
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5 



15 



20 



25 



30 



35 



40 



AAATTCTTTA 


TAGACTGATG 


CAAATCTAAC 


ATATGAAACT 


TGATCAACAT 


GCATTAACAA 


1200 


GTTCATAACG 


TGTTCACCTA 


TATCTCGTGA 


AGACACTTCC 


GTATGACCTT 


CATCTCGTAA 


1260 


TTGCCATTCA 


ACCTTGTTAG 


TTATGTCTTC 


AAGTTGTTGA 


TATCTAACTG 


GTCGTTTCTC 


1320 


ACAAGAACGC 


ACAAGTCCAT 


TAAGTATCTT 


TTCTCTTGAA 


AACTGCTCTC 


TTGTGCCATC 


1380 


TTTTTTCACA 


ACTATAAGCT 


GACTAACTTC 


GATATGTTCA 


AATGTAGTGA 


AACGTGTTCC 


1440 


ACAATTTTCA 


CATTCTCTTC 


GTCTTCGAAT 


GGCATTTAAT 


TCATCGGCAT 


GCCTTGAATC 


1500 


TACAACTTTA 


GATTGTGTAG 


AATTACATTT 


CGGGCATTTC 


ATTACATCAC 


CCTCTTTATT 


1560 


TTGATTATGC 


CTAATTATAC 


TATAAATCTA 


GAGATGAAAA 


AAGAATCCCT 


CAATTTAATT 


1620 


CATTTAACCA 


AATAATGAAA 


CAATAAAAAA 


CATTATATCG 


TTACTTATTA 


AGTAATTTGC 


1680 


ATGACAATAT 


TATTGTATTA 


AAAATAAAAA 


ACCTAACTCC 


GAA6TCAGAG 


TTAGGCTATA 


1740 


AATTAATTGT 


ATTAACTTGC 


ACTTACAGTT 


TCTTTTGATG 


TCAAAAGTGC 


TCCAATTTGC 


1800 


TCAGCAACAT 


CTACAACTCT 


ATTTGAATAA 


CCCCATTCAT 


TATCATACCA 


AGCAATAACT 


1860 


TTTACTTTAT 


TCCCTGACAT 


GACCATTGTT 


GATTTTGCAT 


CAATAATAGC 


TGAATTTGGA 


1920 


TTAGTATTAA 


AATCAACAGA 


CACTAGTGGT 


TGATGTTCGA 


CTTCTATGAT 


ACCTTCTAAA 


1980 


CCTGCATTTT 


CAAAAGCTTG 


GTTTACTTCT 


TCTGCAGTTA 


CTTCTTTTTC 


TAAATCAACA 


2040 


ACTAAATCAA 


CGAGC6ATAC 


ATTCTTTGTT 


GGTACACGTA 


ATGCCATGCC 


GTGTAATTTA 


2100 


CCTTCTAATT 


CTGGTAATAC 


TTCTTTTAAA 


GCTTTCGCCG 


CACCAGTAGA 


AGTAGGAATA 


2160 


ATGCTTTCAT 


TACATGAACG 


TGCACGTCTT 


AAATCTTTAT 


GTGGATTATC 


AATATTTTTT 


2220 


TGGTCATTTG 


TAATAGCGTG 


AACAGTAGTC 


ATTAAACCAT 


TAACTATTCC 


AAACTGATTA 


2280 


TTTAAAACTT 


TTGCAACTGG 


ACCAATGCAA 


TTAGTAGTAC 


ATGAAGCATT 


ACTAAAAATG 


2340 


TCAAATGCTT 


CTATATCTAA 


TTGGTTATCA 


TTTACGCCTT 


TAACTACCAT 


TTGAACATGT 


2400 


CCACCTTTTG 


nAGGACCAGT 


TAACAAAAsT 


TTTtTGGCAC 


CTGCTTTAAT 


ATGTGCGATG 


2460 


GCTTTATCAC 


CATGaTTAAA 


TTTACcAGTT 


GCATCTATAG 


CAATATCGAT 


ATCTAATTCT 


2520 


TTCCATGGCA 


AGTTTTCAGG 


ATTGCGATCA 


GCAACCAATT 


TAATTTTATG 


ATCT 


2574 



45 (2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATCAAGGAGC 


GAAACATCCG 


AATGAAAAAG 


TTGTTGCTAC TGACTCAGCA ATGATTCCTT 


1860 


ATGCTGCTTG 


GCAGAAATTT 


GCTAGACCAC 


GCTTTGaTAA TATGATTAGT TTTAATAGTA 


1920 


CCAACGATTT 


ATTAACATGG 


TTACAAGATC 


CATTCaTCAA AGATATGCCA GGAAAACGCG 


1980 


TTAACATTAA 


TGATGGTGTG 


CCCAGGTTAG 


ATACTTTAAT AGACAGCCAT GTAGGTTATA 


2040 


AAAGGAAGTT 


AAATAGAAAA 


GATAACACAT 


ACGATACTGT ACCACTAATC AAAATAAAGT 


2100 


CGGTAAAAGA 


TACAGAAATT 


AAAAATGGAA 


AAAAAGTAAA AAAGACTATT AACATAACAT 


2160 


TAGATATGGA 


TGGGCGAATT 


CCAATAAATG 


TTTGGACAGG AGATTCGATT GCACGTTCTG 


2220 


GAAGAGGAAC 


TTTAATTAAA 


CTTAATTTAG 


AAAATCTTGA TGCGTTGAGT AAACTGATTA 


2280 


CTGGTGAAAC 


AAGTGGTATG 


TTAGCAGAAT 


GCGTAATCTT TTTAAATGAA AGTTTTAACA 


2340 


TCTCAGAAAA 


TGAAAATAAA 


AATTTTGCAG 


ATAGAAAGAa ACAATTATCA GAAGGATTTA 


2400 


AGGATAAGAT 


TAACTTATTT 


CAGTTAGAAG 


AAATGGAAAG AACTTTAATT AGTAAAATAA 


2460 


ACTCACTTGA 


AGAAGTTGCA 


GATGAAACAA 


TAGAAAGTAT TAGTGCTGTT AAACACTTAT 


2520 


TACCTGATTT 


TGCATTGGAT 


GCATTAAAAG 


AAAGAATTAA TGAGTTGTTT AAAGGTATAA 


2580 


AATCTTTTAT 


AGAAAAAGTG 


TATGATAGTA 


TAGATAATGA AATTTTAGAA ATTTTCAAAA 


2640 


ATATAGATCA 


CGACTTCAGA 


GATGGAGTAT 


CTGAA6AAAT GATGAAACAT TTGAAAGTAG 


2700 


TGAAACAGAA 


TATAGACCAA 


ATAAAAAATC 


AAAATGATAT TTATGGTAGG CAAATTGCAG 


2760 


ATATTAGAAG 


TATTATGAAA 


CAACAAGATG 


CAACAATTTT AGATGGAAAT TTTCAAATTA 


2820 


ATTGTAGCGG 


CGAAAATATG 


GTACAGGGTC 


TAGTTATACC TTCTAATTAT TTAGGAAGAA 


2880 


AAATGAAAAT 


ATTAAAAGAC 


CATATCGATG 


ATGGTATTAA AAAAATAGCA GACTATGTTC 


2940 


AAGGTATATA 


TGATGAATAT 


GCATCGAAAA 


TTGTCGATGT AATAAAATAT TTGATTAATA 


3000 


CAATTCCCAA 


AATACGTAAG 


AATTTAAGAC 


ATGCAATTGA AATGTTAAAT GTAAAAAAGA 


3060 


AAGAATTTTT 


GTCCCTGATT 


CCTAATGTAA 


CTTGTAATTA TATTAAAACT AAATTAGAAG 


3120 


AATTAGATAA 


TACTTTAGGC 


AAATGGGAGC 


CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 


3180 


CAATTTTAGA 


TAACCATTTA 


GATGATATTG 


TTAAGAACAT GAAGCCTTTG ATTGTACAAA 


3240 


TGAyATwTGA 


ACCATCACAT 


TATGaCGATA 


TGTTTAATTC aAGAAAAGCT TTAACGcCAG 


3300 


TGTTCTCAAG 


CGTTTTATAA 


AGGTTG 




3326 



(2) INFORMATION FOR SEQ ID NO: 443: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG TTGCATGCCC 60 

AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 120 

CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC TGATTTCGAT 180 

ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC TCATTTTTTA 240 

CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT CTTTCAAAAT 300 

GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT CTTTATTTAC 360 

AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA AATTGTAAAA 420 

CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAACTTACAA CAAACTTAAG 480 

TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA GGAGGAAATT 540 

CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG TGGTTGCCCC 600 

ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC ACAAATACGA 660 

25 CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG CCCGGACTAC ATTCATTAGA 720 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 780 

GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG ATGATGTATT 840 

AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC CTGCTTGTAA 900 

TGAAGTACAA TGTGGCTGGG CAGCAAGTCA TTCATTAGAA GGTGCTAAAA CTATCGCTCA 960 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA AATAAATCTT 1020 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 1080 

CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA AAATCTTTAA 1140 

40 CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT TAACACGAAC 1200 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 1260 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTGTA 1320 

45 TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 1380 

ATTCCTGTAA CAAQAACCGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT GATATTTCTA 1440 

AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

SO 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATGT 1560 

AGGCGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AATTTTTTAA CAACATCGTC 1620 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACX5 1740 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 1920 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CX3CTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCAITT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 2040 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCXj CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2220 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

20 AATTGAAGAG GCAATTAATG- GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 2340 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACT6CCA CTTTATTTTC 2400 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 2460 

TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 2640 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 2700 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2B20 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 2880 

CTCTAGTCCA TTTTT A AAAG CACAAGOGAA AATTATTATA GCAAGCTATC 6ATTTCATTT 2940 

40 TTAATATCAC AATTTCAATO CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 3180 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 3240 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 3360 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 3420 
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CCCAAGCTGC GTATGATATC GCAACTTGGG 
ACGTTGGGTC TACTGCATAT TGATTACCGA 
^ GAGGCGCTGT TGAATTACCC GTACTACCTG 
TATCACCAGC TGAAACAGTT AAACGATTAT 
TCGCTTCTTT AATCGTTACT TGATTGCCGC 

10 

CTGTACCATC AGTTAATGAG TAAACTGGTG 
AATGCGCACC ACCACCGTGA TATTGTCCAT 

^5 AGCTTGCGTC TTTCGCATGA CCACTAGcTG 
TAGATGTGCC ATTAGGATAA TTGACCTTTC 
TGTTGTTACT ATGTGAATAA TAGCTCGCGT 

20 GATTAATATG CTGTTGGCTT. TGACTCGCTG 
GATTCACATA TGTTTGGCCG CTTCCATTGG 

AATAATGCGT ATGTCCTTGA GCATCTACGA 

25 

TTGCTTGATT CCAGTTACCA TCAGGTGTGT 

AATAAGTACC ATAAGATACG TCTTGTGATT 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT 

30 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT 
TAGTTTTCAA TTAAACTGTA CTGCACATAC 

^ TAATTCTCAT TTTCCAACTA TTTAATATTC 
TCATTACAAA ATAGCCATAC ATTGATATTA 
CTTTTATAAA TTTTCAAAAT TAACAACTAA 

^ ATGCATGATA GTAATCATCC ATCAAATCAT 
GATTGTCATT CAACTTTCTA ATAGTGATAT 

TTCTATTAAT TAAATCATCT GTGAGCTTTA 

45 

TAATATATAA CGTTCTTGTA GTTAAATTTA 

CAGCATCATG CGGACGTACA ATTCCAATCG 

CATTCATTAA ATCCATTTCT TTATCATTGT 

SO 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC 
TGCCTAAAGG TTCAATCGTT ACACCTTCCA 

55 



ATTTTCTGTA TTATCTACTT TGCAAQTATG 1440 

TGCCACCAGA CATACGTTGG AAGTGTACGT 1500 

AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATGCATATA CCATTGGTAG TTATTACTGT 1620 

CACCATAGTT ACTCCAACCT GCTTGTACTA 1680 

AATTTTCAGG CATTGCATAG TCGACACCGT 1740 

ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

TCXXTTTTACT TGCTGATCCA CCATTTTGAT 1860 

CATTACCATC ATGGCTGTTA TACGCTTGGT 1920 

CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

TGTAGTCATT GTTATCTCCT GCTGTTGCTG 2040 

CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

ATGTATAGCT ATATTCTTTA TTATCAAACA 2160 

GATGATAATC CCCATTAGAA TCAATTGTAT 2220 

GTGTTGACAT TTGTGTATGT GCTTGTTGGG 2280 

GCGCCATTGT AAATGTAGCG AAsCCATCGT 234 0 

GTATAAAACA TCCTCCATTA AAGTTAAAGT 2400 

TAAAAGAATT AGACAACTGA GTAAAGGATT 2460 

CCGAAATGTT TTACTAAACT CATTACATTG 2520 

AAATGACATC TCTCACTGCA TTCGTTTAAC 2580 

TC6TTCGTCC ATGTTCGCGA TTCAACGCTA 2640 

ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GCTTCTCAGG CTTAAAAATC GTCATATCTT 2760 

ATGCTACTAA TTCATTGCTG CCATAATACT 2820 

TTACX5GTCTG ATACATCGTA TAGTGATTGT 2880 

GTATATTTAC CGCATCTAAT AAATAAAATG 2940 

TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3060 

ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 3120 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 3240 

GTCCAGTTGC ATCGGAAACA TGATAATQCA ATGGCQGAAC TTCACCGATQ TCATTTAAAT 3300 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 3420 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 3480 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 3540 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3600. 

GCX3AACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 3660 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3780 

20 AATTAATTGA TATTCATTCT. CAATAACTGT G6TATGATAT GTAAGGAAAT CATGACTTAT 3840 

6T6TGA6TGA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3960 

ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTrTG CACAATCATT TATTAATATA 4020 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 4140 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 4200 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4260 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4380 

ATCGGCGGTT TAATTCCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 4440 

40 ATT G C TT TCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCGTCATC CATTCGATTA CAAAGOTATO TCGATTTTCO CTGTTATQAT TOGTAGCTTT 4560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4620 

CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4740 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4800 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4860 

GCGATGATTC GTTTAGGTGA AAAAGCAACA TTACGTTTCG GTGGCAAATT GATGCTCATC 4 920 
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TCACAATATG TCATTGCAGT TATCATTGGT 
GTCGCAACAC CTGGACTTAC GATTGCAATT 
^ GCTACAGGAT TATATAAAAT GAGTGGTACA 
ACTACAGTTT TCAGTATGTT ACAACTAAAC 
TTTATAGTCA GCATTGTATT GATGATCCTT 

10 

AAAACAGTTA AATCTTAAAT ATAATAGAAG 
AAAATGATAT ACAAGAAATC CAAGAAAAAT 
ACGCATCAGT ATTAGGATTC ACTCTAAAAC 
CTTGAGCTAA CCACATCATT CGAATACTGT 
AAAATACAGA TTGAGTATAT GCATATAAGA 
20 ATGTTGCGcC ACGATGATGT CTGAATTCAT 
TTTCATTAAC ATGTCGCGAA ATATCATTTT 
GTGTAAGTTG GGCAAAGTGT CTTATTTTTT 
ATACGTAGTA TTAATGGCGA GACTCCTGAG 
GACGGCACCC TAGGAAAGCG AAGCCATTCA 
AGTAAGATAT TTTCTAATTG AAAATTATCT 

30 

GCCTCTTACT CTAATTATAT TCACTATCAA 
GCGTTGTTTC TGACCTTGAA TATATTTATT 
TAATTTAGAT GTACCGTATT TTAACACTTT 
TAACCATCCA TCTTTTAACA CAATACX5ATT 
TTTAATTTCA AATGTCTTGA ATGAAATAAT 

40 ATAAACATCT AAATCATCAC GTTTATGTTG 
TTTTTCTGGA TATTTCAATT CTAAAAAACT 
CATTTAAAAA CATCCTCTCT TAAGTTTTAA 

^ ATAGTTCAAT ATACATCATT TCGTTATGTT 
TATTCTTTAA AATAATGACT TTTGTATTTT 
TATCATAATC ATTAAAAGGT ATTAACCTTA 

SO 

GTATACTTCA GACGTCTGTT TGTAGACAAT 
AATGTTTATC ATGATATGAT GAATATAATA 

55 



TTCX3TCATAT GTGCGATAGG TAATGGTTTA 5040 

TTCAGTATGC CTAATGAAAA AGTTGGTTTA 5100 

TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 5160 

TATGCACCAA GTGTAGCTGC AACCGTAACA 5220 

GGCTCATTGT CTGCATACAT GATCATTCCA 5280 

AAITATGTTT CGAAATATCT TTATCACTTT 5340 

AAGCGAACTG AATAAATAAA GATTCAATTA 5400 

GATTAATAGT TTTATAAGAA GGTGTTTGAT 5460 

CATGAAGTAA TTTCTCTATT CTACGACCAG 5520 

TGATTTTTAA CATCATCTTT GGATGATAGG 5580 

CGAATTTGCT ATCAGGTATC GTTTCAACAA 5640 

GAGGAATTCT AACAGAAGTT TTTATTGGTA 5700 

TAAAGTATTT CAAAGTAAAA TTACATGTTA 5760 

GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 5820 

ATACGAAGTA TTGTATAAAT AGAGAACAGC 5880 

TACTGCTGTT TTTTAGGGAT TTATGTCCCA 5940 

TTAGACAAAA TGGCCATTTT CAAATATCAC 6000 

ATAATTCTCT TTTTGAAAAT CAGTTAACAT 6060 

TTGCATTGTT TCTATTCTCA TTTTTCTAAA 6120 

AACAGCATCA TATGATAATT CTACTGTTTC 6180 

CGTGCACATT AAAAACGTAT CACCAAAGTA 6240 

TCCAACTAAC AAACGACCAT ATTCGAACTC 6300 

AATAATCTCT TCTTCTTTTA ATTTGAATTG 6360 

CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 

TTTTAATACT TTGTTCAAAA ACAAATATTT 6480 

TAATCACAAT AAACATTTTA AAATTCTTGT 6540 

ATAATATATT CTCTCGTCTC AACCTTAATC 6600 

AAAAGTCATT CACGTCTTCA TATGTCATCA 6660 

ATCGGGTATA TAACTGTATG ATTAATTACA 6720 



1354 



10 



IS 



EP0786 519 A2 

TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 6840 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 7140 

CTATCTAAAG AAACATCATO TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 7380 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 7560 

TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATT6ATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 7740 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAAITA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 8040 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACX3TTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATXX5 GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 8340 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 8400 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCACCGCA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 
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AGCAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 8640 

TGCATTTGQT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 8700 

TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 8760 

AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGGCTTAG TACTCATTTT 8820 

AGTAGCCGCT TCTGTTACAG TATTTATAAA ATAAGGA6TG TAGATGTCAT GAAAAAATCA 8880 

GCTGTTTTAA ATGAACATAT TTCAAAAGCA ATCXXXSACAA TTGGTCATTT TGATTTATTA 8940 

ACX3ATTAATG ACGCTGGCAT GCCAATTCCA AATGATCATC GTCGTATCGA CCTAGCTGTA 9000 

ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 9060 

CAAAAAATAT ACTTAGCAGA AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 9120 

AAACAATTGA TTTCATCGGA AATCGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 9180 

AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 9240 

AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 9300 

ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 9360 

AGATATGCAC AACCAGGCGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 9420 

GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGCAQACAC AACATTTATT 9480 

ACTAAAATTG 6CACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 9540 

ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 9600 

GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 9660 

CCTGAAGATG TTATTAACGC AAAAGACGCT ATAATCAATG CAGACTTTGT CXStTGCACAA 9720 

TTAGAAGTAC CCATCCCGGC TATTATATCT GCATTTGAAA TTGCCAAGGC ACATGGTGTG 9780 

ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 9840 

GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 9900 

GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 9960 

A1TACGCTAG GTAAGCAAGG TACATATTTT GCTACTAAAA ATCAAAGCCA ACACATCGAA 10020 

GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 10080 

GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 10140 

GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 10200 

GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 10260 

CAACATTCGT ATATCATGGC TGTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 10320 
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TTTGTAATCT 


TTTTAACTTC 


CAAATTaTCG 


CATATAAATA TGCTATATTA ATGATAATAA 


10440 


TTATCAATTA 


AAAGGAG6TT 


ATGCTATGTC 


TAAAGAAGCT GGTCATACAT TTTTAGCTAA 


10500 


ATTAGGAAAA 


ACTCGTCTAC 


GCCCCGGTGG 


TAAAGAAGCA ACAGATTGGT TAATACAACA 


10560 


AGGGGCATTT 


TCACAAGATA 


AACAAGTGTT 


AGAAGTGGCA TGTAATATGT GCACAACATC 


10620 


TATTTATCTA 


GCTCATACAT 


ATGGCTGTCA 


CATTCAAGGC GTTGATATAA ATAAGAAAGC 


10680 


ATTAGAAAAA 


GCACAGGAAA 


ACATTTCAGC 


AGCAGGTCTT GAATCATATA TTCAAGTTCA 


10740 


ACAAGCGAAT 


GCTGTTAAAT 


TGCCCTTTGA 


TGACAATCAA TTCGATATCG TTTTAAATGA 


10800 


AGCAATGTTA 


ACAATGTTAC 


CCATCGCCAT 


AAAGGAAAAA GCATTACGCG AGTACTACCG 


10860 


AGTCTTAAAG 


CCTGGGGGTA 


TCTTGTTAAC 


ACATGATATT GTCATCGTTA ATGAATCACA 


10920 


TGCCACACAT 


GTTGTTAAAT 


CATTATCTGC 


AGCAATTAAT GTCAATGTCT CACOGCAQAC 


10980 


GAAACTTGGC 


TGGTTAGATT 


TATATAATCA 


AGCTGGTTTT AATCATGTGC ATTATCATAC 


11040 


TGGTCCAATG 


AGTTTAATGA 


CACCAAAAGG 


TTTAATTTAT GACGAAGGTA TTGTTGGAAC 


11100 


TATAAAGATT 


ATCAACAATG 


CTTTGAAAAA 


AGAAAATCGA CCAATGTTTT GTAAAATGTT 


11160 


TAAAACGATG 


ACTAAATTGC 


GTAAAGATAT 


GAATTATATT ACTTTTGTCG CTAAAAAAGA 


11220 


GCACTAA^TA 


TAATGCCACT 


AACTGTACTT 


TGTATCTATG TTTGACTATC ACTTTAATTT 


11280 


CTTTGTGACA 


CTAATCATCT 


ACTTAACAAT 


ATCGTTATCG TTGATTAGTA AGTCATCAAT 


11340 


TTTGGTTAAA 


GACTTTCATA 


AACACTCAAA 


CATTAACACT ATACATAGTT AGTGGcATTA 


11400 


TTTTTTyCTn 


AAAATTTTAA 


CmTCmCGGGr 


TtGGGAmCrG AAaTGrtAwT TcGCrtnAAtT 


11460 


TAwTcT 








11466 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEa^TH: 2176 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 
TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 60 
ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 120 

SO 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 
CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 240 

SS 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 
GGATTTGOTA AAGGnT 2176 
^ (2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

^0 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

IS 

AAAAGCATGG CTTAAATGAA GTACGCTATA ACAAATTACA AGAACATGCT ATTOTTATGC 60 

ATCCGGCACC TGTGAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 120 

CAAGAATTTT TAAGCAAATG GAAAATGGCG TTTACTTAAG AATGGCAGTC ATTGATGAAT 180 

TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT AAAAACGGTA AAGTATTACA 240 

AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTGC 300 

2S ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC ACTTTGTOTC 360 

ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 420 

AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 480 

^ TAACACAAGA CCGGTACCAG ATTCTGTAGA ACATTTTGAA GCTTTACAAA AATTAATCGA 540 

TGACAATGCT CAAGTACGTG TATTACCTTA TGCTTCAATT ACAACACGTC AATTAGGTAA 600 

AGAATTGGTT GATTTCCCAG CACTAGTAAA AGAAGGTGCC TTTGCOTTTA CA6ATGACGG 660 

35 

TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 720 

CAAAGCCATC GTAGCACACT GTGAAGATAA TTCATTAATC TATGGTGGTG CAATGCATGA 780 

AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTGTG AATCTGTTCA 840 

40 

AATCGCAAGA GATGTACTAT TaGCTGAAGC AGCAGGTTGT CATTATCATG TATGTCATGT 900 

TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGCGCAGGCA TTCATGTTAC 960 

^ AGCTGAAGTT ACACCACACC ATTTATTGTT AACAGAAGAT GATATTCCTG GTAATAATGC 1020 

CATTTATAAA ATGAATCCAC CATTGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 1080 

GTTACTAGAC GGTACAATTG ACTGTATCOC AACAGAcCAT GCACCACATG CACGTGATGA 1140 

SO AAAAGCACAA CCAATGGAAA AAQCaCCATT CGGAATTGTT GGTAGTGAAA CAGCATTCCC 1200 

ATTATTATAT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 1260 

SS 
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TTATGCAGAT 


TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG 


GAGAAGATTT 


1380 




CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA QTTTATGGAA 


ATCCGATCTT 


1440 


5 


AACAATGGTT GAAGGCGAAG 


TTAAATTTGa GGGGGATAAa 


TArTATGCAA 


gCAAACGTTA 


1500 




TCTAGTGTTA 


GAAGACGGTC 


TTTTACQAGG CTACCGTTAG 


GTCTGATAAC 


TTACTGA 


1557 




(2) INFORMATION FOR SEQ ID NO: 447: 








10 
IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 






20 


GCTAGAAATRl 


TTGmATGaCA ATACAACTCT GTTAAAaTGA 


TGGAC6TAGA 


CAAATATGCG 


60 




TATTGACGCT 


TTATTTTAAA 


AATTaACATG CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


120 




CTATGAACTA 


TCaAGTTCTT 


TTATATTATA AATATATGAC 


GATTGATGAC 


CtGAACAGTT 


180 


25 


TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC ACACCATTTA 


AAAGGTAGAA 


TTCTTGTTTC 


240 




TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 




ACATATGCAT 


GCCGATGAAC GATTCAAAGA TATGGTGTTT 


AAAATTGATG 


AAGCTGAAGG 


360 


30 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAQA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 




AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT TGATGCACGT 


AATGATTATG 


AATTTGATTT 


540 


35 


AG6TCATTTC 


CGAGGTGCAA 


TTCGTCCAAA TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA TAAAAAAGTG 


GTTACGTACT 


GTACTQGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA AGATCCTGAA 


ACAAAAGGTG 


AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


45 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA ACCATGTGAA 


CGTTATATTA 


ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTGCTA 


AACATGAGCG TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


SO 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA ACACGCTTTG AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 1260 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 1320 

^ GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 1380 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 1440 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

10 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AQACGGTGCG CCAGCTTATT 1560 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 1620 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 1680 

IS 

TACAAGATAA GCATTACCCT CACCTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 1740 

AGCAGGTcAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 1799 
2^ (2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

ACTTGGTTTT TTATTGTTTA TAAATAAAAC TCACTTAATA ATGTTTTCAT AATCTTCTTC 60 

GACTACTTAA TTCTTTAAGA TATTCGTGAA AAGAGACATT ACACTAGTTA ATTTTCAAAC 120 

AATACAAAAA GCGTCTACCT CCTACATATA ATTGTAGCGG AGATAGACGC TTAATATTTA 180 

35 

TTTAAAAATT ATTTTAAACC ACCGAATGTC ATAACATCAC GGGCAATCAT ACTTTCT T CA 240 

TCTGTTGGAA TAACGACAAC TTTAACTGGT GAATGAQGAT AGTTAATAAA TCCTTCTTTA 300 

CCACGTAGTA AGTTTTCATT TTTCTTAGGA TCCCAGTAAA CACCCATAAA TTCTAAGCCT 360 

40 

TCAAGAACTT TCGCACGAAT TTCTACTGAG TTTTCACCGA TACCTGCTGT AAATACGATA 420 

ACATCAACAC CATGCATTCT CGCAGCATAT GATCCAATAT ATTTGTGAAT TTTAGAAGCA 480 

^ AATACATCTA AAGCCATTTG TGAACGTGCT TTACCTGATT CAGCTTCTTC TGATAAGTCA 540 

CGTAAATCAC TAGATGTACC TGATAATCCT AATAAACCTG ATTCTTTGTT TAAGATTTCC 600 

AATACTTGTT CAGCAGTTTT ACCTGTTTTT TCCATAATAA ATGGAATTAA AGCAGGGTCA 660 

50 ATATTACCAG AACGAGTACC CATTGTTACA CCAGCAAGTG GTGTGAAGCC aTTGATGTAT 720 

CAATAGATTT ACCGCCATCG ATAGCTGCAA TTGATGCTCC ATTACCAATG TGACATGAAA 780 

SS 
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TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATG 900 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTOATQ GAATGCTGTA TCAAAAATTG 960 

^ CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 1020 

GGTTGTGaAG CGGTGCTAAT TcGCTTAATT CTTCAATTTC CTTTTCAACC TCATCAGTAA lOBO 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT GTTCCATCGA 1140 

10 

TATCGTTAAT ATCATTAATA ATATTGTGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 1200 

CCTCAACGTG ATCCTTGATA TCTTGTACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 1260 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTTGTTACT AATTCCTCTT 1320 

IS 

CAGGCATTCT AATTAATTGA A 1341 
(2) INFORMATION FOR SEQ ID NO: 449: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : doiible 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 



30 



35 



40 



45 



ITritSAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT CACCTGAAgA 


60 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 


TGCTATTTGA 


120 


TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATTAGAAGAA 


ATGTCATACG 


ACAATGCAGT 


180 


AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TGATGAATAA 


240 


AGATGACGCG 


AATGAAATCA 


AAGCATTACT 


TCACTATGAT 


GAGGATACGG 


CCG6CGGTAT 


300 


TATGACGACG 


GAgTATTTAT 


CACTTAAAGC 


GCATACGCCT 


GTTAAAGAAG 


CATTATTATT 


360 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 


TTGATGATGA 


420 


TGGTAAATTA 


GTAGGTGTTT 


TATCGCTAAG 


AGATTTAATT 


GTAGCTGAAA 


ATGATGCTTA 


480 


TAITGAAGAT 


ATTATGAATG 


AACGTGTCAT 


TAGTGTGAAT 


GTAGCAGACG 


ACCAAGAAGA 


540 


TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


600 


ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


660 


TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT 


720 


TAAAACAGCA 


TTAAAACGTT 


TACCATGGTT 


GATTATTTTA 


ACATTTTTAG 


GAATGATTAC 


780 


TGCGACAATT 


TTAGGGAGAT 


TCGAAAAAAC 


ATTAGAAAAT 


GTA6CGCTAC 


TCGCAGCGTT 


840 
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10 



IS 



20 



2S 



TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA 
AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA 
TGCAATATAT CATCAGCCAC TTTTAGCATT AATCX3TTGCA 
GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG 
TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT 
TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT 
TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT 
AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG 
TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT 
TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT 
ACGCGCACGT CAAGGACAAA ATGATGAT6 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



TTTAGAATTG CATTAAGAGA 960 

ATATTATTTA CAATTATTGT 1020 

GGAAGTTTAA CTTGTGCGAT 1080 

ATGAATAAAT TAAATATCGA 1140 

GATATTATTA GTATGTTGAT 1200 

TAAGGAGGAG TTATGGAGTT 1260 

CCAATAATTG TCAATCGATT 1320 

ATGGGGATTG TGATTGGAAA 1380 

ATTTTATCAA CGTTAGGCTT 1440 

AAAGCTTTTA AAAAAGATAA 1500 

1529 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 60 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 240 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 300 

CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 360 

GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 420 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 4 80 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 540 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 600 

CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 660 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 720 



55 



1363 



EP 0 786 519 A2 

CAGCAATATC TATTTGATTC ATTGCTGCAA CAACCTGTTC GTAATGAAAT ATATCACACT 840 

GAATCCAAGT CATTTCAACA TCATCTGTTT kTTTATTGTC TGGATATTTT GATATAGCAA 900 

5 AAAGTTCAGC ATCATTTTCA ATCACTTCAC TTAAATACTT ACCAATATAT CCTGTTCCAC 960 

CTGCTAATAA AACTTTAGGT TTCATCTAAA ATACTCCTTT AAACTGTAAC CAAAAAACAT 1020 

ATTACTCCAC CTTTTAGTTA CATATATATT ATAATAGTAG CAAATGTTTT AAAATTTCAA 1080 

AATACTGGAG GCTTTTTATG GCCCATATTA TACGTAGAGT TAGTATCAAA GATGTAGAAA 1140 

ATTTCATTTC AATGTTAGCG AACATATACG ACGAATCTCC GTATATGTTC TACACACCAG 1200 

GAGAATATGA TCCTAGCGTC ACATCGGCTA GTAAACAATT AGAAGAATAT ATCACTTCTC 1260 

IS 

CGCATAAAGT CATCTTCGTT GCTGAAAGTG ATGAACAACT CGTTGGCTTT GCCTTTGTTA 1320 

ATACGACACC ATTTCAACGC ATTAAACATG TTGCTAAAAT TGATTTAGGT GTAAAGAAAT 1380 

TATATCAACA TCGTGGAATT GGCCAAGCAC TTCTTGATGC CATTATGGCT TGGTGTTTAA 1440 

20 

ACAATCAAAT ACACCGAATT GAAGCAAATG TACCACTCAA TAACCAACCT GCCCTCGAGC 1500 

TTTTTAAAAG TGCCGATTTT CAAATCGAAG GCGTTTTAAA AGATAAGTTA TTTATCGATG 1560 

25 GTAAATATTA TGATGACTAT ATGATGGCTA AAATTCTTAA TTAAAGCTAT TTTATCATAA 1620 

TCTTGTATCA GAATCGTATA ACAACGAATT TAATGGTTAC CTAATACATT ACTCATACTT 1680 

ATCAATGTTA TCTAATCTCA AATAAATACG TACACTCTTA TTCATTTATC AAATTTAAAT 1740 

30 TCAAAATAnA ACACCACTAA TGTGTAATTG ATTAACTATC AACTAOGATT AGTGGTGCTT 1800 

TATATATGTG GTTAGTTTTC CTnACTA 1827 
(2) INFORMATION FOR SEQ ID NO: 451: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

45 

TGACGTTATT CTATGkTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

so TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 300 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCX3ATCAG CAAATCOGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 480 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 540 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



35 



40 



4S 





TTAXWZ1P(V2TT A&Tr*&21'm&&^ aPHTTTYS^TVl 


CAAATTACAT 


TGAAGTTATT 




GAGGGAGGAA 


TTGAAGAAAC GCAAACGTTA ATTCACTTAC 


CTTTTGACTA TGTCTTCTTT 


120 


ACAGGAAGTG 


nAAATGTAGG CAAAATCGTT TATCAAGCTG 


CCAGCGAAAA 


TTTAGTTCCT 


180 


GTGACATTAG 


AAATGGGCgG TAAAtCTCCa GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG TTTTGGGAAA TTCACTAATG 


CCgGCCaAAC ATGTGTTGCa 


300 


CCAGATTACA 


TTTTAGTACA CGAATCTGTA AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG TCAAAATATA CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG TCTGACTTCA TTACTTAACA 


GTGCACAAAT 


GAATATTGTA 


480 


TTTGGTGGTC 


ATAGTGATGA GGATGAAC6T TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT GCAAGAAGAA ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAGTCAT 


TGGATGAAGC AATAGCCTTT ATTCACCAAA 


GACCAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA AAATGCTACA CAACGTGTAA 


TAAACGAGCT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC ATTGATGCAC CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG ACGCTATCAT GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA ATCnACACGA TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT CAAAGCTTtC tTTAAAnATT 


AATT 




944 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 453: 

TCTCCAGTAG ACCTTGTGTA TGAACAGTTT CTTTCATATG AATGAACATC GTTTTTAAAG 60 

TTTGTTTCAA CTCAGCCTTA TCAGGATAAT ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 120 

TTGTATGATA TTTAACCTTA TGCAGCGACC CATGATTTTC GTTATAAACA AATGTATGAT 180 

TCACTTCATC GAAATCATGA TCTTCTCCTG CAATCCAAAA AACTGGTACT ACTTGTTGCT 240 

IS TATGTGTATC CGTTAATTCC TTAGATAAAG TAATGATTGa AAATATTTTA TGGAAT6TAT 300 

ACAATGGTCC CCOGAAAAGC CCTGCTTGTT GTCCACCAAT CACAACTTTT GAACCATTAG 360 

CTAAATGTTG TATGTTTAAT TCTTGTTCAC TTGAAAGCTT GAAAGCTTTA AATCACTCAT 420 

ATATTCACGA ATAACATTCG CTAATGCCGC TTCTCTTCCA TTATTTTCTT TAGACATCCT 480 

TTTTTCAAAA CTAGTTTGTT GAGCTGCATC ATATTGAAAT AATCCTGTTA TTACAGGGTC 540 

ACTGCTCTTT ATTTTTGGTA TAAACTGATC TTTTTCATTT AAACTAACTA CTTTACAGTC 600 

2S 

CATGTTTTTT CTCCTTAAGT ACGCGATTAC AAACTATAGT ATAAAGTCTA TACCGGTGAT 660 

TGACAATTTT ACGGCTTGAA AATCAATTTA ATCATGGAAA ATTTATAATA TTCATTGTTT 720 

TACATTTTCA AATCAATGAA AAACACAAGT GGTTTAATGT ATAATAATAG TAGTAAACAA 780 

30 

ATAAGGGGTA GATAAATATG AGTGAAATCA AACGTCTTGA AATTAATTAC AAAACTGACG 840 

AATTATTCGA AAACTTTAGA GCGTTTGGCA ACAAAGACTT ATACATGGTC AATGAGTTAA 900 

^ ACGGTCAAAT GATTGATGCA AGTTCAGATT CACCATTTTA TGGCATATTT GTCGaGATCA 960 

ATTAGGAGCT AGAATGGCAT TACTAAAAAA AGGTGATGTC GAAGAAATCT ACTTCCCAGA 1020 

TTTTGAAGAT TATATATTAT TATGGAAGTT AGAAGTATTA CCAAAATATC* AAAACAGAGG 1080 

40 GTAC6CTTCA GAATTGATTG ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 1140 

CAGAAATGAT TCTAAGGATT TCTTTTTACA TCATGGATTT ACAGATGTGG AAGCTAAAAA 1200 

TATAGAGGGA CATGATGTCT TATTGTGGAA ACCATAAGAT AATAATATTC GACACTACGA 1260 

GCATGAAAAT GCATCTTTTC GTAGTGTCTT TTTTACAATT ACTTTCTTAA GCTAATATAA 1320 

GTaAATCATT TTCAAATTAT TTGTCTTAAC GTACAATATC ATTTAGTTGT TTCCATGrAT 1380 

TAATTTCATA ATCAGGTATA ATTCCTGGAT TATGATCAAA TCCTCTAAAa TTAAACCAGC 1440 

SO 

AAGTAGCTAT ACCcGCATTG ATTCCACCTA GAATGTCAGA TGTTAnAGAA TCTCCaACTA 1500 

TAATCGAGTG CTGtCTTTCA TCCTCACCAA TATCATTAAA AACATAATTA AAAAATTCCG 1560 

SS 
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10 



IS 



20 



AOGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCGGTTAC ACCATTAGTA ACAATATATA 16B0 

AATCATGTCT TTTCGATAAT TCGACAATTG TTTCTAAT6T TTGATCAAAG TATTTAACTT 174 0 

TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

GATGATGTTT GAAGTAATTC ACAAATCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT 1860 

TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

TCTCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACXyV 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 2340 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGC CATTTTCAAC 2400 

2s GATGCTCACA ATGCTTGTGC CTAATTTTGC GCCATCATTA TTAATT6CTG CAGTTACGGT 2460 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAACACT CCTTAAAATT TAAATTTGAA 2520 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 2640 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATT6 AACCTTTTTA GATAAATAGC 2760 

TCTATAATAG TGAAAAATAT AAACATTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

ATGGCCTTCT AAGCCATCGG TCGGGGGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2940 

TTATACGCAT TACCAAACAA AAGA6TTCCG TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

CAGTAATATA GTATGAT6CG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATTGA CACGCX5TCTG 3120 

ACACTTGTTT ATACATTTTT AATTAAGTAA TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 3240 

TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3300 

GGGGCCCGAA ATCCTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTCCATCG 3360 



35 



40 



45 



SO 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT 


GTCTAATGAT 


3480 




CTTTCATTTT TTCTCATTCT 


GTGTATTGTG 


CTTTGTGGAA 


CTCCTGTTTG 


TTTCGATATA 


3540 


5 


TGTAAACTGC TCAAATCACT 


GTCTAATAGT 


TTTTGAATTT 


GATTTCTCAT 


TTTGTACACC 


3600 




CCCTTGTAAA TCTTCAATCA 


TCATTTAAAT 


TAAATAATTA 


CTTTTCCACA 


TATTCCAATA 


3660 


10 


TTTAGGTTGC AAAGCATACC 


TCAAATATCA 


TTAATTTTGA 


GATTTAAATG 


TCAATTATGT 


3720 


GTCTATCAAT CCAATATACA 


TACTCTAATA 


ACGTAATAGT 


ACACACTCTT 


CTTATTAATT 


3780 




GGCCATAGCT ATCATGATAT 


AATTAGTGAA 


GAAAATCACA 


TAAGAAAGGT 


TGTAAATCAT 


3840 


15 


GAGACTTCAA AAAGCACCTC 


TAGTAACGTC 


AGGACTAGTC 


TTAGGATTAT 


TAGGCCTGGG 


3900 


TAATCTATTA AAAGACTTAT 


CTCTTACTTT 


AAACGCTGTT 


TGCGGAATCT 


TTGCTTTCTT 


3960 




GATTTCGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 


4020 


20 


AAACAGTCCT CTAGTTTCAT 


CAGTGTTCAC 


AACATTTTTC 




TTTTAGGTAC 


4080 




TACTTATTTA AATACATTTT 


TTAGTAACAT AACTTTTATC 


AATAGCTTAA 


TAACGCCTAT 


4140 




TTGGATTTTA TGCCTTGTGG 


GAATTAT6AC 


OCATATGATT 


ATTTTTTCAA 


TAAAATATTT 


4200 




AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 


4260 




TATCGCAGGA TTGACGGCAC 


CCGTTAGCGG ATATTTTTTC 


ATAGGTCAAT 


TAACA6TAAT 


4320 




ATATGGCTTT GTAGCTACTT 


GTATTGTCTT ACCTATAGTT 


TTCAAGCGAT 


TAAAAGCATT 


4380 


30 


TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 


4440 




CGCTGCAGCA TATGTTATAG 


CTTTTCCTAA 


GGCGAATGCT 


TTTATCGTAA 


TTATATTTTT 


4500 




ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 


4560 


35 


ACCTTTTTCG CCCGTATTTT 


CAGCTTTCAC 


ATTCCCTTTA 


GTAATCTCAG 


CAACTGCTTT 


4620 




AAAGAACAGT TTGCCTGTAC 


TTATGTlcTCC AGACATTtGG AAAGGkCTTT 


TGTTTATG6A 


4680 


40 


AGTGTTATTA GCCACTGTAA 


TAGTACTTAG 


AGTCTTTATA 


GGATATCTTC 




4740 


AAAAAAGGAA AAACAAGATA AATTTCTxiCG TAATGCGTCT 


CAGTAACACT 


ATTACCAAGA 


4800 




ATTAACACGT ATATTTAATA 










4820 



(2) INFORMATION FOR SEQ ID NO: 454: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4353 bcise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
„ (D) TOPOLOGY: linear 
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ATTTGGATCT TTAATATCAC CAATATTTTT AATATCTTCC GGATTCAATC CATATACTTG 60 

TACTGTATCT GAGTATTTAA TTGTGAAATA ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

^ AATTTGTGAT TTTAATGATA AATAATCTTG GGCTGGTACG ATTTTATTGT TTTTATCTGC 180 

ATCAACGACA GTtAATGTTG TATTTGATGT GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TGATGATGGC TGTACTGCTG CTATACGCAT TCTTGTATTC AAACGTTTAG GTGCTGTACT 300 

10 

TTTTGGCAAA ATGATATCTG CATTATTTTC ATTATTTGAA TTACTATTGT TATCAACAAG 360 

AGTTTCATCA TTACTCTTGA TAGCATCACT TTTAACATTT AATGTAGTTG ATTCAGTTTT 420 

GGCATCTACC TTTTTGTTTT CCTCATTAGT TGGTTGAACA TTTACCACTG ATTTATTCTC 480 

IS 

TTGCAAATCA GGTTGTAACG CTTCTTGATT ACTTATAGTT TGTTTAGTGT TTAAATCTTC 540 

ATTCGTAGAT TTTGGTGAA6 CTTGCTCATC TGATTTGGCA GTTGAAACTT CAACTTTATT 600 

2Q TCCAGTGGTA GATTGTACAC TTTCTTTTTC TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTACCTTGa GATGATACCA TTTCTTTTTG ATTATCATTT TTAGTATTGT CTTCTTGATT 720 

TAGTTGCTGC ATATCAACTT TATCACTCGA TTGATTATCA CTTGCTGAM TTGTCGCTcG 780 

25 TTCAATTCTT TATTAGTACT TTCTGCAGCC TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 840 

AATGTTSTAC CTACTAAAAT TGATGCTGTT CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

TTTAATCGAT TGGATACCAT GCCTTTCCTT GTTATTGCCG TTTTATTTTC TCTGTTTAGC 960 

ATTAGATTAC TCCTAATTCA TCAAATTTTT AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

GTATATCAAT ATAGTATTAC ATTTTTAGAT AAAGCACAAT ACTTTAATTA TTTTTCTTTA 1080 

TCGTAAAACG TTATTTAACA TTTGTQTTTA AATAAAAGTT TTTATGAGTt TTGTAATCTT 1140 

35 

TATTTAATCA TCATAAAAAA TAGTATTATT TGCCCTTGAA ATTAATATCT TAGCTTTTCT 1200 

AATTCATAGA CAATTACATT TCTGTAACAA ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

TTGTTTTATA TCTGGGAATT TCTAAACAGA AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

40 

TAATCATATT ATTTATTTTG TTTTTTACGA CGACCGAATA ACAATAATGA TCCTAATGCC 1380 

GCGAATAATC CACCGAATAA TGTGCCATTA TTTGAATTAT TATTTTCACT ACCTGTTTCT 1440 

45 GGTAATGCTT TAGCTGTTTT ATGCTGATCT TTAACCGTAC TCATTGGTTT AGCCGGA6TA 1500 

TGTTTACCTG CATCTGAATC TGAATCGCTA TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 1560 

TCGCTATCTG AATCTGAGTC GCTGTCTGAA TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

TCTGAGTCTG AGTCGCTATC TGAATCTGAA TCGCTGTCTG AGTCTGAATC GCTATCTGAG 1680 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA TCTGAATCTG AATCGCTATC TQAATCTGAG 1740 

55 



1369 



EP 0 786 519 A2 



TCTGAATCTG AGTCGCTGTC TGAATCTGAA 
TCTGAATCX5C TGTCAGAATC TGAGTCGCTA 
^ TCCGAATCGC TATCTGAATC TGAATCGCTA 

TCGCTATCTG AGTCTGAGTC GCTATCCGAG 
TCTGAATCTG AATCGCTATC TGAGTCTGAG 

10 

TCGCTATCTG AATCTGAGTC GCTATCTGAG 
TCTGATGTTT CTTCTTCGTA GTAGCCATTA 

^5 GTTACATCAA CTTCGCCACC ATCGGCATCT 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA 
AAGCGGTATT TACCATTTTC ATCTGTyTCA 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT 

TTACTGTCGT ACCAAACATA ATCACCTAAA 
TCTAATGTCA TGTTGTCAGC GTCTTTAATG 
ACAGCATCAT CTGTACCTAC ATTTGCAGTT 
TCTACACTAT AAGTTCCATT GCTTAAACCA 
GTCGTACGAT CTAATTCTTT ACCGTTACTA 

30 

CCTTTTTCAT TGGCATCTTG TTTACCATCT 
AGATTATATT TCTTTTGGTC GCCATTAGCA 
TTTQACCAAC TATATTTAGT TTTGTCAGTG 

35 

GTTGAACTAT TATCTGGATA AGCAACTTGT 
TGGCCTTTCA TTAAATCGAC TGTAGCTGTT 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA 

TGATCTGtCA CTtCGTAAAT TTTGAAGTTT 
AAATTAGTAA CAAACGTTTG TTTAGTATAT 

^ ATATTACGCG ATAAATCTTC ATTGTTAATA 

TTTTTATTAC CATAATCGAC AATGATTTCT 
TCCATTTTAT AAGCTGTTTT ATCAGTTGTT 

SO 

TCAAAGCTAC CTCTAACATT TGTATATTGA 
GTGTTTGTT6 TACTATCATA AATACCTTTT 

55 



TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 1860 

TCTGAGTCTG AATCTGAATC ACTGTCTGAG 1920 

TCTGAGTCTG AGTCGCTATC CGAATCTGAG 1980 

TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 2040 

TCGCTGTCTG AATCGCTGTC TGAATCTGAG 2100 

TCTGAATCGC TGTCAGAATC TGAGTCGCTA 2160 

TCAAGTGTGA AATCATCATG ATCCGTAATT 2220 

TTATCATCTT CAGTTGTATT TGTAGCTGTT 2280 

ACTTTGTATT TACCACTATC TAAATTATCA 2340 

GTTGTACCAA TTACTTCGCC TTTTTCGTTT 2400 

TTTTCAGTCG AATCTTGTTT ACCATCTTTA 2460 

CTATATTTTG GTGTTTTGTA GAATCCACTA 2520 

ACACCTGTTG TAGTTAGTCC ATCAGAATCT 2580 

GTCGGTGTAT AACCGGCTGG TGTTGAAAAC 2640 

GTGAACTGAT ATTTACCATT TTCATCTGTT 2700 

TCTTTAAGAA TGACATAAAC ACCTTTAATC 2760 

TTATTTGTAT CTTCCCATAC ATAGTCACCT 2820 

GTTGATGAGC CATTCACATT TGAATAACTA 2 880 

TCTAAAGTAT AATCAATTTT TCCATTATCT 2940 

TGAATGATGT ATTGTTTATT GCTGCTTGTT 3000 

TTATTATCAT TACTATAAAT AACATCGAAT 3060 

TCAGGGGT6A AACTATCCAC AAATTGATTT 3120 

TTTGCATTTG GATTAAATTT ATATCCAGTT 3180 

GTATTTTTAG GTTGATTTAC ATATGCAGTC 3240 

TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 3300 

TCQCTATATG TATCATTACC TAAAGTTACT 3360 

GCATTTTTAC GTTTCGCAAA TGCAACTTGT 3420 

TCTACATAGT TCGTAAAAGT ATATGTTGTT 3480 

GCAATAATAT TACCTTGGGC ATTATATAAA 3540 
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GTAAATGTAT 


CGCCCTCTTT 


AACAGAATCA TCGATTGTGT 


AATTTGCTTT 


TAATTTTAAA 


3660 




ACATCACTTG 


AAGTTGCCCA 


AAATTCAGTT TTACCAGTAG 


TCTGATTAAC 


ATGTCCTTTA 


3720 


5 


TCAATCX3CAA 


TGTCAATATT 


TGAAAAATGT ACTTTATCAT 


TAACATTTGT 


TCCTTGTTGT 


3780 




GGAGCTGCAA 


CAGTATTCAC 


TGCCATGCGA TTTAAAGTTC 


TTGGTTTAAT 


AGTCGTTGTT 


3840 


10 


TTAGGTGTAG 


TTGAAACATC 


TTTTGCTTGT GTTAAATTAC 


TTTTATCAGT 


TTCATTACTA 


3900 


TATGTAGTTG 


ATGATTTATC 


ATTTGTTGTT ACATTGCTAG 


TTTTTGTAGT 


AGATTGATTA 


3960 




GCTGTAGCGT 


TTTGTGGTGA 


TTGCATGTTA CTACTAGTTT 


CTTTAACTGT 


TGCACTATCA 


4020 


IS 


CTCATTGTCTl 


CTTTAGGCTG 


ATCTGCAGTT GCAGTTTGCG 


TATTGTCTTT TA6TTGACGA 


4080 




CTATCAACTT 


TTTTAGTTGT 


TTTATTCTCA CTTGGGGCTO 


TCGTTTCATT 


TTTTGATTGA 


4140 




TTTAATTCTC 


CATTCGTATG 


TTCTGCCGCT TTAGCTTCAT 


GACCACTTAA CCCAAAAATC 


4200 


20 


AATGTTGTCC 


CTACTAAAAT 


TGAAGCAGTA CCTACAGAAT 


ACTTTCTTAT 


CX3AAAATTTG 


4260 




TTTAATCGAT 


TTGGTATCAT 


GCCTTTnCTA TTTGTnGCTG 


TCTTTTTATA ATTCATTTAA 


4320 




TAATACTCCT 


TTAAAATATC 


AAAATTTGAT AAATATAA 






4358 



(2) INFORMATION FOR SBQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotlble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 455: 

3S 



40 



4S 



SO 



TTGACTtCTT 


AATTCAGCAT 


TTTCTGCACT 


TAATGCTTTG TTCTTTTTAA TAAGTTGCTT 


60 


TCTTGCATAA ACTTCGGTAT 


CTATTTTACT 


ATTACTATAC CTTTGATTTA AAACTAATAT 


X20 


ACCAATTAAT 


GCTACAATGA 


TAATGATAAG 


TACAACATAA AAAGACATTT TTTCACCAAT 


180 


CCTTTTTGAC 


TTCTTTAACT 


TTGTATACAA 


TAATAATTAA TAAAGATTAA TTGTTATTCA 


240 


ATTTCCCACA 


TTTTTATTAG 


TTGATTTTAG 


TTCATCATTG TTATAATCAA ATTATAAACT 


300 


GACAGATATT 


GATGTTCAAT 


GAATATGACG 


TGAAAGATTC GTGAATTCAA GTTTATGTCG 


360 


AATTTATGTT 


ATAACGGTCA 


TTFAAATGAC 


AOAATTAGGT CACTCATAGT ATTTTGAAGA 


420 


TTGAATTCAT 


TAATTTTAAA 


ATGTATAATG 


ATATTTGTGA AAGCGCTTGC TTAGGAGGTG 


480 


TATTTGAGAG 


TGAATGAAAT 


GAATGCTAAA 


GAACAATTAG TGGACAATTT AATGAAAACA 


540 


TCATCGCAAT 


TATTTAAATT 


TCACGGTGAA 


GTTGCCATGC AGCTTTTCTT AAATGATGAA 


600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 720 

AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 780 

5 ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCX5 840 

AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA 900 

TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 960 

TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 1020 

TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 1060 



(2) INFORMATION FOR SEQ ID NO: 456: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



25 


ATAATnACTA 


AATACnAAAG 


TTTAACTGTC 


TTACTAATAA 


TGACTATGTT 


ATAATTTTAA 


60 




AAGTGATATT 


TTGGGTAATC 


GCTATATTAT 


ATAGAGGAAA 


GTCCATGCTC ACACAGTCTG 


120 




AGATGATTGT 


AGTGTTCGTG 


CTTGATGAAA 


CAATAAATCA 


AGGCATTAAT 


TTGACGGCAA 


180 


30 


TGAAATATCC 


TAAGTCTTTC 


GATATGGATA 


GAGTAATTTG 


AAAGTGCCAC 


AGTGACGTAG 


240 




CrnTATAGA 


AATATAAAAG 


GTGGAACGCG 


GTAAACCCCT 


CGAGTGAGCA ATCCAAATTT 


300 




GGTAGGAGCA 


CTTGTTTAAC 


GGAATTCAAC 


GTATAAACGA 


GACACACTTC 


GCGAAATGAA 


360 


35 


GTGGTGTAGA 


CAGATGGTTA 


TCACCTGAGT 


ACCAGTGTGA 


CTAGTGCACG 


TGATGAGTAC 


420 




GATGGAACAG 


AACATGGCTT 


ATAGAAATAT 


CACTACTAGT 


TTAGCTCTCC 


TAGATGATGG 


480 


40 


AGAGCTTTTT 


TCATGAAAAG 


AACACTTAAA 


ATTAACGCCy 


TGTCTTGaTA 


tAATGACaCT 


540 


GCcTTGTTTT 


AAAATAGTAA 


GCGGATGCgT 


TAATGTATCA 


GCGATTAAAT 


TTGTTGGAAA 


600 




TGTATAAAAA 


ACACAAGCTA 


AGAATAAAAT 


ACCTGTATAA 


AAGGAGAATC 


ATATATGTTT 


660 


45 


CAATTACTTG 


CAGTTTGTCC 


GATGGGATTA 


GAAGCTGTTG 


TTGCTAGGGA 


AATTCAAGAA 


720 




TTAGGCTATG 


AAACAAATGT 


TGAAAATGGT 


CGTATATTTT 


TTGAaGGAGA 


CGCAAGTGCA 


780 




ATTGTAAAGG 


CAAATTTATG 


GTTGCGCACA 


GCAGACCGAA 


TCAAAaTTGT 


TGTTGGACGT 


840 


SO 


TTTAACGCAA 


CAACGTTTGA 


CGAATTATTC 


GAACAAACCA 


AAGCGCTCCC 


TTGGGAATCT 


900 




ATAATTGATA 


AAGAGGGTAA 


CTTCCCAGTT 


CAAGGTAGAA 


GCGTTAAATC AACACTACAT 


960 
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10 



TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 1080 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 1140 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACX3T TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 1260 
CA 

(2) IKFORMATION FOR SEQ ID NO: 457: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



1262 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 457: 





CCTGGCTGCT 


TTATCAGCAT 


CTACTACTTT 


AAAACCGAAT 


ACGGATAAGA GTTCTGATAC 


60 




TGTTGATTTT 


CCTGAGGCGA 


TTCCACCTGT 


TAGACCAATA 


ACTTTCGGCA TAATTTCACT 


120 


25 


CTTTCTTTAT 


TTTTGACATA 


CTG6ACAATA 


ATGACTATTT 


CTTGTCGCGA TGATTTTTGT 


180 




TTCAATTTGA 


CTTCCACACA 


CTTTGCATAC 


CGGCTGCTTA 


TATACATTAA GATGCAATTG 


240 




CATCTCACCA 


GTTTTTCCAT 


CAGCATGACG 


ATAATCTGAA 


ATACTTGTAC CGCCATATTT 


300 


30 


AATACCTTCT 


TCTAGTACTT 


CTCTAACATA 


ATAAAAAACC 


ATTTCTTGTT GTTGGTGTGT 


360 




TAAGTCTTTT 


ACTTTTTTAT 


CTGGTAAAAC 


ACCTGCACGA 


AACAACGCTT CACATGCGTA 


420 


3S 


AATATTTCCA 


CAACCTGCGA 


TTACTTTATG 


ATCCAAAATC 


ACTTGTTTGA TTGGTTTATT 


480 


CTTATTAGAC 


TGTTGATGAA 


TTCGATTTAA 


ATAATACGTC 


AATGCTTCAT TTGAAAAAGG 


540 




TTCAGGCX5CT 


ATTTCTAAAA 


ATGAAGGATA 


AGATGCTACA 


GACGCAACAT TTCTAATTTC 


600 


40 


TCCAAAACGA 


CGTATATCTG 


AATAAATTAA 


CTTTTTGTCA 


TTTGACAACT CAAAAATAAC 


660 


ATGCCAATGC 


TTACGATAAT 


TAGGTATCAT 


AATATCTTCA 


AGTTCATCTA CAATGAAAAA 


720 




ACCGCCCGCC 


ATACCTAAAT 


GACTAATTAA 


TGTACGTTGT 


TCTCGTTTAT TATCTAGCTG 


780 


4S 


AAAAACGATA 


TATTTACTTC 


TTCGTTCTAC 


ATTTGTAATG 


GTATAGCCTT CCGATAAAGT 


840 




TTTAAAAGTA 


TCTAATTCAA 


TTCCTTTTAT 


AATTGTTTCC 


TTGCCTTGAG CTTTACCTTC 


900 




GATTACTTTA 


TCCGAAAATA 


TAACGTGTTC 


AATTTTTTGA 


TTTATAACGT AGGGTTCAAT 


960 


50 


TCCTCTTTTT 


ACATGTTCTA 


CTTCTGGTAA 


TTCGGGCATA 


CCATTAACCT CACTTTATTT 


1020 




TGCATCATAC 


CAGGTTGCAC 


CATAACTTGA 


GTCTACTTTT 


AATGGAACAT CTArTTGCAA 


1060 
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IS 



20 



35 



40 
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TT 1142 
(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 60 

TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGTT AGGCGTGTTG 120 

GTAAACATGT GTATATCACT GITTAAACGG TCTTTATAAG CTTCGGTACC TTGTACATCG 180 

TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TTTCTCTACC TTCGCTATCA 240 

GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 300 

GGATCTAAAT TTAAATTACC TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 360 

2S TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AA6TGATTAC TTTCTCATAA 420 

TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 480 

GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 540 

30 CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG €00 

TCTATTGGTT TAGITCITAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 660 

TOGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 720 

TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 780 

TTOTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACATCTTTG 840 

TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC CGTTGATAAA ATCGCTAATA 900 

TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 960 

AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 1020 

CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 1080 

TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 1140 

ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 1200 

SO CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTtCTAAC 1260 

CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 1320 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTCT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 1740 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1B14 
(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 60 

CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

30 TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 180 

TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 240 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTO AATCTAOACA AATTTATAAA 300 

AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 360 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

ATTAGCGAAA ATGAATTTOA AAOGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

TGGCCGGTTG CAGGAAGTTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 540 

TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

ATGTTAATTT TTTATTTTTA TCTAT^TAAA AAATTAACAT TAAAAATTTT TAATACCAAC 660 

GTGGGTAATA AGAATAGGAG TTGTAT 686 
(2) INFORMATION FOR SEQ ID NO: 460: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 



5 



20 



30 



ATCTGCAATT ATGGGCACAC 


CCAAGCTnAT 


GCAAGTAACT AAAGGAGAAG 


TACTTTTAGA 


60 


CGGTGTAAAT ATTTTAGAaT 


TAGAAGTTGA 


TGAAAGAGCA AAAGCAGGAT 


TATTCTTGGC 


120 


AATGCAATAT CCATCAGAAA 


TTACAGGTGT 


TACAAATGCT GATTTCATGC 


GTTCAGCAAT 


180 


CAATGCGAAA CGTGAAGAAG 


GACAAGAAAT 


CAACTTAATG CAATTTATTA 


AGAAATTAGA 


240 


TAAAAACATG GATTTTCTAG 


ACATAGATAA 


AGACATGGCA CAACGTTATT 


TAAATGAAGG 


300 


TTTCTCAGGT GGASAGAAGA AACGTAACGA AATCTTACAA TTAATGATGT 


TAGAACCTAA 


360 


GTTTGCaATC TTAGATGAAA TCGATTCAGG 


GTTAGACATC GATGCATTAA AAGTTGTATC 


420 


TAAAGGTATT AACCAAATGC 


GTGGGGAAAA 


CTTTGGTGCA TTAATGATTA 


CACACTATCA 


480 


ACGATTATTA AATTACATTA 


CTCCTGATAA AGTACATGTA ATGTATGCTG GTAAAGTCGT 


540 


TAAATCTGGT GGTCCAGAAT 


TAGCAAAAC6 


TCTTGAAGAA GAAGGATATG 


AATGGGTTAA 


600 


AGAAGA6TTC GGTTCAGCTG AATAATCTTA 


TTAATACAGT ATCCATGAGA TGTTCATCTA 


660 


TATATGATGA AAATGAACAT 


TTATACGAAA 


TAGTAAATTT CATCAAGTAO 


GAGGAAAAAG 


720 


TTATGACAAC TGATATTTTG rACaTTyCTG AAGAACAACT TGTTGATTAT TCTAAAGCCC 


780 


ACAATGAACC TTCTTGGATG 


ACAGAATTAC 


GTAAAAAAGC TTTGAAATTA 


ACAGAAACTT 


840 


TAGAAATGCC AAAACCTGAT 


AAAACAAAAT 


TAAGAAAATG GGATTTTGAT 


TCTTTTAAAC 


900 


AACACGATGT AAAAGGTGAT 


GTTTATCAAT 


CTTTATCACA ATTACCTGAG 


TCAGTAAGAG 


960 


AAATTATTGA CGTAGATCAT 


TCTAAAAACT 


TAGTAATTCA ACATAATAAT 


ACGATTGCGT 


1020 


ACACACAAGT TGATGATAAT 


GCATCGAAAG 


ATGGCGTTAT CGTTGAAG6T 


TTAGCAGACG 


1080 


CTCTTATGAA CCATAGTGAT 


TTAGTACAAA 


AGTACTTTAT GAAAGATGCA 


GTAACAGTAG 


1140 


ATGAACATCG TATCACAGCG 


CTACACACGG 


CATTAGTTAA TGGTGGCGTA 


TTTGTTTATG 


1200 


TTCCTAAAAA TGTAGTTGTA 


GAACATCCAG 


TACAATACGT TGTGTTGCAC 


GACGACGAAA 


1260 


ATGCAAGCTT TTATAACCAT 


GTTATCATCG 


TTACTGAAGA 




1300 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 3135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



ss 
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GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAC3ACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

^ AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 240 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

70 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 360 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 480 

15 

TTGGnTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 540 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

2^ GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

C6TTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 780 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 840 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC ACCACATTTT GAATTAAAGA CAGATAAAAC ACGGGAAAT6 ATGGATTCAG 960 

ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG 6AATCATCAC 1140 

35 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

40 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 144 0 

45 AAATTCTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 1560 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 1620 

SO TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 1680 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 1740 

55 
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AAGGCTTTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG TTTGGCTTCC 1860 

AAGCTGAAGG CGCATCACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 1920 

^ CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTGGGATAA GGCGACTAAT GCTCTTAAAG 1980 

AATCAAATGG ATTAATAGAT AGTGTTACTG ATGATGAAAT TCTAGAAGCA TATCAGTTAA 204O 

TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT GCAGGTTTAA 2100 

10 

TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTGCT ATTTTAACTG 2160 

6TAATGGATT AAAAGATCCT GATACTGCTA TTTCACTACT AGATAATCCG ATAAAGCCAT 2220 

TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AGCTTTATAA CATGTCGAAT 2280 

IS 

GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TGATTCTATA 2340 

GOTATGGCTT TAGATAAATT TTTGCATCTG TCTGTAAAGG AAACATCAGG GACAAAATGG 2400 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAACTTTATT 2460 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 2520 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CXSTCAGCTTC TGCTTTAGTA 2580 

2S GGAGCTATAT ATATCGCAAA TTATTTTGGT GATATCCAAC TGTCTAAACA TGAGGTATTA 2640 

CAATTAGCX3A CTGAAATCGA AGGACATCCT GATAATGTTG CX3CCGACCAT TTATGGTGGT 2700 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 2760 

CCAGACGTGG ATGTGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2820 

CGT6CTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2880 

ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 2940 

55 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTGAAAACA 3000 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAGTG GTGCTGGACC AACTATTTTA 3060 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 3120 

40 

TCATGCCATT CTGAA 3135 
(2) INFORMATION FOR SEQ ID NO: 462: 

4S ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : double 

(D) TOPOLOGY: linear 

SO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

55 



1378 



EP0 786 519 A2 

AGnTCAATAT TTAGATCAAG CCGTTTTAAG TAATTACGAA CAAGTTTATA TCATTCATGG 120 

TAAAGGTACA GGTGCACTTC AAAAAGGTGT ACAACAACAT TTGAAAAAGC ATAAAAGTGT 180 

TAGTGACTTT AGAGGTGGTA TGCCAAGCGA AGGTGGATTT GGCGTTACCG TTGCAACACT 240 

AAAATAAATT ATAATTTGAT AAATTAAATA GCTGCAGTTA AAATAATGTA AAGCAACAAG 300 

AATACATTTC AAACATGTTA TTTGAAATAA GCATAAAAAT TGAGCAAATA GAAATACATG 360 

AAGCATGTTA TCTGATATAA TTTGAACATC ATAATAATAA TTAAGGAGGA TTGGCATTTA 420 

TGGCAATCGT AAAAGTAACA GATGCAGATT TTGATTCAAA AGTAGAATCT GGTGTACAAC 480 

TAGTAGATTT TTGGGCAACA TGGTGTGGTC CATGTAAAAT GATCGCTCCG GTATTAGAAG 540 

AATTAGCAGC TGACTATGAA GGTAAAGCTG ACATTTTAAA ATTAGATGTT GATGAAAATC 600 

CATCAACTGC AGCTAAATAT GAAGTGATGA GTATTCCAAC ATTAATCGTC TTTAAAGACG 660 

6TCAACCAGT TGATAAAGTT . GTTGGTTTCC AACCAAAAGA AAACTTAGCT GAAGTTTTAG 720 

ATAAACATTT ATAAGTTACA ACCAATGACG ACTGGGGCAT TTCTTTAATG AATTGCTCCA 780 

6TTTTTGTTT GTGTTTTTAA TATAAAAAGT TGAATGATAA GTCATCATAT TGTTTACGAC 840 

25 TTGAGAATGG TGGGATTAAT AAATCTATGA ACGTTAAATG ATAATCTAGC ATGCTGATAG 900 

A T TTG T A GCA GTTGGTTTGA TAAAACCATG TTCAATATTA CATGATGTGC ATGAAAAGTC 960 

ATACTCGAAG ATGTTGATTA TTAACTAGAA TTAGTGGTGA TAAATTTGAA GCACTTTTGT 1020 

AGCATCATTC ATTTTAAAAT TAGAAGGGGG GATATTTTTG GAAGACTATA AGCAACGAAT 1080 

TAAAAATAAA TTAAATGTCG TACCTATGGA ACCAGGATGC TATTTAATGA AAGATC6TAA 1140 

TGATCAAGTG ATATAT6TTG GCAAAGCTAA AAAGCTAAGA AATCGATTGC GATCATATTT 1200 

TCACGGGTG 1209 
(2) INFORMATION FOR SBQ ID NO: 463: 



10 



15 



20 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 60 

so TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 120 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 
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GCATCTTCAA CTTCAAAAGG 


CCCAATCGTA TAGCCTGAAC TAATAATAAT 


GTCATCTCGA 


300 




CGTCCTTCAA ACCAGAAATA ACCATCATCA 


TCTACATGAG 


CTAAGTCACC 


AGTGATGTAG 


360 


5 


TATTtACCTG TTTGCGCTTT 


CGCCGTACGT 


tCTGGCTCTT 


tATAATACCC 


TTTGAAAAGT 


420 




GCTGGCAAAT CAAGTGGTAC 


TGCAATATTC 


CCTTTCGTAT 


TAGCAGGTAC 


GCTATTCCCC 


480 


10 


TCATCATCTA CTAC7VGTGAC 


CGAACTACCC 


GGAATGCCTT 


TACCCATTGA 


TCCAATCCTC 


540 


TGTGGTGTAT CTTTTAAAAA 


GCCTATAAGC 


AAGGTACTTT 


CAGTCTGGCC 


ATATCCATCT 


600 




CTTACAGTTA AATTAAAGTA 


TTTCTTGAAT 


TGTTCAACTA 


CTTCTCGATT 


TAGTGGCTCA 


660 


IS 


CCTGCAGAAA CGGCACTATG 


TAAATGCGTT 


AAGTCATAAT 


CATTTAAGTT 


CTGTAATTTA 


720 




GCCATCATAC GATATTCTGT 


CGGTGTACAA 


CATAAAACAT 


TAATTTGATA 


TTTTTGAAGC 


780 




AATTCTAAGT ATGTTTCAGG ACTGAACCTT 


CCATTAAATA 


CAAAAGCAGT 


TGCACCTGAA 


840 


20 


CCTAATACAG ATAAGAAAGG 


ACTCCATACC 


CATTTTTGCC 


AACCTGGTGC 


TGCTGTTGCC 


900 




CAAACTAAGT CATCTTCATT 


aATACATaAC 


CAATGTTTTG 


GTGCCATTTG 


TaAATGTGcA 


960 




AATCCCCaTC CATGACAATG 


TGTAACGGCT 


TTAGGATTGC 


CAGTTGTACC 


AGATGTATAT 


1020 


25 


GACAGAATCG CCATATCATC 


ACGCGTCGTA 


TCTGCCATTT 


CTAGTTTGTT 


ACTTGCGTTT 


1080 




TCTTTTTCAG CTTCAAGTGA AATCCATCCA 


TCTTTTTGAC 


CX5GCAATAAC 


AAATTTAGTT 


1140 




AACGCATCAT ATTCTTTAAT 


TTTTTCAAAT 


TCAACTGTGA 


ATGGCTCTAG 


TGCAATAACT 


1200 


30 


GCATTAATTT CACCATGTGT 


GATACGGTAT 


TGTAAATCTT 


TAGTTCTTAG 


CATTTCAGAA 


1260 




CATGGAATGA TTGCAACACC 


TAATTTTAAA 


GCAGCAATAT 


ATAATTCATA 


CGTCGCAATA 


1320 


35 


GATCX3TGGCA TCATAATGAG 


TACTTTATCG 


CCTTTAGATA AACCGTGCGA 


TGCTAAAACA 


1380 


TTACCTACTT TATTAGACTG 


TTCAATGAGC 


TGTTGGTAAG 


TGACTGATAT 


ATCTTCGCCT 


1440 




TCAGTATTAT GATATAAAAT 


TGCCTTTTTA 


TCTGGTATGT 


GGCTATATTT 


TTCGATTTCC 


1500 


40 


GAAATAATGT TATATTTTTC 


AGGCGCGAAT 


AGAGCTGACT 


TTTGCATAAC 


TAACTTCCTT 


1560 




TCATACATCC ACTTTTCCTG 


TGATGAACAT 


TGTAATTTTA 


TAAATGAATT 


ATATACATCA 


1620 




TACGCCTATC TTTACAGAAT 


TTTCAATTAA 


ATAGGGTTAA 


ATACCAAAGT 


CCTCGACACT 


1680 


45 


ACACTTTGAC ATGACGTAGC 


ATTCAAGGAC 


TTTCAAATGA 


TTGAGGGTTG 


ATATCTCGC3G 


1740 




CTAGACCATA TCAGCTAATT 


CAATACGAAT 


ATTGTATGAT 


AATTCACGAT 


TAATTATTTT 


1800 




TACATCTGCA CCTTTCGAAG TGCCACGATG 


CTTGTGTGTA TGCTTGTACT 


CAGCTGAATT 


1860 


SO 


TTGCCAATGA TAAAATGCTT 


GCCTATTTTC 


CCACAGCGTA ATAATGATAT 


AGTGTCTACC 


1920 




AGCTGTTCTA GGTCTTAAAA ACCTTAATGC 


TTTAAATCCA 


TCAACGTTTT 


TTAAATGCTT 


1980 
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10 



IS 



30 



3S 



40 



SO 



SS 



ATTTAACACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA CATCGTAATA 2100 

TGCAGfTGTCA TTATTTrTTT GTATTGTCAG ACAATCATCC AGTTCTTCTA TTACATAACT 2160 

TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 2220 

ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 234 0 

GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 2400 

AATAGCCGAC 2410 
(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCB CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

2S TTTATTAATT (yrMMAKTV GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 60 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 120 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 180 

CGAACAGCGG GTrTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 240 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 300 

ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 360 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 480 

TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 540 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 590 
(2) INFORMATION FOR SEQ ID NO: 465: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCGATTT GAnTCTTTTA AAATTATGTT TAAAAACATG TCTAATGATT CCGCATCATT 120 

TTTTGCACCA TCAATAAGCG TTTCAGcAAA CCCCTTAATT QAAOTAATAG GTGTTTTTAA 180 

^ TTCATGTGAA ACATTTGCTA CAAATTCACG TCTTAGATTT TCAAGTTGTT TCAGATTTGT 240 

TATATCATGC ATCACaACTA AAATCCCtTG CAAACTTTTT TGAGACCTAG TTAAAATCGG 300 

AACGCATGAA ATATCAAAGT ACTTGGCATG GACTTGGTTT ATTGCAACTT CCAATTGTTC 360 

10 

ATAAATAGGT TTTTCAACTT TAAAACTTTC TAAAATTAAT TGCTCAATTT CAGTATTAAC 420 

ATAGCCGTGA TAGCCTACTT GTTCAATATT ATGCGAGATG TTGAACTGTT CATAATACGC 480 

TTTATTTGCA ACAACGATTT TTCCATTTCG ATCTATCATT AAAATAGCAC TTGGAATATT 540 

IS 

TTCAATCGTT GTTTTTAAAC GGTTGGATTG AATTTTTTGC TCATTATTAA GCTTTTGAAG 600 

GCGTCGTGCT AAATCATTGG TAGACACAAA AAGCGCTTTA GTTTCTACAA CATTACTTTC 660 

AGGTACACX;T ATGTGATAAT AACCATTTGC CAACAATTGT GTTGCATAAG TAACTTCTTG 720 

AATGGGACGG ATTAATGTAC GCTTAAAACT ACGGCTTGCA AAATACAGAC AAATGAGTAC 780 

AACTAAACAT GTCAAAATAA GATATTTCCA CAACGTCCAA TGCATTTCTG TAATATCGTT 640 

25 ATTGTAACCT TTAATCCATA CATGATAACC GTTAACCTTC TTATtAAAAA TAAAAACGTC 900 

CCTTT 905 
(2) INFORMATION FOR SEQ ID NO: 466: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTTT TAAACAACAT ATCAAGTCTA 60 

40 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 120 

TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 180 

AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 24 0 

45 

AAGATTTGaT AAAGCATATG CCGAATATAT TTTTGGcTAG ACTTATTTCA GAAGATAATG 300 

ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 360 

50 ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATGTC TTAATATGGT 420 

TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTGGCGGGT 480 
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10 



IS 



30 



35 



40 



45 



ATTTTA^jm TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 600 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 660 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 720 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 900 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 960 

tATATTTAwT GCACyTCinAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQX7ENCE CHARACTERISTICS: 
(A) LENGTH: 406 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 ' (xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 240 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 300 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 406 
(2> INFORMATION FOR SEQ ID NO: 468: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
SO AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 60 

AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATAC6 ATTACTAAAA TTAAATTAAA 120 
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AGGTACAGCG TTAAAACTAA GAGAAGCCAT CAATTATGAT GAAATGGTTA TTGTAGATAG 240 

TATGACGTAG TTCCTAATTA TGCkAAAAGG GATTGATGAA AAACTGAAGG GCTTTTCATC 300 

^ AATCCCTTTT ATTTTAGGGG AATT6AATAG ATAGTTTTAA ACTATA06AA TTATTAATAT 360 

TTGAGATTTA ATTGAAATAA GTTTTAAAAA TTGGAGGAGA TAGATTAAGC GAAGTC7VTTT 420 

AAAGGTGAAG TTAAGTGTAT TCACAAAAAn TAGCCACACT CATATGACAT CGGATGAGTG 480 

10 

TGGCTTAAGG ATCTATGGGG GGAGGAAnCC ATAGATGTTT ACTTTGATAG GCCAGATTAA 540 

ATATCAAAGT ATGCGATTAT TTATAGCTTG ATGCAAAAGT GGTATGCCTA TTAAAAGTTA 600 

CTGCACATAG CTTTTAATAT TCCGTTCAAA GGAAAGGGGC ATACAATTGA ACAATCTGTA 660 

IS 

ATAGTACTTT TAACCAGCTA TGCTAAAAGT CTAGTAGGGA GAACAGTTGT CCAATCACAT 720 

AAGAACCTCT AACTTCGTTA GTACGATTAA GAAAAGCTTT TTAGTTAGTA TGTAATACAA 780 

TTTATTGACG CGCGTGAATC TCTTTTATAA GAGTGTGTAG GGAATGGCGT TGTATAAATT 840 

GTATTAGAAG AACTTCTAAC GCATCTCTGT GGTTAAAAGA GATGAAGGGA ACGACAGTTT 900 

aATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTCTCGT TCAAAGAGAA GCAGcTGTTC 960 

25 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AGAGTGATAA 1020 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAgAG 1080 

AAGCAGCTGT TCGCAgTTTA ATCAAAACCA CATAAAGCTT TTAACTTTAC TCTTTGATTT 1140 

30 AAAGAGTGAC AAATGTTTAC AGTTTAATTA AAACTGCATA AGAACTTCTA GCTTTTCTCT 1200 

TTCGTTCAAA GAGAAGTTCT AATACCACCA TATCGTGCX^A TCGGGAACGG TATATATATT 1260 

AATAGGAGGG TAATATATAT TTAACGCACXJ ATATGGGACT ATTAGCCTTC GACTTTGTTA 1320 

^ TOTTGATGTG TGGCCTAAAA TATTGGAGAT ACCAATATTT TAGGTTGCAT CAACATCA 1378 

(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 60 

so TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 12 0 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 
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TTATATTCTA TTCAATTTAA TCTATGGATA 
ATACTTCTTT ACATCACTAA GTCAATATAA 
^ CTTAATACTG TCTAATTTTT TTGTAACGTT 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT 
TATCAATATC ATGCGGATAA TTCATTTGAT 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT 
ATGCCTCTTC ATCACAAACA AATGGTGATG 
CTCTAAACAT GCAGATAACA TTaAACCAAT 

15 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT 
TGACAACATG ATCAGTACCC TTTAATATT6 
CATCATGTTT ATAACTCTGA ATTTTAATGT 
CCATACTCCA CACCACCTTC TGTTAATTTA 
TAGTATTATA TAAATACATA TTTTTACGAA 
2$ GGTAATATTG CTAAGTACTA CAACAGAGAT 
ATCACTTTTA TATATCAACA CTTTAACTTT 
TATCATTTAT ACTTTTTAAA ATTAATGTCA 
GAATGTTACC CATTTATAAA TTGATAAATs 
TCTCGATCAC TTAAATCAAA CCAAATCATG 
CCTGAAACGG CGTATACTTT AATCTTCCAT 

35 

AAAATAGGTG TTTCTACTGG TTGAATGTCA 
GCATGCTCAC TTTCAAACAT AGGAAATTGC 
TTTTGCAACA GATATTGACC TTGATTATTT 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT 
TCACAGTTTT CTTGAACTGG ACAAAATAAG 
GCCCCTAATT CCATCATAGC TTGATTAAAC 

45 

AATTCTTGTT CGTACGATTT CCTCGTCGAT 
AATCTAGACC ATACTCGAAA AACATTTCCG 
50 ATGCTCATTA CTGCAGCTTG TGTGTATGGG 
GGATCTTTGG GAACTAAGCC TTCATATTTA 



CTGTGTCCCC ACACGACAGC AAAAGTTATC 300 

ATGATTTAAT CAGTATTTAC ACTTTATTTG 360 

CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 420 

GAAGAATGTC CACGTTGCXJT CTTAAAATAA 480 

TCATATGTTG CTCATATTCA TCTTCATCTA 540 

TAATATCTAA ATCATAGTCT ATATATTTTA 600 

ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 720 

CAGTCACTAA CGTATGATCA TTTCCACCAA 780 

TTGTTTCAGA CCAAACGCGA TGAATCTTAC 840 

TTTCCCCTTC TTTAGGTATG GATTCTCTGA 900 

ACCATTATAA ATTATAGCAT ATTTCAGAAA 960 

ATAAGATTTT ACTACTTAAT AATTAAACTC 1020 

TTACATGTCC CATTTAAAGT ATATAAAATC 1080 

TTGACATTGT TATTCTATGA GATTTAAAGA 1140 

CTATGTTTTC CGATAATATT ACCAATCATC 1200 

TTTGACATAG GTACAGGGAA TGTATATTGA 1260 

TCATCTGGTA ATGTTTCAAT GTTAATTGCT 1320 

GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

TGACCGATTT TTTCAGTCAT TTTACGTCTA 1440 

CACATACCAT GCAATAATTT TTCGCTACX5C 1500 

CTAATTAAAA AGACGGATTG CTCAATTACT 1560 

AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

CATAATGGAT TTTTTGGTGT ACAAATTAAC 1680 

GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 1740 

TGTAATTTAA TATCTCGATA GTCATCATTC 1800 

TCTACAGTTG CTAGTGGTAC ATTATATGCA 1860 

CCAACACCTT TTAACGCTTT AAATTGATCA 1920 

TCATGAACTT CTTTAATCGC CGTATGAAAA 1980 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 2160 

^ GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 2220 

TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2280 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 234 0 

70 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 2400 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 2460 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2520 

75 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTC6CATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 2640 

20 TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 2700 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2760 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACXTCTA TTATTTTCAC AGTTCTTTGT 2820 

25 ATTGGTATT6 TATTATGQGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 2880 

GCTTTACTAA TCATTTATTA CATGATTCX5T TTTAAAATGA GAGCCGTAAT TAAGCAACAA 2940 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTCTTGC GCCAACAATA 3000 

^ AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 3060 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 3240 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTMC AAACGATAAT 3300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 3360 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 3480 

TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 354 0 

4S 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 3600 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3660 

SO CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 3780 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3900 

ACATGATTTG GATTTCGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3960 

^ CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACXXyVT 4020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4080 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 4140 

TO 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 
(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 









TCAAAATCTG 


CAACAGTTTC 


AGCAGAATaG GGGCTTTCAA 


60 




AATAAtCAAA 


GGAGAATAAT 


TTATGACTAA 


AACTTTAAAG 


GTTTATAAA6 GAGACX?ACGT 


120 




CGTAGCTTCT 


GAACAAGGTG 


AAGGCAAAGT 


GTCAGTAACT 


TTATCTAATT TAGAAGCGGA 


180 




TACAACTTAT 


CCAAAAGGTA 


CTTACCAA6T 


GGCATGGGAA 


GAAAATGGTA AAGAATCTAG 


240 


30 


TAAAGTTGAT 


GTACCTCAAT 


TCAAAACCAA 


TCCAATTCTA 


GTCTCAGGCG TATCATTTAC 


300 




ACCAGAAACT 


AAATCAATTA 


TGGTAAATAC 


CGATGACAAT 


GTTGA6CCAA ACATTGCACC 


360 




AAGCACAGCA 


ACGAATAAAA 


TATTGAAATA 


TACAAGTGAA 


CATCCAGAAT TTGTTACTGT 


420 


35 


AGATGAAAAT 


ACAGGAGCAA 


TTCACGGTGT 


AGCTGAAGGT 


ACTTCAGTAA TCACTGCTAC 


480 




GTCTACTGAT 


GGAAGCGATA 


AGTCAGGACA 


AATTTCA6TG 


ACAGTAACAA ACGGATAGGG 


540 


40 


ATTTAAGGCG 


CAGTATATCT 


GCGTCTTTTT 


TATTTGAATA AAAGGAGCTA ATACAATGAT 


600 


TAAATTTGAA 


ATTAAAGATC 


GTAAAACAGG 


AAAAACAGAG 


AGCTATACAA AAGAAGATGT 


660 




AACAATGGGC 


GAACAGAAAA 


ATGCTATGAG 


TATTTAGAAT 


TAGTAAATCA AGAGAATAAA 


720 


45 


AAAGAAGCAC 


CTAACGCAAC 


AAAAATGAGA 


CAAAAAGAGC 


GACAGTTATT AGTAGATTTA 


780 




TTTAAAGATG 


AAGGATTGAC 


TGAAGAAGAT 


GTTCTGAACA 


AGATGAGTAC TAAAACTTAT 


840 




ACAAAAGCCT 


TACAAGATAT 


ATTTCGAGAA 


ATCAATGGTG 


AAGATGAAGA AGATTCAGAA 


900 


50 


ACTGAACCAG 


AAGAGATGGG 


AAAGACAGAA 


GAACAATCTC 


AATAAAAGAC ATTTTATCGA 


960 




ACATTAAGAA 


AATACAACGT 


TTCTGTATGG 


AGCAGTATGG 


GTGGACATTA ACTGAAGTCA 


1020 
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AAGAAAAACA AAGTGAACAA AAAGTCATTA CAGGTACGGA TTTAAGAAAA CTTTTTGOAA 1140 

GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGACCTTGGA GCTGAAATTA 1200 

* GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 1260 

AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 132 0 

CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 1380 

10 

GTAGAAGATG AGCTTAAACA AGTTAACGCT AATTACCAAA AAGCTAAATC CAGTGTAAAA 144 0 

GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA AAAAAGAAAA ATTAGCTCTT 1500 

GATAAA7CTA AAGAAGCCTT AAAATCATC6 AATACAOAAC TTAAAAAAGC TGAAAATCAA 1560 

IS 

TATAAACGTA CAAATCAACG TAAACAAGAT GCX3TATCAAA AACTTAAACA GTTGAGAGAT 1620 

GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT 1680 

20 GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 1740 

GCAATCAAGT TCAAAAACTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 1800 

TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 1860 

25 ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 1920 

ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT TCATCCGAAA 1980 

TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGGTAAA CTTGCAAGTC 2040 

AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 2100 

GTACAATGAC GATGGGCGTA TCTACACGGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 2160 

CGAGTGCAGA CTTTQAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 2220 

35 

AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 2280 

CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 2340 

CAATGGAGGC TATGCCAGGT GTTATCAGCG CAgcAGaAGC AAGTGGTGCA GAAATGGCTA 2400 

40 

CAACTGCAAC TGTAATGGCT TCAGCGATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 2460 

ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2520 

TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 2580 

45 

ACACTTCTGC AGCAATTGAA GTTTTATCTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 2640 

CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTACA GCTAAGGAAA 2700 

SO TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 2760 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TTAGCAACAG 2820 
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CAGATAAAA7 TAATAGCTAT AGCAAATCAT TGAAGAACTC TAATGGTGAA AGTAAAAAAG 2940 

CAGCTGATTT GATGAAAGAT AACCTCAAAG GTGCTCTGOA ACAATTAGGT GGCGCTTTTG 3000 

^ AATCGTTAGC AATTGAAGTT GGTAAAGATT TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3060 

GATTAACAAA ATTAGTTGAT GGATTTACAC ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 3120 

TAGGCTTAGC AATTTTTGGT GCATCTATTG GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 3180 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT ATGCATCATT AAATAGACX3C ATTGCTGAAA 3240 

ATACAATTCT TTCTAATACC AATTCAAAAG CAATGAAATC TTTAGGTCTT CAAACATTAT 3300 

TTCTTG6TTC TACAACAGGA AAAACGTCAA AAGGCTTTAA AGGATTAGCC GGAGCTATGT 3360 

IS 

TGTTTAATTT AAAACCTATA AATGTTTTGA AAAATTCTGC AAAGCTAGCA ATTTTACCGT 3420 

TCAAACTTTT GAAAAACGGT TTAGGATTAG CCGCAAAATC CTTATTTGCA GTAAGTGGAG 3480 

20 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT TTTTAACAGG ACCTATAGGT GCTACAATAA 3540 

CTGCTATTAC AATTGCATAT AAAGTTTTTA AAACCGCATA TGATCGTGTG GAATGGTTCA 3600 

GAAACGGTAT TAACOGTTTA GOAGAAACTA TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 3660 

2S GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA ATTATCTTGG AAGTATAGGC AAAAGCTTCA 3720 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG GTTATAAATC TTTGAGTGAC GATGACCTTC 3780 

TGAAAGTAGG AGTCAACAAG TTTAAAGGAT TTATGCAAAC CATGGGCACA GCTTCTAAAA 3840 

^° AAGCATCTGA TACTGTAAAA GTGTTGGGGA AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 3900 

TAGAAAAATA CGTACACTAT TCTGAAGAGA ACAACAGAAT CATGGAAAAA GTACGTTTAA 3960 

ACTCGGGTCA AATAACAGAA GACAAAGCAA AAAAACTTTT GAAAATTGAA GCGGATTTAT 4020 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA GAAATAAAAA GGAACTCGAA AAAACTCAAG 4080 

AACTTATTGA TAAGTATAGT GCATTCGATG AACAAGAAAA GCAAAACATT TTAACTAGAA 4140 

CTAAAGAAAA AAATGACTTG CGAATTAAAA AAGAGCAAGA ACTCAATCAG AAAATCAAAG 4200 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC AGATTTCAGA AAATGAAAGA AAAGAAATTG 4260 

AAAAGCTTGA AAATCAAAGA CGTGACATCA CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 4320 

AGCAAGAGCG TATTTTAGTA AGAATGCAAA GAAACAGAAA TGCTTATTCA ATAGACGAAG 4380 

45 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 4440 

AGCAGTATGA AGATGATGTC ATTGCTATAA AAAATAACGT CAACCTTTCT AAGTCTGAAA 4 500 

50 AAGATAAATT GTTAGCTATT GCTGATCAAA GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4560 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA AAAAGCAAAA TAAAGATATT GATAAAGAAA 4620 
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10 



IS 



20 



GTTGGTGGTC TAACTTTAGA GAAGACCAAA AGAAGAAAAG TGATAAATAC GCTAAAGAAC 4740 

AAGAAGAAAC AGCTCX5TAGA AACAGAGAAA ATATAAAGAA ATGGTTTGGA AATGCTTGGG 4800 

ACGGCGTAAA AACTAAAACT GGTGAAGCCT TTAGTAAAAT GGGCAGAAAT GCTAATCATT 4860 

TTGGCGGCGA AATGAAAAAA ATGTGGAGTG GAATCAAAGG AATTCCAAGC AAATTAAGTT 4 920 

CAAGTTGGAG CTCAGCCAAA AGTTCTGTAG GATATCACAC TAAGGCTATA GCTAATAGTA 4980 

CTGGTAAATG GTTTGGAAAA GCTTGGCAAT CTGTTAAATC GACTACAGGA AGTATTTACA 5040 

ATCAAACTAA GCAAAAGTAT TCAGATGCCT CAGATAAAGC TTGGGCGCAT TCAAAATCTA 5100 

TTTGGAGAGG CACATCAAAA TGGTTTAGCA ACGCATATAA AAGTGCAAAG GGTTGGCTAA 5160 

TAOATATGGC TAATAAATCG CGCTCGAAAT GGGATAATAT TTCTAGTACA GCATGGTCGA 5220 

ATGCAAAATC CGTTTGGAAA GGAACATCGA AATGGTTTAG TAACTCATAC AAATCTTTAA 5280 

AAGGTTGGAC TGGGGATATG TATTCAAGAG CCCACGATCG TTTTGATGCA ATTTCAAGTT 5340 

CGGCATGGTC TAACGCTAAA TCAGTATTTA ATGGTTTTAG AAAATGGCTA TCAAAAACAT 5400 

ATGATTGGAT TAGAGATATT GGTAAAGACA TGGGAAGAGC TGCGGCTGAT TTAGGTAAAA 5460 

2S ATGTTGCTAA TAAAGCTATT GGCGGTTTGA ATAGCATGAT TGGCGGTATT AATAAAATAT 5520 

CTAAAGCCAT TACTGATAAA AATCTCATCA AGCCAATACC TACATTGTCT ACTGGTACTT 5580 

TAGCAGGAAA GGGTGTAGCT ACCGATAATT CAGGAGCATT AACGCAACCG ACATTTGCT6 5640 

30 TATTAAATGA TAGAGGTTCT GGAAACGCCC CAGGTGGTGG AGTTCAAGAA ATAATTCACA 5700 

GGGCTGACGG AACATTCCAT GCACCCCAAO GACGAGATGT GGTTGTTCCA CTAGGAGTTG 5760 

GAGATAGTGT AATAAATGCC AATGACACTC TGAAGTTACA GCGGATGGGT GTTTTGCCAA 5820 

AATTCCATGG TGGTACGAAA AAGAAAAAAT GGATGGAACA AGTTACTGAA AATCTTGGTA 5880 

AAAAAGCAGG GGACTTCGGT TCTAAAGCTA AAAACACAGC TCATAATATC AAAAAAGGTG 5940 

CAGAAGAAAT GGTTGAAGCG GCAGGCGATA AAATCAAAGA TGGTGCATCT TGGTTAGGCG 6000 

ATAAAATCGG CGATGTGTGG GATTATGTAC AACATCCAGG GAAACTAGTA AATAAAGTAA 6060 

TGTCAGGTTT AAATATTAAT TTTGGAGGCG GACTAACGCT ACAGTAAAAA TTGCTAAAGG 6120 

CGCGTACTCA TTGCTCAAAA AGAAATTAGT AGACAAAGTA AAATCGTGGT TTGAAGATTT 6180 

TGGTGGCGGA GGCGATGGAA GCTATCTATT TGACCATCCA ATTTGGCAAA GGTTTGGGAG 6240 

TTACACAGGT GGACTTAACT TTAATGGCGG TCGTCACTAT GGTATCGACT TTGGTATGCC 6300 

so TACAGGAACG AACATTTATG CTGTTAAAGG CGGTATAGCT GATAAAGTAT GGACTGATTA 6360 

CGGTGGCGGT AATTCTATAC AAATTAAGAC CX3GTGCTAAC GAATGGAACT GGTATATGCA 6420 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG 
GTCGCATCCA GGGAATGATA CAGCTAAAGA 
^ TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA 
TAGCTTGATT CAACACGAAT CAGGAGGAAA 

10 

AGACATCAAC GTTTTACAGG GCAATCCAGC 
ATTTAGACAT TATGCTGTTA GAGGTCACAA 
AGCGTTCTTT AACAACAGAT ATTGGCGCTC 
AAGTGGTCCA AGAAGATATG CGAATGGTGG 
GGGTGAAGGA GATAAACAGG AGATGGTTAT 

20 ATTAACTGAA CAGGTTATGC GCATCATCGG 
AAATAATGAT ACTTCTACAG TTGAAAAATT 
AGGAAATAAA TTAACAGATG CX3TTGATTCA 

25 TTCTAATGAT GCAATTAGAG GTTTAGAAAA 

AAATGCAAAT AATTATATGG GAGGTTTGAC 
GATGGTTACA AGGAAGAAGT AATAACAGAT 

30 

GCTGAAAGTC CAAACACCAA TGATAACAGT 
CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA 
GATGTTATAG ATTTAAATTT ATTTGAGCAT 

35 

CCTTATTATG TTATTACTTC TCAAATGCCT 
GTTACATCTA ATTTAAAAGA TGGTTCTTCA 

40 AAAGGGTATT CTGAATCAGT TAATTGGACC 
ATGTTTGAAA ATGGAATTCC TCTTGATTTC 
TTTACTATTT GGAACGGTTC TACTGATACG 

^ ATATTAATTA ATTTAAATGC GAGTGGAGGA 
ATTTTTAAGT ACAACAAAAG TATAGATAAA 
GCATATCGAG ATATAAACAG AGTGGGAATT 

50 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA 
TTTCCTTTTA TTTATAGGTA GGTGATTTAA 

55 



AGCACACTTA CATTTCCAAT TGATGCAAGG 6540 

TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 6660 

TGTTACTTCG GCTGACGTAG GAAATATCAT 6720 

TGCAGGTATA ACTCAATCTA GTTCGCTTAG 6780 

AAAAGGATTG CTTCAATATA TCCCACAAAC 6840 

CAATATATAT AGTGGTTACG ATCAGTTATT 6900 

ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

CCCTTTAACT AGACGTAAAC GAGCAATTCA 7080 

TATGGATGGC AAGCCAAATA ACATCACTGT 7140 

GTTGAAACAA ATTGTTATGT TAAGTGATAA 7200 

AACTGTTTCT TCTCAGGATA ATAACTTAGG 7260 

AATATTGTCA AAACAAAGTG GGCATAGAGC 7320 

TAATTAATGC AATCTTTTGT AAAAATCATA 7380 

TTTAATCAGC TTATATTTTT AGATGCAAGG 7440 

GTAACTATTA ACGGAGTAGA TGGTATTTTA 7500 

TTAGTATTAA GGTTTGGCTA TGATGGTATA 7560 

TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

ACTGAAATTG AAGTAAGTTT AAATGTTTAT 774 0 

QATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ACACCTAAAT ATACTCATAC ATCAAA7CAA 7860 

ATAAATCCAC GATTCAAGCA CGATTTGAAA 7920 

TTTGAACTGG TTAATTATAC AACAGGTGAT 7980 

AACACTGATT TTGTTTTAGA TGGTGTGTAT 8040 

GATACAAATA GAGGCATTAT AACATTAGCG 8X00 

GACGTCAGTG ATATTAAAAC TACATTTAAG 8160 

TGGATTATCA TGATCATTTA TCAGTAATGG 8220 
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ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACTCATAATA 8340 

GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGOT GAAAAATACA 8400 

^ CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 8460 

ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGACGACA 8520 

GTAGCGAAAC TGGTAAAACG CCAGAATACT CTTTAGATGA GTACTTAAGA TATGGATTTG 8580 

10 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 8640 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 8700 

TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG TTTTTATTCT CCTGAAACAT 8760 

IS 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA 8820 

CTGTCAGTAC GTTGGAATTA AGAACAGCTA TAAAAGTTTT TGGGAAAAAG TACACAGCCG 8880 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 8940 

TAAAAGAAOG TACTTATCGT ACCGCAACAA TTGGGTCTAA AGCTACTATT AACTTTGATT 9000 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 9060 

ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC TTGTTTTGCT AAGTCGGTTC 9120 

AGTCTGAmAC AATAGATTTA ATAaAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 9180 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 9240 

30 

CTTGTATGTA TGTTGGAACT GAAAAATCAA CAGTCTTAAA TTTAATTGCT GACAACTCAG 93 00 

GTCGCAATCA ATACAAAGCA ATTGTTGaCT ACGTCGCAGA TAGTGCAAAG CAGTTTGGGA 9360 

TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 9420 

35 

TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT TATATAGGTT 94 8 0 

ATGAAAAAAT AGAGCCAAGA GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 9540 

40 CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAAOGCA ATAGATGAAG 9600 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 9660 

TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCO TATAAATCAT TTATACACTA 9720 

^ GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 9780 

AGAAGAAGTT AAAATCAAAG CGCTACTTGA AAACGATAAA C 9821 
(2) INFORMATION FOR SEQ ID NO: 471: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



5 



10 



IS 



30 



rrATTGTTTT 




TTCTCGGTTA 


1 1 xAACCATC 


GGTTTCAATC 


CTAOGTGCTT 


60 


CACTTTCTAT 




AaTTTCAGCT 


TGACCACCCG 


CTGTATAAAG 


GGTCAAAGTT 


120 


GCTAATCGAT 




TATAGGACCA 


ACATCGATAT 


CAATATTTTG 


AATACGAAAA 


180 


TATGGTATTA 


CCTTTTCATC 


CAAAAATAAA 


ATGCCGTTTC 


GTACACGCAA 


ATGGTGTTTT 


240 


TCAAATGCAT 


ATCTGCAGTG 


CTTATATOGA 


TAGACTGGCG 


CTATAACAAG 


CGTGAAAACA 


300 


GCAACAAGTA 




CACACTACTA 


ACAATGGATA 


AATGGTTATC 


TAAAATCTCC 


360 


CAAAATAGCC 


AGTTCAAAAT 


ATTAAATGCG 


ATTAAAAGTA 


CAAGCGCTAT 


GGGTATCCAA 


420 


AACAGCACAG 


CACTTAACCT 


CATCACTTTT 


TTAGCGTGTG 


6TGACATAAA 


ATTATAATCC 


480 


CTCATCATTT 


TCACCTCTTA 


AATACCATGA 


TTTCAATTTA 


TTTGCATCTT 


CACTTCTTGC 


540 


GTATTTTAAG 


TTAATCGATT 


GGGCTCCCAC 


ACCTTTAGCA 


ACAATAAAGC 


TAAAATTATT 


600 


TAAATTGTTG 


CGTTTAAGTA 


ATGTATGTTG 


CCAAGTGTCA 


AATCCTATAA 


TGTGATGCGC 


660 


TTTAAAATAA 


TAAATATTTC 


GTTTCAATAG 


CTCGAAATTC 


TGGATAGTAA 


TTTGTTCTTC 


720 


TGTCATTTTA 


AAACCCGCAT 


GTTTGACATA 


AAGATATCCT 


TTGATCACAA 


ATAAACCAAT 


780 


AATGACTATT 


GTTATAATCG 


TAAATAACAA 


TAATAATTGA 


TTCCAAAAAT 


AACAGCCTAT 


840 


ACCTGCCATA 


GCTATGACAA 


TAATACTAGG 


TATTAAAAAG 


TGTCTGTGGA 


AACCTGACAA 


900 


AGGCATACCT 


TCATTAACTT 


GTTGATAAGA 


TAAATCTGGT 


ACTAAATTCT 


GGATAATTTG 


960 


ATATGCTTTG 


TCTCGTTTAA 


TAAACGGCaA 


TATCGGCACA 


CTACCTGAAT 


CATTGTC 


1017 



3S (2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 

TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 180 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 240 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 360 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 420 

TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 480 

CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 540 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 600 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCXSATTT CAACTAAACT 660 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 720 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 840 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 960 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 1020 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 1080 

2S CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 1140 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 1200 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 1260 

TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 1440 

TCTTGCCTTC TTTTTTCTGC TTTTQATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 1500 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 1560 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 1620 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 1680 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 1740 

45 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTG 1800 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 1860 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 1920 

SO GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 1980 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 2040 
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GCAATATTAC 


TAAAACCATA 


CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 


2160 




ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 


2220 


5 


TAATACCTGT 


GATTACTGGA 


CTAATATTAT ATTGCGTATT 


TAACGACTCC GCAATTGTAT 


2280 




TAGATTGCAC 


TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 


2340 




TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACGGAATC 


2400 


10 


CACCATCTTT 


ATCATGTACT 


TTATAAACCT GAGCCAAAGT 


CGCTTCTATA AATGCACTCG 


2460 




CTGCACCTAT 


AAATGCAATA 


ACCCACATCC AAAATACTGC 


ACCTGGACCG CCTAAAACAA 


2520 


IS 


TCGCAGTCGC 


AACACCAGCA 


ATATTACCAG TACCAACTCT 


CGAACCAGCA CTAATCGCAA 


2580 


ATGCTTGGAA 


TGGCGAAATA 


CCCTTCTTAC CATCTTCTAA 


AGTTTCTGGA CGTTCTACTA 


2640 




AAGCTCTAAA 


CATTTCAGGT 


AACATTCGTA ATTGAACGAA 


TTTAGAACTA ATCGTAAAGA 


2700 


20 


AGAATCCAGC 


TGTCAATAAT 


AGACCAATTA AATATTGAGA 


CCATATTAAA TCGGTACCAA 


2760 




CATGGACAAA 


TTCTTTAAAC 


CATCCAGGTA TTAAACTATC 


GAAMCTTTC AAAATAAACC 


2820 




CCTCGCATCC 


TCTACATGAA TCATGTACCT TCTATAAAAT 


TAGACCGAAT TGAACCTTCA 


2880 


2$ 


GTAAATATAG 


AGATACATCA 


TCATTTCTTA TACAATACAA 


GAGATTTATA TTAGTTTGGT 


2940 




CAAAGTATAT 


CGCTAATTTA 


ACGATAAGTA CTTGGTCAGC 


ATTTAATATA AATCCCTTGA 


3000 




ATTTAGTCAA 


AATTTAACAT 


TACTGTATTT TATCATTTAA 


TTTCGTGATT GCATATAGTT 


3060 


30 


TTTAGCTAAT 


ATACATGTCT 


ATTACTTCAC CAAAATCATC 


TGTATCTACA ATGAATGAGC 


3X20 




CATTTGTATA 


TTGTTCAGAT 


TTATGAATAT CATTAATTAA 


ACCATGTTCT TCATTTGATT 


3180 




TTGAATATAA 


TGTATATTGA 


CTATGTTCAC CTGTCACTAC 


ATGTGCAGCT ACAATACGAT 


3240 


35 


GTGGATTTTT 


CTTTAATTCT 


TTTAATAAAG TTATTCCaCG 


TTGTGCTCTT TTAGCAACTT 


3300 




GTAAGATTTT AAAACTAATA 


CGTTTTAACG AGCCGCGTTG 


TGTGGCCATC AATATAGTAT 


3360 


40 


CATTTTCAGA 


AACACCTTCT 


GTCATAACAA CGAAATCTTC 


AGCTTTAAGA TTTATTGATT 


3420 


TAACACCAGC 


TGCCCTTAAT 


CCGGTATCTG ATAGTTCACT 


TGTATTATAC GTTAATGACA 


3480 




TACCTTTATT 


AGTAATGACG 


GTAATTAATT GATCTTTTTC 


AAAGCGCATA ACACTAATCA 


3540 


45 


AATCATCATT 


TTCTTTAACT 


TTAGTAGCAA TTAAAGGTTT 


ATTAAAACXSC GTTGTTTTAA 


3600 


ATAGAGGCAC 


TGTACTTTTC 


TTAATCATGC CATTTTGAGT 


CGCAAAAACA TAAAATGCAT 


3660 




CTGTATTAAA 


GTCCTTTTCA 


TTAAAGACAT TAATAACCAC 


TTCATCTTCT TCGATAGGAA 


3720 


50 


CTATTTGTGA 


TACATGTTGT 


CCCAATTCTT TCCAACGAAT 


ATCTGCTAAT TTATGAACCG 


3780 




GTATAAATAG 


ATAACGACCT 


TTATTTGTAA ATACTAGTAC 


GGTATCTTGC GTATTTACTT 


3840 



55 



1395 



EP 0 786 519 A2 



TAAAGCTACG AATAGAAGTA CGTTTAATAT 
CTTCACTAGG CACCATAACT TCTTTGTCAA 
^ AAGACAGTCG TTCAGATTTG AATTTCTTTT 
TCAATAATQC ATCATGGTTA TCAAGAATAT 
CTTTATGTTC ACCTTCAAGC GCAACTATGT 
TTACAATTGC TTCAGCCTGT TCTTCTGTGA 
CGTCACGCTT GTTTTTAGAG CwACGAATCA 
CTTTAATCAA ACCTTCAACG ATATGCATAC 

IS 

TTCTATTTGC AACAACCTCA ATTTGGTGAT 
CCATCAATTT TGGACGACCA TCACTAATAG 
AATCAGAGTT TTTATAAAGA TAATTTTTGA 

20 

CAATTGCTAT TCGTAAACCA GTTCTATCAG 
CTTTTTTGTC AGCACGTAAT TCATCGATAC 

2s ATGGAATTTC AGTAATAATT AACTGTTTAC 
TAGAACGAAC TATAATTCTA CCTTTACCTG 
CTTGAATAAT ACCACCAGTT GGAAAATCAG 

30 CTGTAATATC CGGATTATCA ATATATTTAA 
GTGGTGGTAT ATCTGTCGCG TAACCTGCAG 

TAGGAAATCT TGATGGCAAT ACCATTGGTT 

35 

AAGAAACTGT CTCTTTATTA ATATCACGTA 

CTTCAGTGTA ACGCATTGCC GCTGGCGGAT 

TTTCTATTAA GACATGTCGT AACTTCCAGT 

40 

CTGAGGAGTC TCCATGTGGA TGATATTGAC 
TACGGAAATT TTTATCGTGT GTATTACCAC 
GTACTGGTTT TAAACCATCA CGAACATCTG 

4S 

TATATCTTCC AAAGCGATCA CCTAAAACAT 
TCACTAGATT TCCTCCTCAT CAAATTGATC 
so TAAAATACTT TGGTCCTCTT GCATACCAAA 
AGGTTGTACT TTGTCACCCA TTAATGTTGT 



ATCCATGACG TGTCATACTT AAAATAACTy 3960 

TTTTAATTTC TTCAATTTCT GCTTCAATTA 4020 

TAATTTCATT CAATTCTTCT TTTATGACAT 4080 

GACGTAATTG TTTGATTAAT GCTTCAAGTT 4140 

CAGTATTTGT TAAACGATAT AACTGTAACA 4200 

ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4260 

ATTCGATTAC TTTATCTAAA ATTGACAACG 4320 

GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4380 

TCAAATAACT ATCTATAATT TQACGAATAC 4440 

CGACCATGTT GAAATTATAT GAAATCTGTA 4500 

TTGATTCACT GTTCACATCT TTTTTCAATT 4560 

TTTCATCACG TACTTCAACG ATACCATCGA 4620 

GTTTTACTAA GCTACTTTTG TTCACTTCAT 4680 

GTCCATTGCG TAAAGTTTCT TCTTCAACTT 4740 

ATTCATAAGC TTTTTTAATA CCATCAATAC 4800 

GACCTTTAAT ATATTTCATT AATTGATTGA 4860 

GTGTTGCTTG AATCACTTCA GCTAAATTAT 4920 

ATATACCTGT AGAACCATTC ACTAGTAAGT 4980 

CGAGTGTCGT ATCATCATAG TTTGGAATGA 5040 

ATAACTCTTC AGCTAGTAAG CTTAACTTAG 5X00 

CATTATCGAT ACTACCATTA TTACCATGCA 5160 

CTTGACTTAA ACGGACCATT GCTTC6TACA 5220 

CAATAACATC ACCGACTGTT TTCGCACTTT 5280 

TTGAATACAT TGCATATAAA ATACGACGTT 5340 

GCAATGCACG CTCTTGAATA ATATATTTAC 5400 

CTTCAAGTGA TAAATCTTGA ATTATTTCAC 54 60 

ATTTTCAAGC ACTTGTACTT CAGAATTATC 5520 

CTCAACATGC TTTTCAATCC ATTCACGTCT 5580 

TACACGTTTA GATGAACGCA CTTCATCTTC 5640 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACGTG AaOCCTTTAC CAAGTTCTTT 5760 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCnT 5820 

5 ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5940 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

75 

TTCAnTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 6240 

TTTACGCTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CT T TT TC T T C 6360 

20 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 6480 

2s CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CTTTTGTTTT 654 0 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6600 

AACTTCATGT GTACCACCAT CTTTAGTACX3 TACATTATTT ACAAAACTTA AAATACTTTC 6660 

30 TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 6780 

GATTCCTTCT TATAATGGTA TGTCTT 6806 

^ (2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 240 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTPQGAT ACQAACATCA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTmXSATAT CTTGAACTOA GATATTAAAT 420 

T6TTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 480 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 540 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAQ CTTTTGTAAT CGCTTCTTCT 720 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 84 0 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 900 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCX5 AATTCTTCtC CGTCAACTGA ACCACTAAAG 1080 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACX3AC CATTTCAGCT 1140 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATC7VGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 1380 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 1440 

ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTOCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT 1620 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 
(2) INFORMATION FOR SEQ ID NO: 474: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
SO (D) TOPOLOGY: linear 
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75 



20 



25 



TGACCAAGTA 


CGTTTCGAAG 


TTGCCATTAA 


AGCATTAAAC 


CCATCATTGA AAGCATTCGC 


60 


ACCTGTACGT 


GAGTGGGCAT 


GGAGTCGTGA 


AGAAGAAATC 


GATTATGCAA 


TTAAACATAA 


120 


TATCCCTGTA 


TCAATCAACC 


ATGATTCACC 


TTATTCTATC 


GATCAAAATC 


TATGGGGCAG 


180 


AGCX3AATGAA 


TGTGGTATTT 


TAGAAGATCC 


TTATGCTGCG 


CCACCAGAGG 


ATGCX5TTCGA 


240 


TCTAACAAAT 


GCTTTAGAAG 


AAACACCAGA 


TACTGCTGAT 


GrAATCATTT 


TAACGTTTGA 


300 


TAAAGGCATC 


CCAGTTCAAA 


TTGATGGCAA 


AACATATGAA 


TTAGACGATT 


TAATTTTAAC 


360 


GTTGAATGCA 


TTAGCTGGTA 


AGCATGGTAT 


CGGAAGAATT 


GACCATGTAG 


AAAATAGACT 


420 


TGTAGGTATC 


AAATCAAGAG 


AAATTTATGA 


GGCACCTGCT 


GCAGAAGTTA 


TTTTAAAAGC 


480 


GCATAAAGCA 


TTAGAAACGA 


TTACGTTAAC 


GAAAGATGTC 


GCACACTTTA 


AACCAATCAT 


540 


TGAGAAGCAA 


TTTGCTGAAC 


AACTATACAA 


TGGACTTTGG 


TTCTCACCTT 


TAACTGATAG 


600 


CTTGAAATTA 


TTTATTGATA 


GTACTCAGCA 


ATACGTAAGT 


GGTGATGTCA 


GAATTAAATT 


660 


ATTCAAAGGT 


AATGCCATCG 


TGAATGGTAG 


AAAATCACCT 


TACACATTAT 


ATGATGAAAA 


720 


ATTAGCAACT 


TATACAAAAG 


AAGATGCATT 


TnATCAAGAC 


GCTGCTGTTG 


GCTTTATCX3A 


780 


TATCTATGGT 


TTACC 










795 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 



40 



45 



SO 



CGATTGAAAG 


AAGACGGTTC 


AGTTGAAAAG 


TTTCCAAAGC 


CAGTAATTAG 


CCAACAACCA 


60 


GAAGGATATA CGAGTCATTT 


TAGAGATCCT 


AAAGTTTTTA 


AATATGATGr 


QaAATATTAT 


120 


GCAATCaTTG 


GTGCmCaAAA 


TAATGATCaG 


CaAGGTCGAT 


TATTACTTTA 


TAATACTGAA 


180 


GATATAATTA ATTGGCATTA 


TTTAGGTGAA 


ATAAATACAG 


AGTTGGATGA 


TTTTGGATAT 


240 


ATGTGGGAAT 


GCCCAGATTA 


CTTTAATGTA 


GATAATCAAG 


ATGTCATACT 


TATTTGTCCA 


300 


CAAGGTATTG 


AACCAAAAGG 


CGATCAGTTC 


AAAAATATTT 


ATCAAAGTGG 


TTATATACTT 


360 


GGAAAGTTTG 


ATATTGAAAA 


GTTAACATAT 


GAACATGAAA 


ATTTTGTCGA 


GCTTGATAAT 


420 


GGTTTTGATT 


TCTATGCACC 


TCAAACATTT 


TTAGATGAAA 


AAGGCCGACG 


AGTACTAATT 


480 


GGATGGATGG 


GGTTACCGGA AATCGAATAT 


CCTACTGATA 


ATGAAGGATG 


GGCCCATTGC 


540 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGCtACGCAA ATAAATTTAC 660 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 720 

5 TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 780 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 887 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 





TTGGAAnCAA AAAACCATTG 


GTAAACCGTG TTnAACCGGA 


TTTCCGATGG 


ACCTTTTAAA 


60 




ACnACCAAAT A6AAAGCTTT 


GATAAAAGGT AATTATGGTA 


CTGATAACAA 


ACAAGTTCAA 


120 


25 


AAACATCATG ATTTAGTACG 


TATGCTTTTG ATGGATCAAG 


ATGC3TTTTTT AACTGAAAAT 


180 




AATAAAGTTG ATCATTTCAT 


TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 


240 




AAAAATGCAA AAGAtATATC 


CATTTAGAGT ACwATACTTT 


CGCTTwAGAT 


GGTTwAGGTA 


300 


30 


AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG 


TCTAGAAGTA AAAATATTAT 


360 




ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT 


TGATCATTTT 


AAATCGTTAG 


420 




GTGGAGAAGT TGAAGCATTT 


TTTGCTTCAA AATTACCGTT 


ATTGAATTTC 


AGAATGAATA 


480 


35 


ATAGAAATCA TAGAAAAATC ATCGTAATC6 ATGGTCAACT AGGTTATGTC 


GGAGGATTTA 


540 




ACATTGGTGA TGAATATCTm 


GGATTAGGAA AATTAGGATA 


TTGGAGAGAT 


ACGCATTTAC 


600 




GTATACAAGG GGATGCGGTT 


GATGCACTGC AGTTGCGATT 


TATTTTAGAC 


TGGAATTCGC 


660 


40 


AAGCGCACCG TCCACAATTT 


GAATATGATG TTAAGTATTT 


CCCTAAAAAG 


AACGGACCAT 


720 




TGGGCAATTC ACCAATTCAA 


ATAGCT6CAA GTGGCCCGGC 


TAGTGACTGG 


CATCAAATTG 


780 


46 


AATACGGTTA TACAAAAATG 


ATTATGAGTG CAAAGAAATC 


TGTATATTTA 


CAATCACCAT 


840 


ATTTCATTCC GGATAATTCA 


TATATaAATG CCATTAAAAT 


TGCTGCTAAA 


TCAGGTGTAG 


900 




ATGTACATTT AATGATTCCA 


TGTAAGCCAG ATCATCCATT 


AGTATATTGG 


GCGACATTTT 


960 


SO 


CAAATGCCTC TGACTTATTA 


TCAAGTGGTG TTAAAATTTA 


TACGTATGAA 


AATGGATTTA 


1020 




TACATTCTAA AATGTGCTTA 


ATTGATGATG AAATCGTATC 


AGTGGGCACA 


GCAAATATGG 


1080 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 11S3 
(2) INFORMATION FOR SEQ ID NO: 477: 

S U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 477: 
GGTTATATGT TTGGTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 60 

IS 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 
CAAGAATCTC GATGATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 
AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 240 

20 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 300 
TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

2s TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 420 
AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 480 
AGATTATAAA AATGGTGATA TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 540 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 600 
CAAACAAGCG CCCGAATTAA AATTGAAAGG ATCCGGCAAT TTAAAAGGCT CATCCGTAGG 660 
ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAOAA GAGAATATCT ATTTTTCAGA 720 
TTCGGTCGAA TTTACACCTA GCGAGGATCSA TAAATCATGA GTCAAACGGA ATATCAAATA 780 
AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 840 
GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 900 

40 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 960 
CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 1020 
GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 1080 

4S 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CQATGTCAAT 1140 
ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 1200 
^ AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 1260 
GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 1320 
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IS 



20 



2S 



ATTGAAAA6T 


ACGATGGTCA 


CATTGTAAGA 


TTTGTGTCTG 


ATGAAGACGA 


ATTAGATGCA 


1440 


GGTGTCCGCA 


ATCATTTATA 


TGAAACTGCT 


GGAGAAAAAA 


TAGTACTTAA 


AAA7GGAGAA 


1500 


GGCCATGCAA 


TGAGTGGTAT 


TTTAATGAGC 


AGAACACAGG 


CTATAATCTT 


AGCTGAATTA 


1560 


AACAAAGTTA 


AAGGCTACCA 


AGACGAAAAT 


AATAAAGCAT 


TAAAATCCGT 


TCGTAAACAA 


1620 


ACQAGGCATA 


GATTACATAA 


AGTAGAGACG 


TTAAGAGCGA 


ATTGGATTCA 


AACAACGGGT 


1680 


GGATCACTCT 


CTTCCTCCCw 


ACAACAATTA 


TTAGAAGCTT 


TAACAGCACT 


AACCATTGCC 


1740 


GAAGGCTTAA 


ATCAATTAGT 


GAATGAAGAA 


AGCCAACATT 


TGAAAAAATG 


TATCACGCGA 


1800 


TGGCACATAA 


ATTTGGAGAC 


AACTGGAAAA 


AAGCGCAAGA 


AGTTGGAAAT 


GAAATTGGTG 


1860 


AAAAATTAAC 


CTCTGAAGAG 


GTTATAGATG 


rATTAAGAAA 


AGGTGGCGCX3 


TATGAAAGTa 


1920 


AACTTGAAAC 


AGATCCCAAA 


AGAAAAATTG 


ATGATAAGAT 


AAAGAAATTA 


AATGATGTTT 


1980 


ATAAAAATTG 


TAATGGCTAT 


ATCGCAAAAA 


TTAAACAGAG 


TATCGAAGCA 


ATTGTTTCTA 


2040 


ATGACCAAAT 


GTTAGCGAGC 


CAGATTGATO 


GGATGATGTA 


ATGTTTACTA 


CGTATAAnAA 


2100 


TATTAATGAA 


CTTGAAAATG 


CCTATGATGA 


AGAAAGAAAA 


CAATTGAATG 


ATGCATTCAA 


2160 


TCAAATTGAT 


GAATTAAGAC 


ATCAAACACG 


CAAGAnATGT 


GAACAAATGT 


ATGATCATTT 


2220 


CTTATATCTC 


AAACATAAAA 


TGAATTrayms 


TGAAGACGCT 


ATGATCAGGA 


TGACACGTAT 


2280 


TATAGAATCT 


TTCGATAGAG 


AAACGAATCA 


ACGTATCCGA 


CATCACGAAA 


TG 


2332 



30 (2) INFORMATION FOR SEQ ID NO: 47$: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
^ (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

40 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 60 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 120 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 180 

4S 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 240 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 3 00 

GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 360 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 420 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 540 
CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 
5 ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 660 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 720 
CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 7 80 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 

GGaATGaTTT ACCAATTTGG ACCGG 865 
(2) INFORMATION FOR SEQ ID NO: 479: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dOllble 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



2S 


TAATGAGTAA 


ATAAGTACCA 


GATAAAATCA 


TGAATATCAT 


CCAACATGCG GrTAACTCTA 


60 




CTAAATAATT 


AATAATAGTA 


TTTTCAGTAA 


ACAAAGAACT 


ATGTATACTT CGCATCACAT 


120 




TAGAATACGT 


ATGTTTCGCA 


TTTTGATCTG 


CAACAAATTG 


ATTGTTATGA TCTAGAAAGA 


180 


30 


CGTAACGTTG 


ATTTCCTGCC 


ATATCACTCA 


GTGTAATTCG 


TTTGTTATAT GGTTCATCAA 


240 




GTATGCTAAC 


TTTACTTACA 


AAGAATCCTT 


CATATTGTTG 


TTCAACTTGA TGTACTGCAT 


300 




CATTTAATGT 


TTGATGCGTT 


TTTACATCAC 


TGTCACCAAA 


AAACTCATTC TTATAAATAT 


360 


35 


TATTTTCAAC 


TTCTGGAAAG 


AACAAGTAAC 


CAATGCCCGA 


AATGGTTAAA GTGATTAACA 


420 




GTGGAGCAAT 


AAATATTGCT 


GCATAGAAAT 


GTAATCTTTG 


TAATGGATTA AATGTATTTT 


480 


40 


TCATATTTCC 


CTCCCAATTG 


GCTATTATAC 


GGTGTCAATT 


CTGTGATGTG TGTGAACAAA 


540 


CTGTGACAAT 


ATTTATTTTC 


TAGAAAAATT 


TAACGATGAT 


TTGTGATTTT TAGAAAAATG 


600 




AACTTTTAAG 


TTGGAATGTT 


TGAAGAAAAT 


TGATTATTCG 


TATGTTTTAT CAAGCAGCTA 


660 


45 


TGATAAAATT 


TAAACATAAT 


ACAATGCGAG 


CCATTTAACG 


ATCTATGTTT AAATGGACAT 


720 




CGATATTGTA 


TGAATTCGTT 


GTAACAAGCA 


AGCATTTCTA 


TGTGAACGAA CCAAAGGGGA 


780 




AAGTAACATG 


ATTAATAAAG 


AACAATTAGA 


TCTTTTATAT 


AAATTAAAAA AAGAAGTTGA 


840 


SO 


AAAGTCGCGA 


AATGAAGCAC 


TTTTACATAC 


AATTAACCAA 


GTAATTAAGA AAGTATATTT 


900 




GCAGCAATAT 


ACATGTTCGT 


TCGTTGGACA 


TTTTTCTGCA 


GGTAAATCGA CACTGATAAA 


960 
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TATTGTGTCA GTTTCAGACA ATCACX3ATAT TATTGCTAAT TTGCCGAATC AAACGTATGC 1080 

CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 1140 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 1200 

AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG 1260 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTQAACTTA ACTTTAAGTT 1320 

TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTQACAG 1380 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 1440 

TGGC 1444 



75 

(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 
20 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 



40 



45 



SO 



GCAGCAGCTT 


ATCGTGAGTT 


ATCATTATTA 


TTACGTAGAC 


CTCCAGGTCQ TGAAkCATAC 


60 


CCAGGTGACG 


TATTCTACTT 


ACATAGTAGA 


tTATTAGAAA 


GAGCAGCAAA ATTAAACGAT 


120 


GACTTAGGTG 


GCGGTTCAAT 


TACTGCATTA 


CCAATTATCG 


AAACACAAGC TGGTGATATT 


180 


TCAGCTTATG 


TACCAACAT^ 


CGTTATTTCA 


ATTACAGATG 


GACAAATCTT CTTACAATCT 


240 


GATTTATTCT 


TCTCAGGTGT 


AAGACCAGCG 


ATTAATGCCG 


GACAATCTGT ATCTCGTGTT 


300 


GGTGGATCTG 


CACAAATTAA 


AGCAATGAAG 


AAAGTT6CTG 


GTACGTTACG TCTTGACTTA 


360 


GCGTCATACA 


GAGAACTTGA 


ATCATTTGCA 


CAATTCGGTT 


CAGACCTTGA TGAATTTACT 


420 


GCAAGTAAAT 


TAGAACGTGG 


TAAACGTACT 


GTTGAA6TCT 


TAAAACAAGA TCAAAACAAA 


480 


CCATTACCAG 


TCGAACACCA 


AGTGTTGATT 


ATTTATGCAT 


TAACAAAAGG ATATTTAGAT 


540 


GATATTCCAG 


TTGTAGATAT 


CACACGTTTT 


GAAGACGAGT 


TAAACCACTG GGCAGAATCA 


600 


AATGCTACTG 


AACTGTTAAA 


TGAAATCAGA 


GAAACTGGTG 


GCTTACCAGA TGCTGAGAAG 


660 


TTTGACACAG 


CAATTAACGA 


ATTCAAAAAA 


AGCTTTAGCA 


AATCTGAATA ATAAACAAGT 


720 


TTAGTATAAG 


GTGGTGAGAT 


AGTGGCTTCT 


CTTAAAGAAA 


TAGATACTCG AATAAAATCA 


780 


ACCAAAAAAA 


TGAAGCAGAT 


TACGAAAGCG 


ATGAACATGG 


TATCAAGTTC AAAACTTCGT 


840 


AGAGCTGAAA 


AAAATACAAA 


ACAATTCACA 


CCATATATGG 


ATAAAATGCA AGATGCAATT 


900 
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ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC lOBO 

5 AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

10 

CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 1380 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACGG AACAATATTA 1440 

IS 

GACGCAAAAG CAAGTGAGCA TGCAACACX5T ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 1680 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

2s GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTT6TTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCX»T 1980 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2040 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCGCCGG TGTAGGTGAA CGTACTCGTG AAGGTAACX3A TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCX^GGCA AATGAATGAG 2280 

40 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTCCGT 2340 

GACGAACAAG GTCAAGACGT ATTATTATTC ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 2460 

45 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2520 

ACTTCTATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2 580 

SO GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 2640 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 2700 
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CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2880 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC 2940 

TTAGATGGTA AATATGACCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 3000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 3300 

TCAGACTGCA GAAACTGCAA GAQAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 3360 

AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 3480 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3540 

25 ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 3600 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3720 

CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3780 

ACTTTTTTAC AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3840 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTTCATA CCAAGATATT TTATAGTTAT 3900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3960 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG GATTATATOG QACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4080 

TATTTGTAIT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4200 

AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4260 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 4380 

SO ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4440 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCT6 ATAAACCX3AG CAAATTAGTT 4500 
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GACGrTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4620 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4680 

5 CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 474 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 860 

10 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 5040 

IS 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCX3CTGGTG CTATAACGCG TGGTGATATT TTTGTAC»TG GTGCAATCAA AGAACATATG 5160 

20 GCGAGTTTAG TCTATAAACT AGAAGAAAT6 GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 5280 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 5340 

25 AAAGTCGTAA CCGAAACTOT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 5400 

ATGAATGCTA ATATCAATGT AGAAGGTCGT aGTGCTAAAC TTGAAGGTAA AAGTCAATTG 5460 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

30 

TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 5580 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 5640 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 5820 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 5880 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 5940 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 6000 

45 AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 6060 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 6120 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

50 TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 6240 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 6300 
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(2) INFORMATION FOR SEQ ID NO: 481: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 81: 





AGTTGCTACA 


CCAGACATGA 


TGGGTGAAGT 


TGGTAAATTA GGTCGTGTAT 


TAGGACCAAA 


60 


IS 


AGGTTTAATG 


CCAAACCCTA AAACTGGAAC 


TGTAACAATG 


GATGTTAAAA 


AAGCTGTTGA 


120 




AGAAATCAAA GCTGGTAAAG 


TAGAATATCG 


TGCTGAAAAA GCTGGTATCG 


TACATGCATC 


180 




AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 


240 


20 


TGTATTAGCT 


AAAGCTAAAC 


CATCATCTGC 


TAAAGGTACA 


TACTTCAAAT 


CTGTTGCTGT 


300 




AACTACAACA 


ATGGGTCCTG 


GAGTTAAAAT 


TGATACTGCA 


AGTTTCAAAT 


AATAAATGAT 


360 




ATAAACAATT 


ACAGGCTGAA AGAAATATCT 


TTCAGTCTGT 


AAAAATATAT 


TGACAATAAG 


420 


25 


TAATTTCCAA 


GTTATATTAC 


TTATTGTGAT 


TATTTTACCT 


AAGACAGTAG 


GAGTTATTTA 


480 




TAACTTAAAA 


TTTATCCTGC 


CGAGGCTAAA 


ATTGACTTGA 


ACGTGATGAT 


CTATGATCTT 


540 




TCAAGCACTT 


TTTGCOGTGG GTAGAAAGTG 


CTTTTTTTAT 


TAATTTTAAA AAAAGCACCA 


600 


30 


AAAATTTAAA 


TGGAGGTGTC 


TGAATGTCTG 


CTATCATTGA 


AGCTAAAAAA 


CAACTAGTTG 


660 




ATGAAATTGC 


TGAGGTACTA TCAAATTCAG 


TTTCAACAGT 


AATCGTTGAC 


TACCGTGGAT 


720 


35 


TAACAGTAGC 


TGAAGTTACT 


GACTTACGTT 


CACAATTACG 


TGAAGCTGGT 


GTTGAGTATA 


780 


AAGTATACAA AAACACTATG GTACGTCGTG 


CAGCTGAAAA AGCTGGTATC 


GAAGGCTTAG 


840 




ATGAATTCTT 


AACAGGTCCT 


ACTGCTATTG 


CAACTTCAAG 


TGAAGATGCT 


GTAGCTGCAG 


900 


40 


CGAAAGTAAT 


TTCTGGATTT 


GCTAAAGATC 


ATGAAGCATT 


AGAAATTAAA 


TCAGGCGTTA 


960 




TGGAAGGCAA 


TGTTATTACA 


GCAGAAGAAG 


TTAAAACTGT 


TGGTTCATTA 


CCTTCACACG 


1020 




ATGGTCTTGT 


ATCTATGCTT 


TTATCAGTAT 


TACAAGCTCC 


TGTACGCAAC 


TTCGCTTATG 


1080 


45 


CGGTTAAAGC 


TATTGGAGAA 


CAAAAAGAAG AAAACGCTGA 


ATAATTTTTA GCGTAAAAAA 


1140 




ATTAAAAATA 


ATOGAGGAAT 


TATAAAATGG 


CTAATCATGA 


ACAAATCATT 


GAAGCGATTA 


1200 




AAGAAATGTC AGTATTAGAA 


TTAAACGACT 


TAGTAAAAGC AATTGAAGAA GAAriTGGTG 


1260 


SO 


TACTGCAGCT 


GCTCCAGTAG 


CAGTAGCAGG 


TGCAGCTGGT 


GGCGCTGACG 


CTGCAGCAGA 


1320 




AAAAACTGAA 


TTTGACGTTG AGTTAACTTC 


AGCTGGTTCA 


TCTAAAATCA AAGTTGTTAA 


1380 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 1680 

AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 1717 



10 

(2) INFORMATION FOR SEQ ID NO: 482: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : dOXlble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 





GTAAATCTGT 


TACTCGAAAT 


GTAACGATTA 


AAGAAAAGGG 


CTCATCTCAA ACATATATTT 


60 




TGTTAGGCTA 


TCCAACAAAA 


GCACAGAAGA 


ATAGTCATAG 


CAAATATAGT 


GGAGTCTTTA 


120 


25 


TATATAAAGA 


CTTGAAATCA 


ATCGAAGATA 


CAAATAATGC 


TATTACGATT 


ATCACCATAA 


180 




TTAOGGCTGT 


TATTTTCTTA 


ACAATTACAA 


CAGTCTTTGC 


GTTTTTCTTA 


TCGTCAAGAA 


240 




TTACAAAACC 


TTTAAGAOGT 


TTAAGAGACC 


AAGCTACACG 


TGTATCTGAA 


GGGGATTACT 


300 


30 


CTTATAAACC 


TTCTGTCACA 


ACGAAAGATG 


AAATTGGTCA 


ATTATCGCAG 


GCATTTAATC 


360 




AGATGAGTAC 


AGAAATCGAA 


GAGCATGTCG 


ACGCATTATC 


CACATCTAAA AATATTAGAG 


420 


35 


ACAGCTTAAT 


TAACTCTATG 


GTAGAAGGTG 


TCCTAGGTAT 


TAATGAGAGT 


CGACaAATTA 


480 


TCTTATCTAA 


TAAGATGGCG 


AATGATATTA 


TGGACAATAT 


TGATGAAGAT 


GCTAAA6CTT 


540 




TCTTATTAAG 


ACAAATAGAA 


GATACTTTTA 


AATCAAAACA 


AACTGAAATG 


CGCGATTTAG 


600 


40 


AAaTGAATGC 


ACGATTCTTT 


GTTGTGACCA 


CAAGCTATAT 


CGACAAGATT 


GAACAGGGAG 


660 




GTAaAAGTGG 


TGTTGTTGTG 


ACAGTTCGTG 


ATATGACTAA 


TGAGCACAAT 


CTAGATCAAA 


720 




TGAAGAAAGA 


TTTCATTGCT 


AATGTATCAC 


ATGAATTACG 


TACACCGATA 


TCATTACTTC 


780 


45 


AAGGTTATAC 


TGAATCAATT 


GTAGATGGTA 


TTGTTACAGA 


ACCGGATGAA ATAAAAGAAT 


840 




CGCTTGCCAT 


TGTCCTTGAT 


GAATCGAAAC 


GTTTAAATCG 


TTTAGTTAAT 


GAATTGTTAA 


900 




AIGTCGCACG 


CATGGATGCT 


GAAGGGTTAT 


CCGTAAATAA 


AGAAGTTCAG 


CCTATTGCAG 


960 


SO 


CGTTACTAGA 


TAAGATGAAA 


ATTAAGTATC 


GCCAACAAGC 


TGATGATTTA GGTCTAAATA 


1020 




TGACTTTTAA 


TTAyTGTAAG 


AAGCGTGTTT 


GGAGTTATGA 


TAWGGATCGC 


ATGGACCAAG 


1080 



55 



1409 



EP 0 786 519 A2 





TTACTTGTGA 


TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 


1200 




TTGCACCAGA ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 


1260 


5 


ACCCCGnGGT 


AACCAnGTA 




1279 




(2) INFORMATION FOR SEQ ID NO: 4 83: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




IS 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 






GAGCTGTTGT 


TACTTTGATG CCTGCAGCTT 


TATTAOQGCT GACTTGGTAA TGATAAGTTT 


60 


20 


CAGCATATTG 


CTCAATATAT GCTATATCAT 


ATTGAATGGT ACGAGGTGAT ACACCAAGTT 


120 




GATTAGCAAT 


GGTATTGATT GGAATAAACG 


TTTGCTCATG AATTAAAAGA TACAAAATTT 


IBO 




CGATTTGTCT 


ATAACTTAAC AACGTAATAT 


CCTCCTATTT GTAATTGTAA GCGATTTCTT 


240 


25 


AAAAACGTAG 


ATATGCAATC TCTTTCATAT 


TTTAATCCGA AAAATTGCAT ATCAAAATGT 


300 




TTATGGCGCA 


AGATTTTATA GGAACTTTTA 


AAATAAATTA r ATATTCATG TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT AGTTAGACAT 


CTAACGAAAT GGTGGTGCAA TAAATGGAAT 


420 


30 


TCACTTATTC 


GTATTTATTT AGAATGATTA 


GTCATGAGAT GAAACAAAAG GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT ACAAATGAGC 


AAGGTCATAC GTTAGGTTAT CTTTATGCAC 


540 


35 


ATCAACAAGA 


TGGACTGACA CAAAATGATa 


TTGCTAAAGC ATTACAACGA ACAGGTCCAA 


600 


CTGTCAGTAA 


TTTATTAAGG AACCTTGAAC 


GTAAAAAGCT GATCTATCGC TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG AATATAGGGC 


TGACTACCTC TGGGATTAAA CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT GAAATGGAAC 


AAACACTCGT ATCGCAGTTA TCTGAAGAAG 


7B0 


AAAATGAACA 


AATGAAAGCA AACTTAACTA 


AAATGTTATC TAGTTTACAA TAAATGATAA 


840 




GTGTGACTGG 


TAGAAATCAG TCACTTTGTC 


TTTAATATTA TAGTTAGATA TCTAATTGTT 


900 


45 


AGTAAGCTAA 


TTATTGGAAA AGACAAGGAG 


TATTGAACAA TGAAAGACGA ACAATTATAT 


960 




TATTTTGAGA 


AATCGCCAGT ATTTAAAGCG 


ATGATGCATT TCTCATTGCC AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 


1080 


SO 


GAyAGCCACA 


TGATTTCTGC tAatCTCTCT AACACT6CCA GTATTTGCTA TCTTAATGGG 


1140 




GTTA 






1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 



IS 



20 



25 



30 



40 



ACAACCAATT 


TTACTAAACA 


TGGTrrTCAT 


gCATTTTCAA TATTTAGATG TCGACCGCAT 


60 


AATCGAAGAA 


TCGCCGACAA 


TAGTACTTAT 


CGATGAGTTA GCACATACGA 


ATATTTCTAG 


120 


AGATCGTCAT 


GAGAAACGAT 


ATATQGATAT 


TGAAGAAATT TTAAATCATG 


GTATCGATGT 


180 


TCATACCACT 


TTGaACATTC 


aaCATATTGA 


AAGTTTAAGT AGTCAAATTG 


AACTGATGAC 


240 


CGGTGTACAT 


GTTAAAGAAC 


GTGTACCCGA 


CTATTTCATA ATGAGCGCCG 


ATGTATTAGA 


300 


AGTCGTAGAT 


ATCTCACCTG 


AACAATTAAT 


TAAACGCTTA AAAGCTGGCA AGGTATATaA 


360 


AAAGGATAGG 


CTAGATGTAG 


CATTTAGTAA 


TTTCTTTACG TATGCCCACC 


TAAGCGAAtG 


420 


CGTACATTGA 


CGTTAAGAAC 


AGTTGCCGAC 


TTGATGAGTG ATAAAGAAAA 


AGTCCGACAC 


480 


AACCATAAAA 


CGTCACTCAA 


ACCTCATATT 


GCTGTGGCAA TTAGTGGGAG 


CATTTATAAT 


540 


GAAGCAGTAA 


TTAAAGAGGC 


ATTCCATATT 


GCTCAAAAAG AACATGCGAa 


GTTCACTGCT 


600 


ATTTATATAG 


ATGTATTCGA 


AAAAAACAGG 


CAATATAAAG ATAGTCAAAA 


GCAAGTGCAT 


660 


CAACATCTCA 


TGCTTGCAAA 


ATCATTAGGA 


GCAAAAGTAA AAGTAGTTTA 


TAGCCAAACC 




GTTGCATTAG 


GATTAGACGA 


ATGGTGTAAA 


AATCAAGATG TAACCAAATT 


AATTATCGGA 


780 


CAACATATTA 


GAAATAAGTG 


GCGAGACTTT 


TTCAATACAC CTTTAATTGA 


CCATTTAATG 


840 


TCCTTTGAAC 


ATAGCTATAA 


AATCGAAATC 


GTTCCAATCA AACAAATACC 


TGTTGAATTG 


900 


AAAATGAACA 


AATCACCCTA 


TCGTCCTAAA 


GGCAAACGTT TCGCCATAGA 


TATGTTAAAA 


960 


ATGATTTTGA 


TTCAAATAAT 


TTGTGTAATG 


ATGGGACTGT GGATTTATCA 


ACTTGATAAG 


1020 


CATGAGTCTA 


GTACGATTAT 


TTTAATGATT 


TTTCTCATCG GCATCATTTT 


ATTATCCATT 


1080 


TGGACGCGGT 


CCTTCATCAT 


TGGCTTTTaG 


CAGCAATTAt TAACGTATTT GTgTkTAATT 


1140 


ATkTTTTtAC 


GGAACCTA 








1158 


(2) INFORMATION FOR SEQ ID NO: 485: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA C6TTTAAACA 60 

5 TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 240 

TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 300 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 360 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 420 

IS 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 480 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 540 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 

20 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 660 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

2g TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAQTCCAGTT 780 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 840 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 900 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 960 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 1020 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGC TAAAGCAACa 1080 

^ aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGT6 1140 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

40 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTQAATT ATTTTATGGT 1320 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 

4S 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 
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AGTGCACATG ACTAATTAAG TTTIGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAC3G TTGGGGATTA I860 

5 GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 2040 

GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA XlAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

IS 

ATTT 2224 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 





ACATTACnTT 


GAATCGAAAG 


TTTCATAAAC GACTTGAATG CCAGTTTCTT 


TTTCAAATTT 


60 


30 


CTTAATTAAC 


TCTGGATCAA 


TATATTCGCC CCAATTGTAT ACGTAAATTT 


TTTGATTTGT 


120 




AT6CACTTGT 


TCTTTAGATT 


TAAACCAATG ACTTAAAGTA AGACAAAGCA 


TACCCACAAC 


180 




TAATGCACCT 


ATAATGAGTT 


GTAAAAATCG TTTCATTATT TTACACCTCG 


CTTGATTAGT 


240 


35 


TTTTTCTTAT 


TTATCACGTA 


TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 


300 




GCAAACAATA 


ATGTTGAAAT 


CGCATTAATT TCCATACTAA TTCCTTTTCT 


CGCCATAGCA 


360 




TAAACTTCAA 


CTGATAACAC 


ACTAAAGCCA TTACCAGTAA CGAAGAAACT 


TACTGTGAAA 


420 


40 


TCGTCTAGTG 


AATAAGTTAA 


AGCCATAAAG AATCCTCCTA TAATAGAAGG 


TAAAATATTA 


480 




GGAATAATAA 


TGTT6CTTAA 


TAATTGTGGT TCAGTC6CTC CTAAATCTCT 


TGCAGCATTT 


540 


45 


AACATATTAT 


TATTCATyTC 


ATACAGTTGT GGTAAGACGA TAATCACAAC 


TATAGGTATG 


600 


CAAAATGCAA 


TATGAGATAT 


TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 


660 




CCAATCGTTG 


TAAACATAAT 


TAAGAATGAT GCACCTATGA CAACGTCGGA 


a-GATACCATC 


720 


SO 


AAGACATTAT 


TCAATGTTAG 


TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 


780 


GCAATGGCAC 


CAAATGTACC 


AATAACTGTA GAAATTGAGG CTGCTAAAAG 


TGCTACAGCT 


840 
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10 



IS 



20 



25 



40 



4S 



SO 



AATGTAAAAT GTTCAAAGTG AATCATATTA CCAGCCGAAT TGAATGAATA GAACATTAAA 960 

AAGAATATTG G6ATGTATAA AATCGCTAAA AGTATCCCGA TATACAGCTT TCCATACCAT 1020 

TTCATATGAT TCACCCTCTC CCATTAGATG ATTTTGTAAT GATTAAAATG AATGCCATAA 1080 

ATACAATTAA GAATATAGCT ATAGTTGATC CCATACCATA ATTTTGAATT GTTAAAAATT 1140 

GTTCCTCTAT TGCCGTACCT ATATTTATGA CTTTATTACC TGCAATTAAT CTTGTAATCA 1200 

TAAATAATGA AAGTGATGGA ATAAAGGTTA CTTGAATCXrC AGTCATAACA CCTTCTTTTG 1260 

TTAACGGCAT GATTACTTTT CTAAAAGTAT AGAAAGGACT GGCACCTAAA TCACTTGAGG 1320 

CCTGCAATAA ATTATTAGGA ATTGCTTTCA TGCTATTAAA TATAGGTAAA ATCATAAATG 1380 

GTATATAAAT GTAACTTGCC ACTACTAAAA ACGCACCAGT TGTAAATAAC AAATTGAATG 1440 

ATGGTAAATT AAATAAGTGG AAAATTGATT AATCACGCCA TCATGACTTA ATAAACCTAT 1500 

AAAAGCATAT GTCTTTAACA ATAAATTTAT CCATGTTGGA ATAATCATTA TCATTAATAA 1560 

GATATTTTGA AATTTCGAAC GAGTAATATA ATAGGCAGllT GGATAACTGA TAGTCAAGGT 1620 

AATAATTGTT ATTGAAGCGG CATATAAAAT TGAATATGCA AACATTTTCA AATATTTTGT 1680 

AGTAAAAATT 1690 
(2) INFORMATION FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 
30 iB] TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 60 

TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATG6TTATT 120 

CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 180 

TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 240 

ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 300 

TGTTCATATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAACT ATGTTATAAA 360 

TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTGCC GAGGTTTTCT 420 

TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 4 80 

AGTGTAcCAT TGATTATTAC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 540 
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5 



10 



IS 



20 



25 



35 



40 



TATTATTATG 


TTCAAAACTT 


TACGCTCCAA 


AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 


660 


TTGCTTAaCT 


TCGGaTATTG 


AACGCATCAG 


TCCAATTTGA CATAGAGCCT TTTTTAGTTC 


720 


TTGATGTTTC 


TCTTTAAAAC 


CTTGCATATT 


TTACAAAAAG AAAAATTAGC AGTATAATTA 


780 


AGACAACGAA 


AATAAGTATT 


TACTTATACA 


CCAATCCCCT 


CACTATTTGC GGTAGTGAGG 


840 


GGATTTTTAT 


TGOTGOGGCT 


ATATGTCACC 


TATTTTGTAT 


TGCGTCTACT TAGCCAATAA 


900 


GAAAAAAACG 


CAATGGCACA 


GCCACTGATG 


ACTGGTGCTA 


TGATGTQAAC GaAAATAAGC 


960 


ATCACCTTAT 


ACACCTCCTC 


TCTGCGTCTA 


AATTGACX5SC 


TGAGaGrTAG GcGACTCTAC 


1020 


TATTATATCA 


TCGGCAAATA 


TACAAGCACA 


GTCACTTGCT 


TCTGATAAGT TATATGATTC 


1080 


TAGCTGATAG 


ATTGAATCGT 


CTACACTTAA 


TTGGACAAAT 


TCTATGAGAA TAGATATTGT 


1140 


TAATTTAAGA 


AAGTAGGCGA 


TTTTATTATG 


ACAAGAGAAA 


GAAGATCATT TAGTTCAGAG 


1200 


TTTAAGTTAC 


AAATGGTTAG 


ATTATATAAA 


AATGGTAAGC 


CTAGGAATGA AATTATACGC 


1260 


GAGTATGATT 


TCACACCTTC 


GACGTTTGTA 


AATGGCGGTT 


ATAAAATGTA GGAAAATGGA 


1320 


TAAAGCAACA 


TCAAAACACG 


GGTACATTCA 


ATCACCAAGA 


TAACTTATCG GATGAAGAAA 


1380 


AAGAGCTGAT 


TAAATTACGC 


AAAGAAGTTC 


AACATTTAAA AATGGAGAAC GATATTTTAA 


1440 


AGCAAGTAGC 


GCTGATTATG 


GGGCAAAAAT 


AGAAGTCATT 


CAAAAGAATG CACATCAATA 


1500 


TTCAGTATCA 


GCAATGTGTA 


AAGTCCTGAT 


AATACTAAGA AGTACCTATT ATGATTCTAT 


1560 


AAAAAGAAAA 


GATAATAAAA 


TCACTAAAGA 


TGATTCAAAC ATAGAACATG CCGTCATAAA 


1620 


TATTTTTAAT 


TCTAATAGAA 


AAGTCTTTGG 


TACAAGACGA ATTAAAAATC ATTTAAATGA 


1680 


CAAGGGTCTC 


ACTGTATCTG 


GACAAAAGAT 


AGGTCGATCA TGAAAAAATC TAGTTTCTGT 


1740 


TTATACGAAA 


TCTAAATACA 


AAAATCATCT 


AAAAGAAACT 


AATGAAAAAC GAATTAAAAA 


1800 


TCTTTATTAT 


TAGCTGCTGG 


TGTATTATTT 


GTTAGTCCAA 


TTTCATTATC TTATAATTCA 


1860 


GATGTAGCTC 


ATGCT6AAGA 


TAAGTTAGAC 


CATTCTCAAG 


CAAAGGTAAT ATATTTGAGT 


1920 


AACCAAAATT 


TATTTGATGA 


ACTTGAGAAA 


AAAG6TTATA AACTGGAAGA TATATTTACA 


1980 


AAAGAAGAAA 


TAAAAAAATA 


TAAAGCTGAA 


GACCAATTGA GAGCGGGTAA AACTCAATAT 


2040 


GTAGAAACAG 


6TAAAGATAC 


TGCAACATTA 


TATCTTTCTT CTGCATATAC AAAAACAATA 


2100 



GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

55 
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(xi) SEQUBNCE DESCRIPTION: SEQ ID NO: 488: 

GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 60 

5 

TTATAGTCTC CTTCTTGAAT CTCXSAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 120 

GTTACTTCTT TTTCAATATC AACTATCAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 180 

GAACACTTAT TTGTATTTGG ATGAAAACGA AGGAATCCAT CACTAAATCC TGTTGAAAAA 240 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 300 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 360 

15 CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTT6AATAGA ATCT 454 
(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

TTGTCAGAAT TAGAATGCTT TTGAGTTACT TCATAATACT CATCAGTTTT TTGTGTATCC 60 

30 

TTTTGACTTT TATTTATTTC TTTCCACTTA CCAGTATGAC TTTCTTTTTT TACAGTTATT 120 

TTCGGTTTGT TTAAAAAATG ATAACCGATG TTCTTTTTGG TATCTGTGGA CTCTAAAAAG 180 

ACTGAATTGT TTTCCTGATT ATCAGAATTG GTTGTTTGTT TATCGTCTGT ATATAACGTA 240 

35 

TACCCATTCG CTTTGCGATT TTCAATCGTT GTAACTGGAT TCTTTGATGA ATCAGTACTT 300 

TTAATGCCAG TTCCTAAGAA GACAATTTTA TCGTTTAATA TGAAATATGA TTTTTTGGCA 360 

GTTAAAGTTT TGTCCTGATT TTCAAAATCC ATTCCGATAC TAGCATGTTG GTCATCAACT 420 

40 

TTTGTTCCGC CAACAAAAGT TTTACTCGAC TTTTTATCAT CCGTATCTTT TAATATTTCA 480 

TT6TCTAAAG TTGTTGTACC TGATAAACGT TTCATATCGG CTGTCACCCA GAAGTTATCA 540 

45 TGATAGTGTT TGACATCGCT GTTATATAAA TAAGACATTC CAGCACCAGT GTGCCAACCT 600 

TTTAAATTCT CTCCGTTGAT ACTTTCATAG CGTGCTACGT TTTTCGACGT CATACTTAAA 660 

CCAAATGCAA AGTCTAAGTC TTTGTTATGA TAGGTGACAC 6ATCCATGTC ATTATATATT 720 

^ TTAAGTTGTT GTOTTAATCC GTTTTTAGAA ATACTGTTAT CTGTCATTAA AGACTTCATT 780 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

S CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TC A TTTT G GG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TOGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 1140 

GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 1260 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

IS 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 

(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 





ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC 


CAATTTGTTC GTCTTATATA 


60 


30 


AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC 


TATGTCATTT AACTAAACAT 


120 




GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT 


AGGAAAATGC TiTiTrCrn' 


180 




TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT 


AGTTGCATTG TACTGTTGTG 


240 


35 


CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG 


AGTATTATTT AGTAATTGGT 


300 




OGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC 


TAAATA6CCA TTAAATCCAT 


360 




CTTGaATCAG TTCTGaTGGA CCATAATCTA CATCATAACT GATCACTGGT 6TACCTACTG 


420 


40 


AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 


480 




CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC 


ATGAAATTTA ACATGTTCCA 


540 




GATATnATGA TAATCTTCTA CAAG 




564 


45 


(2) INFORMATION FOR SEQ ID NO: 491: 






50 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCS DESCRIPTION: SEQ ID NO: 491: 



5 



10 



IS 



20 



25 



30 



40 



TATCCACCCC 


ACGAnAGCCC 


CGGAAACTTA 


TTGTGTTACA 


AGATATATAA 


GCAGAAACGA 


60 


ACAACAGTTA 


ACAAAA7AAA 


TGAAATTAAA 


CGTTTTAAAA AT6AAACAAA TGAAATCATC 


120 


TATTAGGTTA 


TGAAACTGTT 


TATAGCTTGA 


ATAGAAGCAT TTATTTTTTA GGAGGACAAT 


180 


TATTATGCGPT 


CAAACATTTA 


TGGCAAATGA 


ATCAAACATT 


GAGCGCAAAT 


GGTATGTTAT 


240 


CGATGCTGAA 


GGCCAAACAT 


TAGGTCGTTT 


ATCATCAGAA 


GTAGCATCTA 


TCTTACGCGG 


300 


TAAAAATAAA 


GTAACTTACA 


CACCACACGT 


TGATACTGGT 


GATTATGTAA 


TCGTTATTAA 


360 


TGCATCAAAA 


ATCGAATTTA 


CTGGTAACAA 


AGAAACTGAC 


AAAGTTTACT 


ACCGTCACTC 


420 


AAATCACCCA 


GGTGGTATCA 


AATCAATCAC 


TGCTGGTGAA 


TTAAGAAGAA 


CTAACCCAGA 


480 


ACGTTTAATT 


GAAAACTCAA 


TTAAAGGTAT 


GTTACCAAGC 


ACTCGTTTAG 


GCGAAAAACA 


540 


AGGTAAAAAA 


TTATTTGTAT 


ATGGTGGCGC 


TGAACATCCA 


CACGCTGCAC 


AACAACCAGA 


600 


AAACTACGAA 


TTACGTGGTT 


AATTAGAAGG 


AGGAAATGAC 


TTTGGCACAA 


GTTGAATATA 


660 


GAGGCACAGG 


CCGTCGTAAA 


AACTCwGtAG 


CACGTGTACG 


TTTaGTACCa 


GGTGAAGGTA 


720 


ACATCACAGT 


TAATAACCGT 


GACGTACGGG 


AATACTTACC 


ATTCGAATCA 


TTAATTTTAG 


780 


ACTTAAACCA 


ACCATTTGAT 


GTAACTGAAa 


CTAAAGGTAa 


CTATGATGTT 


TTAGTTAACG 


840 


TTCATGGTGG 


TGnTTCACTG 


GACAAGCTCA 


AGCTATCCGT 


CACGGAATCG 


CTCGTGCATT 


900 


ATTAGAAGCA 


GATCCTGAAT 


ACAGAGGTTC 


TTTAAAACGC 


GCTGGATTAC 


TTACTCGTGA 


960 


CCCACXTTATG 


AAAGAACATA 


AAAAACCAGG 


TCTTAAAGCA 6CTCGTCGTT 


CACCTCAATT 


1020 


CrCAAAACGT 


TAATTGTCGG 


ACGATATATA 


CAAAACACCT 


CGATATTATG 


TCGAGGTGTT 


1080 


TTTTTGGCGT 


TTTTGCGGCG 


AATATGGAAT 


GTGTAGAATA 


TAAATGAATT 


TTTACCTTCC 


1140 


CACCATAAAA 


GATGAAQAAC 


CATCAATGTG 


GAGAACAATA AATAGTTGGA 


TATTCTGTTA 


1200 


rrTTTTi'GGA 


AGTGGAAGTG 


GATTTGGAAT 


ACTTTACTCn 


AAACGATTAA AAG6TTTAAA 


1260 


AAAACAACAA 


AnAGAAA 










1277 



(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



SO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 



SS 
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TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 180 

5 AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 240 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 300 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

TO 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 540 

IS 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 
aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 660 
GTTTACCAGT ATC 673 

20 

(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 



45 



SO 



AAAAAAAGTT 


AATGCTGAC6 


GT6TATTAAC 


TTTTGATATT 


CTAGAAAATA AATATACTTA 


60 


TGAAGTTATT 


AACGCTATAG 


GGAAAAGATG 


GATTGTTAGT 


CATGTCGAAG GTGAAAACGA 


120 


CAAGAAAGAA 


TATGTAATAA 


CTGTCATTGA 


TAGGAAATCA 


GAAGGCGACA GACAACTGGT 


180 


TGAATGTACT 


GCTAGAGAGA 


TTCCCATAGA 


CAAGTTAATG 


ATTGATAGAA TTTATGTTAA 


240 


TGTAACAGGA 


TCTTTTACAG 


TAGAAAGATA 


TTTTAACATT 


GTGTTTCAAG GTACTGGAAT 


300 


GCTTTTTGAA 


GTCGAGGGCA 


AAGTTAAATC 


TTCAAAGTTT 


GAAAATGGTG GTGAAGGCGA 


360 


TACAAGGTTA 


GAAATGTTTA 


AAAAGGGATT 


AGAACATTTC 


GGTTTAGAAT ATAAAATAAC 


420 


6TATGACAAA 


AAGAAAGACA 


GATATAAGTT 


TGTATTGACG 


CCTTTTGCAA ATCAAAAAGC 


480 


GTCTTATTTT 


ATTTCTGACG 


AAtCAACGCC 


AACGCTATAA 


AACTCGAGGA AGATGCAAGT 


540 


GATTTCGCCA 


CCTTCATTAG 


AGGATATGGT 


AATTATTCAG 


GAGAaGAAAC ATTCGAACAC 


600 


GCTGGGCTCG 


TAATGGAAGC 


TAGAAGTGCA 


TTAGCTGAAA 


TATACGGCGA CATCCACGCA 


660 


GAACCATTTA 


AAGATGGTAA 


AGTGACTGAC 


CAAGAAACTA 


TGGATAAAGA ATTACAATCG 


720 



ss 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 840 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

^ 6TTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATOT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 1080 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 1140 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAmCG 1200 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATCACATG 1240 

75 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS ! double 

(D) TOPOLOGY: linear 

• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

ACGGTGGATT TAGCCTAGAA GATTTAACGC ATCAAGGTAa ATTaTCAGCA TTTAGCTTTA 60 

ATGATCAAaC AGGTCAAGCA ACATTGATTA CTAATGAAGA TGAAAACTTC GTAAAAGATG 120 

30 AGCAACGTGC TGGCGTAGAT GCAAATTATT ACGCTAAACA AACATATGAT TATTACAAAG 180 

ACACATTTGG TCGTGAATCA TATGaCAACC AAGGTAGTCC AATTGTTTCA TTAACGCATG 240 

TTAATAACTA CGGTGGTCAA GATAACAGAA ATAATGCCGC ATGGATCGGT GACAAAATGA 300 

35 TCTATGGTGA TGGTGATGGT CGCACATTCA CAAGTTTATC GGGTGCAAAT GACGTAGTAG 360 

CACACGAATT aACACACGGT GTGACACAAG AGACAGCGAA CTTAGAATAT AAGGACCAGT 420 

CAGGCGCTCT AAATGAAAGC TTTTCAGATG TTTTTGGATA CTTTGTAGAT GACGAGGATT 480 

40 TCXTAATGGG TGAAGATGTC TACACACCTG GAAAAGAGGG AGACGCTTTA CGCAGCATGT 540 

CAAACCCAGA ACAATTTGGT CAACCAGCTC ATATGAAAGA CTATGTATTC ACTGAAAAAG 600 

ATAATGGTGG CGTACATACG AATTCTGGAA TTCCAAATAA AGCAGCTTAT AACGTGATTC 660 

4S 

AAGCAATAGG GAAATCTAAA TCAGAACAAA TTTACTACCG AGCATTAACG GAATACTTAA 720 

CAAGTAATTC AAACTTCAAA GATTGTAAAG ATGCATTATA CCAAGCGGCT AAAGATTTAT 780 

ATGACgAGCA AACAGCTGAA CAGGTGTATG AAGCATGGAA TGAAGTAGGC GTGGAGTAAA 840 

SO 

AATATATAAA CaAOAAGAAG TAATGTTAAA CACTTATAAA TAATTAAATT TTAAATACAG 900 

55 
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ATTAGATGAG AGGAGTCTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 1020 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA OGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAAA CGATTA6AAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 1260 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 





TGCACTTTCT 


AAAAATACTT 


GrCTTTACTTG 


TTCCAATTCC TTGTCAGACA TTGATTCGGP 


60 


25 


TAATTTATTC 


ATTATTGCCT 


CAAATACGTT 


AATTATGTCA TCCATTTCAG TACTATAAAA 


120 




ATCAAACCAT 


TTTGCCGTAT 


CTTTTTCTCT 


ATTAAGCTTA TGATCACTTT GAGAGCGCTT 


180 




AGCTAATTCT 


GCATAAATAT 


ATGGACAAGG 


TGCCATTGCA GCAATTGTAT AAATAGCATT 


240 


30 


TTCACGACTA 


TGCGCTTGGA 


AATACATATG 


TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 


300 




AAcrnTOiT 


TTAATGATTT 


CTTCGTATGA 


TTCACCAACA ATTTGCGCTA AAATATCATG 


360 




CGCAAGTACT 


TCACCTTCAA 


CCATAAATTC 


TATTTGCTCT ACTAAAAATT TTACGTCATT 


420 


35 


CATGCTATTC 


ATTTTTGGAA 


TTAACAAAGC 


ATATAAGTTT GTAAATTCTT TTAAATACGC 


480 




AGCATCAGCT 


TTTAAGTAAT 


GGCGTAATGC 


QTCAOcTCCT ATATCTCCOG ATAACATCTT 


540 




CTGAATAAA6 


TCATCCTCAT 


AAATATCATT 


AATGATTGGC TTTGCAGCTT GGTACAATTT 


600 


40 


TTGTGAAAAT 


TCCATTGTAA 


AAAATCCTCC 


CTAAATAAAA AAACTACTTC CAACATGAAA 


660 




GTAGTTTGAT 


GGCAATGTTG 


CTATACTAGC 


CCCATCACTT CAATAACTAC TTTCCTACGT 


720 


45 


TGGTACTAAC 


CAAATCAGGT 


CATAAGGGTC 


TGAACAATTC ATCTCAGCCA TATCATTAGG 


780 


CTCCCCTAGT 


AGTTCCTTAG 


TATTCAATTG 


CAAATTAATC TTAGCAAACG GTTTCAACAT 


840 




TTTCAATTAT 


TGTTGCTCAG 


TTGTATTATT 


ATCTTTAAAT AATAATTCTA TAATGACATA 


900 


SO 


TATTTGCGAA 


ATAAAAAAAC 


CGGAACATAT 


CGAGAATTCC CCGATATATT CCAATCTAAA 


960 


AGTTACTTAT 


ATAACTATTA 


ATTAGCTATG 


CATAAATGGC TTATGCAGTA ACCCAATGTC 


1020 
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10 



IS 



20 



TTGCTGGTGA 


TACACCTTTA 


TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTGTAAGA 


1140 


ATTGATACTT 


ACCAGCTGCA 


CCTGgATGTT GGrTTTACAQ CATGAATATT GCCACCTGaT 


1200 


TCACGTTGrG 


CAATTTGTTT 


TAGATGAGCa 


TTCACATTTA 


CTGATGAACC 


TTCTGATGAT 


1260 


TTTGATyCAG 


TTGGTGTTGC 


AGTAACTTGT 


GAATTGTTTG 


ATGTTGATGC 


TTGTGGTTGT 


1320 


TGAGTTTGAG 


CATTTTGTGG 


TGCTTCAACT 


TCTTGTGATT 


GTACTTGATT 


AGCTTGAACA 


1380 


GCTGATGGTG 


CAACATTATT 


AGTTGCAGGT 


GCTTGTGCAC 


TCATGTCTGC 


TCCATTAGTA 


1440 


CCTGTTGCAT 


GGTAATTCCA 


AGCAAAGTGT 


GTACCATCTG 


ATTCAAAGTG 


ATAAGTAAAC 


1500 


CCTTCATAGT 


CAAATGTATA 


ATTATAAGCC 


CCAGCTTCAA 


TTGGTTTTTG 


ATTTAATGTT 


1560 


TGATCATTTG 


ATTGCGCCAT 


TTGCCTGAAA 


GATGCTTTAT 


TTAAGTCCGC 


TTCACnTGCA 


1620 


TGGGCTTCGT 


GGACCTGCAT 


TTCCTGGCTA 


CGATTCCTAA 


ACCTACTGGC 


nAAnGATGAT 


1680 


GCGAGTAATG 


TTTTCTTCAT 


AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 


1740 


AGTTTTACAG 


TGGACGACTG 


T 








1761 



(2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 
TCATTTATGA AAAATGTOGG AnAGAnCCaa GaAAiTiaCAAT TGAgCGTGAA GAAAAAGCAA 60 
3S GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG 120 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA X80 
GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 240 
ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 300 
ACAGTGTTGA AAOTQAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 360 
TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 420 
AAGTGCAACG TAAAGGACAA GTACTAQAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 480 
AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 540 
GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 600 

SO 

AAGATGTTAG AATCGAAGCG CCAATACCTG GT06TTCTGC AGTAGGTATT GAAGTGCCAA 660 

55 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 780 
ATGAAATGCC ACAC 794 
(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

IS 



25 



30 



40 



AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


GAATATATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 


AGAAATTTAT 


TCTTCTTTTT 


CTTTTCAGCA 


GTTATCATTC 


CTAATTGTCC 


TTTCTTTTTA 


120 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA 


TTATTTTTCA 


240 


AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


TTTCTCCACC 


TATTTTTACA 


TTGTTTTTTA 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 


CTAGTOACTC 


AATATGTTGT 


TTGCTTTGAG 


TATACTTAAT 


CTCCGGCTTA 


TGTCTCATTA 


420 


CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATGTTTTA TCATAAGGTT 


480 


TTGTCGCATC 


ATGTTGAAAA 


6CTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


540 


TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC AACTTGCCCT 


TCTGGCATTA 


600 


AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTGCACT ACATGCATCT 


AATACGTGAT 


660 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA GCTTTTATCT 


TGAATTGAAA 


720 


CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA TGGAGACAAT 


780 


AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC TTGCAACTTT 


TCAATAATAT 


840 


CATCTAATGA 


TGCTCGCGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTOGTT 


TCTAAAAATG 


900 


ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA TCTATAATCC 


960 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


rmTlTAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACGCATC 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 


A6CCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


1140 


TATCTAnATA 


nACATATTGA 


T 








1161 



SO 

(2) INFORMATION FOR SEQ ID NO: 498: 



SS 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

AGCTCACGTC ATCTTCGQCG GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 60 

TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 120 

CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCmTT IBO 

IS TCAATATATA TACAACTATT AAATCCCATA AGATIGCAGA GCACATAAGT AAATTTTTTT 240 

AGAGCTTGAG GTTTGTTTAG CTTAAGCAAC CCATGAGCTC AAACACTTCC TGTTCACATA 300 

ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTTTAT ATAAATTACA CACAATAAAT 360 

AGAAAGAATG TGAACATCAT GAATAAATTA TTGCTACTCG TTACATTTAT CATTCGTGTG 420 

GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTACGCTG 480 

AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGGTTT 540 

TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTCGGAGAG 600 

ATTGCAATTG GATTAGGTTT AATTTTTGGA GTTTTTGCAT ATGCTGCTAG TTTCTTTGGA 660 

GCCTTTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACQTATCC TCTTCAATTA 720 

ACTTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 780 

ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 840 

35 GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGG 900 

TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTTTACTAT CAAATGATAT 960 

TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGTCAAAGA 1020 

AATGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTGCCAAAA CACAAGAACA 1080 

TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCC 1140 

AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 1200 

ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACTTATT AACTTAAGTA AAGTtGTGaC 1260 

tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT GGTATTTAGC 1320 

TTCTaGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATGA 1380 

CTATTACGAA GATGCTAATA CCGTGAATGT CCATATACAC CGTATTAGAG AnAAATTAGA 1440 

AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTGA 1500 

55 
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25 



30 



40 
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(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 99: 



75 



20 



35 



40 



45 



SO 



ATTGAAAGCG 


ATAATTCGTA 


nTAATTGAGT 


TTGTTGAAAA 


ATTTAGGGTA 


ATGTAAAGAT 


60 


ATAAAAGATA 


CATAGAyTGG 


AGAGATATAA 


AGATGTTGAA 


TGAGATACAA 


ATATTAaATA 


120 


aTGGATACCC 


GATGCCTTCA 


GTTGGGTTAG 


GTGTTTATAA 


AATCTCTGAC 


GAAGATATGA 


180 


CTAAAGTTGT 


AAATGCTGCA 


ATTGACGCAG 


GCTATAGAGC 


GTTTGATACA 


GCATACTTTT 


240 


ATGATAATGA 


GGCTTCACTA 


GGACGAGCAT 


TAAAGGATAA 


TQGCGTCGAT 


AGAGAAGATT 


300 


TGTTTATAAC 


AACGAAGTTA 


TGGAATGACT 


ATCAAGGTTA 


TGAGAAAACA 


TTOGAATATT 


360 


TCAACAAATC 


GATTGAAAAT 


TTACAAACTG 


ATTATCTTGA 


TTTATTTCTA 


ATACATTGGC 


420 


CTTGTGAAGC 


AGATGGTCTA 


TTTTTAGAAA 


CATATAAAGC 


TATGGAAGAA 


CTTTACGAGC 


480 


AAG6TAAGGT 


AAAAGCAATA 


GGTGTATGTA 


ATTTTAATGT 


TCATCATCTA 


GAAAAATTAA 


540 


TGGCTCAATC 


AAGTATCAAA 


CCAATGGTGA 


ATCAAATTGA 


GGTACATCCA 


TA'ITITAACC 


600 


AACAAGAATT 


ACAAGAATTT 


TGTGATCGTC 


ACGATATTAA 


AGTGACTGCA 


TGGATGCCTT 


660 


TGATGAGAAA 


TAGAGGACTA 


CTAGACGACC 


CTGTCATTGT 


TAAAATTGCT 


GAAAAATATC 


720 


ATAAAACACC 


AGCACAAGTT 


GTATTACGTT 


GGCATTTAGC 


acacaataga 


ATTATTATTC 


780 


CAAAATCTCA 


GACACCTAAA 


CGCATTCAAG 


AAAATATAGA 


TATTTTAGAT 


TTTAATTTAG 


840 


AATTAACAGA 


AGTAGCTGAA 


ATTGATGCTT 


TAAATAGAAA 


TGCAAGACAA 


GGTAAAAATC 


900 


CAGATGATGT 


GAAAATTGGG 


GATTTAAAAT 


AACTGGATGT 


TAAATTTTAC 


GTTTATGAAT 


960 


GCCTTTTAAT 


GTGTACATTA 


AAATAAATGA 


GTTGGTTTTT 


ACTATTTGAT 


AAAACAATAC 


1020 


TCAGGTACAT 


TCAAAATCTT 


TTAAATAAAA 


AGGATGGACA 


TAGATGAAAA 


TTAGAGTCGT 


1080 


CATTCCTTGT 


TTTAATGAAG 


GGGAAGTCAT 


TACACAAACA 


CATCAACAAT 


TAACTGAAAT 


1140 


ACTTTCACAA 


GATAGTAGTG 


TGAAAGGCTA 


TGATTATAAT 


ATGCTTTTCA 


TAGATGATGG 


1200 


TAGTACGGAT 


ACCACTaTAG 


ATGAAATGCA 


ACATcrrocc 


ACAATAGATA 


GGCATGTCAG 


1260 


CTTTATTTCT 


TTTAGTAGAA 


ATTTTGGAAA 


AGAAGCAGCT 


ATGATTGCAG 


GTTACCAGCA 


1320 


TAGTACTGAA 


TTTGATGCAG 


TCATCATGAT 


AGATTGTGAT 


TTGCAACATC 


CACCTGAATA 


1360 
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IS 



25 



30 



35 



45 



SO 



55 



TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 1500 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 1560 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 1620 

TGA 1623 
(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGQTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 60 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC 6CGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 240 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 360 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 420 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 480 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 540 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 

CATTT 605 
40 (2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 
GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 120 
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TGAGAGAAAA 


GTTTAGACAC 


TATTGTGAAC 


ATGAAGTTCT 


AAGTTCAATC 


ATCAATGGTT 


240 




CATACATTAT 


CGTCAAAACC 


TCACCTGGTT 


TCX3CCCAAGG 


CATAAACTAT 


TTTATCGATC 


300 


5 


AGCTAAATAT 


AGAAGAGATA 


TTAGGTACGG 


TGAGTGGAAA 


TGACACTACA 


TTAATCTTAA 


360 




CTGCCTCAAA 


TGATATGGCA 


GAATACGTAT 


ATGCAAAATT 


ATTTAAATAG 


ACATGTATCA 


420 


10 


AATGAATAAT 


AAAAATTTGT 


TTCGTATCAC 


GTGTACTCAA 


GTTAGTTACC 


AAATATTAAC 


480 


rrGTGTACGC 


GTTTTTTTAT 


GGAAAGAAAG 


AATTCATAGT 


CATTCAATTG 


ACTGTATAAA 


540 




AAACTTTATA 


CAACATGTTT 


TTATGGGTAT 


TTTTGAATAA 


AAAATGTATA 


TTTTGACCCA 


600 


IS 


AAATACCTTT 


ATTTATGTAT 


AAAAATCCAT 


TATTATGTAT 


TGTATAACAA 


AAAGATATGA 


660 




AATTTTCGAC 


TTTCTTTATG 


TGAATATAAT 


CACATGTAAG 


CGTTTGAAGA 


TTGTCTATAC 


720 




TCTAAATGAA 


TTCAAAGATA 


AAAGGAGGAA 


ATAGACATGA 


CAGATGGTCC 


AATTAAAGTA 


780 


20 


AATAGCGAAA 


TTGGAGCrTT 


AAAAACTGTG 


TTACTTAAGC 


GTCCTGGaAA 


AGAATTAGAA 


840 




AATTTAGTAC 


CTGATTATTT 


AGATGGATTA 


CTATTTGATG 


ATATTCCATA 


TTTAGAAGTA 


900 




GCTCAAAAAG 


AGCATGACCA 


TTTTGCGCAG 


GTGCTAAGAG 


AAGAGGGTGT 


TGAAGTACTT 


960 


25 


TACCTTGAGA 


AGTTAGCAGC 


TGAAAGTATT 


GAAAATCCTC 


AAGTAAGAAG 


TGAATTTATT 


1020 




GATGATGTAT 


TAGCAGAGTC 


TAAAAAAACA 


ATATTAGGTC 


ATGAAGAAGA 


AATTAAGGCA 


1080 


30 


TTATTTGCGA 


CACTTTCTAA 


TCAAGAACTT 


GTAGATAAAA 


TAATGTCAGG 


GGTACX3TAAG 


1140 


GAAGAAATTA 


ATCCGAAATG 


TACACATCTA 


GTAGAGTATA 


TGGATGATAA 


GTATCCATTC 


1200 




TATTTAGATC 


CAATGCCAAA 


CCTTTATTTT 


ACTAGAGATC 


CACAAGCCTC 


AATAGGACAC 


1260 


35 


GGTATAACAA 


TCAATCGGAT 


GTTCTGGAGA 


GCACGACGAC 


GAGAATCAAT 


ATTTATTCAA 


1320 


TATATTGTAA 


AGCATCATCC 


TAGATTTAAA 


GATGCGAATA 


TTCCAATCTG 


GTTAGATCGA 


1380 




GATTGCCCAT 


TCAATATTGA 


AGGCGGCGAT 


GAACTTGTTT 


TATCTAAAGA 


TGTCTTGGCT 


1440 


An 


ATAGGCGTTT 


CAGAACGTAC 


ATCTGCACAA 


GCTATTGAAA 


AGTTAGCGCG 


ACGTATTTTT 


1500 




GAAAATCCGC 


AGGCGACGTT 


TAAAAAAGTA 


GTAGCAATTG 


AAATTCCAAC 


TAGTCGAACT 


1560 




TTTATGCACT 


TAGATACAGT 


ATTTACAATG 


ATAOATTATG 


ACAAATTTAC 


AATGCATTCA 


1620 


45 


GCCATTTTAA 


AGGCAGAAGG 


CaATATGAAT 


ATATTTATTA 


TTGAATATGA 


TGACGTAAAT 


1680 




AAAGATATTG 


CCATCAAACA 


ATCTAGTCnT 


TTAAAAGATA 


CTTTAGAAGA 


CGTACTAGG 


1739 



12) INFORMATION FOR SEQ ID NO: 502: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: 



5 



10 



IS 



20 



25 



30 



35 



40 



CTGTACATAC 


AGCAATATCG 


TTAACAAATG 


TAAACCTAAA 


ATACTGTTAT 


TTTTATTACT 


AAGGTGGAAC 


GAATTCAGAA 


ATATCTGCTC 


AACTTATAAA 


TGAATAATTA 


GTACTAGACA 


ACTTTTTATT 


CATTGAAGTT 


AAGCGTAATT 


CACGTATGAT 


TGTTTTAGCT 


CCTACTTGTT 


GATGAACCTT 


GACATTAGGT 


AAATGTTTAA 


CTAAACTAAA 


CGTACCTTCT 


TTTTTACTAT 


ATCTATCTGT 


ACTTCTCTCA 


ATAATGTCTA 


CCGGAATGAC 


CGCTATTGTA 


TGTTCCATGC 


GTCAACCCAT 


AATGGTAACG 


TTTAATCATA 


TTGCTAAATT 


CACAAACGAT 


GATACCATTT 


TTTAAAGCTT 


TATCAATGAG 


ACCTTTATTA 


TGAATATCAC 


GTTTTGACAA 


TGCTTTTAAA 


GATTGTGCCT 


CTAAATCCAA 


ATTCGCAAGA 


TTTTGaTCAA 


CAAAGATTAC 


CTTATCCATA 


CCGCTTCCTG 


CAAATAAATC 


TAAACCTATA 


CCTTCTTTAA 


CTTTATCCAT 


AGTTGGTCTC 


TTACTTTTAT 


GTTTACCTGC 


AATGACGCGC 


ATTTAATATA 


ATTTATTGAG 


AAAAAGGAGA 


CACTTGGTGA 


AGGTCTAACA 


GATTTGTTCG 


AACGTATTGA 


TAAAATTATC 


TATTTTCATT 


TAGCAATCAT 


TATGAACCCT 


ACAACTGGCA 


ACGCTATTAA 


ATATCCATAT 


CCAGATTCAA 


CTGAAAAATT 


CGACATACCA 


ATTTTAGGTA 


ATTTATCGTT 


ATATTATAAT 


TATTTAATTA 


aACTTCAATA 


ATATTAATTA 


TATATTTCGT 



55 



;EQ id NO: 502: 

AAAACAGTAT TTTAGGATTG TAAACCATGA 60 

TAAATTTCTT CTTCAATGCC TTTTCAACAT 120 

GATAAGCTGC AACTTCTTTA ACAATACTTG 180 

TCATATATAA CGTTTCAATT TCATTGTTCA 240 

CATATTCAAA ATCACTGACT GCTCTTAAAC 300 

CACAATAATC GACTAGTAAA CCACTAAATT 360 

CAGATTGTTC AATTAAATCC ATACGCTCTT 420 

TTTTAAGAAC ACAGACATGA ATTTCATCAA 480 

AATGACCATA AGTAATGGGG TCAAAACTAC 540 

TATTCTCCCT TTTCTAATAA CAATGTGTCT 600 

TTAAACGGTT GATAATCtAT TTCTTCATGA 660 

TCTTTCAATA AATTAAACTC TGAAATTAGT 720 

TAAGGTGGAT CTAaGAAAAT GACATCAAAT 780 

GCTCTATCTG CATTATTTTT ATAAACTTCA 840 

TTTGaTTTAA TAACTTTTAC AGCTTTAAAA 900 

CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 960 

CCTGACACAT CATATAAACT ATTAAAGATA 1020 

GTATTACGGC CTTCCATACT TTCTAAAGCT 1080 

ATGTTGTTCA CACTTCCAAT TCATTTAGTT 1140 

ATGATAAACC AATGAAACAA ACATTTATTA 1200 

AATTCATGAC GATGATTGAA TATAACCATC 1260 

CACCACAAGC TGAAAATAAA AAGTCATCTG 1320 

ATCATTTCCA AGCATTTTAT ATCATGATAA 1380 

ATAAAAAGTT TCAAATGATA AATGATTGTG 1440 

TCGATGTACA GCCCCCTCAA GCATTTCATG 1500 

GTGTGTTAAG GCTCCAAAAA TGGATACCAG 1560 

GTTTCTCTTT TTCGTAAGTT TTCTTTAAGT 1620 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 1740 

GAACG 1745 

5 

(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: doxible 
<D) TOPOLOGY: linear 

75 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

TCGTCTTTAA TCTTGCTGAC TTTAGAAGGC TTACGA6TGC GGTTACCATT TTTTGCATCT 60 

TTAACTGATT GAACTAAAGC TtGACGTGTA GATTTATCAG CTAAACTAAT TGCACCACCA 120 

^0 ATTACGGCAC CAATTAAAAT ACCAGGAACA AATTTATTTT CCATAAAAAA CTACCCCTCT 180 

TTCAAATTTG CATCTTTTAC OATGTAGTCT ATTAAATTAT CACAAGATGA TAATACCATG 240 

TCGTATACAC CTTCAAAATT ATTCGTGTAG TATGGATCTG GTACATCACT CTCTTCCATA 300 

TTACTAAATT CTAACAGTTT GAACAATTGT CCCTTAAGAT TAGGATTGAT AGATTTAATA 360 

TTATCAACGT TACTTTGATC CATAGCCACA ATGTAATCAA AATCATCTGT CGCTTCGAAT 420 

AATTCACTAA TCATGCCATC AAATGGAATA TTGTGTTTGT TGAGAATTTT TTGTGTACCT 480 

TCATGAGGTG GCTCTCCTAA ATTCCAGCTA CCAGTACCTC TTGAATGTAC TTTAATATCA 540 

TGAATATTTC TGTCTTTAAG TCTTTGTCGC ATGATTGCTT CTGCCATTGG AGAACGACAT 600 

ATATTGCCAA GACAGACAAA TGCTACATCT ACCATTTTGA TTCCTCCAAA CTATGTAGTT 660 

ATATCCCCAT TTTATAGCGA CTTTAAACAA TAAGAAAGCA GATTATATAA AATTCTATTA 720 

AAGTTTATTA AATTGTGATA CTTTGATAAC ATAACTATTA TTAGAGGTGA ACATTGTGGC 780 

40 TATGACAAAT 6AAGAGAAA6 TnTTAGCTAT TAGAGAGAAG TTAAATATtG TTAATCAAGG 840 

ATTATTAGAT CCTGAAAAAT ATAAAAATGC AAATGaAGAA GAATTAACAG ATATATATGA 900 

TTTTGkTCaA yCAAGAGaAA GATTGTCGCC AAGTGaAGTG mCAGCTATTG CTGaCGCTTT 960 

AGGACAATTG CGACACGAAT AGGAGTGGGA ATTTTGACTA ATTACAAAGA AAAGTTACAA 1020 

CAATACGCTG AACTA 1035 
(2) INFORMATION FOR SEQ ID NO: 504: 



2S 



30 



35 



45 



50 



55 



(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDSDNESS : double 
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10 



IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 
AnCCTGACAA GATTCTTATA TTCATAGATA TGAGAGCTAA AAATGAAATC AACAAACAnA 60 

CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 120 

TATTATAGCC ATTTCCAGCT AATAATTGCT GGATAAATGC ATTATTGTTT AATGTATTTT 180 

CAAGATTAGC AAtCGATGTG TTATTACTGA ATGAAACAAG TGCTATAAAC ATCGTAATGA 240 

CAGTAAGCAC TAATAACATC ACCCAACATA ACCAACCTAG AACTTTTTCA GTTAATCTAC 300 

TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 360 

ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATG ATTTTATTTT 420 

AATGACATTT TGAAATTCAA AAA6TTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 4 BO 

20 TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACO ATGGTCGTCT 540 

TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTGCATA GCCTACGAAT 600 

TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATGCACCC 660 

25 TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 720 

TTAGGAATAA CATAATTATA TTTATCCCCT TCTTGCACTA GAGGTGCTGC AACACCACTC 780 
CAAACAACCG CTATGTTACC TTCATTTTGT TGAAGCATCA TGGTAATTTC ATCACCTACG 840 
ACACCTCTTA CTTGTGGTGC TAGTTTGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 900 
GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATGCCTAT AATCTCTCTA 960 

GCACCGTCAA CTAGTAAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 1020 

TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 1080 

TAAGGTAAAG AATATTTATT GCCTCTATCA AATGACATAT TCATATAATC TGAATCTAAA 1140 

40 TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 1200 

TTTTGAACTG nATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTGCGAATT 1260 

TTGGnGTCCA nCGCTTCATT TGAA 1284 
^ (2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



30 



3$ 



55 
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10 



IS 



20 



25 



30 



35 



40 



4S 



SO 



ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 240 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 300 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATOGAGCC CTGQAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 480 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACQATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA AT6CACCTAA GATACTATCT TTCAATQCCT CTTCCATCCA 1260 

TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 1320 

AAATGCTATX3 ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 1380 

ATTTAATATG TCTAGACTTA OACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 1440 

ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 



55 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1860 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACGCTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGACCXSA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2040 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

10 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 2160 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

15 TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2340 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 2400 

^ TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTATW^ TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2580 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 2640 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2760 

30 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2880 

^ TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2940 

TCATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3000 

AACTTQATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 3240 

^ CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 3360 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 3420 

50 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 3480 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CATATAGCCA CCCTAAAGAT TTAATCAATA 3540 

55 
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GTTAGAATTA TGTTGGACTG GTACTACTAT 

GAATCAAAAA AGACATAAGG TCTTGGACTA 

5 

TATGTCTTAT CAATTAGCCA ATACCGAATA 

CACCTAATAC CATTGTGATG ATAATTAATG 

GTTCATGTCT CTTTCTTTTA GCAATCGACA 

10 

ACAGCATTTT CAATGTAAGC AACATATGAT 
ATATCCAAAA TCCTGAAATT AACGTCAACA 

^5 GTTTGAAAAA TGGTGATCTG CCTTGATTTT 

AAATAATCGC TAATACCCAA CTTAATATAT 
TAATTATTTA TATTATTAAA TTAAAGCTTC 

20 AATCATGATT TGATTATACA CGAATATATA 

AACATGACAT TTTATTCCTT ATTAGAATAT 

TCTTAATATC AATATTTCAC TTTCATCTCA 

25 

AACACCTTTT TAGAAAAGCA AGAAAAAACT 

TTAAATGATA ATTATTTAGC GATATAAGTT 

TTAACTTCAT CACCTGGTTG TAAAAATTTA 

30 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA 
ATTAATTCAT CAATTTTAAG AATCATTTCG 
TTAACTTTTG TAACAATATT TACATTTTCA 
TAAGGACCCA TTGGGCAACC GCCAGTTAAA 
CTTTGTGCTT TGCX5ATCAGT GATATCGTTA 
40 GCTAATGCTT TTGGAATCTT TTCACCAGAC 

TAATCTAATT GATCAGTAAT ATCTTTATGA 
GATGACXSCTG CrrTTGTAAA TACATATAAT 
TGATCTTTGT AATTTCTACC AAAAGCAATC 
AATTCAATGT CATTAAATGA AATTTTATAG 
ACAACTGCTT TACGTACTTG TTCTTGAAAA 

SO 

AACAATGTTT TAGGATGGAA ATCTCCTTCT 
CATACAGCAT CTTCGCGTTT TACTTTAACG 

55 
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CCAGTCCATT 


AAA A \J X A A 


AAPAT^TTTTA 
nnymrJ^ A X X X X *\ 




ATAATTGTCC 


ATGC CTTATG 


TCATATACTA 

X X t\ X AW X J\ 




ATTTTGATAT 


AGGsCCTAAC 


GGTA6AATGA 




CAATTGTTAT 


CCAAAACATT 


GTGTGAPTTT 
\j X u X utn^ X 4 X 




CTTCCATCAA 


TCCAACTACT 


GPA A PAPP AP 




TTGCCCCGCC 


ATTCATAAAT 
^%x xwnx/mnx 


GAn*GAATTA 




GCATAAATAA 


GCGTAAAATP 

wW\J X#WWw%X w 


ATfSTOP A A Pf2 
A X vy X VtV^AA WV9 


Anon 


TTGAAATGTT 


TAAGTATCTA 
xnnu xnx wx#% 


GPf3 AT A A ATA 


Af\an 
%\3oM 


GTAAATGTAA 


CATAPTTSATT 
wnx AVo X vnnkX X 


V«wV»V.L.LdAl. X 1 






TAfcr'Af*rm 


wl I\*4.ivVi 1 X 


4200 


TTCTACCACA 


PTT'f'TA'PA'T'n* 
wx xwx#%xnxx 


TnAr3AfV2AAfl 


4^qU 


TCTGAATCTG 


CTRTAAAATA 


R'^^^A ^ ^^^^^ 

ii X UilA W X AV- 1 


A ^ o n 
4 J20 


AAATGGTAAC 


ATT AT A A AT A 
nx XnX/i/VtXA 


Al 1 lAiUi 1 X 


A a A 


AACCAATCTA 


TATAAAt^APT 


X 1 A\J w 1 1 X 




GTCAGCGTTC 


CAATATTATP 


AATAf2TPAPT 

A/\ X t\\J X V-iAV^ 1 


A C A A 


GGTGGTTGCA 




AAPfSPPTPPT 
aaV_V>V»v, 1 VjV* X 


A C C A 


TGTAGTGCAA 


PATATTTTfiA 


rV\X X X w X X ^ X 


Aco n 

40ZU 


CCAGTGTTAC 


CATPTTGTPT 
wnx wx x^xwX 


A ATTTP A '1 Tl 'i 
A/%X X X WAX 


AC an 


GGTAATGGTA 


GTTPR' 1 'f "I"IT 

^7 X X W\J X w X X X 


AGTAAPfSATA 
avixaavwaXa 


A'7An 


cvrri'iXaKTA 


AAAATGPTTG 


ATPTTVZTTPA 

AX W X X\7X X WA 


A Ann 

40UU 


ATAATTGTAT 


AGCCX3TAAAC 


ATAATCTAAA 

x«^nx w x#i^vi 




TTACCAATAA 


CAATACCTAA 


TTPACCTTPA 

X X WAWW X X WJ% 


AQ7n 


TTTGGAATTG 


TTGCATTATC 


TPPTrtTT A A A 

XWWXwX X^l^t^X 


^ 70U 


TTTTCCACTT 


CATGATTTAA 


TTCGTTPGPA 

X X W\9 X X WV9V«A 




ACATTATTCG 


GAGGTGTTAC 


TGGTGGTAAA 


5100 


TCTTCAGCTT 


TGCCGCTATC 


TTCTGCTGCT 


5160 


TCTAAAGTAT 


GATTTTGTTG 


TAAACCAGCT 


5220 


GCAAAGTCAG 


CAAATACTTG 


TGTTAAATCC 


5280 


CCATATGAAG 


TTTTGTCATT 


ATACTTGAAT 


5340 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 5460 

TGCTACAAGG ACTTTGCACT TAATCX3AAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5520 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 5580 

ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 5640 

GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 5700 

ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 5760 

ACG 5763 

,5 (2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 60 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 120 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 180 

GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 240 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 300 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 360 

TTCTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 420 

AT 422 
40 (2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1X88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



35 



45 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 60 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 
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TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 
TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 
AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCGTAAATG 
TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT GTTTCATACC ACAGTTTATG 
GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 
ATCAAAAAAT CCCX5ACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 
ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 
TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 
ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA ACACATACAA ATTAATCCAC 
AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTC GAGTCCTACC 
TGTGQAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG CXKrCCCTGCT AAGGtGTAGG 
TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCX5TT ACTTGCTAAA ATGGTATATA 
CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 
TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG GTTTTTATAC 
TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGTTTTTA ACTATAAAAT 
TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATGtACTCA 
TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 
(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 
CCCAACTTCG GTTATAAGAT CCCTCAAAGA TOATGAGGTT AATAGGTTCG AGGTGGAAGC 
ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 
TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 
GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 
CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 
TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 
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GGCGGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 480 

TGTTTTTTTA TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 540 

^ CTGGaAAGTG TGTAGGCX3TC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 600 

GGcGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG 660 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

10 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 780 

GATTTTGAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 840 
(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

2S TTCTATGAAA CTATCAGCTG TTTCTTTTTC CTTGGTAAAA ATCGTTTTAA CGCCATGATA 60 

CATGTATTCT CTTTCAaCAC CCGATTGGTC ATAAGCATCG AAAACACAAA TAATTTCATC 120 

TGAAATAACT GCATTATAAT TTGCAATTGC ATCTATTAAT TGCATTCTAG CTTCTTCTAA 180 

30 

ATTCTCTTTT GCAATGGCGC TTAGCGTTGG TGATTGTCCT ATCATATTGT ATCCATCAAT 240 

GATTAAGTAA CGTTCTTTCA TTATATTTCT CCAACATCAT GTCTTTTTCG AAATACTTCG 300 

TACATCATTA AACTTGCTGC AACCGAAGCA TTCAAACTGT TTACATGTCC AACCATTGGA 360 

3S 

ATCTTAATAT AAAAATCGCA TTTATCACTT ACTAGGCGAC TCATACCCTQ TCCTTCGCTA 420 

CCAATTACAA TAGCCAATGA CATGTCCGCT TCTAGATTTC TATAATCTGT TGCATTATTA 480 

40 GCTTCAGTGC CAGCTACCCA AAAGCCATTA TCTTTTAGTT CATCGATA6T TTTAGCTAAA 540 

TTTGTCACTC GAATAACTGG TACATGTTCA ATTGCACCTG TTGAGGCTTT TGCAACTGTT 600 

TGCGTTAGTG TAACTGAACG ACGTTTAGGA ATAATAACAC CATCAACTCC CGTTGCATCG 660 

^ GCTGTTCTTA AAATTGATCC CAAGTTATGT GGGTCTTCTA AGCCGTCTAA TATAAGTACT 720 

GTCAATAAAC CTTCTTTTTC TTTTTGCTGT TTTAAAAATT GATCGAAGTC AGCATATTCA 780 

TATGGTGCAA TAAGCGCTGC AACACCCTGA TGTGGTGCAT TTGCTAAAAA ATCTAATTTA 840 

50 

GATTTTGGTA CAGTTTGAAC AATGATTTTT TGATCTTTTG CATTTTTTAA AATTTCATTA 900 

ATTTGTTGCT TTTTAATACC TTCTTGAATC AATATCTTAT TTAyCGGATG CCCAGTAATA 960 

55 
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TTCX3TTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 1140 
S 

AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 
TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACmTGGC 1260 
AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

TO 

GATATAGGTA CXSTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 1373 
(2> INFORMATION FOR SEQ ID NO: 510: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 





TCAAGTGCCA 


ACTACACAAC 


TGATTTACAT TCTTTAGGTC 


AATATGTACA AGAAGGCCGT 


60 


2S 


CGCTTCTTAT 


TCGAAACAGT 


TGTAAAAGTA AATCATCCTA AATATGATAT TACTATTGAA 


120 




AAAGATAGTG 


ATGATCTAGA 


CGGATTAAAT TATTTAGCTG GTAAAACAAT CGACGAAgTT 


180 


30 


AACACAAAAG 


CATTCGAAGG 


TACATTATTA GCGCATACTG 


ATGGTGGTGT 


TCCTAACATG 


240 


GTAGTGAACA 


TTCCACAATT 


AGATGAAGAA ACTTTCGGTT 


ACGTCGTATA 


CTTCTTCGAA 


300 




CTTGCTTGTG 


CAATGAGTGG 


ATACCAATTA GGCGTAAATC 


CATTTAACCA 


ACCTGGTGTA 


360 


35 


GAAGCATATA 


AACAAAACAT 


GTTCGCATTA tTAGGTAAAC 


CTGGTTTTGA 


AGaCTTGAAA 


420 




AAAGrATTAG 


rAGAACGTTT 


ATAAAATACA TTACTTCAAA 


GATTAGTGAA 


GTTTGAAAAG 


480 




ATAGAACTAG 


ACGTTAACTA 


TTTAAAGCAT ATTTTCGAGG 


TTGTCATTAC 


AAATGTAAAA 


540 


40 


ATGTAATGAC 


AACCTCGTTT 


TTATTTATAT GCAAGAACTA 


GGTTACTAGC 


TAaTGTGACA 


600 




AGAT6TTAAG 


A6AAAATTAA 


aGATAAAATA ATATCTGCCT 


TACAATAATA TTGTTATACT 


660 




ACTAGnGACT 


GATTTATTAG 


CATGATTACA TGTTAATGTT 


TCTTTACTTA 


GTAATTA 


717 


43 


(2) INFORMATION FOR SEQ ID NO: 511: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 60 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

5 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 240 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 300 

10 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

f5 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 480 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 540 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 600 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 780 

25 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 840 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

30 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 10 BO 

^ CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 1140 

TGGcTGAAAA TTAACTTTAT AOAATGATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTGG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 1440 

^ ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

50 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 1680 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 1740 

55 
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5 



10 



20 



35 



40 



45 



AAGATATGCA 


TACCGTAATG 


ATAGATATAG 


TTTTAAACGT 


GACTTCAAGC 


TATATGAATG 


1860 


TGATGACTGT 


TCATCATGTT 


CTTTGAGACA 


TCAATGCATG 


AAGCCAAATT 


CGAAATCCAA 


1920 


TAAGAAAATT 


ATGAAGAATT 


ATAATTGGGA 


ATACTTTAAA 


GyCCAAATTA 


ATCAAAAGCT 


1980 


TTCTGAACCA 


GAAACGAAAA 


AAATCTATAG 


TCAAAGAAAA 


ATTGATGTAG 


AGCCTGTTTT 


2040 


TGGATTTATG 


AAGGCTATTT 


TGGGTTTCAC 


TCGAATGTCA 


GTTCGAGGAA 


TAAATAAAGT 


2100 


TAAACGAGAG 


CTAGGTTTTG 


TATTAATGGC 


ACTTAATATA 


AGGAA^ATAG 


CAGCTCAACG 


2160 


AGCTGTACAT 


TATAAAATAC 


ATATCAAAAA 


AGCTGATTTC 


TATCAAATAA 


TTAATAGAAA 


2220 


TCAGCTTTTT 


TACATTGCCT 


AAGAATTTAA 


TGTCCCAAGC 


CCTTTTTATC 


GAATAACTTA 


2280 


TT6TAAACCT 


TGTCTTTCTT 


GGTTATTGTT 


TTCGTTATTT 


TTTTCGTGTT 


TTTGTTTCCA 

AAA \^ AAA 


2340 


CTCTTTTTGA 


GTCATTACAT 


CGTCAACTTG 


CAlXaTl'AACT 


TCAACTACAT 


CTAAACCAGT 


2400 


AATATATTTA 


ACTTGTTCTT 


TAACTAATTC 


AGTTACTTTA 


CGGAAGATTT 


TTGGTG^TGA 


2460 


TTCACCATAT 


TCTAAGATTA 


CTTTTAAGTC 


TACAGCAGCT 


TGTTTTTCAC 


CAACTTCAAC 


2520 


AGATACACCT 


TGAGTAACAT 


TGTTGCCACT 


TGAGAATGCA 


TTAGTGAATG 


TATCAGTTAA 


2580 


GCCACCTTTC 


ATATCTAAGA 


TACCTTTAAC 


TTCAOGTGCA 


GCGATACCAG 


CAATTTTTTC 


2640 


AACAACTTCA 


TCAGAGAATG 


TTAATTTGnT 


TTTGAATTGA 


GGCTCCTGAT 


TTnGTTCnTG 


2700 


(2) INFORMATION FOR SEQ ID NO: 512: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 






ATCAAATACA 


ATAATTAAAA 


TAaTGGCTAA 


TACaCCTAaA ATaATAATcG 


AGTTaGAGTA 


60 


GCCTAAGCGA yCACCcTAAC aGTArACATT 


CTAGGCATAT ATTTaTCTTT 


AGACATTGAA 


120 


GCCGCTAACA 


TCGGAAAAGC 


TGTAAATCCA 


GTGTTGGCTG 


CTAATACCAA 


AATCATCACG 


180 


GTCGTTGCCT 


GTACAAAATA 


GAAGGCAGCA 


TTATCACCAA ATATTTGCAT 


AGCTAATTGT 


240 


GATAAAACTG 


TCGTTTCCGT 


TTGTGGcAAA 


ATTCCATAAA 


CATATGCTAA 


ACCAACGATA 


300 


CCAACTAATA 


AAAACGCTAA 


AATTGAACCC 


ATAGCAATTA 


ACGTTTTCAC 


AGCATTTTTA 


360 


GCACTTGGCT 


CTCTAAAATT 


GGTTACCGCA 


TTTGAAATAG 


CTTCAACACC 


CGTTAACGAT 


420 


GAAGCCCCTG 


ATGAAAATGC 


TTTTAATAsC 


AAGAATAATQ 


TTACTCCAGG 


AACCGCAGTT 


480 
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10 



AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 600 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC 660 

AGTrCkTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 720 

ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 7 80 

CATTTTTCTC CTAAATTCGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 840 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA 900 

ATOGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 960 

^5 TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 1020 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCCCTAT AATAAGtCTT 1080 

TTAAATTGAT kGAcATAAtC TCCTATTCCT TTTTATAGTT TTnGATGGAA ATCAC 1135 
20 (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

TTATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA TCATTTATGT GGCAAAAACG 60 

TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 12 0 

CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 180 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT GGCATATCCA CATTTTTTTC 240 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 300 

40 AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

AGAATGTGTT GATGT6CAAG TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 420 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 480 

TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAQCT ACGATACCAC TTGCTACTGC 540 

GATGATATTT TTCAATGCAC CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 600 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA CTTACACTTT TATCTTTTGA 660 

50 

TGAAGCAGCA ACTGTAGTTG GTTGCTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 720 

CAACACGCCA ATACCTGCAT TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 780 

55 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 
^ (2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 60 
TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 
20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 240 
AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 300 
GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 360 
TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 420 
TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 480 
GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 587 
(2) INFORMATION FOR SEQ ID NO: 515: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



25 



30 



45 



SO 



5$ 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTnTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 120 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 240 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 300 
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10 



15 



30 



35 



45 



SO 



55 



TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 420 

TTATQATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 480 

GCGTTATAAT QAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 540 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 600 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 660 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 720 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 
(2) INFORMATION FOR SEQ ID NO: 516: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



. Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24 0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACTTGT 300 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 420 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 480 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA 
AAAGAAAATA CTAAATGGTT ATGCGATCAA 

5 

GTAATTGATG AACTGTCTAC ATTTAAAAGT 
AAGAAATTAC CACTCATTAA TAGATTTATA 
TTACAGGATT TATGGGCTCA AGTTTATTTG 

10 

TTCAGTCGTT ATCGAGAAAG GTACTTTAAA 
AACTGGGAGC TAAGAGACGG ATCTGAAGAA 
TTAAGCATGA AAGCGAAAGA TTATCTAGAT 
GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA 
GAATCGGAAG AAGAAGGAAC AGTTGTAGCT 
20 CTTCAACTAT CTAACGGTGC AGTTTATACA 

AAGAAGTTAG ATAAGTTAGA GGAAATTATA 
TTTTATAACT TCaAACATGA TAAAGAAAGA 
TTaGAGGATT CAAACTATAA AGAACGTTGG 
CATCCAGCAA GTGCAGGGCA TGGATTAAAC 
nTGGACTTA CATGGTCATT GGAATTATAC 

30 

GGACAAAATC ATACGACTAT TATTCATCAC 
GTATATAAAG CTTTACAAAA TAAAGAACTA 
GCAAGAATAG CTAAGCATAA GTAATGGAGG 

35 

AAGCCAGGTA CATTTAAATA TATTGAGTCA 
GAGATAAATA GATTGAGAAT GGAGATACTT 

40 GTGTATGGAC CGTTGCAAAA AGGTGAACCA 

TTATTGACTA ATAAGATGTT ACGAAACCTA 
TACTTAAAGT TACCTGAAGA TCATAAGAAA 

^ AAGAAGCTAA AGATAGAGCA AATAGGAGAT 

ACTATACGAA AGAACTTTGT TAAAGCGGTA 
GCAAAGATTG TGCAAAAGGC CTACAAATCT 

50 

TAAAGTTATC TAAAAGTTAT ACGACACAAG 
TTTGTTATGC AATCAAAGAG GTGTAAGAGA 

55 



AACACAQAGG CTGATATCTA TGTAACCAAT 120 

TATAAAAAAG AATGGCCATT TGATATGGTT 180 

CCTAAGAGTC AAAGGTTTAA ATCTATTAAA 240 

GGATTAACAG GAACACCTAG CCCAAATAGT 300 

ATAGACAGAG GCGAAAGACT TGAGTCTTCA 360 

CCAACACATC AAGTTA6CXSA ACATGTTTTT 420 

AAGATATATG AACGAATAGA AGATATATGT 480 

ATGCCTGACA GAGTTGATAC TAAACAAACA 540 

TATGCAGAAT TAGAAAAAAA CTATATTTTA 600 

CAGAATGGGG CATCATTAA6 TCAAAAACTA 660 

GATGATGAAG ATCTAAGACT TATACATGAT 720 

GAGGAGTCTC AAGGCCAACC AATATTATTG 780 

ATACTTCaAA GGTTTAAGGa AGCAACCACA 840 

AATAGTGGAG ACATTAAGCT GCTTATAGCA 900 

TTACAACAAG GTGGGCACAT TATTGTTTGG .960 

CAACAAGCAA ATGCAAGATT ATATAGACAA 1020 

ATTATGACCG ATAACACAAT AGATCAAAGA 1080 

ACGCAAGAAG AATTGATGAA AGCTATTAAA 1140 

TATAAGATGG GAAAGGCATC ATACGATATT 1200 

GAGATATATA ACCTACAAGA GAACAAGAAA 1260 

AACCCAACGA AAGAGCTAGA CACTAACATT 1320 

GTTAGAACAA CTGAACTAAT GGCAACAAGG 1380 

GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 1440 

GTAATTAGGC TAAAGTATTG GAATAGAGAT 1500 

GCATGTCACA TGCATCGTAA TACAGTTACT 1560 

cGwtATCATG CAGGTATCAA ATAACATTGT 1620 

GTAGTAATAT GATAGTATCG GAAAGATGTA 1680 

TACACGAGGC ACATCGCTAT GCGtGTGTCT 1740 

TGACCAAGCA TAATAACATT TATAAGCATG 1800 
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AGATAGCATT AGATAGGGAT AATCATCTTT GTCAAATGTG TTTACXSTGAA GACATAGTAA 1920 

CAGATGCAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 1980 

^ ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT GCAAATGATA 2040 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 2100 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGCC 2160 

10 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAACAAAA 2220 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 2280 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 2340 

IS 

ATGATAQAGC ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATCC ATTAAGCATA 2400 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG TTTAACTGCA 2460 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCG GTGACTATTA AAGTTTTAAA 2520 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 2580 

AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 2640 

^5 AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATTCGATTCA TAGAAAAGTT 2700 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTGTATTA CAACCATGGC AACATTTTAT 2760 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2820 

30 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATATCTGGTG TTGCTAACTA 2880 

TGCrcTTTCT CAAGATGGAG AAAACGGCGC TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2940 

ACAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3000 

35 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3060 

GCCACAGGCX TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGGC 3180 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 3240 

TGTTAATAT6 GTAGAAGCX3G GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3300 

45 TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 3360 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 3420 

AAAGGCXAAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 3480 

TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 3540 

TATTTCCTTA GATGAGITGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3600 

55 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3780 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3840 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3960 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4020 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 4140 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4260 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4320 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 4380 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTAT6AAGA TTATAAAGTA GTTAATACAG 4440 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4544 
(2) INFORMATION FOR SEQ ID NO: 518: 



TO 



IS 



2S 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

^ TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 240 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

SO 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 420 

AATAAAGATT TAATCX3TGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 480 

55 
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IS 



20 



25 



30 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTArT 660 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 720 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 780 

GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 840 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 
(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 60 

ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 120 

AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 180 

AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 240 

GTTCCATACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 300 

ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 360 

AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 420 

GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 480 

ACTATCX3TGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 540 

40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 600 

AAAGTTGGGT GAAACAAACA AGGCATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 660 

TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 720 

^ GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 780 

CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCTTA 840 

ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 900 

SO 

ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 960 

TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 1020 

SS 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 1140 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

5 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 1260 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 1320 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 1380 

10 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 1440 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

,5 AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA 6TTCTGCATA 1560 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 1620 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 1680 

^ ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCX5 AAACAAATGT 1740 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 1920 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 1980 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 2040 

30 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

^ ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 2220 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAACCCAG TCCGTTCATA 2280 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 2340 

40 ATAATGGACA CTTA6ATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 2400 

TrCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 2460 

TTGCAG7GGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2520 

^ ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 2580 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 2640 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAOCAGAC CATCATTACT 2700 

50 

ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 2760 
ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2820 

55 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 2940 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3000 

TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT ATAAAGTGTG 3060 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 3120 

GT 3122 



10 

(2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3982 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 



25 



30 



35 



40 



45 



SO 



ATAnAGATAT 


AnATnAATAT 


ATTGAGGTCA 


AACGATGATA 


ATTAAAATTT 


TAACAATTCT 


60 


ATTACTACTT 


TGTATATTGA 


GCTATTTGGT 


TACAAATAGA 


AAGAAGCCTT 


TTCTGTTCTT 


120 


AAAGACACTC 


TTTATC5GIGTG 


TGGTATTTAT 


CTTTA.TA.CSfSA 


TATATTTCAC 


TGGCAATATC 


180 


TGCCGTAATT 


ATTTATGGTA 


TTATTCAATT 


TATCACAATT 


GATTTTGGTA GTTTTTTCTT 


240 


AATGGGTATT 


ATATTGATCT 


TGATTTCAAG 


TATATTCCAA 


TTATTTATAG 


TTAGATTACT 


300 


TTTTAGAAAA 


AAGAATGTCG 


ATTTGACAGA 


GGTTGTCGTT 


TTAsAsCATT 


TAATTCAATG 


360 


GTTCTTAGTT 


TACTTTGCGA 


TCTATCAAGC 


AGTAAATGAA 


AAAATGGACA 


TTAATGATAT 


420 


TAATATCGAC 


AATTTCCAAT 


CTGTCTTTTT 


TGACGTGTCT 


AATTTGAATT 


TAGTAATTCT 


480 


ACCAACGTTA 


ATCATTAGCT 


GGGTCACAAT 


ATTTAACTAT 


AGAATGAGAA 


GTTACAAATA 


540 


AAATCTATGA 


GATTATACCT 


TCAGACACCA 


ACATTCAAAT 


GGTGTCTTTT 


TTGTTGTGTG 


600 


GTTTTATTTT 


TGAAATTCGA 


AAAAGTAGAG 


GCATGAATTT 


TTTGACTAGT 


GTATAAGTGC 


660 


TGATGAGTCA 


CAAGATAGAT 


AGCTATATTT 


TGTCTATATT 


ATAAAGTGTT 


TATAGTTAAT 


720 


TAATAATTAG 


TTAATTTCAA 


AAGTTGTATA 


AATAGGATAA 


CTTAATAAAT GTAAGATAAT 


780 


AATTTGGAGG 


ATAATTAACA 


TGAAAAATAA 


ATTGATAGCA 


AAATCTTTAT 


TAACATTAGC 


840 


GGCAATAGGT 


ATTACTACAA 


CTACAATTGC 


GTCAACAGCA 


GATGCGAGCG 


AAGGATACGG 


900 


TCCAAGAGAA 


AAGAAACCAG 


TGAGTATTAA 


TCACAATATC 


GTAGAGTACA ATGATGGTAC 


960 


TTTTAAATAT 


CAATCTAGAC 


CAAAATTTAA 


CTCAACACCT 


AAATATATTA AATTCAAACA 


1020 


TGACTATAAT 


ATTTTAGAAT 


TTAACGATGG 


TACATTCGAA 


TATGGTGCAC 


GTCCACAATT 


1080 



55 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

S AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

10 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 1620 

IS 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1680 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 1740 

^ TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGT6A 1860 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 1920 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2040 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

^ AAGTTTTTTC AATACTCXSTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

35 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATCCAAGT 2340 

TGGCGGGGGC CCCAACATAG AAGCTOOCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

40 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

45 AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 2640 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 2760 

50 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 

55 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 3000 

AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 3060 

S ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 3120 

ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA AT6ATGAAGA AGTCCATACA 3180 

AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATGCAT 3240 

GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 3300 

TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 3360 

TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 3420 

75 

TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 34 80 

AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 3540 

TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 3600 

20 

GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 3660 

CATTTGTTAA CCTCCTTGAA CATTTAAATT GATTCAACTT AAGTTTAACT TATTCATACA 3720 

2s ACTT.CGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 3780 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 3840 

TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTTTT 3900 

30 ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 3960 

GTCATGTTAG CCAATTTTTA AG 3982 
(2) INFORMATION FOR SEQ ID NO: 521: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 

45 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 
GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 
SQ TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 300 

SS 
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GTTAGGTAAA ACATTTTATC AAAACTATAO AGATGATGTT GAAAGTTTAT ATAGTAAGTT 420 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAAAAGCAG TTAACAAAAG 480 

^ GATGTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 540 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 600 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA 660 

10 

AAGAAGCAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 720 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 780 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 840 

IS 

TAAACAACX3T ATTGACAACG AAAACGACAA GAAACTT6TG GTTTCTGCAC CAACAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 960 

20 GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 1020 

TGAAAGTCAT GATTTCACAA CAACGCATCA ATCACCAACA ACTTCAAATC ACACGCATAA 1080 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 1140 

2S TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 1200 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC ATAGTTATAA 1260 

AGACCGAATA AATGCACAAC AAAAAGTAAA TACTTTAAGT GAAGGTCATC AAAAACXSTTT 1320 

TAATAAACAA ATCAATAAAG TATATAATGG GCA 1353 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotlble 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

45 ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 240 

SO TGATGCAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 360 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 523: 





TTAGAAGTCA 


AATCATTAnT 


GGCGTCnTAT 


CGAGTATACT ATTAACTTCA ACTATTTTAG 


60 


10 


CAATTGCATA 


TATTTTAAT6 


TGGTTTAACG 


GCCATATGAC aCTAACTTTG ACCTTAACQA 


120 




CAATAATTAC 


AAGCTGTTTA 


ACCTTATTAA 


TATGTAGTAT 


TTTTATTAAT 


CCACTTATAC 


180 




AAAAAATTAA 


GCAGTTTAAT 


ATAAAAACTA 


AGCAATTTGC TAACGGAAAT TACGCAAGCA 


240 


IS 


ATGATAAAAC 


GTTTAATTCA 


CCAAAAGAAA 


TTTATGAATT AAATCAATCT TTTAATAAAA 


300 




TGGCTTCTGA 


AATTACGCAA 


CAAATGAATC 


AAATTAAATC 


CGAACAACAA GAAAAAACAG 


360 




AACTGATTCA 


AAACTTAGCC 


CATGATTTAA 


AAACACCTTT AGCAAGCATT ATTTCATATT 


420 


20 


CTGAAGGACT 


ACGTGATGGT 


ATAATCACTA 


AGGATCATGA GATTAAAGAG TCATACGACA 


480 




TATTAATTAA 


ACAAGCAAAC 


AGATTATCAA 


CATTATTTQA 


TGATATGACT 


CATATTATCA 


540 




CTTTAAATAC 


AGGTAAAACA 


TATCCCCCAG 


AATTAATACA ACTAGACCAA TTACTTGTAT 


600 


25 


CAATATTGCA 


ACCATATGAG 


CAACGTATCA 


AACAT6AAAA 


CCGCACATTA 


GAAGTGAATT 


660 




TCTGTAACGA 


AATTGATGCA 


TTTTATCAAT 


ATCGAACGCC 


ACTTGAGCGT 


ATTTTAACAA 


720 


30 


ACTTACTTGA 


TAATGCGCTA 


AAATTTTCAA 


ATGTTGGTAG 


TCGCATTGAT 


ATTAATATTA 


780 


GTGAAAACGA 


AGATCAAGAT 


ACTATCGACA 


TTGCTATTAG 


CGATGAAGGT 


ATTGGCATTA 


840 




TACCAGAACT 


ACAAGAACGT 


ATATTCGAAC 


GTACATTCAG 


AGTAGAAAAC 


TCTCGTAATA 


900 


35 


CAAAAACGGG 


TGGTTCTGGA 


TTAGGCTTAT 


ATATAGCTAA 


TGAACTCGCG 


CAACAAAATA 


960 




ACGCAAAAAT 


CAGTGTAAGC 


AGTGATATAG 


ATGTAGGAAC 


TACGATGACT 


GTAACATTAC 


1020 




ACAAATTAGA 


CATTACGTCA 


TAATCCGATT 


TATTTATAAA 


ATAAAATGCA 


AAGACTAAAA 


1080 


40 


AGAAGCTCCC 


ATTAATGAGG 


GCTTCTTTTT 


TTGTTTATTT 


AGAATAAACT 


TTATGGGTAT 


1140 




CCTTCTCATC 


ATTTTCAAGA 


CTTGAAAGAT 


TTGTAGCTTG AATAATATAT TTAGGACGTC 


1200 




CCTTAACTTC 


ATAATATATC 


CTGCCAATAT 


ATTCACCTAC AACACCAATT GAAATTAACT 


1260 


45 


GTATGCCGCC 


TAATAATAAA 


ATAGCTGCAA 


TCGTTQAAAA ATATCCCGGA ATATTAACAC 


1320 




CAGATATCAT 


AATATTGATG 


GAATAAATAG 


A 






1351 



(2) INFORMATION FOR SEQ ID NO: 524: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

^ TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 60 

TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 120 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 180 

^0 AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA AGGTTTATAA 300 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

IS TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAAGGC CCTCCTCTTA 420 

TTTTGACCCC nAT 433 



(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 



30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


T6ACCAATTT 


GATAGGCTTC 


CACATTTTGT TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA ATAGAAACTT 


720 


SO 


CTTTTTTCAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA AAGACATCGA 


780 
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GCACTAAACT GTATCCATTT GAATGAATGC CACTTGACGC AAGCCCTATA ACAACTTGTC 900 

CCTCTTTCAC TTCTGAACCA TCTACATAGT CATCCTTTTC AACTGCTCCA ACAGCAAATC 960 

^ CAGCTACATC ATATTCGCCT TCGTGATACA TTTCACCCAT TTCAGCAGTC TCTCCACCGA 1020 

TAAGTGCAGT ATTCGTTTCA ACACATGCAT CACTAATACC TTTAACAATT TGTTCAATAA 1080 

CTTCAGGAAC AACTTTGTTT GTAGCAATGT AATCTAAAAA ATATAATGGT TCTGCACCTG 114 0 

70 

TCGTTAAAAT ATCATTAACA CACATTGCGA CTGCATCGAT ACCTATCGAA TCATGTTTAC 1200 

CATAGTCGAT AGCTAGTTTT AATTTCX3TAC CTACTCCGTC TGTTCCAGAA ACTAAAACTG 1260 

GCGCTGtCAT ATTTAATTGT GATAAATCAA ATGTAGCACC GAATCCACCT AAACCACCGA 1320 

IS 

TAACTTCTTT ACGCATCGTA CGtTTAACaw GaCtAGaCaT TCtTTCtACA GCTTCATAAC 1380 

CAGCATGAAT ATTTACACCA GATTGTTCAT ATGCTTTAGA CATTTAAATT TCCCTCGCTA 1440 

20 TCAAAAAAGT GTTTGTTTTT AGAAATATAT TGCTTTTGTC GATGACTTAA ATGCGCTTTA 1500 

TAATTTGCTT CATAATCATA TAACCCTGCA GGATAATCTC CAGTGAAACT TTCTACACAT 1560 

AAGCCACTAT ATGGCGCGTC ATAATCTAAA CCAATTGATT CAATTAACCC ATCTACAGAT 1620 

25 AGATATGCTA ATGAATCAGC GCCAATATAA TCTTTAATTT CTTCAGGTGA TTTGCTTGCA 1680 

GAAATTAATT CTGCCGTAGT TGAAACATCG ATTCCGTAAA AACTTGGAAA CATAAATTCC 1740 

GGTGATGCTA TACGCACATG CACTTJcATTT GCACCAGAAT CkTTTAACAT TTTCACAATG 1800 

30 CGTCGAATTG TCGTACCGCG nACAATGGAA TCATCAACAA GACTG 1845 

(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 



45 



SO 



TCATAAATnG 


TAAATGTTAA 


TCGTCATATA 


ATATTAATTT 


ACAACACCAT 


TTTGGTTATT 


60 


TGAAGCTTGT 


GGOGCTTGTT 


GTGTGCCACC 


TTGATTTTGA 


TTTGAGTTTT 


GATCTGTAGC 


120 


AGGTTGTTGT 


TGATTGCTGG 


AmtCACTGTT 


ATTAGTTGAm 


TCACTGTTCT 


CGTTAGATGT 


180 


CTTATCTTTA 


TCTGTCGTAT 


CATTTTGTTC 


TTTTCTCAAT 


AAACTATTAT 


CTAAAG6CGT 


240 


TAATGGTATT 


AATGAACCAT 


AATGATTAAT 


GACACGTTGA 


TCTAAGAAAT 


CATTTTTATC 


300 


ATTAATAGGT 


GATAATTCTA 


AGTCTTTACG 


AAGTAAGTTT 


GCATATTTTT 


GAATGcrrrc 


360 
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AGTTTTAATT TCAACATCAT TTGTTaAGTA CATTTTTGCT AAAGCTTTAA TTTCAGAGTT 480 

AGTTAAATTA TGCTTTGCAT TTTTACCTAC AATTTGAATC ACGTTATCAA GTTTATCAAT 540 

AGAATCAACT TCCTGTGCTT TTTGGAATAA AATCTTAATT AATTCCATTT GACGTTGTCC 600 

ACGTTTTAAG TCTGAATCAT GATGTCTAGT TCTAGCAACT GCTAAAGCCT CATCACCATT 660 

TAATTTTTGG TACCCTTTTT TAATTTTAAT CTTACCAGTA TCATCTGTGT TAGGTTCATT 720 

TAAGTCGTAT GGCACATCAT AGTATATGCC ACCAAGCTCA TTTACAGCCT CGACAAATQC 780 

TTTCATATTG ACTCTCACAT AATAATCAAC AGGTACATTC ATGGTAGCTT CTACCGAATC 840 

CATTGOGGCA ATTGGACCAC CATATGCATG TGCATGGGTA ATCTTATCGT AATAGCCAAC 900 

TTTAGGAATG TAGCTGATAG TATCACGTGG AATACTAAGC ATTCTAATTT GATGTTTTGA 960 

TTGATTAAAA GTAGTTAAAA TCATAGCGTC TGATCTAGAG TGTTCAGCAT CCTGTCCTTT 1020 

20 rr rr C TTCTT CCATCGTTAT CATCGATACC TAAGAAAAGA ATAGAGATAG GTTGTTCTTC 1080 

GGGATGACTT TATTATCTCT TAAGTTGGAT TGACGTTAGC ATTTTTGCTG TCTTGAGAAG 1140 

A 1141 
(2) INFORMATION FOR SEQ ID NO: 527: 



10 



IS 



2S 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : doxibls 

(D) TOPOLOGY: linear 



3S 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAnCAGTTG TTGAAAATTC GGCGCCAACA 60 

AGTAAACroGG CmCaGGCaGC mATamCCCCa TATTATAcTT ATAATGGTTA TATTGGTAAT 120 

40 AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA TTAAGTATGA TAATGTGAAA 180 

TTCAATGGTA TTAAATTAGC TAAAACGAAT ACGATAAAAA AAGTAGAGAA ATATGATCAA 240 

ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC GAAGCAAGTC AATTGCAATT TGTAGTTAAA 300 

AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA AGGACTTGAA AAAAGAAAAT 360 

GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC AAAATGCTAA AAAGACATTA 420 

GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTCGAAG TAACAGTTGG ACATACACCA 480 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGCTAG AGTGTGAGGA GTGATACCGC 540 

ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC TTACATAGTT GTTTGAAATT 600 
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GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 720 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAACAT 780 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA GAAATAAATA 840 

GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGGCA 960 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 1080 

AAAGAATTGC CTAAATCCOA AAATGTAAAA AATATTTATC AAGATTAC6C TCTTACTGAT 1140 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1200 

CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CX3TTTTAATC 1260 

20 AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAGTAAAGAT 1320 

GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA AOAATAAAGC GAAAAATCTT 1380 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 1440 

TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 1500 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565 
(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTrCTA 120 

TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 240 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 300 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 360 

TTGATTAGTG CAGGTATAtC TGGrATTCyA GTCTCTACAA COCCTATTTT TTCTAGTGCT 420 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 540 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

5 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATaTACAAT AATGGCAAGC 660 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 720 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

10 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 840 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

f5 ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTT^CTTCTT TCAAAATACA 960 

TAATTATTCC AAGTCCCX3CA CCTCAGAATC CAAAAACATT CX3AGTGATAA AATTTTAAAA 1020 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 1080 

20 CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 1140 

ATTACAGTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 1200 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 1260 

25 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 1380 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 144 0 

30 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

35 TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 1620 

ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTG6ATTT 1680 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC 1740 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTITA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACX3 ACAAGTCAAA 1920 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 2040 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

SO 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 
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20 



25 



TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 2340 

AACCTCAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 2520 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 2640 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 2760 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2820 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2870 
(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



30 



35 



40 



45 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 240 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 300 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 360 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 480 

ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 540 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 600 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 720 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 780 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 900 

AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 960 

TTCTGGTACT TCTTCGGCTA AAAATAAAOT GTCATTGCCG TTACCGCAAG TTGCGTCTAC 1020 

AACAATACTT TCTGGTGTTA TATGTTGTTT AATAAGTGTT TTTGAAAAAG GGAGTATACG 1080 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGCAAGC CTTGATAAGA ATTTCTACGT 1140 

GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 1200 

ACCATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

ACTTCTAATT GGTCCACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTTAATAAA 1320 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 1380 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGC CATCATCATG TCATAGTCTT 1440 

CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 1500 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACX3ATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

AATATTCGAC AACATCGTCT G6TAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 1740 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT IBOO 

TTTCTTTAAT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

GACAGTCAAA CCCGCCATCC AATGCAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 204 0 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC ATATTTTAAT 2100 

GTTATAGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 

CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 2220 

TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 2280 

GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 2340 

TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 2400 

GTACAACGCT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 2460 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 2520 

CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 2580 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2760 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCX3CTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATX5GAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2940 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 
(2) INFORMATION FOR SEQ ID NO: 530: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 120 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA 180 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 240 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 300 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAQAGTCT GCTGAACGTG 360 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 420 

40 GGTTATATGA ATAAAATATA AACTGTAAAC CTTTAC6ATT TATTTATAAA GGTAGAAAGG 480 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTl'GTA 540 

GTAAATTACT TGAAAAATTT TATAGTTTTT kGGTAAcACG TAtTaAAAaG AGAGGAATAT 600 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTT G TTA AAATGAAgAA TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 720 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 780 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 840 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 900 
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GATACGATTA TATTAAAACG GCTAATCATT 
TAGAAATTCA TGATACTTTT CTACAGACGA 
^ TATTAAGATA ATTCTGACAT TTAAAATGAG 
AACAAACATT TATGAATGCA CAATGAATAT 
CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT 

10 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT 
TATTAATAAA GAAATGTTTA AGCGTCAAGG 
TGAGAAAGAT ACAGTAGAAA TAGTATCAGG 
AATTACTATG GTTGTAACCA ATGATGACTT 
TCCAATAAGT GAAGAAGAAC GTGAAAATAT 
20 TCATGCAGAT TTGGTTGGAG GTATGAAATA 
GCGATCATCT GCTAGAGAAA CAGCAGCTCG 
ATTACAACAG TTAGATATCG ATATATACAG 
TAAAGATTTT TATGATTCAG AAACATTTAA 
AATTGATGAC AGTATCGCAC AAGCAATGCG 
AGATTCAATT GGCGGTGTCG TTCAAGTTGT 

30 

TTATGTGCAT TATGATCGTA AGTTAGATGG 
TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG 
GATTCAAGAT GAAATTCTAT ATAATAGTGA 

35 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG 
ACCAATTCCA ACGTTATATA AACCATTAAA 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG 
ACATGTCGTA GCATTTGAAA TAGCAAAAGC 
TGAGCAACTT AAACAACAAA TTATTGAGCG 

^ AGAACAATTG AGGTGTAATC ATGAAATTAC 
TATATGTTGA ACACGGTGCA ATTGACCATA 
GTTTTATATT AATTGACGAG CATGTAAATC 

SO 

TATCATATGA AAATGTACAT AAAGTTATTA 
AGCAATATCA AGAAACATTA GAGTATATTT 

55 



TTTAATTAAT GATTATATGA TGCAACTGTT 1020 

ATATATTATA ATTAATTTTA GTTCGTTTAA 1080 

ATGTCATCCA TTTTCTTAAT TGAGCTTGAA 1140 

GATAAGATTA ACAACATATT ATAATGTTAT 1200 

GAGATACCTA ACATCAGGAG AATCACATGG 1260 

ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 1320 

CGGTTACGGA CGTGGACGTC GTATGCAAAT 1380 

CGTTAGAAAT GGTTATACAT TAGGTAGTCC 1440 

TACGCATTGG AGAAAAATTA TGGGAGCAGC 1500 

GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

TAATCATCGT GATTTACGAA ATGTGCTAGA 1620 

AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 1680 

TCGTGTTGTT GAAATAGGTG GAATTAAAGA 1740 

AGCAAATCTT GATCX3TAATG ATGTTCGTGT 1800 

AGATAAAATT GACXSAaGCTA AAAATGAAGG 1860 

AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 1920 

TAAGATTGCA CAAGGTGTTG TCAGCATAAA 1980 

ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 2040 

AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

AATGCCAATT ATCGTTAATG GTGTAATGAA 2160 

TTCAGTAGAC ATTAATACTA AAGAAGACTT 2220 

TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 2280 

ATTATTGGAA GAATTCCAAT CAAATCATAT 2340 

CAGACAATTA AATATTGAGT TTTAACAACA 2400 

AAACAACATA CCCTTCAAAT AATTATCCAA 2460 

TTAGCACGTA TATTGATCAG TTTGATCAAA 2520 

AATATTTTGC TGATAAATTT GATGATATTT 2580 

TTCCAGCTGG TGAAAAGACG AAAACATTTG 2640 

TATCCCATCA TGTAACTCGT AATACAGCAA 2700 
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A0C5AGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATQATT CTAGTGTTGG 2820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2880 

AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 2940 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3060 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 3120 

ACXSTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GITGAATACT ATCATAAAAT 3180 

ACCTCATGGT CATGCAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 3240 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3300 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 3360 

20 AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 3420 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 3480 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 3540 

AATAGAAGTG CCGGGCGATA AGTCAATXSAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3660 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3720 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 3780 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 3840 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3900 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3960 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 4140 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4200 

45 TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGQAAGTG ATGTAACAAT 4260 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4380 

SO 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 4440 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4500 

SS 
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AAACTTGTTA GGGTTTGAAT 
ATTTAAAACA AATGCAACAG 
^ AGTTGCTTCT CTACTTTCAA 
ATCATTTCCA GGATTTTTAC 
GGAAGATATC TATAAATTAA 

10 

TCGTGTTAAT GAAGCAATAA 
ACTTTACAAT TTTGGATTAA 
ATATCCAGAC GAAAGTGAAT 

IS 

AACTGAGGAA GCGTTAGAAT 

AGAAGCAGAT TTATATCAAC 
20 AGCACTTGAA CTAGAGCCAA 

TTATGATGGT CAATATTTAC 

ATATCAAGTT AATGGTGTAA 
^5 kAACTATAGT GATkCcGATt 

AAGATTATCT CAAAAGAGCC 

AAATAATGAC TACATTACTT 

30 

aATCaTTATA TG 5532 

C2) INFORMATION FOR SEQ ID NO: 531: 

<i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 



45 



AATTGGGTTA 


TACTATAGGT 


AAATTTAAGG 


AGGTAAGAAA 


ATGGATAAAA AAGAATTAGC 


60 


GAAATTTATA 


GGCAATAAAA 


TCAGATACTA 


TAGAACCaAA 


TTGaACTTAA CTCAAGATCa 


120 


ACTTGGAGAA 


AAACTCmACa 


CTAAAAArGC 


TACTATTTCA 


AATTATGAGA CAGGGTACAG 


180 


AACTCCTAAA 


CAAGATGATT 


TGTTTGAAAT 


TGCTCATATT 


TTAAATATCA GTATCGATGA 


240 


TTTGTTTCCT 


ACAAGAAATA ATAAAAAAAA 


CGACATCACT 


TCCATATACA ACAAACTCAC 


300 


ACCTCCCCGC 


CAAGAAAACG 


TACTTAACTA 


CGCAAATGAG 


CAATTAGATG AACAGAATAA 


360 
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TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4620 

TTGATAGTTT AACTGATCAT CGAATAGGAA TOATGCTTGC 4680 

GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4740 

CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4800 

TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4860 

CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4920 

TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4980 

TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 5040 

ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 5340 

CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 5400 

ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 5460 

TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5520 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 480 

TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 540 

^ GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 660 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 720 

10 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 840 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

IS 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 
(2) INFORMATION FOR SEQ ID NO: 532: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS : double 
(D) TOPOLOGY: linear 

2S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 60 

30 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 120 
TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 180 
TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 240 

35 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 300 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 360 

40 AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 

(2} INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 60 



SS 
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ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA IBO 

AAATCCACAA AATTCTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 240 

ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 300 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 360 

CX3ATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 420 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 480 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 540 

TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 600 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 660 

CX3TTCGTCAT GCGCCATGTG ACX3TGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 720 

20 CCAACCACAA GTT 733 
(2) INFORMATION FOR SEQ ID NO: 534: 
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SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 

CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 120 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 

TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 240 

40 TGATCGT6CT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 300 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 360 

AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 420 

TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 480 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 540 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 600 

CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTG 660 

AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 720 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 840 

ACTTGGTAGG TACTGAATCA AAQAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 

TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 
AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG . 1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 1140 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACAAAGTaGc AAGCGAACTT GCTGATAAAC 1200 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACX3AAAA AATCTTAATO ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

20 TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 1440 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CC7UW5ACTAT 1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

^5 AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 1620 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1680 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 1800 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT 1860 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2040 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 2160 

CAACGTCArr CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGQATATTAA 2220 

45 GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 2340 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 2400 

SO 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 2460 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2520 

SS 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 2640 

AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCGTATCAA CATTTATGGG 2700 

AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 2760 

TTTAACATTA TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 2820 

CGTGGGAATT GCTGGTAAAG ATGGCGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 2880 

ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 2940 

ACAAGTATTC GAGGAGGCAG ATCCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 3000 

GGTCGTGGTT TCATTGGTTA TATTCTgCAG ACAACAATGT TAAAGTAACA TTTGCAGACG 3060 

TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATAC6ATGTT ATTTTAGCTG 3120 

ATGAGTCTAA AACAACGACX3 CGCGTGAAtA ATGtTGATGC AATTAATTCA ATGCAACCTT 3180 

CTGAAGCGTT GAAACAAGCA ATTCTAGAAG CTGATATTAT TACAACAGCT GTTGGTGTTA 3240 

ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3300 

TTAATATTGT TGCTTGTGAG AATGCTATTA TGGCAACTGA TACATTGAAA AAAGCAGTAC 3360 

TTGATATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTGATAGAA 3420 

TTGTACCATT ACAAAAGAAT GAAAATATAT TAGACGTTAT GGTTGAGCCA TTTTACGAAT 3480 

GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 3540 

ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 3600 

CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 3660 

TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 3720 

ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

CTTATTTATC TGATOAAGTA ACAOGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3840 

AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA CGCACTGGTT 3900 

TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 3960 

AGAAAAATAA TTACATTAAA GAACACGGTT TAAAAGC6TT TTTAAGTGAA TATGCTAAAG 4020 

TTGACGATGG CTTAGCCGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 4080 

AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 4140 

AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTCAACATAG 4200 

ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 4260 

CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAACAAATT CATATTAACT 4320 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 4440 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CX5TCATTATG TGACATACCA 4500 

GTATCTG6TA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4620 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4680 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4740 

TCrrTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4860 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4920 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4980 

20 TCAGTTGTtG ATTCAACCAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 5040 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTATCAGCA 5160 

25 GATGATGCTG CTTCTGTGCT CX3CAGTTGTT GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 5220 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 5280 

TTATCAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 5340 

TTACTAGTTA CATCGTTATC CATTGTOGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTO 5400 

TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 5460 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 5580 

GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 5640 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 

TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 

^ ACTTCTCTGA TTGCTTCCAA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5940 

ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

SO 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6060 
(2) INFORMATION FOR SEQ ID NO: 535: 

55 



30 



35 



1469 



EP 0 786 519 A2 



(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



10 


AACAAAGCCT 


TCCAATTATC 


TGCGTCGGTA 


GAACAAGTAT 


TAGCAACTTT 


ATCACCTACG 


60 




CTAAACAGTC 


CTTACGATTT 


ATACGGCACG 


ACAAAAATGC 


TAGATATTAC 


ATTCGATTCA 


120 




TTTGAACATG 


ATGGTACAAC 


GTACCCTGTC 


GACTATGCTA 


CGTTTGAAAA 


TGATTATGAA 


180 


IS 


GATAATAAAG 


ATCCTGAGTT 


TAGACGTAAA 


AGTTTCAAAT 


CGTTTAGCGA TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC 


GGCTACATAT 


AATATGCAAG 


TACAACAAGA 


AAAAATTGAA 


300 


20 


GCTGATTTAC 


GTGGATTTGA 


ATCAGTCATC 


GATTATTTAT 


TACATAGTCA 


AGAAGTAACG 


360 


CGTGATATGT 


TTGACCGTCA 


AATCGATATG 


ATTATGCGTG 


ACTTGGCACC 


AGTTATGCAG 


420 




AAATATGCTA 


AACTTTTACA 


ACGTATTCAC 


GGATTAGATA 


ACATGCGTTT 


TGAAGACTTG 


480 


25 


AAGATTTCTG 


TAGACCCTGA 


TTATGAACCA 


GAGATTTCAA 


TTGAAGACTC 


AAAAAATTAT 


540 




ATTTTCGGTG 


CGTTAAGTGT 


TTTAGGTGAT 


GACTATACAA ACATGTTACG 


TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT 


TGCACAAAAT 


AAAGGTAAAG ATACAGGCGC ATTTTGTGCA 


660 


30 


AGTCCATACT 


TTACACATTC 


ATATGTGTTT 


ATTTCTTGGA 


CTGGTAAAAT 


GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT 


AGGTCATGCA 


GGTCATTTTA 


CATTAGCTCA AAAACATCAA 


780 




CCATATCTTG 


AATCAGAAGC 


ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC 


AATGAATGAA 


B40 


35 


ATGTTGATG6 


CCAATTATTT 


ATTTAACACA 


AGTGATAATC 


CAAQATTTAA 


GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT 


CATAATATGG 


tACCCmTTTA TTAGAAGCnG 


960 




CTTATCCACG 


GGGAGTG 










977 



(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 240 

TTATGCTAAT CCAGCGCXSTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 300 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 360 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTACTTTGTC 420 

GTATGCrrCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 480 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 540 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 600 

ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 660 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 720 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 780 

20 TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 840 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGTT 900 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 960 

GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 1020 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 1140 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTTCAATA 1200 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 1260 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 1320 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 1380 

CGTTTTGACG AATTTTTrGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 1440 
^ (2) INFORMATION FOR SEQ ID NO: 537: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



25 



30 



35 



SO 



SS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 
TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 
AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 
CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 
TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTrT 
CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 
ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 
CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 
TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 
TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 
TATC 

(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 



CAATCTCCAC 


CAATGACACG 


TCGTCATTCG 


TGACCTCATA 


CCArACAAAA AACAGTCTCG 


CAATCAAGAC 


TGTTTTCCAC 


TCAATATATT 


CATCCATTAG 


CGTAATAGAT TATTTGACTT 


CTGTAGCTAC 


AAAGaTTTTA 


CGTTTTTCCC 


AAACGCCTGT 


cTTTTCATTG TAATCATCAC 


AAGTAATTAA 


TGTTAATTGT 


TTATCTTTAC 




ATCTAGAACT CCTACATCTG 


TAGGCTTAAC 


ATCTCTTATA 


CTTGTCATTT 


TATACTTACG 


TGTTTCATTA CCAACTTTAA 


AGTACACCAT 


ACTACCTTTT 


TTGGCTGCTT 


TAAGATTT6T 


AAATTGATAG TTCGGACGGT 


CAATGAAAGT 


GTGTCCTGCA 


ATTGAAATAT 


TTTGATCATC 


TAGTGATTCA TTTTCTTCTG 


CAAAGCTTAC 


ACCTCTATTT 


AATTGTTCAG 


GTGTTGCTGG 


TCCTGGATAT ACTGGTTCTT 


TAATATCAGC 


ATCTGGAATT 


TCAATATAGC 


CTGCCACTTT 


CGATTTATCT TTCGGAATTT 


GAGGTTTAGC 


TTGCTGCTTT 


TTATCTTTAC 


TCGCCTGTTC 


TTTTACATTT TTATCATATT 


GTTCAATCTT 


TTCATCTTTA 


TCTTTATCGT 


GAAGATAATT ATCGATATGT GGTTTAGCAA 


ACAAATATGC 


TGCCACTAGG 


aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 
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AGTATACCAT TAATTTCAAA ATGACTCATA GCAATTCATT TTATACTATA AAATTTACAT 


840 




GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT 


CAAATAACGA AACGCTTTTA 


900 


5 


ATAATTACTA 


AGGGGGAATT GATATGATTA GATACGCTAA 


AAAAGAGGAT TTAAACGCTA 


960 




TATTAGCGAT 


ATACAATGAT GCCATTATCA ATACTACAGC 


TGTTTATACT TATGAACCAC 


1020 


10 


AAACCATAGA 


CGAACGTGTC GCATGGTTTG AAACGAAACA 


ACGTAAGCAT GAGCCTATCT 


1080 


TTGTATTTGA 


GGAAAATGQA AGTGTCTTAG GGTTTGCGAC 


GTTCGGTTCA TTTAGACCTT 


1140 




GGCCAGCATA 


CCTATATACA ATCGAACATT CTATTTATGT 


CGATGCTTCA GCTAGAGGAA 


1200 


IS 


AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTOGA AGCAAAAGCT AAAGGTTATC 


1260 




GTGCGCTAGT 


TGCAGGCATT GATGCTTCCA ACAAAGCGAG 


TATTCAGTTG CATCAAAAGT 


1320 




TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 


1380 


20 


ATTTAGCATT TTACGAATTA GATTTACAAG ACTAGTAATG 


TTTGAATCAC ATAATATAAA 


1440 




CAAGACAACC 


ATGTTAATTC CCTTAACATA ACAAGCCAAC 


ATATAAAATT TTAAACTTCT 


1500 




CAGGGGAGTG 


GGACAGAAAT GATAAAGAGC CACTAATGAT 


TTATTATGTA GTGGTTCTTA 


1560 


25 


CACATTAGCC 


ACAGCTAATG TGTAcTTAAA AATAGGAATA 


CATGAGTAAA ACTCATGCAT 


1620 




AAGAAATACT 


AATTTCTATA GAAAAAGTAT TTCTTTATCG 


TCGTCCCACC CCAACTCQCA 


1680 




CATTATTGTA AGCTQACTTT TCGTCAgCTT CTGTGTTGGG GCCCAAAAAG CTTGTTACAA 


1740 


30 


GCGCATTTTC 


GTTCAGTCAA CTACTGCCAA TATAACTTTG 


TAGAGCATAT TACATTGATT 


1800 




TACATTGTCC 


CTTTTATTTA TTCTTTTCAA ATACTATCCC 


CATAGCTTTG ATTTAACGCT 


1860 


35 


TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC 


GATAATCATG CTGAGCGATA 


1920 


AAGATTTAAA 


TAATAGATTG ACCCACGTTC CCTCAGTCGT 


ATATCCATAT GTAATCGTTG 


1980 




TGTTAATGAT 


GAATGCTATA AAGATGATTG ATAGTCTTAG 


CATATCATCA CTCCTTTTAA 


2040 


40 


GTTATTTTAG 


ATATACGGGG GCGCTTTTGC AATCACTATT 


TTGATTAGTA TGCATTTTCC 


2100 




ATAAATCTTT 


CAACTTCTTC AGAGATAATT AAGAAGCATC 


TATCTGGTAC TAATGATCCA 


2160 




GACAGATGCT 


TCTrriTi'AT CAATAmTA TTGTTATCTC 


ATTAATTATT TTTAACCATA 


2220 


45 


TCTTCAGCTG 


TGCCAAAGAT TTTACGTTTA ATTGCTTCGC 


CAGTTGGTGT GCCTGCTAGT 


99Rn 
^ ^ ou 




CCACCCAATC 


CAGTTTCACG TAATGATGCA GGAAGGTTAC 


GACCAACCTT ATCCATTGCT 


2340 




TCAATAACTT 


CATCAACAGG GATTCTACTT TCAATACCTG 


CTAATGCTAA ATCTGCTGAA 


2400 


SO 


ATTAAAGCGT 


TACCCGAACC AATTGCATTT CTCATAACAC AAGGAATTTC AACAAGTCCG 


2460 




GCTACTGGAT 


CACAAACTAA ACCTAATAAA TTACTTATCG 


CTAATGCCAT AGCGTGCCCG 


2520 
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GAACCAACTT CAGxiTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 
ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 
TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 
GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 
CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 
CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 
TTACTGCGTC TCGCATGACA TCTAAATTTT GTrTCATTTG CGCTCGCACT TCATCACGTG 
ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 
CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 
ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 
ATTGCITTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 
TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTGCTTTA 
GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 
TTTATAT6AA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 
ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 
CTATCGCCTT CTTCTTCGAT GATATCTATT TTAATACCAT CATCAGCTGC AATTTCTAAT 
GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 
TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATGTAT TTCAATATTT 
TTATATCTCC CAATATTGCG CGTGCTGAAT TCXrCCTTTAC TGCACCAGCC GTATGAGAAC 
TTGAAGGGCC CAT 

(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 
GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 



300 



CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 



360 



CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 



420 



ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 



480 



GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 



525 



10 



IS 



20 



2S 



30 



35 



40 



4S 



SO 



(2) INFOKMATZOM FOR SEQ ID NO: 540: 

{!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

TTGATTTGCT ACAAAGTATC TnCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT 60 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

AAGTTGTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATTCCT TATAGAATAT GAATGTTGCG ATACTTGOGA ATCCTGCAAT 240 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 300 

TAGTACTGCT ACTAGGTAGC TTTTATAAAT GTTACTCATT TTCTTTTTTC AACTCCTCCA 360 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 420 

TTCTGCCTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 480 

TTTGTCCTGC TTTGGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 540 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 600 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 660 

TTCATAGATT TCTTTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTTTTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 780 

ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA 840 

AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCQTTTCTGC TGTAGACTCT 900 

TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC . 960 

TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 1080 
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CGACTTCX3TA AGTGTGCTCA ATCTCGCCTG cATATGTCAC AGTAAGAGTA TCTTTGTX3T6 1200 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 1260 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 1320 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 1380 

CTTCTCTGCC ATCAGAACGC ACAATTGT 1408 

(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 60 

AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 120 

TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 180 

ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 240 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 300 

CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 360 

TACAGTGAAA ATTGCTGCTG CTGTTTCTGC ATCAACTGGG nCAACCTCAC CATTAACTGT 420 

TGTTGGACAC AT 432 
(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 60 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 120 

^ GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC ATATCATAAA CTGAATAOGT TACAATCTTT AGATCAACGT ATTGCAAATA 240 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 360 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 4 80 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 540 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 600 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

TGAAGAAACA TTTTGCATCA ACTGGCXSACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

TGAATAATCC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CGTATAGAAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 840 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGtTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 1020 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 1140 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CAOGACGCAA TAAGTTGAAA 1260 

AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

CATTITAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 1440 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

^ GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGAT6CAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 1740 

« GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

^ AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 2040 

SS 



25 



30 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 



2160 



ATGCCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 



2220 



CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 



2280 



ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 



2340 



ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 



2400 



ACTTTTTnCA ACAAATGTCT nACATG 



2426 



20 



25 



30 
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40 
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(2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 60 

GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 120 

GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA TAATGAGATA 180 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 240 

ATGCATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 300 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 360 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 420 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 540 

TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 600 

TTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 660 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 720 

ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 780 

GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 840 

GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 900 

GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 960 

ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 1020 
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TACCTTTAAT 


ATTTTCTAGT 


ATCGAAAGAA 


TTGACACTAT 


TAGTACTGCT ATGGAGTTAA 


1140 




GACAATTCG6 


ACAGTATAAA 


AGGAGAACCT 


GGTACGTCAA 


AAAACAATTA AAAAAAGATG 


1200 


5 


ATTATGTTGT 


TTTGTGTTTG 


ACGTTAATAC 


TTCTGATGTT 


AGTAGTTACA TTATTCTTTT 


1260 




TAAATAATAG 


TOGATATTTC 


AACCCGTGGC 


ATTAGTATTC 


ATATAAATAG TCTTTAAATA 


1320 


10 


GAAATAGGAG 


GGAGACATTT 


AATGATAAAT 


ACTGAAAGAT 


TAAATTTAAT GATTCCAAGT 


1380 


TCCTCGCATT 


TAATTGAACT 


TTATAATATT 


TGTAGTCATC 


CACAAGCAAA TATATACACT 


1440 




CCCAAAGGTT 


TACATAATTC 


CAAATTAGAC 


ACACAACGGT 


GGATTGAAAA ATGGCGAAAC 


1500 


IS 


CATTGGCAAC 


AATATCAATT 


TGGTTACTTT 


GTATTGGTAA 


AAAAAATAGA TTGTAGTGTT 


1560 




ATTGGTATTT 


GTGGATATGA 


ATATCGACAA 


TTAAAGCAAG 


AaACAGTATT AAATTTATTT 


1620 




TATAAATTAC 


ATCCAAGTTT 


TGAAGGACAA 


GGGTACGCAT 


GTGAGGCTAT TACAGCAATC 


1680 


20 


ACAAATTTTG 


TGAATTATAT 


CGATCAAGAA 


ACAGTAAAAG 


TTATCAGGAC AAATAAGTGT 


1740 




AACCAACGTT 


CAATAAATTT 


AGCAGAAAGG 


CTTAAATTCA 


AGCGAGACGA TACTATGGAC 


1800 




GACATTATCA 


ATCAAGGAGA 


TATTGTGTTT 


TAAllAATAAA 


ATACTATGAC ATTATCTAAA 


1860 


2S 


AAATAAAATT 


AAAA 








1874 



(2} INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 



40 



45 



SO 



TCAACATTTC 


TAACACCAAT 


GTGAAAATGA 


TCTATGTGAT 


TTGCAATGGC TTGATTTGTA 


60 


ATAT6TGTGC 


CTAAATGACC 


TGTAGCACCT 


GTTAACATAA 


TATTCATTCA CTTCATCTCC 


120 


TAATCTTTAT ATACATAACA TAATACTTAT 


TTGATGGTTT 


TCAAAACATT TGATTTTATA 


180 


AAAAATTCTA 


ATCTGTATTT 


ATTGTCQACG 


TGTATAGTAA 


ATACGTAAAT ATTATTAATG 


240 


TTGAAAATGC 


CGTAATGACG 


CGTTTTAGTT 


GATGTGTATC 


ACTAATATCA TTGAAAATTT 


300 


TAATCaGGTA 


CTACGACAAT 


ATGATGTCTG 


TTTTGTGTCT 


GAAAGTTTTA CAGTTTTTAA 


360 


AATAAAAATG 


GTATAAAGTG 


TGATTTGTAT 


AAAAAAGAGT 


CTCGACGGAT AAGAATTGAT 


420 


TAATAACAGT 


TAGCATTTTA 


TTAATTACCT 


TAACAATGAT 


TCAAGTTTAG TTAAATGAGG 


480 


TTTAATTTGA AAGGGGATAG 


CGCCTCAATA 


TAATGTAGGT 


AGATTGTTCA TATTACGTAA 


540 



55 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 660 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 720 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 780 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 840 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 

GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 960 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 1020 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 1080 

AACGT6CGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 1140 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 1200 

GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 1260 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 1320 

AGCGACGTAC TGTTGCTTAG CrTCXTr ' nT TGAGGGGAAA AGTTACAAAA CTCACACAAA 1380 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 1440 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 1500 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 1560 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TGTAAACACT ACAACGGAAA 1620 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 1680 

CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 1740 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATT7TACG 1860 

TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACXSrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 2040 

TTGATTTTAA ATTAAGAAAA TACCTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 2160 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 2220 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 2280 

AGAGGTTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 2340 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 420 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 480 

ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 540 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCX3 TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACX3 AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 840 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 

(2) INFORMATION FOR SEQ ID NO: 546: 

20' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

GGCATTTGTG TCCTTATATA AGGAACTGTG tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

30 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

55 ATTAAGAATA ATTCAAGTAT ATTTAAATCG AGGTTAATTA TCGTATGAAA CGATGCACGT 240 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA ATCCAATATA TTAAGATTGG 300 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

CAACAGGAAC TTTAACAACA GAAGTTCATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 420 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 480 

AAAATATTAG TGCTTTGAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 540 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 

GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 660 

GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 780 

AAATAACACT AAAAGAACTG GATTATAAGT TGAGAAAGCA TCTAATTGAA AAATACGGAC 840 

55 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT 
AACAAATTAA AGATATTGAA GTTAACTTAA 
^ TGAAGCGGCT TAACGGTGAA ATGTAAATTG 
CAATCGATAT CGTCGTTAAG CCGTTTTGGT 
ATAAAGGTAA AATTTCCACC ACCAACATCA 

10 

TTATAATAAA TCTATTACAC AAAGAGATAA 
GTCTATTACT GAAAAACAAC GTCAGCAACA 
TGCX3AATGAT TTAAGAGGGA ATATGGATGC 
GATTTTCTAT CGCTTCTTAT CTGAAAAAGC 
TGAAGACATC ACGTATCAAG AAGCATGGGC 
^ AGAATTAATT GACCAAGTCG GTTACTTCAT 
TCXSTGAAATT GAAACGCAAG ATTTCGATAT 
TGAAACATCA ACATTAGGTG AAGAAAGTGA 

25 

GGATTTGAGT TCAACGCGAC TAGGTAACAA 
AGTCATGGTT AATCTTGACG ACTTACCATT 
AGGTGATGCA TATGAATTCC TAATTGGGCG 

30 

CGAGTTCTAT ACACCACAAC AAGTATCTAA 
AGATAAATTA CGTCACGTGT ATGACCCAAC 
35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT 
CTTAGCACGC ATGAATATGT TATTACATGA 
TGATGACACA TTGGAAAACC CAGCCTTTTT 
CCCACCGTAT AGTGCGAAAT GGACTGCAGA 
TGGTTACGGC AAACTTGCGC CTAAGTCTAA 
ACATTACCTA GACGATGAAG GTACCATGGC 

45 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA 
TGTGATTGGT TTGCCAGCGA ATATTTTCTA 
50 ATTTAAAAAA TGTCGCCAAC AAGACGACAA 
TGAAAAAGGA AAAAATCAAA ATCATTTAAG 
ATATAAGCGT AAGGAAACAA TTGATAAATA 

55 



TTAAATATAT GGGGGAAGTC ATAGAAAGCA 960 

AGTAAATCAT TACGAATAAT TAAAAGTAAT 1020 

GTGCGCATAG CTTATACAAA AAGGATGCAT 1080 

TTGTGTGTCA TGAATCCTAT CCCAATCTCC 1140 

AAATTCTCCA CATCGCAACA TAACCAAATG 1200 

ATTACTTATT CAAAGGCGGA GGAATCACAT 1260 

AGCTGAATTA CATAAAAAAT TATGGTCGAT 1320 

GAGTGAATTC CGTAATTACA TTTTAGGCTT 1380 

GGAACAAGAA TATGCAGATG CCTTGTCAGG 1440 

AGACGAAGAA TACCGTGAAG ACTTAAAAGC 1500 

TGAGCCAGAA GATTTATTCA GTGCGATGAT 1560 

CGAACACCTG GCGACGGCAA TTCX5TAAAGT 1620 

AAATGACTTT ATCGGTCTGT TCAGCGATAT 1680 

TGTCAAAGAA CGTACTGCTT TAATCTCTAA 1740 

CGTTCACAGT GACATGGAAA TTGATATGTT 1800 

CTTTGCX3GCG ACAgCGGGTA AAAAAGCAGG 1860 

GATACTGGCG AAGATTGTCA CAGACGGTAA 1920 

ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

CGGTCAAGAA CGTAACAATA CTACATACAA 2040 

TGTGCGTTAT GAGAACTTCG ATATCCX3TAA 2100 

AGGCAATACA TTTGATGCGG TTATTGCGAA 2160 

TTCAAAGTTT GAAAATGACG AACGATTCAG 2220 

AGCAGACTTT GCCTTTATTC AACACATGGT 2280 

CGTTGTACTC CCACATGGTG TATTATTCCG 2340 

TTTAATTGAA GAAAAGAACT ACTTAGAAGC 2400 

TGGGACAAGT ATTCCAACAT GTATTTTAGT 2460 

CGTACTATTT ATCGATGCAT CCAATGATTT 2520 

CGATGCCCAA GTCXy^CGTA TTATAGACAC 2580 

TAGCTACAGC GCGACACTAC AAGAGATTGC 2640 
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GATTGAT7TA GATCAAGTCC AACAAGATTT GAAAAATATC 6ATAAAGAAA TCGCAGAAAT 2760 

TGAGCAAGAA ATCAATGCAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 2820 

AAAAGAAAAA TGTGCCAGAA TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2880 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2940 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 3060 

AAAGTTATTC TAATGGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3120 

GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 3180 

TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 3240 

GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3300 

AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3360 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 3420 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAA6AT TATCCAGATT 3480 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3540 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 3600 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 3660 

ATTCTATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 3720 

CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3780 

3S TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 384 0 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 3900 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3960 

TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 4140 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA 6TTAAATAAA GATTAAGTTG AGATAAAGTG 4200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4260 

TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4320 

TTGCTCATTA TAATGA 4336 
(2) INFORMATION FOR SEQ ID NO: 547: 

55 



2S 



30 



40 



45 



50 



1485 



EP 0 786 519 A2 



(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

10 TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA 60 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 120 

ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 180 

AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 240 

AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 300 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 360 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 420 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 480 

TTCTTTA 487 
(2) INFORMATION FOR SEQ ID NO: 548: 



75 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 

TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 60 

TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 120 

CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 180 

TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 240 

45 AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 300 

TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 360 

GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 420 

^ TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 480 

AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 540 

CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTO 720 
CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 
TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 840 



ATTCACGTGT CGCACtAGTA ATTACCAnAG T 
(2) INFORMATION FOR SEQ ID NO : 54 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



871 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 
20 TCAATCTAAT ATATACTTCA TGACTTCCCQ CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 
TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

25 

CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 240 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 300 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 360 

30 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 400 
(2) INFORMATION FOR SEQ ID NO: 550: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

45 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 180 
TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 240 

so 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 300 
TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 360 
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ATGTGGACAA TAAATCTTTT TTCTGCGGAT TATCTAACTC ATAATCAACA TGTGTCACAT 360 

TATAACGTGC TTTTTTAGAA AGGCTAGCTA AAATTTGCTC GTGAAAAGCA GTTAATGAAT 420 

CTAAATCTAA TTTGATTTGT AATAGGAAAT TGTTATTAAG TAAATATAAG TCGTTTTGAT 480 

AACGCGACAA TTTGTTTAAT ACTTCATAAG CTTGTCTAGT CGTCTGAACT ACTTCTCTGA 540 

AAAGTATTTT CTTTCTATTC TGTTGGTGAA TATGTTTTTT TGTAATAGGA CGTTCTTCGC 600 

TATAGTAATC ATAAATTTTC TCTAACTTTT CGACACX3TTG TTITAAATTA TGACTATCTT 660 

GTTTAATATT ATTAAACTCC GTCGTATCAT TTAATACTAA TTTAAACCAC ATAAAAATAT 720 

CTGAGGATAT ATTTAATGAA TTATAGTAAA TTTTTGTTTC AAATTTAGGT GGTAGAAACA 780 

CAAAGTTAAC TAGAlGATGAA CTTATGACAC CAATCATTAC AAGTACAAAC CTGTAAAAGG 840 

CGGTAATATA GAAAGAACCG GTATGTTGTC CCATAATGAT TAATGCTGTT ACACTCGCCA 900 

20 AAGTAGCAAC ATGTGCTAAA TTAAATTTAA ATAAAATAGC AATAAGTACT ATGACGGTAA 960 

CACCCATAAT GATAAAATTA TCACTAAAAA TTGTTACCAT TGTAACAGAT AGTATGGCAC 1020 

CTATAATGTT ACCCAATGCT TGATCAGAAA CTGTTTTAAA TGAACGATAA ATACTAGGTT 1080 

GCATTGCACA ACAGCACTGA CACCAGCTAA GGCTTTCAGA CCAACATCAT CCGGTAGTAA 1140 

AGAAGCGATA GACATAGCTA AAATAATGGC TATACCAGTT TTAAAAATCC GAGCTCCTAG 1200 

TCTCAAAAAT AATGACGCCC CTTTTAAGTr TATTGAATAT CTAATATTCG TATTCATTAC 1260 

TGTTATACAC TTACTAGTTA CAAAATTCAA GCTTATTTAT AGTTGTTAAA ATAAATCATA 1320 

CATAATACTG ATAGCGATGT AAAACTTTAG TCAGAGATTA AAATAGTATA AATTTGTAAA 1380 

ATAAAAACTC ACATAGTGAC ATATCAAGTT AAACGTTAAT AGTTAACGAT ATAAAATGAA 1440 

TCTACTATGT GAGCATTTGC TTTATTTTAA TTCAATTAAA AATATACTTC CTTAAAAGTT 1500 

ATTTCATTTG ACTAAAAGCA TAGTCTGCAG CTTTTAAAGT TTGTTTAATA TCTTCTTCTG 1560 

40 TATGTTCAGT TGTTAAGAAC CAAGCTTCAA ACTTAGAAGG TGCTAAATTG ATACCTTGAT 1620 

TTAACATTAA TTTGAAAAAT TTACCGAACG CTTCGCCGTC AGAATGTTCA ACTTGATCAT 1680 

AATGTGTGAC TTTTTCATCT GTAAAGTACA ATGTTAAAGA TCCATAAATA CGATTAATTG 1740 

TAGCTGTGAT ATTATGTTTT TCGATTAATT TAAGTAAACC TTCTTCTAGT TGTTGGCCTA 1800 

AGCTGTCTAA TTTTTCATAA ACACCGTCTT GTTCTAGTAC TTCGAGTAAT GCAATACCTG 1860 

CTTTCATAGA TAACGGGTTA CCAGCCATTG TACCAGCTTG ATATGCAGGT CCTAGAGGTG 1920 

CTACTTGTTC CATAATATCT TGACGTCCAC CATAGCCTCC AATTGGTAAA CCACCGCCAA 1980 

CAATTTTACC AAATGCAGTT AAATCAGGGA TAACACCTAA TAAATCTTGA GCGGCACCGT 2040 

ss 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 2160 

CAATTGGTTC TACTT^TACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 2220 

5 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 2280 

CAGCTGAGTC TGGAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 2340 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATGCAC 2400 

10 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 2460 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 

^5 TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 2580 

CATGCCCCX3T AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 2640 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCX3 CCTCCTACAG 2700 

CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTA6AAT ATATTCGTTT GAAAGTTGTT 2760 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 2820 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2880 

2$ 

GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 2940 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3000 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3060 

30 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 3120 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 

35 AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 3240 

ATGAACAAGG TAAAGTA7TA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 

AACTTAAAAA CATTTTGGGG T6ACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 3480 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3540 

45 

GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3600 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3660 

TAAAAAGATG AAACTATCAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3720 

' 50 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3840 
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T6CTGACATT ATTATAAATG CTTTACCAGA AACGGAAGAA ACGATTCaTT TaCTAAAGAA 3960 

AAAACATrrr GAATTAATGA AAGATGAAGC ACTTTTTATA AATATAGGAC GAGGTAGCAT 4020 

AGTTAAAGAA QCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4080 

AGATGTGTTT GAAAATCAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 4140 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4200 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4260 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4380 

TTGAAATAGA ATCAATTGAA CA7GAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 4440 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4500 

20 ATCCAACAsC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4560 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4680 

AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4740 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAGTTTGTA 4800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4860 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4920 

TTT 4923 
35 (2) INFORMATION FOR SEQ ID NO; 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 
^ {D) TOPOLOGY: linear 
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SO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 60 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 120 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyxGAA 180 

CGCATTTTGG TACCCAAAAt GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 240 

GTCGATTGGT TTCATTAATT GAaCACCATT TTGCTCTTCA GCAAGACCTT TGTCTACTTT 300 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 420 

TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 480 

ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 540 

GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 600 

ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 660 

AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 720 

TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 

ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCX3C AATAAATACT 840 

AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 900 

CTTTTAGAAG TATGAAA 917 
(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 
2s (C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 60 

rrrCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 120 

35 AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 240 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 300 

^ AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 360 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 420 

ATCTCTTTCT CT 432 

45 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 60 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 

^ ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 180 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 240 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 300 

10 

ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 360 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 420 

TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 480 

TTAGATTGTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 540 

TTTCXGTGGA AGTTCGAATA AGGACACCAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 660 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 720 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 780 

2S 

CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 840 

GCGCAGaTAC TAAAATGTAT T6TTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 

GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 960 

30 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAAC6TCT GCTATGTTCC GTATAAGGTA 1020 
GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 1080 

^ CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 1140 
CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 1200 
ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 1260 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 1320 
TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ 10 NO: 555: 

4$ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 555: 
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TTACCTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACX3TTATCGA CATTGTTAAA 120 

TTGTCCTTTT TGAAAAAATA ATTCTGGTGA GATAGACGCT ACTTGGaTTT CaTCaGTTTG 180 

TAAtAACGCA GtATAAtTAC CaTTACTAAA tTGaGTTTUVT CGTTCaTATA AAGTACTAAT 240 

AGGATAATAA ATGTTATCCG TTAAgCGCGC CGTATAGTTC ACTTGATACG TTTCGCCTTC 300 

AACAATTGCT TGTTGGACAC GTTTAATATT AGTCATCATA ACCTCAGAAG ATTCAACAAA 360 

TGAAAAATGA TACTTTGATA CATAAGAAGT TTGATGTTCA TATGTTGAAT TTATGCTTTC 420 

CGCTTTTTCA AAACTATAAG CTGCTGCATA AATATCATCT TTAGCTAATG AATGTGTACA 480 

CATAGCATGA TTAAAATACT TTGCCGCTTC GTAACTTAAA TATAACGAGA CATATCTACC 540 

TTGTCGTkGt GCGCTTGTGC AAAGTGTATC ACTTCTCCCA CATCAGCCAA CTTAGTAGCA 600 

ACATACTTCT TTATAAATCC CTTTAATTGA ATATGGTATT GCTTATATTC ATTTTCAGTT 660 

20 AAATAGTAGC GATAATTATA TTCTATTCTC ACAGTAATCA CCTACCTTCG ATAAAAATAA 720 

TTCAACTTGT CGATAACCGT ATTCACTCAA AATAGATTCA GGATGATATT GCACACCAAA 780 

AACCGGAAAT CTAATATGCT CAAATGCCAT AATAATCGCT TCATCGTTTT TTGCTGTAAT 840 

CTTTAAGCAA TTTGGAAAAG TCGCTCCGTC AGCAATTAAT GAATGATAAC GCATTACATT 900 

GAAATTTTGA GGCAGTCCTT GAAAAATACC TTCATTGGTA TGGCGTAACT GTGTAGTATG 960 

TCCGTGTACA GGATGATAGC CGTGAATGAT ATTTCCACCA AAATAAGACA CGATACATTG 1020 

AAATCCTAAA CATACACCTA GTATAGGTAC ACGCTGATAA AATTGTTCTA ACACTTCATT 1080 

CAAGATAGGA TAATCATCCG GaTTACCCGG CCCAGGCGAA ATAACAATTG CTTTTGGCTT 1140 

CATATTAATG ACGTCTTCTA TCAGCAGATT ATCAATACCA ACAACTTGAA CTGTTAGTTT 1200 

CGTTTGAGTC TTAATATAGT CTATTAAATT ATATGTAAAT GaATCATTAT TATCTATGAC 1260 

TAGAATCATT GTATACTCCG TTCTAAATGT GTTTTATTTT TATAATATGT ATTGGATGTA 1320 

40 6CTAAAACTT TAAAAGCATT GTCATTATCC TGACCTTGAT TTTAACTAAT ATATGGTATA 1380 

rrCTATTCAT CGTACATAAA TGAATATCAG AGGTTCCTAG CTGAAACCCT CTATAAAAAA 1440 

CTAGGCCATT GAAATTTCAA ACAITCGTTG GG 1472 
(2) INFORMATION FOR SEQ ID NO: 556: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 
(B> TYPE: nucleic acid 
^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAQAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAS AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 160 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 300 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 420 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 480 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

CX3GTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AQCGTATOAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCX3CT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
^ (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 <D) TOPOLOGY: linear 
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30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

45 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 
AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 
GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

SO 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 240 
TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 420 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 480 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 540 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG . 600 

ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 720 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 

^5 AAAACAATAA AAAACAAGAT TTAATACTGC TGCX3TCATGA GGCGCTTGAA CATTATTTGA 840 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 900 

ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 960 

20 GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 1020 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 1057 
(2) INFORMATION FOR SEQ ID NO: 558: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

35 CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 

GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GQAGATGTTT GTGACTTTTC 180 

GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 240 

CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 300 

GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 360 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 

TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 480 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 540 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 600 

TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 
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AAACTGGAAT ATCAATAAAT TCTTTTTTAG 

TAGTATAACT TGGTAATAAA GATGAATTAT 

5 

AAATATGACG CTTTTGCTTA ACATTATTCA 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA 

AATCTAATAC TTTTTATCTC AGTTTCATCA 

10 

TCTACAACAA GATTTCCTGT TTCGCTCTCG 
TCGTATTCTT CATTATTCAA AAGCGTGATA 

^5 CTCAATACTT TTCACCTACT TATCAATATA 
AATGGCACTT cCCTTTCTGG TTTCAATATA 
TTGATTAAAA TTAATAATCT CTTTCAAATC 

20 TGTTGACATT TTTTCTCTTA ACAATTCATT 
AGGCAATTTC TTATTATTTA AAATGGCTCT 

TAACATATAT CTATTTTATT TTCCGCAATT 

25 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC 

ATATTTACCt TCACGCTCTT TAAAGAACTT 

GGTACTTCTA CACCAAGGAT GCATAGGTGG 

30 

TTTAAATATT TTTCCGTTAA GTGAATGACA 
AACATATTTA TATTCGCCAT CTTCACCAAG 

35 TGCATTTTCA GTAAATAGTT AAAAAAGCGT 
TCTAAAAAGG ATGCTATTTA TCTTTTGAAT 
CAAATTCTTC AACTAATTCT TTAAAAGATT 

40 CTAATTGTGA ATCTAACATA ATTAAAATCA 
ATAAAGTTTT TTCAATAATT TTTCAATATG 

ATATTTAGCA CAAAATTTAC TCACTATTTT 

45 

TTTTACTTGT CCCTTATTAG TAACGATTGT 

TGTAAATAAA TCATATTCTG AAAATCCTGA 

TGAATTCGGT TTACTGTTAA ATAATAATTC 

so 

ACTATCTTGA TTGACATATA CTTTTGTAAA 
ATTGCTATCA TTGTTTTCCA TGAGATATAC 

55 



TCATCTTTTC ACTTTCATTA GAATCTATTA 760 

TTTTCTTATT ATTATATTCA TTTGTGCATA 840 

ATTTCATCGT CATTTTACCA TTACTTATCA 900 

GAAAGGATGC TATTTTGTAA TTATTTTAAT 960 

AACGAATAAA CAGCAAAATC AGTCTGTATA 1020 

AATTCATTTT CATAGTCAGT TACAAACCCT 1080 

ATAACATCTT TACGATATGC ATCmCAATT XI 40 

AGGTACTATA TGaGcACCTG GTCTTCX3AAT 1200 

CACATTTTCX3 ATATGTATTT TTCCAATAAT 1260 

AAATCGCTCA TCACTTAATA TTAGATTGCC 1320 

CAATAAATCT ATAGAAAGTA TTGTATAGCT 1380 

TTTTTTATTT TCATTATATA GGTGATGACC 1440 

TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

CTGtAATTAT GCTTCTTTTA CTTCTACTTG 1560 

GTCCCGCCAA TTGCCAACAT GTGGCACTGT 1620 

CGCATTCACA CCTGGTATCA TATCTTTAAC 1680 

TAATTTAGAT GTTTTACTAT CTATTTTGGC 1740 

TTCTTTTAAA TATGTTAACT TTTGTGCTTC 1800 

ATAAAAATAG CACCACTTTC TCTTTAkCTG 1860 

TTGAATTCTT TTTCGCTTTT TCTATACTTT 1920 

CACTCAATTC TTTTGCAGTT ACATTTCCAT 1980 

TCTCACTTTA TATTTAATCa TATTTATACT 2040 

GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

CATTGATTTT ATTGATGGAT GCTTAAAAAA 2220 

CTGTCCAGGA TGGTTATGTA ACATAACAAT 2280 

GGTTGCTTGT TACCCTGGCA CAAAAGATAC 2340 

TTTACCATCT TTTAACAAAT AAGCTACTTC 2400 

CTCCGTTTAT AGtCTGTCXSA CTGATATTCC 2460 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT .GCTACGAGTT CAGATTCAG6 2580 

AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AATAATTATC 2640 

TCnTCTATTT tTTTCTAAAC TAGATTTTTC AGCATTGATA ACTTCAAGAC TATCTATATC 2700 

CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 2760 

ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCTAATTAAT 2820 

TAAAATACTA TAATTATATT TTATTTCTGT AAGTTTATGT GCCTCTATAT AGTGTAAATT 2880 

ATACTTATTC ATTAGATAGT GTTCAAGAGC TTCATGTTTC TACATTATTA TATCCATTTT 2940 

TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA GCCATATAAA AGTCAGGATC 3000 

AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AATATATGTT CATAAACTCT 3060 

TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 3120 

ATCTOGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 3180 

TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CAACCTTCTA CTTGATATTT 3240 

ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCX5 ACATGTGGCA CTGTGGTACT 3300 

TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT ATCATATCTT TAACTTTAAA 3360 

TATTTTTCCG TTGAGTGAAT GACATAATTT AGATGTTTTA CTATCTATTT TGGCAACATA 3420 

TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyr 3480 

TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 354 0 

AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTcGGG AAATTcTTGC TTCTCGATAA 3600 

AGTCTCTTAC TACAGAAAAA GACTTATTAC GATATAACAT AAAATATTCT TCATTTTCTA 3660 

TTTCAGATAA TGAGAAATCT ATAACCTCTG CATCTTTTTT ATTAAAAGTT ACTGAACCTT 3720 

TACCGTTACT TATATCATCT TCAGGTATAT ATTC 3754 
(2) INFORMATION FOR SEQ ID NO: 559: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



so 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 
ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 240 

CTGAAATGAA TATTACX^GTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 300 

GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 360 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 420 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 4 80 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 540 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAQA 600 

TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 660 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 7 BO 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 815 

(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 
2s (A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

35 CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 120 

TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 

TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAAC6 240 

40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 300 

AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 360 

TATTTTTATC CCATAGTGT6 ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 

GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 480 

GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 540 

GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 

CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACQTATT AGAGGAAAAT 660 

GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 

ss 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 840 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 900 

AAATTTGGAT TTCATAAGG 919 



(2) INFORMATION FOR SEQ ID NO: 561: 

(1) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 

20 CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 
AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 240 

25 TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 
AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 360 
' mrriTiTi ' AAAATAGTTA ACCAAGATAG CCCAGCATGT TOTGarrCTA ATGCTAAAAG 420 

30 

TTTAAACAAT GCCTTGCnAT CATAGAGOGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 480 
AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
(2) INFORMATION FOR SEQ ID NO: 562: 

35 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 



CTTATTTAAA 


AGTAAATCAA 


TCAATGTATT 


ATAATCCGAA 


TAGTCCGCAT 


AAAGCTGGTT 


60 


TGCaGGCAAA 


tCAATTACTA 


CAACAAGCAA 


AAACCCAAAT 


TAATGCAATG 


rTTaATTCAA 


120 


AAACAAATTA 


TGATGTTGTA 


TTCACTAGTG 


GTGCACTGAA 


TCCAATAATC 


TTGCTTTAAA 


180 


AGGTATTGCC 


TATCGTAAAT TTGATACAGC 


GAAGGAAATA 


ATTACATCCG 


TGTTAGAGCA 


240 


TCCGTCCGTA 


TTAGAGGTTG 


TAAGATATTT 


GGAAGCACAC 


GAAGGATTTA AAGTTAAATA 


300 
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CAAAGTCGGT TTAGTAACAT GTATGTATGT AAATAATGTA ACTGGACAAA TACAGCCTAT 420 

TCCACAAATG GCTAAAGTTA TAAAAAATTA TCCTAAGGCA CATTTTCATG TAGATGCGGT 480 

TCAAGCATTC GGCAAAATTT CAATCGATCT CAATAACATA GATAGTATTA GTTTAAGTGG 540 

ACACAAGTTT AATGGTTTAA AAGGACAAGG CGTCTTACTT GTAAATCACA TTCAAAATGT 600 

TGAACCAACT GTCCATGGTG GTGGTCAAGA ATATGGTGTT AGAAGTGGAA CAGTTAATTT 660 

GCCAAATGAT ATTGCAATGG TTAAAGCGAT GAAGATAGCT AATGAAAACT TTGAAGCATT 720 

GAATGCATTT GTTACTGAGT TAAATAATGA CGTCCGTCAA TTTTTAAATA AATATCATGG 780 

AGTTTATATT AATTCTTCAA CTTCAGGTTC ACCATTCGTT TTAAATATTA GTTTTCCTGG 840 

CGTAAAAGGT GAAGTATTAG TTAATGCTTT TTCAAAATAT GACATTATGA TATCTACX5AC 900 

AAGTGCTTGT TCATCTAAAC GTAATAAATT AAATGAAGTA TTGGCTOCAA TGGGATTATC 960 

20 AGACAAATCT ATTGAAGGTA GTATAAGATT ATCATTTGGG GCTACTACAA CTAAAGAAGA 1020 

TATAGCGAGG TTTAAAGAAA TATTTATCAT CATTTATGAG GAAATTAAGG AGTTGCTAAA 1080 

ATAATGAAGT ATGATCACTT GCTTGTTAGA TACGGGGAGT TAACATTAAA GGGTTCAAAT 1140 

AGAAAGAAAT TTGTAAATCA ATTAAGAAAT AATGTAAATA AGTCaTTAAA AGGACTTGAT 1200 

GGGTTTGTCX3 TTAAAGGCaA ACGAGATCGT ATGTATATTG AACTTGAAGA CCATGCaGAT 1260 

ATAAATGAAA TAACATATCG ATTATCAAAA ATTTTCGGTA TTAAATCTAT TAGTCCAGTA 1320 

TTAAAAGTAG AAAAAACAAT AGAGGCAATA AGTGCAGCGG CAATTAAATT gCGCAGaATT 1330 

TGAAGaAAAC AGCACATTTA AAATTGATGT GAAGCGTGCC CGATTAAAAT TTCCCCAATG 1440 

GATACGGTAT GGAATTACAG CGTGGAATTG GGGTGGTGCC AGTATTGGAG CACTTCGCCA 1500 

TATTTCCAGT GGATGTCCAA CGTCCCAGnC CCAGGAATT 1539 
(2) INFORMATION FOR SEQ ID NO: 563: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 60 

SO 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATTCGATAAA 120 

GGTGTAACTG CAATTGTTGG TCCAAATGGA AGCGGTAAAA GTAATATTAC AGATGCTATT 180 

55 
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ATCTTCTCAG GTGCAGAACA TCGCAAAGCT CAAAATTATG CTGAAOTACA GTTAAGATTA 300 

GATAATCATT CTAA7U\AGCT CAGTXTTTGAT GAAAACGAAG TTATTGTAAC AAGAAGATTG 360 

TATCX3AAGTG GTGAAAGTGA GTACTACATA AATAATGACC GTGCAAGATT AAAAGATATT 420 

GCCGATTTAT TTTTAGATTC TGGATTGGGA AAAGAAGCGT ATAGCATTaT CTCGCAAGGT 480 

AGAGTTGATG AAATACTAAA TGCTAAACCA ATTGATAGAC GTCAAATTAT TGAAGAATCG 540 

GCTGGTGTAC TTAAATATAA AAAACGTAAG GCTGAATCAT TAAATAAACT TGACCAAACA 600 

GAAGATAATT TAACGAGAGT AGAAGACATT TTATATGATT TGGAAGGTCG CGTAGAACCT 660 

CTAAAAGAGG AGGCAGCTAT AGCTAAAGAA TATAAGACAC TTTCACATCA AATGAAACAT 720 

AGTGACATTG TAGTTACAGT sCACGATATT GATCAATATA CAAATGACAA TAGACAATTA 780 

GATCAACGTT TAAATGATTT ACAAGGCCAA CAAGCAAATA AAGAAGCTGA CAAGCAACGT 840 

TTAAGCCAAC AAATTCAACA ATATAAAGGT AAACGTCATC AACTTGATAA TGATGTTGAA 900 

TCgCTTAATT ATCAATTAGT AAAAGCTACX3 GAAGCCTTTG AAAAATATAC GGGACAATTA 960 

AATGTTTT 968 
(2) INFORMATION FOR SEQ ID NOi 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

TTGTGTGTAT GCATTCAATG TGCTCTGTTT GTAAATGGCT AGCTATATAA TTTAgGATTC 60 

GAGGATCGTC ATCGACAACA AGACATTGCA CCATAGCTAT AAACTCCCTT ATCTTTTTCA 120 

40 TTTATTATAC ATGTAAAATA TTTTTGCGTA AAAAAACAAT TGTTCATATT GAGTTCATAT 180 

TTCAACCTTA TACTGACGCT AAAGAAGAAA TAGGGAGAAG TGAATCGATA TGAAATTAGC 240 

GATAAAAGAG ATTATGTTTT ACAAATTTCG TTATATTTTA ATCACATTAA TCATTCTTTT 300 

^ ATTAAGTATT ATGGTGTTAT TTATTAGTGG TTTAGCTCAn GGOCTTGGTA GGGaGAATAT 360 

TTCGTwATTT. GAACACTTTG GATAATGATG aaTATGtTGT TCaAAAAATG AAAGAGCCGC 420 

aAATTGaGAA ATCGCA 436 

SO 

(2) INFORMATION FOR SEQ ID NO: 565: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2554 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOP0LCX3Y: linear 



(xi) SEQUE3^CE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT 60 

CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 120 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 240 

ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACAC6 CTAACACCTT CAAAATACAA TTTAGTATAT T6ATTAGCAA TATATTCAGG 360 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 420 

TGCATCTTGG AAAtTGCCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 480 

ATCTGGTTCT TCAGTAGCAC TTGTCACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 540 

TTGATAACCA AATGTATCAT CGATAAGTCC TTCGTTTAAT AAATTCTGAT AAAAATCTGT 600 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 660 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT 720 

TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 840 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 960 

GGCAATATCA ACACXSTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 1020 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 1140 

GTTTTCAATA TtATCAGTTG CACTGAACAA GTAGCTTGTA CX^TCAAAGC TTGtAAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG CTTCTTCTTC TTTTTCAAAT 1260 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

TTGAAAACCT GGlTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAATt CTTGTTCGAA 1440 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 

AACACATAGA TtGTATCTAA AAATGCTTTT TCAGCAACAG AAACAATATC TTCGCGACTT 1560 
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TTATGCATAA TCTCTATAAT ACTTTTCGGA CGATCTTCAG ATTCATATCG ATGAGAAATG X680 

ATTACTTTTT TAGCTAACTC TAATTrTTCT TCAGTGAAAT CTCCTGCTTT TATTTTTTCA 1740 

AATTCACTTA TAATAGTGTC TTTTGCAGTT TCGTACTTAT CACTTGAAAC CCCACTCAAA 1800 

ACAAATAAAT AGCCATTTTT GCCATCAATT TGTGAATGTA TAGAGTACGC TAAACTTTGC 1860 

TTTTCTCGCA CTTCATTAAA TAAAACAGAT GAAGGATCTC CTCCAAACAT CATGTTAAAT 1920 

ACAACAAAGG CAGCATATCC ACTTTGTCCA TATTGTGTTG GAAAACGGTA TCCCATATTT 1980 

AATTTAGCTT GATCCACGTC ATCATATTCA ACAATATAAT CAACTTCTTC ATCGTGTAAA 2040 

TGATGAGTAG AATGTTGGAA TTGATGTTTA TCGAATGGTT TAAGTGCAAA TTTTTCACGT 2100 

ATTTGTTTCT CAACACTTTC AGGTTCTACA TTGCCGACAA CATAAACAGA ACATTGATCA 2160 

TTATTAATCA TTGATTGATA TGTATGATAT AGTGTTTCAG CAGTAATATG TGGGATTTGT 2220 

TCTAGTTGTC CTGTAGATAA GTATTTATAT GCTTCATTTT CAAACATATG GTCGAGTAAT 2280 

TTTAAAAACG AATATTGTGC TTTATTATCT ACCATTGCTT CTATTTTTTT GGCTAATAAT 2340 

GTTTTCTCTT GGTTAACAAA ATTATCATTG AATGCTTTAT TTTCAATTAA TGGATTCCAA 2400 

ATGATTTCTT GtAATAAATC TAATCCTTGa TTAAATAATG AwTCACCGkT TCyTAAATAA 2460 

CGkkCaTTAA caATTyCTAA tGaAAATGtA ATgACaTGCT GaTCTTTGAA TTTTGAAATT 2520 

GTACTATTCA CATACGCACC ATATAAATCG GCTA 2554 

(2) INFORMATION FQR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 

TnTCGCTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 60 

AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 120 

TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 

GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 240 

ATTTAGAAGT ATTGAAATTC CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 300 

TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 360 

TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 
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GACTTATGCA 


ACATGGGAAT TAACTAAATT AAATATGGAT ACACCATATA 


540 


TGACAATCAT 


GGGTATCTAT 


GTACTTCGTT CATTTGGTAT GGCATTTATA ATGATGCCAA 


600 


TGGTAACTGC 


AGCTATTAAT 


GCGTTACCGG GACGACTTGC CTCTCATGGT AATGCTTTCT 


660 


TAAATACGAT 


GCGTCAATTA 


GCAGGCTCTA TAGGTACAGC AATCTTAGTT ACTGTAATGA 


720 


CAACACAAAC 


TACACAACAC 


TTATCAGCTT TTGGGGAAGA GTTAGATAAA ACGAATCCTG 


780 


TTGTACAAGA 


TCATATGCGT 


GAATTAGCAT CACAATATGG CGGACAAGAA GGCGCAATGA 


840 


AAGTGTTACT 


ACAATTTGTA 


AATAAACTAG CAACGGTTGA AGGTATTAAT GATGCATTTA 


900 


TAGTTGCAAC 


GATATTTAGC 


ATCATCGCCT TAATTTTATG TTTATTTTTA CAAAGTAATA 


960 


AAAAAGCAAA 


AGCTACAGCT 


CAAAAGTTAG ATGCAGATAA TAGTATCAAT CATGAATAAA 


1020 


TAAAATAAAT 


TAATTGAAGT 


GTGACTAATC AAAAATTATG TTGTGGGGAC ATGATTTTTA 


X080 


AAGTATCGGT 


GCCAAATATG 


GTTATCX5ATA CTTTTTTTAT TlXSTlXaATTT ATAGAATGTT 


1140 


AGAG6AATTA 


TATTAAAATT 


TGGCATTGAC GTAGTAGGTC ATTAATAAAG AAAAAOCAGG 


1200 


AAGTGGGTCA 


ACGAAATGAA 


TTTTGTGAAA ATAACATTTC TGTCCCAATC CCTACTATAT 


1260 


AACATTATTT 


TAAACGAGGC 


ATGCGATTAC GGAAGAATAA GCTTATAACA AGTAAACCX^V 


1320 


TGCTACAGCC 


AAGTAAAATG 


ATGCCGTTAT GAATAGCGTC ACTTgCTGTA ATCACTTGAT 


1380 


CTGGTGGTAC 


ATTTAAATAA 


TATTTTTTGA AAACATCTGC AATT 


1424 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 
TCTCCCATGT TCTGCTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 60 
TTCACCAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 120 

45 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATTACGAC CTTCTGTAAT 180 
TGTAATTTCA ACCAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC 240 
SO CGGTTGCGTC ATACCATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA CTTCTTCTCT 300 
CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 360 
CATTAAATTA GTAAATTCAC CATCATTtGT GAGTAATAAC AATCCAGAAG TATCATAGTC 420 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 



540 



AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 



600 



TTTTGTTCCT AATTCAGTGA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAQAAATTAA 



660 



AGTTTCTGCC TTACGT 
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(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 

GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 60 
AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT . 120 

AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 180 

TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 240 

ACATTTTGCA CCTTTCACGT CATATTATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 300 

TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 360 

TTTTAAATTT GTAGCTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 420 

kAAAGTCTCA GCTGtAAAaG TTTTAAGTTT AATT 454 
(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 60 

CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTGCTTGTAA AAGATCTCCT 120 

AAGTCATATT CTATATCCTG TAAATCTTTA CCATAAGAAT TTACATTTTG ACCTAAAAGC 180 

GTTATTTCTT TGTAACCTTC ACGAGCAAGT TCACGTACTT CATCTATAAT GTCTTCAGGT 240 

CTACGGCTTC GTTCTTTACC TCTTGTAAAT GGAACAATAC AATATGTACA AAACTTATCA 300 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 420 

TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTGTAC CAAATATCAT ATCTACATTT 480 

^ TGATACXSATT TTAAAATTTT ATTCACTACT GACTCTTCTT GTGACATACA ACCACAAACA 540 

CCGATTAAAA TATCAGGTCG TT C TT T TTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 600 

ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 660 

10 

TTAATATCAG TCGTkGCTTG aTAGCCTAAT GCyTCmAGTA TACCAGCAAT GACCTCAGTG 720 

TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA AT6TACGCTC GTTCCCCATA 780 

IS CCGCGATA1T TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 840 

rrrrrf nCGC tcctttaaaa ttaaggcggc tgataaacag gtccaaaata ttac 894 

(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2S 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 
TATCAATCCC ACAGCACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 60 

30 

TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 120 
TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTO CGAAAGATAA 180 
35 TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 240 
TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 300 
TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 360 

40 

TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTxiGGGGCAC AAAGGGACTG 420 
AACCAGAAAA AATCGCTCAA C 441 
(2) INFORMATION FOR SEQ ID NO: 571: 

45 

(i) SEQUEaiCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 
SO (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

55 
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AAGTCGTTCA TCTTTTTCTA CAAGGGTGTA AAAATAATCA ATCATATCGT ATAACGCTTC X20 
TTTACCAATG ATGTCATATG GTGTTGTTGT CATTTAATCA CCCATTTTCA AAAATTTACT 180 
^ GTTACGAACT TAAGTTAATA TATAACTAAT ATAACATGAT TTTAAACATT TGAAAGAAAT 240 
ATGCATATTT GCCAATTTAA TTTATATTGT TTGAAAGTGT TTCTTTTTTC TTGAAAAAAC 300 

GTTGAACTTT ATTTAAAGGT tOATGATGTT CGAGGTTTAG TTCGTTtAAT AAAGATtGGA 360 

10 

ACTTTTGTAA ACCTTGATTA TAGTCTTTAA CTTCGAACTC TAACTCATAA TCCGTAGTAT 420 
CGAAATACTC ACTTTTATCT AAAACCAGTA AATCACCTTT ATATTTAGTT TCTTGGCGAT 480 
IS ATGTCGTTAA TGCACCAAGT ATTGATAAAG TTGTATCTTT TACACCAAAC TGTTCAACTA 540 
TAATTTGACG AATGTCATCT GGAAGATTGT CGTTTGAAAT AATCAAGTTC ATCTCTCGTT 600 
TAATGTCGAC GATATAGTTG TATTCTAATA GACCAACCTT TGCTGGTGTC TTTAAAGTCA 660 
TTTCATATTG ATTGTCTTTA ACTCTTATGC GTAGTGCAGA GCGATGTTCC TTTAATTTGA 720 
AATCGGGTGT ATCAATATAG TAATTGACTT GCTTAnAAAG CACACTGTCT TTAAAATATT 780 
TCTCTTGCAA TTTATTATAG ATTGAtGCAG TTATCATTTG TtTAAATTCT ATtTCATGAT 840 

2S 

TTGTTGCCAT GATATGTATA CACCTCGTAT CAAATTCAAT TTATCTTAAC TATATTATGA 900 
ATGACAAAGT TGAATTTTAA AAGTAATTTC CTTTATCTAT TATCAATGTT AATTTGACCA 960 

^ TTAAAAATAG TGTTCGTAAG TGTTTTGTAT TATTGaATTG TGTTAAAATG TTATGGAATA 1020 
AGAGGAGGAT TAAGCATGsG TTTwTATATT AATGAAATTA AAATTAAAGA TGACATACTT 1080 
TATTGTTATA CAGAAGATTC TATTAAAGGA TTATCTGAAG TAGGACAAAT GCTCGTTGAT 1140 

^ AGTGATAATT ATGCCTTTGC GTATACATTA GATGATGGTA AAGCGTATGC TTATCTCATT 1200 
TTCGT 1205 
(2) INFORMATION FOR SEQ ID NO: 572: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
SO TGAAGAAATA GCAATGATGA AATGCCGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 
AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 
TTATTACGTG AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 

55 
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ATTTGCTAAG TCAAAATTAA TTGAAGATGA AAAACGATTG ACTCAGTCAT TAGCTGATTT 300 

TGrTACGTTA TCAATGAGGG AACAGACGCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 360 

TTTTCAATTT CAAGAAACGG TATTATATAT AT3cAAaAyCT AATaCGTACa GTtATAACCT 420 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 480 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGCTAATTTA AAAGGGCTAG AACATACAAC 540 

ACTTAAAGAA AAAGTTATAC CTAACX5TTAT 570 
(2) INFORMATION FOR SEQ ID NO: 573: 

IS (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 

GTTGAATGGT TAGCAGCTGC AGTTGTATTA TATTTCTGTG GTGTAATTGT TGACGCTCAT 60 

GTATCATTCA TGTCCTTTAT TGCAATATTT ATCATTGCTG CATTATCAGG TTTAGTCAGC 120 

TTTATTCCTG GTGGTTTCGG CGCTTTCGAT TTAGTTGTAT TACTAGGATT TAAAACTTTA 180 

GGTGTCCCTG AGGAAAAAGT ATTATTAATG CTACTTCTAT ATCGTTTTGC GTACTATTTT 240 

GTACCGGTAA TTATTGCATT AATTTTATCA TCATTTGAAT TTGGTACATC AGCTAAGAAG 300 

TACATTGAGG GATCTAAATA CTTTATTCCT GCTAAAGATG TTACGTCATT TTTAATGTCT 360 

^ TATCAAAAGG ATATTATTGC TAAAATTCCA TCATTATCAT TAGCAATTTT AGTATTCTTT 420 

ACAAGTATGA TCTTTTTTGT AAATAACTTA ACGATTGTkT ACGATGCTTT tATATGATGG 480 

AAATCACTTA ACGTATTATA TTCTATtGGC AATTCATACT AGTGCTTGTT TATTACTTTT 540 

ACTGAATGTA GTTGGTATTT ATAAGCAAAG TAGACGTGCC ATTATCTTTG CTATGATTTC 600 

AATTTTATTA ATCACAGTGG CGACATTCTT CACTTACGCT TCATATATTT TAATAACATG 660 

GTTAGCTATT ATTTTTGTTC TGCTTATTGT AGCTTTCCGT AGAGC6AATA GGTTGAAAC6 720 

CCCAGTAAGA ATGAGAAATA TAGTTGCAAT GCTTTTATTC AGTTTATTTA TTTTATATGT 780 

TAACCATATA TTTaTTGCTG GAACGTTATA TGCATTAGAT ATTTATACGA TTGAAATGCA 840 

so TACATCTGTA TTGCGCTATT ACTTCTGGCT TACGATTTTA ATCATCGCTA TCATCATAGG 900 

TATGATTGCA TGGTTGTTTG ATTATCAATT TAGCAAAGT 939 
(2) INFORMATION FOR SEQ ID NO: 574: 

55 
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10 



(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

GAATTAATTA AATATTACAC ACAGCCTCAT TTTTCATTTT CAAATAAATG GCTATATCAA 60 

TATGATAATG GAAACATTTA TGTTGAACTT AnGAGATATT CATGGTCAGC ACATATATCT 120 

TTATGGGGCG CTGAAaGTyG GGGAAATATT AATCAGTTAA AAGATCGTTA CGTAGATGTG 180 

IS TTTGGACTAA AAGACAAAGA TACTGATCAG TTATGGTGGT CTTATAGAGA GACATTTACA 240 

GGTGGCGTTA CACCAGCCGC AAAACCTTCT GATAAAACTT ATAATCTTTT TGTGCAATAC 300 

AAAGATAAAC TACAAACGAT TATTGGTGCG CATAAAATAT ACCAAGGCAA TAAACCAGTA 360 

TTAACATTGA AAGAAATCGA TTTCCGTGCA CGAGAAGCGT TAATAAAAAA TAAAATATTA 420 

TATAACGAAA ATCGTAATAA AGGTAAGCTT AAGATCACCG GTGGCGGTAA TAACTACACT 480 

ATTGATTTAA GCAAAAGATT ACATTCAGAT CTAGCAAATG TTTATGTTAA AAATCCTAAT 540 

AAAATAACTG TTGACGTCCT CTTTGATTAG TATATGAAGG TGACTTATAC TTCATGCACT 600 

TTAATTCCAA ATCAGATTAT TTAAATGATA ATTTTTAAAG TGTATGATGT ATATAATAGG 660 

30 TAAAATTTTC TATATATTTA AATGGAATTG GGAGTAGGAA TGTGACAGAA ATAGTATTTT 720 

ATAAAATTTA TTCtTGTCAC TCCCCAACTT GCACATTATT GTAAGCTGAC TTTCCGCCAG 780 

CTTCTATGTT GGGGCCCCGC CAACTTGCAT TGTCTGTAGA aTTTCTTTTT GAAATTCTCT 840 

ATGTTGGGGC CCCGCCTATA ATTGAAAAAT GCTTGTTACA TGGGCATTTT CATTCGGTCA 900 

ACTACTACCA ATATAATATT GtAGaGCCTA AGACATTGAT TTATTATGTC TTAGGCTCTA 960 

TTCCTTCATT TAATGATTAA nTTATTATAG CAATACTTTA TTGTCCCATG ATTAGTGTTC 1020 

TTTTAATGAG ACATAGTAAC TATAAAGTTT AATAATCGT 1059 
(2) INFORMATION FOR SEQ ID NO: 575: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dOiODle 

(D) TOPOLOGY: linear 



20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 60 
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TTCAACTkGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGmA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGTCACCATT 240 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 300 

GTTCTTCGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 360 

ATATTCATTA TTGATACCAC CAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 420 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 480 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 540 

TCCTTTTTTC TTCAAGTCAT TAATATATTC TTCT 574 
(2) INFORMATION FOR SEQ ID NO: 576: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTCGTT AAGGAGAATG ATATGATATC 60 

AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 

30 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 180 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 240 

35 TAGTAAGCAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 300 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 360 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 420 

TCAAGCAAAA TATAATATTC CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 480 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 540 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 600 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 660 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 720 

so ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 780 

AATAGTCTTA TCTATA 796 
(2) INFORMATION FOR SEQ ID NO: 577: 

55 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLCXSY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 



10 
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30 



35 



40 



AGAAATTATG 


ACAAAATATA AAGATGGAAA 6TTAGTTTAT GCATCAGTCG AACCAGGATC 


60 


TTAC6TAATA CATAAAGATO ATGCAATTAA ATATGACGAT TATTCTAAGT TAAAAAAATT 


120 


AAGTCAGCTA ACTAAACTTG ATCATCCAAA ACCAGTTCCA TATAGCGTaC TCAAATCAAA 


180 


TCTTTCGGAG TACCTTTAAC AAGCGTTTCA TTTATGACAC ATGGATCAAA GGATACTAAA 


240 


GATGAAGTGT 


TGCCGGCATT GGCCTATTTC ACTTTTTCAC CAAAAAATTA TGAAGACAAG 


300 


TCTAATCCAG 


ATCCAAAAGT TTTAAATTTA GTACATATGG ATTTCTTAAA TGCATCTAGT 


360 


GATTTTGGTA ACGCACATTT TCiTTGTTTTA AGTAAATATA TTAAAGAGTA TGAATCAAAC 


420 


TATGAAACAG 


CGTCAGATGA TTCTTTAAAA TAGTATTTAC TGTGTGAAAA ATAAATAGTG 


480 


TACTACATTA 


AATAATCGCA ATAATAATCC CGATAAACAA TCAGCATTAC TGCTTATCAC 


540 


ATAGAGTTCG 


TAATAACTAT AACTCTATGA TTCGCAAATA ATAAATGATT GTCATCGGGA 


600 


TTTATTTTTA 


TCAATTTATA AAGTGACATT ACCTTGTTCA TCAGCAGGTT TGAAAACAGT 


660 


AATCACTGCA 


CTAATAATTG CTAAAATGTG TGGGATACCT GTCCAACAGA ATATTAAGTG 


720 


TAGAATACCT 


TGCATATTCT TGCCG6CATA AAATTTATGA ATACCAAAAC TACCTAAOAA 


780 


CAATGCTAAT 


AAAATATAAA TAACTTTGTT TACTTGCATT TCTTTCCCTC CAGTTGAATT 


840 


GCTTATAATG 


ACATTAGCTT CTCTTTTTAT TATACCCACT TTTAGTTCAA ACATTCTAGT 


900 


TTAAGCATTC 


CCAATCATCT AAATTTCAGT TATTCAATCC TTACAATAAA TTTAGGATTA 


960 


CATTTCAGTT GCATTGTATT ATTTTACGTG TGAAATATAC GTAATGAATC ACATGACAAy 


1020 


CTyCAAATTG 


AAAAATATAC ATTCTATGAT GTAAGGTCGC ATTTTTAATA TATTTACGTn 


1080 


AAAATAGTTT 


GGATG 


1095 


(2) INFORMATION FOR SEQ ID NO: 578: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
SO (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 

55 
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ACTGCCAAAA TAATCATAGC CATGTTCTAC AGCTGCTTTC GCTACAATAT CCAAACGCAT 120 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

5 

CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA ATATACTTCA CATTCGCACC 240 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 300 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTGCAT ATTGTGTTAA 360 

10 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 420 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 480 

f5 GTKlGATTTn 489 
(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

CGTACTGCAT CATTTGTTGA AAAATTTGCA AACTTCTTCA AGATGTTATG GCTTAGACTT 60 

AAAGCGATGA AGCACTACAA AGCCTTAAAT AAAGAATCTA AGAAGCAAGA ATTTGAAAAT 120 

30 

TCATTCAAAG ATGTTCAAAA AATTATGCGT ATTGTGAATC ACAATATTAT TTTACGCTTA 180 

AAAGAAGAAC AAAATAGTAC AAATGTACTT GAGGTTAGCT TAGTCATTAA TCATTACTAT 240 

3S GATATGAGTC GCTCATTAAA GTGGCGTGCA CAACGTCGAA AAGAACGTCA AGAAAACAGC 300 

AATCAAATCA TACCGCAAGC TATGTTCCAT AACCACAAAT TGGAAGCATT GTACTTACAA 360 

CGTCATCTTT TAGATGAATT AATTCGCAAA AATAAAATCA ACAATATCGT TGCAGCTCAA 420 

ATTCGAGAAA ATATCAATTa CAACGAAATT GTCTTGTCTT TACAGTCCAA ACATTaAGCA 480 

AGaCwTaCaw TmCCCCCGTA CATATGAGAC AAAGTCATTA TCATCTCATA TGTACGGGGT 540 

TTTTATATTC AACATCAAAA AATCAGATTG ATGAAAAGTA AATAACCTTT CATCAATCCG 600 

45 

ATTTGATTAT AGAATCTATT TTTTAAGTTT AAATGGAATT GTACATACGT TAACATTCTT 660 

TTGATAAATT AAATACAATT TCATACGCAA ACTAGTTTGA TTGTGTAATA AATTATGCCA 720 

ACGTTTCTTA GTAATAAATT CTGGTATCAC TACTGTAATC ATATAGTTTT GATCGTTGGC 780 

SO 

TTTACGATTA ATCTTATCGA TAAAACGTGA AATTGGTCGT ATAATACTGC GATATTCCGA 840 

ATGTAAAATA ACTAATCTTA CATCTGGGAA ATGACGTTTC CATTTCTCTT GGAATGCTTT 900 

55 
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TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 1020 

ATTACGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 1080 

^ ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 1140 

TATCGGCCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 1200 

CGATACCACC AAGTAAGTTC ACAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 1260 

10 

TCCATTTAAT AACCATACCG AATTGTG 1287 
(2} INFORMATION FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 
GCCTCCTTTC CTATATCCAT GGGTGTCCTA GTTGGGAATG GCTTATTCCC TATCCCAAAC 60 

2S 

ATCTGTCAAT TCAAGACTAT CACTGTATAC TAAATCGCCT nTCTATACnT CCATnCATTT 120 
CATTTTCCAG TTGGAACAAT TTGTTTAATT ATTTTAAATA TCATTATCAA TCCAGAAGTA 180 
TTTACTATTC ACTTTTACAA TAATCAATCA TTTAACTACA CATGGGTAGT CGGTGGTTTG 240 

30 

CTTGGGGTTA GCTTTTTAAC TGGCAATTTA TTGTTATTGC CAAAATTAGG TGCAACATTA 300 
ACTGTAATTG CAACAGTTGC GGGTCAAATT ATTATGGGTG TCATTATTGA TACATTTGGA 360 

3S TTATTTGGCG CTACAATTCA TGATTTTAAT TTAATTAAAG CAATTGGAGT ATTGTTACTC 420 
ATTGTCGGCA TCGTCATAAT GAATCAATTT AACAAGAATA ATTTATTACT AACTGATCAA 480 
AAGTATTTAC TGTTTTGGCT TCTATTAGGA TTTATTTTTG GTTTCTTTCC ACCTATTCAA 540 

^ ACGACAATTA ATAGTGCTTT AGCTAGTCAT ACTCATTCAC CAGCCTTTGC ATCATTAGTA 600 
TCATTTACAA TTGGGTCAAT AGCGCTATTG ATTTTAACCG CTATTTTTAA TCGTTCTTTA 660 
AAACTAAAAA CAAGTCATTT AAAATTCGGT AAATTAAAGC CTATCTATTT TACTGGOGGT 720 

4S 

ATACTTGGCA TGGCTTTTGT AACAGCTAAC ATTATCTTAA TGCCTCATAT GGGTGCAGCA 780 
TTAACAACAC TTATTGGGAT GTTTGGCCAG ATTCTAATGG GCATATTGAT AGATCACTTT 840 
GGATTATTTG GTTCACCTAA AATAGCAATG ACATCCAGAA AAACTATTGG TCTATTATGT 900 

50 

ATTTTGACAG GCATTATACT TTTAAGATTA TTTTAAATTA ACTTTTAGCT TATCATTTTA 960 
ACTTGTAATT ATTTTTAAAA GTGATAAGCT ATTTTTTTGT GGTCTAAAAA TCTTTAGAAA 1020 

55 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1140 

CAAAAATTAT GTTATTCGTG CTAATATTTC ATAGTTQGTT ATTCAATTAA TTAAAAATAA 1200 

GTCAAAATGC ACAACTTTTT ATn 1223 



(2) INFORMATION FOR SEQ ID NO: 581: 

(1) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5B1: 
CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 
20 AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GATCTTGCAT CTGTTGCTAG 120 
AGCTTGTTCG TGTTTTTCTG ATAAAACATA TGTGATGGAA GCGTGAATAA TAATGCCTAA 180 
TGTAACAAAA CTGATAATTA ATATACTGCT TATCAATAAC ATTAAGCGGT GGTGAAACTT 240 
CATCATTGTT CTTTAGGTCT TTCCAATTTA TAGCCTAAGC CACGCACAGT TTTAATAAGT 300 
TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTOATATG TACATCAACA 360 
ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 

30 

ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA 454 
(2) INFORMATION FOR SEQ ID NO: 582: 
(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 
TTTATAAGAT TTTATTTACA AATAATTGGT TTTCATATGT ATAAACACTT TTGACTTTCA 60 
^ AATCTTAACG ATAATTCTAT TACAATACAA TCCCCTATTA GAATGATTTA TGTAATAAAA 120 
AAAGCGGAGT TTCCCCCAGC TTTTCTAAAC GACTACATAA AATATAAGAT TGCAATTAAA 180 
TGCAATAGTG ATGCTATTAC AATAAAAATA TGCCAAATCA TATGAAAATA TGGTCTATTC 240 

SO 

TTTTGTGCAT AAAACCATGC ACCAATTGTA TAAGACACAC CACCTAAGAA AATGAATAAT 300 
ATGAATATCC ATGATGTGCG AATAAAAATA ATTGGTAACA AGATAATACC TACCCAGCCC 360 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA 452 
(2) INFORMATION FOR SEQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

CGCTTTTTGC AATAAGTCAT TAGCCGCTTT TAAGCCTTCT TCTTTTCGAT CTACAACAAG 60 

TAAAATAAAT GGCTTTAACG CTTCTTCTTT TTCACTTTCA AGCATATCTG GTTTTTGAAC 120 

CATTTCAAAT GGAGATTTCA ATCCATTATT ATCGCTCATT TCAATAATTG CATCATACTG 180 

20 TGCTTGTGAC ATACTTGCAA TAGCCTGTTT TGCATTTTCT TGAAGGAAAT ATAAGTTTTT 240 

CAATTTAGGA TGCTTATTTA ATGTACTTAA TGTAATCGGT GTAATGTCTT TCTCATAA6A 300 

CACTTCAATC ACTGTACTAT TTGTTCTACC AGGAATTGGT GGTTTTTCAT GAATATGCTT 360 

TGATACTTCT CCAATTCCAA CGACAGATTG ATTTTTCGTT CGATTATAAA AAATAATATT 420 

GTCGCCTTCT TCTAACTGAG TATAAAAATG ATAACCATTA CGTTTAATAC CGTTGTACGT 480 

GTGCGTATAA ATCGTATATT GGTTTCCAGG TTCAAATTCT TCAGTTTCAG CTAAAAAGAA 540 

ATAACGCGGT ATCTTAATTT CGCCTTTACC AAGACCACTT ATTAAATCAA ACTCTTCTGC 600 

AGTGATTTGA TTGAACAATG TCTCTTTCAT ATnACTTATA CGAAATTCCA AAGCTTCACT 660 

ACGCTTTAAA TAATCTGCTG TTAATGGTTT CAATTGTTCA TTAAAACGAA ACTGTACACG 720 

TATTTTATTT TGTGCACCTG TTTCAACACT AATAATTTCA CCACATCCAA GTAGTCCAGT 780 

ATCCGTCTGA ACTTGATAAA AGATGACTTG ATCTCCTACT TTAGCCTTTT TAAACGCTCT 840 

40 AAATCCTTGA GATGGGTTAA AATGTGCGCC TGATTCAAAT AAAGCTGTTT GTCCTACTAA 900 

CGGTTCATTA TGATTCCAAC GGTTATATCC ACAATTCAAC CAAAAATAAT TCGTTTCTGC 960 

TGTCATCTTA ATACTCCTTA ACCTGAATAA ATTTTAGAAA CACTATGAAT TACATTCTTT 1020 

tAGTGTTTCT TATGCAGTTG GACGCGTATG CGAACAACTG TATACCCTTT GTTCACTGCG 1080 

ATTTTAATCG CATTTCCTAT AACATTGTAG CGCCCAGGAC ATTAATTTAC GTCCCAGACC 1140 

CTTATCGTTT TCACTTCTAA GTAAGTCGAA CTATTrTGCT TTACAACAAG TGCGACTCTA 1200 

AATACAGTTG GACACACATA CGAGCAACTG TATACCTTTT AATCAGTTTT CTATATTTTA 1260 

TTTATTATAT CTGTCTTAAT GATAAAAATT GTTACAAACA GTTTAACATA TTTAGCTACC 1320 

55 
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ATGATACCAC TATGCTTGCn TATCTCTATA GCGCCATTGA TACACATTTT TAAATATCTA 1440 
TACTGCCGTT AGAATTTTAT CATGTCTnAA TT 1472 
^ (2} INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 
10 IC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

GGTAGTGAAT GGGGTTCAAG ACAACAGTAT TGGAAGTACG AATGAATCAC AGTTTTTAGG 60 

AAATTATATT GTAATTAAGC ACGCA6AAAA TGAGTATAGC TTAATAGCTC ATTTACACCA 120 

20 ATATTCAATC ATTGTGAATG AGGGGCAAAA TGTTAAATAT GGTGATATCA TTGGGAAGGT 180 

TGGGAATTCT GGCAATTCTA CTGAACCTCA TATACATTTT CAAGTAATGA ATGATAAGAA 240 

TATTGAAGCA TGTACATCTT TAAAAATTCG ATTTATAAAT AATCGAGAAC TTATCAAAGG 300 

GGAT6TGGTC TGCGGATTAC AAGCTGAATG ATGGCGATAC TTATAAAATC TCGACACTAT 360 

AAAAATGGTA TAGTGTCGAG ATTTTCTTGC TTATTTAGTT AATTCAAAGT GCACOCCGGA 420 

TTCATTAGAA GTCGACGTAT TTTTGTTTGT AATAGAGTAA CCGGTCATTG AAATTTTAGA 480 

TTCAATATCT GAAGCGGAAT TTGTAGATTC AGGATTATA6 AAGCTACATT CATAAQTGTT 540 

ATCATCTTTC TTTTTAAGTA TAAACATACC TTTGGCTTTA ACTTCGACTT TAGTGTTGTT 600 

AATGTCAAAA GTTTGAGTGC TATTACTATA ATTAACACCA GCCCAAACCG ATTCATTATC 660 

35 

TTTCACAACG GGGAAGTCAT CTTCTTGCTT AACGACGTnA CTTCATCTTT CTCTGTCTTA 720 

AAGACATCTT TAGATAAGCC TGGATACAAC ACATATCCAT ATTTATTGTC AGAATTAGAA 780 

40 TGCTTTT 787 
(2) INFORMATION FOR SEQ ID NO: 585: 



30 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 
ATGCTAAATT AGGGGGAATA TTTACGATAA AGAGACCAGA AAAATAATAA AACACATGCT 60 
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10 



TAAAATGAAG ACTATTTTTT ATTACAAGAA AATGTATCTA GTAAACTTAA AGTAGCAAGA 180 

CCTAATAAAT TTAATGCATG TTGTGCACCT TTTTTACCTT GGCCAGCTTC GAAATGTTTG 240 

TAAGCAGCTA CACTTAAAAT GCCTATCGTT GATAGTGATG CAAGGCGAGA AATGTTTTTA 300 

TTGATAAAGC TAGCTGAGTA TAAAGCAGCA GTAGTTGCTT CTGCAATGCC GACGTATTTT 360 

ACAAGTTCTT TTTGCAAGCC AAAAGTATGT TCAAACAGTT CAATCATACC CTTATCTTCT 420 

TGCAATTTAG GTTTACTGGC TTGGTATAGC TCTTTCGCAA GTTTTAAATT CGTTGCGTAA 480 

CGCAAAATCA TATTTAATTC CTCCCAATAT TTGATTTTTT GTGAAAGATG ATTACTTTAT 540 

^5 CATTTTTACC CGTTTCTATA AAAATGAATC AATTATGTAA CGTATGTGTA GTTTAGGAAT 600 

GTTTGCTATG GAAATATAAT TCTGTTCACT CAAAATGTAT GAAATTAATG TGTAGTTTTG 660 

TOGAGTTGCT CTTTTAATTT GGTTAGATTG TTTTTTAQAG AAGCGGTACT ATTTTTAAGT 720 

20 GCATCAACAG ATTTACCTTC GTTTTGAGAC ATTGAGTTTA TTACAGCACG AAGTTCTGTT 780 

TCTAGTATGT CaGCGTCGCt TTAGCATTAG AACTTAaTAt TTAtAcTCTT 830 
(2) INFORMATION FOR SEQ ID NO: 586: 



25 



30 



45 



50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

3S TTAGGACGTT TTTACAATCA GTACAATGAG CTCATTGTTA TTAGTCCTTT AACGGCGTCT 60 

TTTAATGCTG GCOCTACATT TGGGCGATTT CATCATTTAA TTGATACTGA AACTTTAGCA 120 

AAATTAGAAC ATGAAAAAGG ACATTATTAT CAGAAGATGA TATGTGATQA CAATGTAGAA 180 

ATGATTTCTA TAAATAACAT ACCGAAATAT CCGAGAAATC ATAATGTATT AACTAATCAT 240 

GACTCATACG AATATTCATT GAATTTAGGA AGTAGTAATA GTTAITCAAA GTATGAGCTT 300 

ACCTTAGATG ATATTTATGT TGGTGCTACC TTTtAACAAA TTATATTTAT ATTCTAGCCm 360 

ACtAAATAAA AGGGkaCtaT TTGaATCaAA CmATaTGtAT TAACCTTTTT TA 412 
(2) INFORMATION FOR SEQ ID NO: 587: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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GTCAAACACT AATAATGGCG AAACGAGTGT 
ACAATCATCA TCAACAAATG CAACTACGGA 

5 

TACGACAACG AATCAAGCTA ATACACCGGC 
GGAATTAGTG AATCAAACAA GTAATGAAAC 
TGTAAATTCA CCTCAAAATT CTACAAATGC 

10 

AACTGAAGCA ACACCTTCAA ACAATGAATC 
AGATGTAGTT AATCAAGCXK5 TTAATACAAG 
GGCAGTAGCT GGAGATGCAC CGGTAGCTGG 
QACAGTTGGT ATTGACTCTG GTACGACTGT 
GAATTATGGT TTTTCAGTGC CTAATTCTGC 
ACCTAAAGAA TTAAACTTAA ATGGTGTAAC 
TGGAGATCAA GTATTGGCAA ATGGTGTAAT 
TACAGACTAT GTAAATACTA AAGATGATGT 
TGACCCTGAA AATGTTAAAA AGACAGGTAA 
AACAGCAAAC AAAACAGTAT TAGTAGATTA 
TATTAAAGGT ACAATTGACC AAATCGATAA 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC 
TACGGATAGT AATGCATTAA TAGATCAGCA 
3S TAATGCAGCT GATTTATCTG AAAGTTACTT 
TAATAGTGTG AATATTACAT TCCCAAATCC 
TGATGATCAA ATTACAACAC CGTATATAGT 
CAAAGGTGAT TTAGCTTTAC GTTCAACTTT 
CTCTATGTCA TGGGACAACG AAGTAGCATT 
CGATAAACCA GTTGTTCCTG AACAACCTGA 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC 
AGATTCGGGT AGTGATTCTA CATCAGATAG 
AGCAAGTGAT TCAGACTCAG CGAGTGATTC 
CX3ATTCCGAC TCAGACAATG ACTCGGATTC 
AGATTCCGAC AGTGACTCAG ATTCAGATAG 
55 



GGCGCAAAAT CCAGCACAAC AGGAAACGAC 1800 

AGAAACGCCG GTAACTGGTG AAGCTACTAC 1860 

AACAACTCAA TCAAGCAATA CAAATGCGGA 1920 

GACTTCTAAT GATACTAATA CAGTATCATC 1980 

GGAAAATGTT TCAACAACGC AAGATACTTC 2040 

AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 2160 

CACAGATATT ACGAATCAGT TGACGAATGT 2220 

GTATCCGCAC CAAGCAGGTT ATGTCAAACT 2280 

TGTTAAAGGT GACACATTCA AAATAACTGT 2340 

TTCAACTGCT AAAGTGCCAC CAATTATGGC 2400 

CX5ATAGTGAT GGTAATGTTA TTTATACATT 2460 

AAAAGCAACT TTGACCATGC CCGCTTATAT 2520 

TGTGACATTG GCTACTGGCA TAGGTAGTAC 2580 

TGAAAAATAT GGTAAGTTTT ATAACTTATC 2640 

AACAAATAAT ACGTATCGTC AGACAATTTA 2700 

GCCGGTTTTA ACAGGTAATT TAAAACCAAA 2760 

AAATACAAGT ATTAAAGTAT ATAAAGTAGA 2820 

TGTGAATCCA GAAAACTTTG AGGATGTCAC 2880 

AAATCAATAT AAAGTAGAGT TTAATACGCC 2940 

AGTTGITAAT GGTCATATTG ATCCGAATAG 3000 

ATATGGGTAT AACTCGAATA TAATTTGGCG 3060 

TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

TGAGCCTGGT GAAATTGAAC CAATTCCAGA 3180 

TGGCAGCGAT TCTAATTCAG ATAGCGGTTC 3240 

TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 3300 

AGATTCAGCA AGCGATTCCG ACTCAGCGAG 3360 

AGATAGCGAT TCTGACTCAG ACAGTGACTC 3420 

CGATTCTGAC TCAGACAGTG ACTCGGATTC 3480 
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CGATTCTGAC TCCGACAGTG ATTCCGACTC AGACAGCGAT TCAGATTCCG ACAGTGATTC 3600 

CGACTCAGAT AGCGATTCCG ACTCAGATAG CGACTCAGAT TCAGACAGCG ATTCAGATTC 3660 

^ AGACAGCGAT TCAGATTCAG ATAGCGATTC AGATTCCGAC AGTGACTCAG ATTCCGACAG 3720 

TGACTCGGAT TCAGATAGCG ATTCAGATTC CGACAGTGAC TCAGATTCCG ACAGTGACTC 3780 

AGACTCAGAC AGTGATTCGG ATTCAGCGAG TGATTCGGAT TCAGATAGTG ATTCCGACTC 3840 

10 

CGACAGTGAC TCGGATTCAG ATAGCGACTC AGACTCGGAT AGCGACTCGG ATTCAGATAG 3900 

CGATTCGGAC TCAGATAGCG ATTCAGAATC AGACAGCGAT TCAGATTCAG ACAGCGACTC 3960 

AGACAGTGAC TCAGATTCAG ATAGTGACTC GGATTCAGCG AGTGATTCAG ACTCAGGTAG 4 020 

IS 

TGACTCCGAT TCATCAAGTG ATTCCGACTC AGAAAGTGAT TCAAATAGCG ATTCCGAGTC 4080 

AGTTTCTAAC AATAATGTAG TTCCGCCTAA TTCACCTAAA AATGGTACTA ATGCTTCTAA 4140 

20 TAAAAATGAG GCTAAAGATA GTAAAGAACC ATTACCAGAT ACAGGTTCTG AAGATGAAGC 4200 

AAATACGTCA CTAATTTGGG GATTATTAGC ATCAATAGGT TCATTACTAC TTTTCAGAAG 4260 

AAAAAAAGAA AATAAAGATA AGAAATAAGT AATAATGATA TTAAATTAAT CATATGATTC 4320 

ATGAAgnAAc rCCTTAAAAG GTGGCTTTTT TACTTGGATT TTCCAAATAT ATTGTTTGAA 4380 

TATAATTAAT AATTAATTCA TCAACAGTTA ATTATTTTAA AAAGGTAGAT GTTATATAAT 4440 

TTGGCTTGGC GAAAAAATAG GGTGTAAGGT AGGTTGTTAA TTAGGGAAAA TTAAGGAGAA 4500 

30 

AATACAGTTG AAAAATAAAT TGCTAGTTTT ATCATTGGGA GCATTATGTG TATCACAAAT 4560 

TTGGGAAAGT AATCGTGCGA GTGCAGTGGT TTCTGGGGAG AAGAATCCAT ATGTATCTAG 4620 

TCGTTGAAAC TGACTAATAA TAAAAATAAA TCTAGAACAG TAGAAGAGTA TAAGAAAAGA 4680 

3S 

TTGGATGATT TCAATATGGT CCnTTCCCA 4709 
(2) INFORMATION FOR SEQ ID NO: 588: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 
(B> TYPE: nucleic acid 
{C} STRANDEDNESS: dotlble 
(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 
CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

so 

CATTTaaTAA gCgCTTCAcC ATTAAAtGTG gTAmCCTTTA ATTCGCCAGT AGAAACATCA 120 
CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 

55 
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CGTCTAACCA TACCTTTCGT TTGTTTCGGA TCTTCCATCT GTTCACAAAT AGCTACTTTA 300 

TATCCATTAT TAACAAGTGT ATCTATATAA CTATCTGCAG AATGATACGG AACACCACAC 360 

^ ATCGGAATTG GATTTTCTTT TTTAGCATCT CTTTTAGTTA AAGTAATTTC AAGTAyACGT 420 

GATGCCTCCT TGGCATCTTC ATAAAACATT TCATAGAAAT CACCTAGTCT AAAAAATAAT 480 

AAGCAATCTT GGTATTCTGA TTTTATTTTT AAATATTGCT GCATCATTGG TGTAACATTA 540 

GACATATTAT TTCTTCACAA CCCTTGTCTC TTTTTAAAAT TTGTCTTTAC AATATATTCG 600 

TTTGTAAGyT TTTTAATTAT TAATTATTTA ACTTATACAT TTTAACATAC TTACTTTTAC 660 

AAACCTATTC ATACCATATA ATCACGAAGC ATCTTAAATG TATAAGAAAA CGCCTCAAAC 720 

IS 

CTAATAAAAT GTGTCAATAG CATGTTTAGA ATTAAATTAA AATTCTAACA TTCAAGACAT 780 

TTAATTAAGT AAGGGCGTTC AATATTAAAA TGAACAATGA CTCTGTTTGA AATCATATAT 840 

20 CATAAAATTA TTTTATAAAC CTTTGAAGAA TACCACGTTT TTTTAGAGTA ATTAATAAGA 900 

AATAACTTAT AATAGATCCG ATAGCACTTG aGACTATGaA CGTAATCATT AACGGTTTAA 960 

TGAAGAAGTC TTGAAGCCCA AGGaAATATG CTAATGGtAT aCaAATTAAA cTTCCgATGA 1020 

CaCCAGTTCC aAGTACTTCa CCGACCGCGG CCaTAAATAT ATGTTTACGA TATnygTAAA 1080 

ACATACTAGC CAATAAAACT CCAATCATAC TACCCGQAAA TGCAAAAGst GTACCAGTAC 1140 

CAAAAAGAAC TCTTAAAATT GATGATATAA GCX5CTTGAGC TAATCCATAC CAAGGACCTA 1200 

30 

CTATGACCGC ACTTAATACA TTTACAAAAT GCTGTACTGG TGCTGCCTTA ACTGGTCCTA 1260 

GAGGAATGAT GATAATACTG CTTAATACAA CATTTATTGC AATTAAAAGT GCAGTTATAG 1320 

CCAGTTTTCT TGATTTCATA TGATTGTTCT CCTTTTTGTT TGTAATTAAT CACTATGCTT 1380 

3S 

GGCTTTATTA TGGTCATTTA AACGTGTTTC CATTGTTGAT ACAAACATTT TCAATAATTG 1440 

ATTCGCTTCA TATTGTGAAG TTTGAAACTG TTCAACTATG GGCAATGTAT TTATTTCTGC 1500 

40 TTCTAtACTC TGAATGGtAT GTTCCGACTG nTCCAGCGCA TTTGTTTCCC GnAA 1554 

(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 
TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 60 

55 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACG GCATTTTTAG ATGAACAGTA 960 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACX3CAAAG TAGGACATAC 1020 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 1080 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG GATACCAAGC 1140 

ATTTGTAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 1200 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA 1242 
(2) INFORMATION FOR SEQ ID NO: 591: 

,5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 

TTCCAGATAG AGCCTTAGTT GCTGCCGCTG AATTGTCTGA TCGTTACATC ACTGATCGTT 60 

TTTTTACCAG ATAAAGC6AT TGATTTAGTT GACCAAGCAT GTGCAACAAT TCGTACGGAA 120 

ATGGGATCAA ATCCAACTGA ATTGGATCAA GTTAATAGAC GTGTCATGCA ATTAGAAATT 180 

GAAGAAAGCG CACTTAAAAA TGAATCTGAC AATGCGAGCA AACAGAGATT ACAAGAACTA 240 

CAAGAAGAGC TTGCCAATGA AAAAGAGAAA CAAOCAGCAC TTCAATCTCG TGTAGAATCA 300 

GAAAAAGAAA AAATAGCAAA TTTACAAGAA AAACGTGCGC AACTAGATGA AAGTAGACAA 360 

GCGTTGGAAG ATGCACAAAC AAATAACAAT TTAGAAAAAG CTGCTGAACT ACAATATGGA 420 

ACAATTCCTC AATTGGAAAA AGAACTTAGA GAATTAGAGG ATAATTTCCA AGATGAGCAA 4 80 

GGTGAAGATA CAGATCGAAT GATTCGTGAA GTTGTAACAG ACGAAGAAAT TGGCGATATT 540 

40 GTCAGCCAAT GGACAGGCAT ACCAOTTTCA AAATTAGTTG AAACAGAACG TGAAAAATTA 600 

CTTCACTTAA GTGACATCTT GCATAAACGT GTTGTAGGTC AAGATAAAGC GGTTGACCTG 660 

GTTTCAGATG CAGTAGTTAG AGCAAGAGCA GGTATTAAAG TnCAAACAGA CCTATTGGTA 720 

GTTTCTnATT CCTAGGTCCn ACTG 744 
(2) INFORMATION FOR SEQ ID NO: 592: 



2S 



30 



35 



45 



SO 



55 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

5 ACCAAAAACA CTTTCTAAAG AAATTGTTTC AGAACCATTT GCTAAAAACC AATTACGTGA 60 

AAAAGCTCGT CTAACTAACA TTAAAGGTTT AGAGATTCCA AGAGTTTCAT ACACTTTAGA 120 

CGATGATGAT TTCATTACAG ACGTAGAAAC AGCAAAAGAA TTAAAAGCAA AAGGTGATAC 180 

AGTCAAGTTC ACTACTAATA AATTCAAAGT ATTTGCTGCA ATTTCAGATA CTGTAATTCA 240 

TGGATCAGAT GTAGATTTAG TAAACTGGGT TGAAAACGCA CTACAATCAG GATTAGCAGC 300 

TAAAGAGCGT AAAGATGCCT TAGCAGTAAG TCCTAAATCT GGATTAGAAC ACATGTCATT 360 

IS 

TTATAATGGA TCTGTTAAAG AAGTTGArGG AGCAGACATG TATGATGCTA TTATTAACGC 420 

TTTAGCAGAT TTACATGAAG ATTATCGTGA TAACGCAACA ATTTATATGC GATATGCAGA 480 

TTATGTCAAA ATTATTAGTG TTCTTTCAAA TGGAACAACA AATTTCTTTG ACACACCAGC 540 

20 

AGAAAAAGTA TTTGGCAAAC CAGTAGTATT TACAGATGCA GCAGTTAAAC CTATTGTGGG 600 

AGATTTCAAT TATTTTGGAA TTAACTATGA TGGAACAACT TATGACACTG ATAAAGATGT 660 

25 TAAAAAAGGC GAATATTTGT TTGTATTAAC AGCATGGTAT GATCAGCAAC GTACATTAGA 720 

CAGTGCATTC AGAATTGCAA AAGCAAAAGA AAATACAGGt CCATTACCCA GCTAAGCCCC 780 

AAAAGGTTAA TGTAACAGCT AAGGCTAAAT CAGCTGTAAT ATCAGCCGAA TAGGGGTGAT 840 

^ GAAATGAGTT TAGAAGAAAk TAAATTGTGG TTGAGAATTG ACTATAATTT CGAAAATGAT 900 

TTAATTGAAG GTCTCATTCA ATCGGCTAAG TCTGAATTAC TATTAAGTGG GGTnCCAGAT 960 

TATGACAAAG ATGACTTGGA ATACCCGCTT TTTTGTACAG CGATTAAATA TATCATTGCA 1020 

35 

AGAGATTATG AAAGTCGTGG ATACTCAAAT GACCAATCTA GAAGCAAGGT GTTTAATQAA 1080 

AAAGGATTGC AAAAAATGAn TTTGAAATTA AAAAAGTGGT AGGTGATTTT TAAATGGAAT 1140 

TT/^TGAATT TAAAGATCGC GCGTATnTTT 1170 

40 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1393 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 
TGAAGGGTGT T TTG TTTATG GCCAATTGCT GTGTTATTAA AnCGAATGTT TCGAATGGAA 60 

55 
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5 



10 



IS 



30 



35 



40 



ATCATTCOGG 


AAGAGGACAT 


GTCTTCAGCT 


TCTGATTTGA AAGCACAGTT TTATCX3CGGT 


180 


ACTTTGCAGC 


GCTTTTACCA 


AtOtlTTGCAC 


GCAGAAAAGC TTACACCTTA TGTTATGTCT 


240 


TATGACGATA 


TCATTTCATT 


TTGTAAAGAA 


AACAATATCT CTGAAGTAGT GACTGCGGGT 


300 


GATATTATGA 


GTTATCATCT 


TGAAGAATAT 


GATATTTTAC ATCAACJGTTC TTTATTCAAT 


360 


GAAGCACGCA 


TTGCCGTTAC 


TTTGATACGT 


GGGAATCATT ACTTTAAAGC GAGTAAAACA 


420 


ATGAATCAAC 


AAGGGGAGCC 


ATACAATGTT 


TTTACTAGTT TCTATAAAAA ATGGCGACCT 


480 


TACTTGAGGC 


ATAGAGACGT 


ATATCACTAT 


GATTTAAAAT CATTCGAAAA CTTTGTCATT 


540 


GCATCACCTG 


ATGATTTAGT 


GTTTGATGAC 


ATAGCATTTG GATCCTCACA aATAATTGAA 


600 


CAGatAAATG 


GCAACATTTT 


TTAGATCAAG 


ATATACAGAA TTACGAAAGC GGAAGAGACT 


660 


ATTTACCTGA 


AGTATTAACA 


AGTCAGCTAA 


GTGTTGCTTT AGCATATGGA TTATTAGATA 


720 


TTATTGAAAT 


TTTTAATGAT 


TTATTGGCGC 


GTTATGATGA AGATGAGGCA AACTATGAAG 


780 


CATTTATAOG 


TGAACTCATT 


TTTAGAGAAT 


TTTATTATGT GTTAATGACA CAGTATCCTG 


840 


AAACCTCATA 


CCAAGCTTTC 


AAACCTAAAT 


ATCGACAGAT AAAATGGTOG CAAAATGAAG 


900 


CGGATTTTAA 


TGCATGGTGC 


GAAGGGCAAA 


CAGGATTTCC AATCATTGAT GCAGCAATAA 


960 


TGGAATTGAC 


ACAAACTGGT 


TTTATGCATA 


ATCGAATGAG AATGGTTGTG TCGCAATTTT 


1020 


7AACCAAAGA 


TTTATTTATA 


GATTGGACAT 


QGGGAGAAAA ATTCTTTAGA AAGCACCTTA 


1080 


TTGACTATGA 


TGCAGCATCA 


AATATTCATG 


GATGGCAATG GTCTGCTTCT ACAGGTACGG 


1140 


ATGCAGTGCC 


GTATTTTAGA 


ATGTTTAATC 


CAATAAGACA GAGTGAACGC TTTGATGCTA 


1200 


AAGCTTTGTA 


TATCAAAACA 


TATCTTCCGA 


TTTTTAATCA AATTGATGCA AAATATTTGC 


1260 


ATGATACACA 


ACGCAATGAG 


TCCAACCTTT 


TTGAACAGGG GATTGAATTA GGTAGTCATT 


1320 


ATCCAAGACA 


AATGGTAGAT 


CATCAAGAAA 


AACGTACACA AGTTTTAGCT ACATTTAAAG 


1380 


CGCTAGACTA 


ATT 






1393 



(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 
TTAACTTGAA AGATTTCCCC GACATCTATA ACTTGTATAA TGTGTATATG TCGTTTTTAG 60 

55 
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AAAAATTAAA 


AAAAGAATAT 


CTTGATTGGT 


ATAATCAGAC 


CTTAGAGTTT TCTAATTTAT 


180 




CAAACAATGT 


AGTAAGvATA 


GATACTCCTT 


TTAAAGATAA 


TTCTTTAGAT AATTTAATAA 


240 


5 


TTTACGCTTT 


ATACGATCAG 


TCCAGAGACA 


TGATTACACT 


GACAGATGAC GGCTATACTA 


300 




TATtTGATTT 


AGAAAATAAT 


GGTArrrcrr 


TAAATAAATC 


AAAAAAACGT AAAAAGATTT 


360 




TTGAAGAGCA 


CCTTTCAGCT 


TACGGTATTA 


AATATAACXSA 


TAAAACTCAC GAAATTTTTG 


420 


10 


TTCAAACTAA 


CTTTAAAAAT 


TTTAATAAAT 


CGAAACATAA 


TTTATTACAG TGCCTTATAT 


480 




TTGTTAATGA 


TATGTACTTA 


CTTTcTAATC 


CTAAGTCACA 


GAACATATTT ACAGAAGATG 


540 


IS 


TTGCAAACAA 


ATTGGATGAA 


CATAACATTT 


ATTACGGAAG 


AGATTTACCT ATTATAGGAA 


600 


GCAGTGGTGT 


TGTTCATAAT 


TTCGACTTTT 


TTATTAGCGC 


TAAGAAAAAT CAAAAAGAAA 


660 




AATTTATCAA 


TGCTATTTCT 


AACCCTAATA 


ATTCTATGAT 


TATTAAGTCG AAAATAACGG 


720 


20 


ATGCTATGCA 


AGCAAAAAAA 


ATAAAAAGAC 


ACAGGCAAAA 


TGAGTTTATT TTTATTTTAA 


780 




ATGACTCAAA 


AAAAGAAATA 


AATGAACATA 


ATAAAAATCT 


TCTTCATGAA AACTATATTA 


840 




GTACAATAGA 


TTATAGCGAA 


TTAGATGAAA 


AGATAGGTTT 


ATTGATTTAA TATATATAGA 


900 


25 


CGTGATAATA 


TCAATGTTTA 


TATTAAATGA 


AACGAATTTG 


AAAATTTCGA AACAGCTTAA 


960 




GATAGCAAAT 


TGAATAGCCT 


TATTGATAAT 


GCAGAATCGT 


CTACACTTAG TTGAACAAAT 


1020 




TCTATGAGAA 


TAGATATTGT 


TAAACTATTT 


GGGTAGGCGA 


TTTTATTTTG ACAAGAGTCA 


1080 


30 


GAAGATTATT 


TAGTTCATAG 


TTTAAGTTAC 


AAATGGCTAA 


ACTTTATGAA AATGGTATGA 


1140 




CTAGAATCGA 


AATTATACTC 


GAATACGATT 


TAACACTCTC 


AATCTTCTCG AACTGAATAA 


1200 


35 


ATACTGGGTC 


ATTCAATCAT 


CAAGACAACT 


TAATAAGTGA 


TGATAAATGT TCATACCATT 


1260 


TTGCAAATAT 


AACCAGATAC 


TCTCAACAAA 


ATGTAGAGTT 


CAAAATTAAA AATGTCCCTA 


1320 




ATTGGCACAT 


TGCTGTATGA 


TAATAAATTC 


AACTTAGATT 


TCAACAATAT AACTAATGAT 


1360 


40 


GCTTATATTT 


ACCGCGATAT 


CAAACCAAAC 


AGCCCATCCA 


CAAACAAATT AGAATATATC 


1440 


irtTGTTAATAA 


TAGTAAATGA 


TATATCTTCC 


AAAGACCGCA 


AACT 


1484 



(2) INFORMATION FOR SEQ ID NO: 596: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 



55 



1528 



EP0 786 519 A2 



5 



10 



IS 



3S 



40 



CTTTATCaTT 


AAATGCCaTT 


TTGaATCCTT 


CTTGAATTTG 


TTTGATTTCC 


TTTTCAGTAT 


120 


TATTAACCTT 


GTTCAAGACA 


GTGCCATTAA 


TTTTCTTCTT 


GAGTGGACCA 


ACCTCATCTT 


180 


TATTAGTTGA 


TTCATGAATC 


GTTAATCCAA 


TGTGTGGCTG 


TATTCTATAA 


CCATCATTCG 


240 


CTATAGTTGA 


AACATATTGT 


GaTAATTGTA 


ATGGTGTATA 


GGTATCATAT 


TGACCAATTG 


300 


ATAAATCTAG 


ATAATTACCT 


GGATTATTTG 


TTAATGGTTC 


GATTTGACCT 


CTTGTTTCAT 


360 


TTGGTAAATC 


TATCCCTGTT 


TTCACACCTA 


AGCCTACTTG 


ATTTAATCCT 


CTTCTTAGCT 


420 


TTTGGGCAGG 


TGAACTTATG 


TCTGAAGGTA 


AAGCCATACC 


AGAATAATAA 


GGGTCTCCCG 


480 


CTAATTTTAA 


TGCTGTTTTA 


AACATATATA 


CGTTTGATGA 


ATGCATCAAA 


GCTTGCTTAT 


540 


CATTAATAGT 


TACATGCCCG 


TTTTTATTGA 


AGTATGATCG 


TTTTGTCAAA 


CCACCTTGGA 


600 


AATGTAATGG 


TTCATCGACC 


ATTGTTTCTC 


CAACTTTGAT 


AGCTTTATTC 


TGATAACCXK3 


660 


CTAATAATGT 


TCCACCTTTT 


ACAGAAGATC 


CAACOGCAAA 


TTGAGAAGTA 


AACGTACCAA 


720 


TGTCATAATC 


AGTCATTTTA 


CCACTCTTAT 


TAATCTGCTT 


TCCGGCAAGC 


GCAAGAATGT 


780 


CTCCATTTTT 


AGGATTTTGT 


ACAACCATCA 


TTGcATTATC 


CATATCTTTG 


GCACCTTGAC 


840 


TGCGAsTTCT 


TAATTTGTTT 


ATCTAATAAT 


GCTTCTACTT 


CTTTTTGAAG 


ATCTATATCG 


900 


ATCGTTAATT 


TCAAATCTTG 


ACCGCGA6CG 


CCAGGrTTTA 


ACACTTCTGA 


AGATGTAACT 


960 


TTACCAGATT 


TGTCCGTTGT 


GTATTTCATT 


TCTTTCTTCT 


TACCACGCAA 


TACATCTTCA 


1020 


TATTGATATT 


CTAGGTAAGA 


TTTTCCAACA 


CGATCATTGC 


GTGAATATCC 


TTTGGATAAG 


1080 


TAATGTTCTG 


TCAATTCTTT 


TGGaATACCT 


TCAGCAGGTG 


TCGATACATC 


TCCGAATATA 


1140 


CCTCTTAAAG 


TATCGCCATA 


TGGATATTTT 


CTATCCCAAT 


CCATAGACGT 


GTTAACACCT 


1200 


GGTAATTTGG 


AAAGTTGCTG 


AGAAACTGCT 


GCATACTCTT 


TTTCACTGAC 


ATCTTCATTT 


1260 


TTTATCATTT 


GTGGATCTAA 


AACTGTTCCT 


GGCATTCATC 


TCTCGAAAAA 


TAGCTAAAAC 


1320 


TTGGTAAATC 


TTTAGAAGAC 


AATTCATCTA 


ATTGTGGATT 


TTCCGGATTT 


CGGATAACAG 


1380 


TTGGTnTATC 


CATAATGGAT 


CCTGGTTTAA 


TACCTTCCAn 


CnGGCXSAACA 


TAGCC 


1435 



(2) INFORMATION FOR SEQ ID NO: 597: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 
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AGAAAAAGAT TTAGAAAAGA TTACCATTCG CGACATAACA ACACGCGCTG ATATCAATAG 120 

AGGGACATTT TACTTACATT ACGAGGATAA ATATATGTTA CTCGCAGATA TGGAAGATGA 180 

5 

GTATATTTCA GAACTAACGA CATATACTCA ATTTGATTTG TTACGCGGTT CTTCArTTGA 240 

AGACATTGCG AATACTTTTG TAAATAATAT ACTCAAAAAT ATTTTCCAAC ATATTCATGA 300 

TAATTTAGAG TTTTATCATA CTATTTTACA ATTGGAACGC ACGAGTCAAT TAGAATTGAA 360 

10 

AATCAACGAA CATATTAAAA ATAATATGCA ACGTTACATT AGTATCAATC ATTCTATCGG 420 

AGGCGTTCCA GAGaTGTATT TTTACAGTTA CGOTTCCGGA GCAACAATTT CaATTAlUlTA 480 

f5 AATACTGGGG TAATGGACAA ACAGCCCATT TTCAGT 516 

(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1955 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 



30 



40 



45 



AGTACTTTAC 


CATGAGGTAA GACTTTATTC CCTGCTTTTA TAGATTGTGC TTTTATTTGT 


60 


GTGCCAGAAC 


CAATAGtAct GGTcTAATGA TTTTGCACTC ACATTATCAA TAGCTTTTTG 


120 


TTTGTCTTGA 


CCTTCAACAT CTGGCACTTT ACTATACTCT GCATTAGATG TGTCATCTTT 


180 


TGATTTACCT 


ACATTTAAAT ATTTCAAAGT ATTTTCCATT ATTGGTTTAA ACGCTTTACT 


240 


AACACCTAAT 


TCATAAGCTT CTTGGTCATT TTTTTGTGCC AAGCTCATAC CAGOGTATAC 


300 


AATAACTTTA 


GGATTTTTCT TCGGCGCGTC ACCCATAAAA CTTACAAAAT ATGGGTTTGG 


360 


ACCTTTAACG 


TATCCACCAC CATTAGGTGC AGCGACTTGT GCTGTACCAG TCTTACCTTC 


420 


GACCTCATAA 


CCATCAATAC GATAGTTTGC AGCGTGACTC TTCTTACTAT TCACAACTAA 


480 


ATCCAATTGC 


TTTTCAACTT TTTCAGCAGT ATCmTGTr ATTGGTTTGC CTGCGATTTG 


540 


TTTTTGCCCT 


TTATAAAATT GTCTTTTACT AACAGGATTT TCAACGCTAT TCACAAACCA 


600 


TGGTTTTAAC 


ATATTACCAT CATTAAAGAA CGCTGATTGC GCTTGTAACA TTTGAACAGG 


660 


TGTTACTGTT 


GTCGATTGAC CAAATGATGA CGTTTTTTGt TGCAACTCAT TACTCCATCC 


720 


AATTTGaCCA 


GGTGCTTCTC CATCAAACAT ACCTTTA6TT GaTTTTCCaA ATCCaAATCG 


780 


TTCATACCAA 


GATTTCATTT TGTCTGCACC AACTAAATCT TGTAAATGCA TCATCAATGT 


840 


ATTAGATGAA 


TAAGTAAATC CGAGTGACAT TGGGATTTCA CCCCAACCGA CTCTATTCCA 


900 



ss 
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AAAAGCACCT TCTTGAATAG CAGCTGCTAA CCCATATGAT TTAAATGTTG ATCCAGGCTC 1020 

GTATGTGTTT TGATAAAGGT CATTTGCCCA CTTTTTACCA AAGTCTTTAC CAGTTTCAGG 1080 

5 

ATTAAATGTT GGTCGCTGAC TGTATGcTAA AATTTCTCCA GTTTTGGCAT CCATGACAAC 1140 

CGCAAATAAA TCTTTCGGCT GGTATCTTTC AACCATGCCA TCTAAAGCTT CTTCAACAAA 1200 

TACTTGAATA TTTGAATCGA TTGTTAAATG GACATCATCA CCACGTTTAG GCTGCTTCTC 1260 

10 

TTTTTTAGTA TTTGGTGCXSA TATATCCCCA AATATCATGA ATATATCTCA ATGATCCTTT 1320 

AGATCCACTT AAATAACTAT CAAAAATCTT TTCAACTCCA AGTGCACCTT TAAGTTCACC 1380 

IS A6TATCCX3GA TTTTTCTGAG CTCTACCAAT TAAGTGTGAT GCAAAATTGC CATTTGGATA 1440 

AAAGCGTTCT GTTTCAGGCA ATAAAGAAAT ACCAGGCAAA TTCATTTTCT CTATTTTCAA 1500 

TTTGTCCTGA TACGTTAAAT TTGTTCCTTT GCX5TCCAAAT TCAATTTGGA AAGCTTTCTT 1560 

20 TTGACTAAGT CTCTTTTCAA TTTCCTCTGG CTTCATATTA ATGACTGTAG ATAATTTCTT 1620 

TGCAGTCTCT TTTTTATCAA CTACATGCCT AGGTTTTTTA GAATTGGCAC TC3GCCTTTTT 1680 

ATCTATTACT GCAACAAGTT TATATCTTTC TACATCTTCT GCTAGCACTT TACCATlcACG 1740 

ATCATATATC TTTCCTOGTT CTGGTTGTTG TGCATTCTTA ACTAAATACT TTTCATTTGC 1800 

CTTCATGACT AAATCTTGAC CATTAGAATG TCCAGTAATC ATGATATATG AAATTCTTAA 1860 

AACCAATATA AAAAAGAGCA GTCCGAATAA ACCAACAAGT AGGACTGCCC CTATTTTATT 1920 

30 

TTTTTTAATT TTAATTTTTT TGGTCGCCAT TACTA 1955 

(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 
TAGCAAGAGT AGGTGATAAC AGTTCTTTTA ACGTGTTTGC TGCTTGCTCA TTTCTTGGCT 60 
^ CATTGTAGAC TAAAATATAA TAAGCGTTTT CAAATAAGTG CTTTTTAGCA TTTAGTACAC 120 
CAGATTTATG ACTACCAGCC ATTGGATGAC CACTGACTAA ATGAATATTA TGCTTTAATA 180 
AATTGCATTC GTGTTGCTGT ATCATTGCTT TAGTACTACC AGTATCAGAA ACAATAACAC 240 

so 

CAGGTTTAGT TGGCATATCT ATAAGCTCGC TAAGATATTT ATTTGTGATA GCAACAGGTG 300 
TTGCATAAAT AATTACATCG GCTTTTTTAA TAGCTTCACT ATAATTTAAA CATTTTTCAT 360 
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TATTAGTATT AGGGTTATGG TATTTTATAT TGCTAGCAAG ACTTCCACCA ATTAATCCAA 480 
GCCCAACAAA TAAAACTGTT GTCATATAAA TCACCTTATT TCGAAATTTT CAGAATAATA 540 

5 

ACATTGTAAA TGAGCTGTTG ACACAGTGCA ATAGTAAATA AAAATCGATA ATAGCATTAA 600 
TAGATAAACG GAGATAAATC ATCTACAATA AAGAGTATAG TAACACAATG GCAACGGAGG 660 
GGTAAATCAA TGGAACCAAT ATTAGAAATG ATtAAAACAT TAAC 704 

10 

(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 





CCTCTAAAGG 


ATCACGAACC 


TTTTTCATCG TGACTAAGCC 


TATAATACCC 


TTAAAcCTAT 


60 




TATTATTAAC 


TTTTACCTCT 


GTGTATTCCC TATCAATTAG 


GCGACGCCAG 


TGACGTTTAT 


120 


25 




TACTTTCACA 


GTCACCAACT CCTTGTCATT 


ATTATATAAA ACTACATAAA 


180 




TGATGTCATG 


TCATGATACA 


GTTTTATAGT TTTTGTTAAT 


CATATGAAAA 


TATTTATTAT 


240 




TATTTTACTA 


CAACTCGCTT 


CAATTTACTT AAAATAGACA 


ATATTAATTA 


GATAGTACAC 


300 


30 


ACATTTCTTC 


ATAAAAGTGA 


TTTTTCAAAA ATATAAATAA 


CACACTCTTA 


TCGTTTTCAA 


360 




AATCATTTAA 


TGCTATTTTC 


ATTAAAAACA GCTGAAGCAT 


CAAATCTATT 


CTGATTCAAT 


420 


35 


CAASAATACA 


TATAAAAATG 


AAGTGACTCA AAGGTTTATT 


AACAACCTTC 


AAACCACTTC 


480 


ATTGATCACT 


TTTATTTTAA AGCATATTTT TCGATTACTG 


ATTTAAGATG 


CGGATATTGC 


540 




GTAATTAATT 


CAGATTGCTT 


AAACAATTCA AACTGCTTAA ACTCAAATGC 


CGGTTGACAC 


600 


40 


ATACAACCTA 


CTAAACTAAA 


AGTATTTGAT ATTTCGATTG 


AAGAAGCAAA 


AATTGTTCCT 


660 




TTAGGCACTA 


CATATTGCAA 


TACATCTCCA TTTTGGATAT 


CAGTACCCAA 


TGTTGCAGTC 


720 




GTATATTCCC 


CATCCGGATT 


TATCATATGA ATTGTTAGAG 


AATCGCCAGC 


ATGATAGTAC 


780 


45 


CATACTTCAT 


CAGCATCAAT 


TCGATGAAAA TGCGAAATAT TGTCATCTGT AAGTAAAAAA 


840 




TAAATACTAC 


TAAACGGCGC 


TCTGCGTCCA TCTTTCAATA 


CTTCTCGAAT 


TGTCTCTCTA 


900 




TAGAAACCAC 


CTTCAGGATG 


TGATTCAAGT TGCAATTcAT 


CaATCCATTG 


TTCTGCTGAT 


960 


SO 


TTCATTATTT 


CAAATCCACA 


TTATGGAATA CGTTTTGTAC 


ATCTTCTAAA 


TCTTCTAATG 


1020 




CATCGATTAA 


TTTTTCAAAT 


GTTACTTGGT CCGCTTCAGA AAGTTCAATA 


TCTGTTTGAG 


1080 
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GAACGACTGC AAATTGAT 1158 
(2) INFORMATION FOR SEQ ID NO: 601: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



IS 


TGATACTTAT 


TTTTCAATAT 


TATTTGGAAT 


AAAnTCTTnT 


AATTGTGGTG 


TGAAATAAAC 


60 


ACCGAAATTC 


TTTTCAGATT 


CATCTTTTcC 


ACTACCTGCA 


TATAAAATAC 


CAATCAGTyC 


120 




ATGTTTAGAA 


TTCAATACAG 


GAGATCCTGA 


ATTACCTGGT 


TGTGCATACG 


CATCAAATTC 


180 


20 


CATAAAOGTT 


CCACTGATAT 


GGTTAATCGT 


TCCTGTCGAT 


TCAAACATTT 


TATATTTTGT 


240 




TTGTGCACCC 


TTTGGATAAC 


CAATAACAGA 


AATTCTATCT 


TTCACTTTTG 


CTCCGTCTGC 


300 




AAATTTTGTA 


TAACTAACGT 


TCTTATTAAA 


ATTCAAACCT 


TCTGTACTTG 


TTTCATGAAC 


360 


25 


ATGAACTATC 


GCAAGGTCTT 


CTTTTCCGGG 


ATATTCTACA AT6TCTTTAA CGTCGTAGTT 


420 




TCCTCCGCCT 


TTACCTTTAC 


TCGAATGATG 


TGCT6ATACT 


CTATTTTTAA AAATATCATT 


480 




ACTTTTAGCG 


ATATGTTTGT 


TAGTTACGAT 


TGTATTTTTA 


CCAACAACTA 


CACCAGTACC 


540 


30 


ACCCACAAAT 


GCTACCACTG 


aATTGTATGG 


TTCCTTAGTT 


GCATCGGTAA TTTCTTTGAC 


600 




ATTCTTTTCT 


GCTTTGGCAA 


TTGAATGArG 


CTGATTAGAA 


ATATTTTCAG 


CAAAACCAAG 


660 




AGATGTTAAA 


ATAGTTAAAG 


CAGTTAAACC 


TTTAACCATT 


ACATTTTTAT 


TCATATTTTA 


720 


35 


CCTCCTTCCA 


AATTTATTTT 


nATAGTTAAT 


GTATCTAGAT 


ACACATTTAC 


TATATACATT 


780 




TTAAATAAAA 


ATATTnATTA 


AATATTAATT 


AACCACTTAT 


IlAACGTT 




827 



(2) INFORMATION FOR SEQ ID NO: 602; 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 
SO nGGnACGTAA AGTATTTATG TTAAATGGCA nAAATATATG AATGATGAGT TACCTGGATT 60 

ACCAATGTTC CAAGGTAAAT CGATAACTAT TGTTAACGAT AAAGTACGAA ACTTAGACAT 120 
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CGACGATTTT GTCGAGGTTT ACTTATATGG GGGGATGTTA TGAATAATGT ATTGTTAGAG 240 

GTTAAAGATT TAGAAACATC ATTAAAAATA AATAATGAAT GGTTAGCAAC TGTTGAAAAT 300 

5 

ATTTCTTTTG AATTATCTAA AGGAGAAGIT TTGGGTATAG TAGGGGAATC TGGTTGOSGT 360 

AAGTCCATAT TAAGTAAGTC AATTATTAAA TTATTACCAG AAAAGATATC TAAACTAAGT 420 

AATGGAGAAG TTATATTTGA TGGTAAACGA ATCGATACGC TCAATGAGAA GCAATTGCTA 480 

GATATTOGAG GAAATGATAT TGCTATGATT TTTCAAGAAC CTATGACTGC TTTAAATCCT 540 

GTATTTACCA TAAAAAATCA ACTTGTGGAA TCTATAAAAT CACATAAAAA AATTTCTAAA 600 

AAAGAAGCAA ATAAATTAGC AAAAGATTTA CTAAAAAAAG TTGGAATTGC TAGACAAGAT 660 

GAAATATTAA ATAGCTATCC TCATCAATTA TCTGGTGGTA TGAGACAAAG AGTAATGATT 720 

GCAATGGCCA TTTCATGTTC TCcTAAATTA TTAATTGCTG aTGAACCTAC AACAGCATTG 780 

20 

GATGTCACGA TTCAAG 796 
(2) INFORMATION FOR SEQ ID NO: 603: 

2s (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 
GGATGTGTTC ATCCCAAATT GaAAcAgAaA TTTATGATCA AAtGATGTGG ATTGGTTTTA 60 

35 

AaCCATTCAa AATTaCCAAT ATTAmaCAAG AATCTGAAGA CATTAAATCA TTTtACAGTT 120 
GAAACTGAAG AATATGACTT TAGTGAATTT ACACCAGGCC AATACATCAC AGTTGATGTT 180 
^ TCTAGTGATA AACTTCCATA TAGAGCTAAA CGTCACTATT CTATCGTATC AGGTGAAAAA 240 
AACCATTTAA CTTTTGGCGT TAAACGTGAT GTCACAACAG AACATGAAGG CGAAGTTTCA 300 
ACAATTTTAC ACGATGAGAT TAAAGAAGGC GATATGATTA ATTTAsTGCG CCTGTAGGTG 360 
GATTCGTATT AGAGAATACG ACTGAACCAC AACTTTTCTT AGGTTCAGGT ATTQGTGTTA 420 
CACCTTTAGT AGCTATGTAT GAAGCTGCCT CTGCCAAAGG TTTAGATACA CAGATGGTTC 480 
AAGTTGCTGA AAATGAACAA CATTTACCTT TCAAAGACAA CTTCAACAGT ATCGCAAGCC 540 

so 

ATCATGACAA CGCTAAATTA TATACACACT TAnAgATAAA CAAGGCTATA TTOGTGCTGA 600 
AGAATTACAA GTATTTTTAG CAAATAAACC TGAAATTTAT ATCTGTGGTG GTACAAAATT 660 
ss CTTACAaTCT ATGATCGAAG CACTTAAATC TTTAAATTAC GATATGGATC GCGTACACTA 720 
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5 



ATATGAAACT 


TAATCAACGT 


TACGTAAAAG 


TATTTGCATT ATATTTCX5TA AGTATTGTTA 


840 


CTGCAAATAT 


TATTGTTAAA 


AATAATAATT 


TAATTAAAAC AtTGATACAA ACCATAGCCG 


900 


GGTACACGGT 


CTTTGCAGTT 


GGTTTGAAGT 


ATTTAACTAA ACGTAAAAAT AAATGACATC 


960 


TATACCAAAA 


CAGCTATGAC 


TTTACTTTGT 


CATGGCTGTT TTTTAATTGG GAGTAGGACA 


1020 


GAGATGATAT 


TTTCGCAAAA 


TTTATTTGGT 


CGTCCCACCA CAACATGCAT TGATGTATGC 


1080 


TCACTGAATT 


TCATAAGAAA 


GGAGTTCACA 


AGATGACCGT AGATATTGGA CGGATTTATG 


1140 


ACAATAAAGA 


TAATACCGAC 


GCTATTCGTA 


TCCTAGTCGA TAGAGTCTGG CCGAGAGGTA 


1200 


TTTCGAAAAG 


AACTGCTAAC 


CTAGATTATT 


GGTTAAAAGA CATTGCCCCT TCTACTGAGT 


1260 


TGCGACAATG 


GTTCCAACAT 


GATCCTAAAC 


TTTTTGGAGC TTTTAAAGAA AAATATGAAA 


1320 


AAGAATTACG 


TGATCAGGAT 


GCXXIAAAAAG 


ATGCTTTTGA AAAATTAAAG GATATTGTAA 


1380 


ATCAGCATAA 


TCATGTTCTiy 


TTGTTATATG 


CAGCAAAAGA TACTAAACAT AACCAAGCTG 


1440 


TAGTACTACA 


GCAGTTGCTC 


AATACTTAGT 


TATATATGTT TATCGTGAAT CACTATAAAT 


1500 


GTCGCTAACT 


TCATTTCTAA 


TATTTATTTT 


TAAAAAGCAT AATTACTACA ATTAATTGGA 


1560 


ACTTTAATAA 


TTACTAAACT 


TGGAACAATA 


TTTTACTTTA ACAAAATAAA GTTTTAAA 


1618 



(2) INFORMATION FOR SEQ ID NO: 604: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

ATTATAATGC AAATATTGAA GCATTTAAAA CAGTCGCTAA AGCAGTAGGC AAAGAGAAAG 60 

40 

AAGGCGAGAA GCGTCTGGAA AAGCATGATA AAATATTAGC GGAGATTAGA AAGAAAATTG 120 

AACAGAGTAC GTTAAAATCT GCATTTGCAT TCGGTATCTC AAGAGCAGGT ATGTTTATTA 180 

45 ATAATGAAGA TACATTTATG GGACAATTCT TAATTAAAAT GGGTATTCAA CCTGAaGtcA 240 

mAAAArAmAA AACTACGCAT GTTGGTGAAC GCAAG6GTGG TCCTTATATA TATTTAAATA 300 

ATGAAGAACT TGCCAATATC AATCCAAAAG TTATGATTTT AGCCACTGAC GGAAAAACGG 360 

50 

ACAAAAATAG AACGAAATTC ATTGATCCTG CAGTrTGGAA ATCATTAAAA GCTGTGAAAG 420 

ATAACAAAGT TTATGACGTT GACCGAAATA AGTGGTTGAA ATCAAGGGGG ATTATCGCAA 480 

GTGAAAGTAT GGCAGAAGAT TTAGAAAAAA TTGCAGAAAA AGCAAAATAA AAATACAGCG 540 
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TGTGATGATT TACCTGAAAG 


TTTATTTCGA ATAAATTTAA TTACATAACC GACAAGGATT 


660 




GTTTTAACAG TTdTTTMJT GAATT6GC6C ATCGTTACAT ACCTCATTTC TCTATATCTT 


720 


s 


ACGAACTATA TACCCATTCA 


TATATGCTTT 


TTAAACGTCA TTGTCACAAT TTAATTTTTA 


780 




GGGAATATAA TATAACCATC 


TTTATCTGCT 


TTTTTAGTAA AAATGACAAA AATTGCATGT 


840 


10 


ATTATTGaGA TGATGGTAGG 


GATACCTGTC 


CAGAAAAATA ATAAGTGAAA AAGACCTTGT 


900 


CCAAATTTAT CAGCATAAAA 


TTTATGaATA 


CCTAAACCTC CAAGAAATAA TGCAACmATa 


960 




ACATAAATGG CTTTATTGAC 


TTTCATTTGT 


AAtCCTCCTT AACTATAATT CTACTTAAAT 


1020 


IS 


TCGTTGTGAA AACCAATATT 


TCTAACTTTA 


GAATTTTCAA ACTTTCTAAA ATTATAAGTA 


1080 




TATCTTTTTA AAATAAGCTA 


GAATTTCTAT ATAATAAATG TTAATAACGT AAAAGGGAAT 


1140 




GATGACATAG TGATACGTCA AGCACGTCCA GAGGACCGAT TTGATATTGC GAAGTTAGTT 


1200 


20 


TATATGGTTT GGGATGATAT 


GGAATTAGAA 


TTGGTAAAGC ATCTACCTAA AGACATGGTA 


1260 




TTAGATGCAA TTGAAAAAAG 


CTGTGTTGAT 


GCAACATATC GAACTTTTTA TCAGCATATT 


1320 


25 


TTAGTTTATG AAGTAGAAAA 


TAAAGTAGCA 


GGTTGTATTA TTAGCTATAG TGGTGAAAAT 


1380 


GAATTGAAAT ACGAAAAAGC 


ATGGGAACTA 


CTTGACTTGC CAGAAAAAAT AAAACAATAT 


1440 




GGCACGCCAT TACCTGTAAA 


AGAAGCTAAA 


GACGATGAGT ATTATATAGA AACAATTGCG 


1500 


30 


ACATTTGCAG CATATAGAGG 


TAGAGGCATC 


GCGACAAAGT TATTAACGTC ATTACTTGAA 


1560 




TCAAATACAC ATGTTAAATG GAGTTTGAAT TGCGATATTA ATAATGAAGC AGCATTAAAG 


1620 




TTATATAAAA AAGTAGGCTT 


TATATCTGAT 


GGACAGATTG AATTATACAA GCACATGTAT 


1680 


35 


CATCATTTAA TTGTTAAATA 


AAATACTCGA 


CAGTTCGATG TAAGTCGATT GCCGAGTAGT 


1740 




ATCATTTTCT ATTAAATGCC 


TGCAAATAAT 


GCACTAATAT AAATACCTAA TGCATATAAT 


1800 


40 


AAACCGAAAA ATGTATTTGT 


TTTACCAGCA 


GCAGCCATTG CTGGCATCAT TGTAGGCGGT 


1860 


GTATCATTCT TCTTGAAACX3 


TCTGATAACT 


TTAACAGGCA TTGGGAATGA TAACAACGCA 


1920 




AGTAAGTAAA ATAATGAGCC 


ACCAGGTTTA 


ATAATGATCG TAAGTACAAT AAAGGCATAA 


1980 


45 


GCGATAAAGT ACATGATTGC 


CATAAATGTT 


AAAGAAGCAT TTTTACCTAA TAGAATGGGT 


2040 




AAAGTTTTGC GACCACTTGC 


TTTATCTTTG 


ACACGGTCGC GAATATTGTT AGCCATATTA 


2100 




ATTAAACCGA TAGTG 






2115 



(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : doiible 



1536 



10 



IS 



EP 0 786 519 A2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

TATGTCTCAA ACTCAAATTA ATCAAATGTT CCAGCAGAAA AACATGTCTA CTGAATTAAA 60 

ACGTCGTTAT GCACAAOGTT TATTACAGTT TCCACATGTA CACAATAAAG AATACTTGAA 120 

ATCTTATGCT AAAAACCCTA AAGAAACTAA AGATAGTTAT ATTTCTGGTT TTAAAGAGAA 180 

TCAATTGATT AAAATAGAA6 CGATTAAATC ATTGTTTGCA ATGGATAAAT CTCCATTAGA 240 

ACATGTTAAA CCTGCTACAA AACCAGACGC TTCTTGGGAT GAGATGAAAC AAAAAGCAGT 300 

TGAAATTGGT AAAGCTGATA CTACATCGAA TAAATTTGGT ATTAQAGATC AATACTGGAA 360 

ATTAATTCAA GAAAGTAAGC GTAAAGTTAG ACGTGACTAC GAATTCAATG TTAATTCTCC 420 

AGAATTCCAA GATTTAGAAT TACTTGTAAA AACAATGCGT GCTGCTGGTG CAGATGTTCA 480 

20 ATATGTAAGT ATTCCATCAA ACGGTGTATG GTATGACCAC ATTGGTATCG ATAAAGAACG 540 

TCGTCAAGCA GTTTATAAAA AAATCCATTC TACTGTTGTA GATAATGGTG GTAAAATTTA 600 

CGATATGACT GATAAAGATT ATGAAAAATA TGTTATCAGT GATGCCGTAC ACATCGGTTG 660 

GAAaGGTtGG GTTTATATGG ATGAGCmAAT TGCGAAACAT ATGAAAGGTG AACCACAACC 720 

TGAAGTAGAT AAACCTAAAA nTTAAAATAC AAATAGCACA TAACTCAAC6 ATTTTGATTG 780 

AGCGTATGTG CTATTTTTAT ATTTTAAATT TCATAGAATA GAATAGTAAT ATGTGCTTGG 840 

ATATGTGGCA ATAATAAAAT AATTAATCAG ATAAaTAGTA TAAAATAACT TTCCCATCAG 900 

TCCAATTTGA CAGCGAAAAA AGACAGGTAA TAACTGATTA TAAATAATTC AGTATTCCTG 960 

TCTTTGTTGT TATTCATAAT ATGTTCTGTT AACTTAATAT CTT 1003 
(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 
GCTnTTATAT ITAAAAAATT TATTnGCGGA TAATTACCTT TATTAAATCC CACCCATTAG 60 
SQ GAwTaCGAGT AATAGGACTA ATACTGGAAT CACATAATGT AACATAACGT CCCTCCTTTA 120 
ACTTAATTTT AATTGTAATC AAATTTGACA ATAAGTCAAA mCATTAATAC CTATGATArG 180 
TATCATTTAT TAACATATGT ATCATATTTT TAATCTTGCG TAATTTTTAT CGTTAACTAT 240 

SS 



25 



30 



35 
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5 



10 



25 



CTTTTGTCAT TTTAATTATQ 


TTAAGATAAT 


AGTAGTTATA GAAGTTCAAT 


CTATAGGAGG 


360 


CATAGCATGG ATATTCCAAA 


AATCACGACA 


TTTTTAATGT TTAATAACCA AGCTGAAGAA 


420 


GCTGTTAAAC TATACACAAG 


CTTATTTGAA 


GATAGTGAGA TTATAACAAT 


GGCTAAGTAT 


480 


GGTGAAAATG GACCTGGTGA 


TCCCGGGACT 


GTACAACACT CAATATTTAC ATTAAATGGA 


540 


CAAGTATTCA TGGCGATTGA 


TGCTAATAGT 


GGCACAGAAT TACCAATGAA 


TCCTGCX3ATT 


600 


TCATTATTT6 TTACAGTAAA 


AGATACTATT 


GAAATGGAAC GACTATTTAA 


TGGATTAAAA 


660 


6ATGAAGGTG CCATTTTAAT 


GCCAAAAACG 


AATATGCCAC CATACAGAGA 


GTTTGCTTGG 


720 


GTTCAAGATA AGTTTGGAGT 


AAGTTTTCAA 


TTAGCATTAC CTGaGTAAAA 


GGATTCGCAC 


780 


AGCtTGgAAT GATAAAGAAA 


CACTTTTTCT 


TATGCATGCG TTTTACCTAT 


GTATTTCTAT 


840 


TTTAAGTACA CATTAGCATT 


TTAGTTTCGC 


TCATTTTTAA AATCACAAAG 


ATAATTTGTA 


900 


ACTTAAATGA TGATACGTTA 


TTTAAAAACA 


CGATACTTCG TTTCAATGAA 


CGCATTAAAT 


960 


AATAAATAAA CACCTCACCA 


TAAGAAAGGA 


CTACTTTCTT TTGTGAGGTG 


TTTatTTGTC 


1020 


GACAACTTGA TTATGATTGT 


TTCATTTTTT 


GAATAAGTTC ATAATCAGGT 


GTAGCATACA 


1080 


AAGTTTTTTG ATTGTCA 








1097 



<2) INFORMATION FOR SEQ ID NO: 607: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 



GAGGTCTAGC ATGTCTCGTT 


CAAAAAAATA 


CTTTTACTTA TCTAGCTTAA TGATTATTTT 


60 


AAGCTTTTTC TTTAATACAA 


ATAACGTTTT 


CCTAAGTGGA CTTTTTAATT CTTTTATTAA 


120 


ATTAATACTT TTCTGCAGTG 


TTATTAACTC 


AATTGTACTA ATTTTGTCTA TAATTTTTGC 


180 


AGATOGTTCA ATTAAATCAC 


TAAAGCCTGA 


TGCAGATTGG ATTAGAATTG CGAGTAAAAG 


240 


TTTGCCTTGG aTTATTCTAA TTGTTATTTT 


AGTACATATC TTTTCAATTG TTCGTACATT 


300 


CGGTTTTATT TAAAAAAGTT AATATGTCAT 


TGTAGCCTTA ATACAAAACA ATACAATGTA 


360 


TCATGCTATA ATGAGTAAAA 


CAATTTGATA 


ACGTTGTTGC GTATAAAAAA TATTAGATTT 


420 


TCGAAATCAT AACTATGCAT 


CTAATCGCTA 


TAGTTATACA ACAAGATATA ACATATAATG 


480 


AGGTTTGATA ATOCATCGAC 


AATTTTTGTC 


GTCGCGTTOC CAAAACCTCT TTTTTAAATT 


540 
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10 



IS 



20 



2S 



30 



35 



40 



4S 



SO 



TTTAGGCGCT TTATTACGTT ACCTCATTTC TTTTCTGAAT ACTGACGGAG GTTTTCCAAT 660 

CXX5AACACTG ATAGCCAATT TGACTGGTGC CTTTGTAATG GGATTGCTAA CAGCCTTAAC 720 

AATTGCATTT TTTTCAAACC ATCCGACCCT AAAAAAAGCT ATTACX5ACTG GTTTTCTTGG 780 

TGCTTTAACG ACTTTTTCAA CATTTCAATT AGAATTAATA CATATGTTTG ATCATCAACA 840 

ATTTATAACr TTACTACTAT ATGCTGTAAC AAGTTATGTC TTTGGTATTT TGTTATGTTA 900 

CGTCGGTATA AAACTAGGTG GTGGTTTATC ATGATATCAA TCATTTTAGT CATGATTGGC 960 

GGCGGTTTTG GCGCAATTGC TAGAAGTGCC ATTACTGATT ATTTTAATCA TAAATTTACT 1020 

TCAAAGTTAC CTATCGCAAC ATTGATAGTA AATCTAGTTG GTAGTTTTTT AATTGGATTA 1080 

ACTATAGGCT TATCAATTTC AATCTCATGG TTCCCTGCGT TCTTTGTTAC CGGTTTTTTA 1140 

GGTGGCTTAA CAACTTTCTC AAGGTTAGCC AAAGAACTTA CACTAATGAT GACGCCAAAA 1200 

TTTAATATTA ACCTTTTTCT CAATTATTCA CTTTTACAAT TCATCATTGG ATTTATAGCT 1260 

TGTTATATTG GCTATCATAT TTAAAAATAA AATGCTTCAT TCAGCAAATA GGTAAATTAC 1320 

GACACCTTCC TGAACGAAGC ATTTTTTAAT TTTCATGCAA ATTTTTAAGC ACCATATAAT 1380 

GCCTACCAAA TTTCAATAAT CTTTGTTGCC GTTTAAATAA TGTGAATGTC AATAAATTCT 1440 

CCAAACTAGT CGAAAATAAA GGGAGTGGGA CATAAATCCC TAAAAAAACA GCAGTAAGAT 1500 

AATTTTCAAT TAGAAAATAT CTTACTGCTG TTCTCTATTT ATACAATACT TCGTATTGAa 1560 

TGGcTTCGCT TTCCTAGGGT GCCGTCTCAG CCTCXSGcTTC GACTGGCACT GCTCCCTCAG 1620 

GAGTCTCGCC ATTAATAcTA CGTATTAACA TGTAATTTTA CTTTTACATA CTTTAAAAAA 1680 

TAAGACACTT TGCCCAACTT ACACTACCAA TAGAAACCTC TGTTAGAATT CCTCAAAATG 1740 

ATATTTCGCG ATATGTTAAT GAAATTGTTG AAACGATACC TGATAGCGAA TTCGATGAAT 1800 

TCAGACATCA TCGTGGCGCA ACATCCTATC ATCCAAAAAT GATGTTAAAA ATCATCTTAT 1860 

ATGCATATAC TCAATCTGTA TTTTCTGGTC GAAGAATAGA GAAATTACTT CATGACAGTA 1920 

TTCGAATGAT GTGGTTAGCT CAAAATCAAA CACCTTCTTA TAAAACTATT AATCGTTTTA 1980 

GAGTGAATCC TAATACTGAT GCGTTAATTG AATCTTTATT TATTCAGTTC C 2031 
(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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10 



TCCCCGGGAA TCXIAACCCXK: GAnCTCCTGC GTGACATGCA GGCGTGTTAA CCGCTACACT 60 
AOGAGACCTA TTAnATTAAA AACTATGTAT TGCGGGAGGC GGATTTGAAC CACCGACCTT 120 
CGGGTTATGA GCCCXIACGAG CTACCGAACT GCTCCATCCC GCGCTAATAT TATTTTGAAT 180 
TACCTAATTA ATATACCATA ATCAAAAACC TAAAGTarrG AACTTTTTGA ATTTAATTTA 240 
AATGTTATCT CTTAAATAAT TACTTAAATA TCGTAGCAAC ATGTTCTCTG TTGAACACAA 300 
ATATTAGTAT ATTCATTTTT 6TAGTGTACX3 TCAACGACAT TTTCAAAGTT TTTTGTGTAA 360 
AAAAACXKrrr CTTATTCCCT TTTATCATAT AAGTGTCTAA TAGTTGTCAT AAATAGTGkT 420 
IS AAAGCATTTA AAAAGGTATA GGAGTTATAA AGTTTACAAC GCCTATACCT TCTGAAAAAG 480 
AAATTATAAA AGCTTGTTAC ACCGCATATT CTTTCAGTCA GCGACTACCA ATATAACATT 540 
GTAGCCCtAA GACATTGCTT GACGCCTCAn TTACAACAAT TTTTCAAAAT CAGCAGCTAC 600 
CTACTGACAC AACATAACAC AACCCnTACA CTAnCTATCG TGTCATGTAA TCTTGCATCC 660 
GATCTTGCAA CGCTGTAAAT GTTTCGA 687 
(2) INFORMATION FOR SEQ ID NO: 609: 

25 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



20 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

^ GTGTTGAATA CTTTGACAAA ATGAAATCAT TTGTTCTTTC GTTTCAAATT TAACTGTTTG 60 

AATTAAATCA GTTCGTTTTT CGTAGTACTT CGGCGTTGTG TTCATATTCA TTTTTTCTAA 120 

AAATAAACTA GTAAACAATG CACCTTTAAG ACTCTGACTG ACAACGTGTG GTGCTAAAAA 180 

40 

GAAACCTTGA TACATTTCAA GCAATGCATT TAATGATGCA CCCGCTTCTT TACCAATACC 240 

AGGTGCTGTC AATCTATAAC CACATCGTTC AATTAAATCT TTTCTACCAG CAATGTATCC 300 

45 ACCAATCTTA GCTAAACCAC CGCCAGGGTT TTTAATTAAT GATCCTGCTA TTAAATCGGC 360 

ACCACATTCT ATAGGTTCAC GTCTTTCAAC AAATTCCCCA TAACAGTTAT CCACAAATAT 420 

TAAAATATTA GGATGCACGT TTTTCAACCT AGTAATTACC TTTTCAATTT CATCTAGCGG 480 

SO 

AATTGAAGGT CTTTGATCAT AGCCTTTCGA ACGTTGAATC GCTATTACTT TGGTGCGCTC 540 

AGAAACCCCa TCTAACACAC TTTCAATATC GATCTTACCT TCTTTAAGTG CAATATCTTT 600 

ATACGATmCG CCATGCTCCA TTAAACTTTC AATACCATTT CCGTTTACGC CAATGACTTC 660 

ss 
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ACnTGTAAT GCAATAGTAA TCGCATGCGT ACCTGAAATA ATTTGCXX3AC GAACAATTGC 780 

ATCTTCTGCT TTAAATGCCT GCGCATATAT TTCTTAATTG AACGAATTTA GAACTAATCG 840 

TAA 343 



(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 



20 



35 



GATTATTATA 


AACCGAATGA 


AACAAAAAAG 


TTTTTACATA 


TTTTCAACAA 


CTTTATTTCA 


60 


ATAAAGCATC 


AAGTTGATCT 


AAAGTTTGAT 


TCATACCTTG 


TTCAACACCC 


ATGTTTATGA 


120 


CTTGTTGAGC 


GGCTTCTTTT 


GTTGGAAATA 


CCGATGTTGA 


TGTCACTGTC 


GTTTTCGTTG 


180 


TATTGCTCTT 


GGAAAATGAC 


AAAGTAATTT 


TCATGCTTGG 


CATTTTTGTA 


TCTTTTGCAC 


240 


CTTGAGGTGT 


CGCAAATGAG 


TCAATATATT 


CAATTAAATA 


TGGACGCTTA 


ACCGTTTTAT 


300 


ATTCTGCTAA 


TGTATAGCTG 


GTCATTGTAG 


GTGTTTTAAT 


CGCATAAAAT 


GCATCACCGC 


360 


CTGAAACAGC 


ATTAAAGCGA 


AACACTTTGG 


TACTAGCGTC 


TTTTGGAtGA AACCATTTTT 


420 


CAAATAATGC 


TTTkGTCGTA 


TAGGCATCAA 


ACACCTTTTC 


AATCGGTGCT 


TCTATCGTTC 


480 


TTGAAAAAAT 


AATTTTATTA 


TCTTCAACTT 


TAATCOTCAT 


CTTCCCACTC 


CCTATCTTTT 


540 


ATATTTACTC 


ATCTTAACAT 


GCATTGAGTA 


ACATATGTTA 


CCAAATCATT 


TGTTATAAAC 


600 


TATAGTAATA 


TTAATCATTC 


CCTCTGTnGA 


AATTGATAAC 


TACGAAATAA AAAACACACT 


660 


CTATTCAGCT 


ATTAACAGTT 


GAGTAGAGTG 


T6CAAACCTG 


TTGAATTAAT 


GCGCTAACAT 


720 


TTCTTCTTTG 


ATTTGGTCTT 


nATnTAATTT 


TGAAGGATAA 


TATGTTGGCC AGTTA 


775 



(2) INFORMATION FOR SEQ ID NO: 611: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 
ATCAAAATTT GATACTAAAG GGCTCACGCT ATTTTATGTA TTTATTGGAT TGATTATGTT 60 
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TGTGGCAATG TGTTTATTTA AAGTAGAAAA ACATGTTTCT TCACCATTTT TACCTGTGGT 180 

TGAATTTAAT CGTTCGATTA CTTTAGTTTT TATAACTGAC CTTCTAACAG CTATTTGTTT 240 

^ AATGGGATTC AATTTATATA TTCCAGTCTA CCTTCAAGAA CAACTAGGAT TATCTCCATT 300 

GCAAAGTGGA TTGGTTATTT TTCCTTTATC TGTAGCTTGG ATTACATTGA ATTTTAATTT 360 

ACaTCGAATT GAAGCAAAAC TATCAAGGAA AGTTATTTAC TTACTATCAT TTACATTGCT 420 

70 

ACTAGTAAGT A6TATTATCA TTTCATTTGG TATTAAATTG CCGGTACTTA TAGCATTTGT 480 

GTTAATTTTG 6CAGGATTAA G7TTTGGATA TATTTATACG AAAGATAGTG TGATTGTCCA 540 

IS AGAGGAAACT AGCCCATTAC AAATGAAGAA AATGATGTCA TTTTATGGAT TAACTAAAAA 600 

TCTTGGGGCA TCAATAGGTT CAACAATAAT GGGATATCTT TATGCGATAC AATCAGGAAT 660 

CTTTGGTCCA AACTTACACA nTGTGTTAAG TGCTGTTGCT GTAATTAGCA TGGGCTTATn 720 

20 

GTTTTATG 728 
(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 



35 



TATCAGAGCA 


AATAAGCAAG 


TTAGATAAAA 


AGATTTCTGA 


CTTACAATTA 


ATTAGAOGCT 


60 


CTGTATGTGA 


ATTTATTAAA 


GGACTCTCTC 


TAATAGATAC 


CAGCATTTTA AACAAGACAC 


120 


TACAGTCACA 


ATATGATAAA 


GAAGCATCTA 


TAAAATATGG 


TCATACGAAA 


GCATATCAnT 


180 


CATTTATTAG 


ACGTAAAGAC 


AGCTTACAAT 


CGCAGGATAT 


CAGACATAAA 


TTGACAACTA 


240 


TCTTCAATAA 


ATTTAATCAT 


ATGTCTTTGA 


GTCATTATCC 


AATCCAAGAT 


TGTAGTGATC 


300 


TCGTATTTGA 


GTGGAAGGCA 


TTTATGAACA 


CTATCGCTGA 


TTTTGmTGAT 


GAAACATTAT 


360 


GCTGTATTGC 


TAAAACATAT 


GAAGAT6ATA 


CGCGTTTCAA 


AGATTACTTT 


AATTCATATG 


420 


ATAATCAAAA 


TTTAGCATCA 


TACATTTCAG 


AAGCTGTTAA 


TTATTTTTTG 


AGCAATGTGA 


480 


ATAAGAGCGA 


CAATTTTTAA 


TCCTCATACA 


GATGCTACAA 


CACTGAATAC 


ACCAACTAAA 


540 


ATAATTGTTC 


AAAGTAGAAA 


ACTAATCATT 


TATTCTCTAA 


ATCAATAACT 


ATATTAAAAG 


600 


TTATACCTTT 


GCAAAGCGAA 


TTAGTATAGG 


TTACCGAAAG 


GAGAAAGGAT 


TAGGTTCCAT 


660 


TCGATTTATA AAAAAATATA 


TTTCGAAACA 


TACAAAAGCG 


CCAGAAGATA ATCATTTCCT 


720 



55 



1542 



EP 0 786 519 A2 



TCAATGCATA TTCAACTAGC TCAGGTTTAC TCTTTAAGCC AAGCTTTGTC ATAATATGCG 
TCTTATGTGC TTCTACTGTT TTCACAGATA CAAATAATTT CTCTGCAATT tCTTTATTCC 
CGTAACCTTT GGC 

(2) INFORMATION FOR SEQ ID NO: 613: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiable 
(D} TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 
ACAAGTGTTG ATGTCATAGT AGCGTCAAwT GTQCTTTATT TCTOOQACAC ATTTAAATTT 
TTCCACCCCA TTACTTGGAG CTTAATCTTT ATTACAATTT TACTATTATT AAACATTTTT 
TCTGTAAAAT CATTTGGAGA AACTGAGTTT TGGTTATCAT TGATTAAAGT GTTAACAATT 
ATCGTATTCG TTATTTTTGG CTTTTTAATG ATTTTCGGTA TCTTAGGTGG TCATACATAT 
GGATTTGAAA ACTATACAAA AGGCCAAGCA CCGTTTGTTG GTGGTATCTC TGGTTTCTTA 
GGCGTATTAT TAGTCGCCGG ATTTTCGGTT GGTGGTACAG AAGTAGTAGC AGTAACTGCT 
GGTGAATCAG ATGACCCTAA AAAGTCTATG CCTAAGGCAA TTAAACAAGT ATTTTGGCGT 
ATTCTTTTAT TCTATGTCTT ATCAATTGCA GTAATTGGTG CAATTATTCC GTACACAGAT 
CCATCATTAT TAAGAGCAAG TAGTTCAATA AGTCAAAGCC CATTTACAAT TGTATTCGAT 
AGAGtAGgCA TAGCCTTTGC AGCATCAGTA ATCAACGCGG TTATTTTAAC TTCATTATTA 
TCCGCTGCAA ATTCAGGTGT TTATACAACA GOCAGAATGT TGTATTCCTT AAGT 
(2) INFORMATION FOR SEQ ID NO: 614: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 
TCGATCTTTA TTGATAATAA TTAAATTGTC GCCTTTAAAA TGTGATATTA ATCCTGCGGC 
AGGnTGTACA ACGAGTGATG AACCTAGTAC AACAAGGGTG TCAGCATGTT CAATTTTATT 
TAATGCCCTT ATGATGGTAG GTTGATCTAA CATTTCACCG TATAATACGA TGTCCGGTCG 
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ATAAGATTTA TGACATACAT TACAATAAAA ACGATTTAAC GTGCCAT6TA ATTCATCAAC 


300 




ATGTTGACTT CCAGCGTCTG AGTGCAAACC ATCGATATTT TGCGTGATGA CACCTAAAGA 


360 


5 


TTGTTGATTA 


CGTTCTAATT TTGCAATCCA ATCATGAACG ATATTGGGCA TCGTATCGAC 


420 




AAATAGTAAG 


CGCTTATGGC AGAAATTGAT AAAACCTTCA GGATCATCTT CTAAATAATC 


480 


70 


ACGGCTTAAC AAGTATTCTG GCX5AAAGCCC ATCTTTTGaA ATTTCATCAA ATAAGCCACC 


540 




CATTGaACGG AAATCZX3GAA CGCCACTTGC GACAGATACA CCAGCACCTG TAAAAAATGT 


600 




AATACGATTC 


GAACTATCTA TAATATGTTT TAGTGTCTCT AA 


642 


IS 


(2) INFORMATION FOR SEQ ID NO: 615: 




20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




25 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 




TTTCCCCnCC 


CCnCCAAATA TCCAAnGGAA CTTTAATAGT CCAATTGGCA CAGTAAAACT 


60 




ATGGCATTTT ATAAGTATAA TATATCTGTA ATTTATGGTC AATTAGTAAA TTGTTTTTTA 


120 


30 


TTTGAAACAT 


ATTTCACATC AAAATCACAA AGACTTTTAG ATTTTGtTCT AAAAATCTCT 


180 




TAATAATTTA 


TTTAATGAGA AGAGTTGCTT ATATAGTAAA TTGTGAAGCC GTTAAAACAA 


240 




CGTTACAAAA 


CCTATATCTT TAATACGGAA CCATATGGTA TGAATCAAGG AATACTTAAA 


300 


35 


CTAAAACTTC 


TCTATCAGAT TTATTTGTTG CGAAATCAAC AACTTTAATT GCTTGCCCTT 


360 




CATTTAATGG 


ATAATTTGCT TGCGTAATTT TAACTTTTAC AATTTGACCT ATGAGTGATT 


420 


40 


CGTCACCTTC 


AAATTGTACT TTCATATAAT TATCTGCATA TCCAACTAAT GTACCTTCTG 


480 


TGtCACCCTG 


TTCCTCAGGA ATTACTTCAA GCACATCTTG ATCAAATTTA GACGCATATA 


540 




ACTTTCCGAG 


TTGATTGCTT AGCGTAATTA ACTTATGCAC CCGTTCATTT TTAATTTCTT 


600 


45 


CATCAATTTG 


GTCATCCATT CTTGCAGCTG GCGTGCCAAT TCTAGGAGAA TAAGGGAAAA 


660 




CATGCAGTTC 


AGAGAACTTA TGCTTTACGA TAAAATCATA TGTTTCTTGG AACTCAGCTT 


720 




CAGTTTCACC TGGGAAACCA ACAATTACAT CACTCGTAAy TGCCAAGTCT GGTAAAGCTT 


780 


SO 


TATGCAATTT 


TGTTAATCGT TCTGAAAATC TATCCATTGT ATACTTACGT CTCATACGTT 


840 




TTAATACTGT 


ATCTGAACCA GATTGTAATG GAATATGCAA ATGACGCACA ACTTTTGTTG 


900 


55 


AACGTTCTAA 


AACGTCAATT ACTTCATCTG TAAGTTGACT TGCTTCAATT GAAGAAATTC 


960 
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CTTTTAAATC TTGACCATAT CCACCTGTAT GAATTCXCGT CAATACAATT TCCTTATATC 



1080 



CTGAATTCAC TAGTTGCGTC GCTTGTTCAA CTACTTTTTC CGGATCTCTT GAACGCATTA 



1140 



AGCCACGAGC CCATGGAATA ATAcAGAATG TGCAGAAGTT GTTACAACCT TCTTGAATTT 



1200 



TTAATGACGC 



1210 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATIOt? FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

TAATAAAATA CAATACTTTT CAATACAGAG AATCCCGCAA TGTCGGGATT CTTATTTATG 60 

CTGATTTTGT TTTrGTCTAT GATCAGGGAC TTtCAGGGAC TCAATTAATT ATCACATTCA 120 

TATTTTCTTA TGACCAAATT GATACATATG ATGCTATTTA ATGAGAAAAT TAGGCATCAC 180 

TTGGTTATTG AATTTCCTTC ATTAAACTTT CCAGCTCAAT TTAATAGTTA GTCGACTATT 240 

ATTCATTAAA CACTTTTTAA TCATAAAAAA GTGTTTTTGa TAATTCACTa CCaAT^CAC 300 

CTTCTTACTT ATAATTCTAT TTGTTCACCA TTTCTAATCT TATCGGCTAA ATCATTCAGT 360 

TTTCTTAATC GGTGATTTAC ACCTGATTTT GAAATTGGAC CAGTTGATAC CATTTCTCCA 420 

AGCTCTTTCA ACGAAATTTC TTGATGTTCT ACTCGAATTC TAGCAATCTC TCTCAACCTG 480 

TCTGGTAAAT TTTCAATACC AATTTCTTTA TCAATCAATT TAATGCTCTC AACTTGTTTC 540 

ATCGCAGCAC TAACTGTTTT ATTTAGATTG GcCGTTTCAC AATTAACGAG TCGGTTAACA 600 

GAATTACGCA TATCTCTTAC AATACGTACG TCTTCAAATT TTAATAACGC TT 652 
(2) INFORMATION FOR SEQ ID NO: 617: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 
TnACCACTTT AAAATAGCGC TTAATAAAtG AAGGGGGCAA GTCTTATGAC GtTTTACaAT 60 
TTCATCaTGG GTTTTCAAAA TGrTAACACA CCATTTGGTA TATTGGCCGa ACACGTTAGT 120 
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TCTAATTACA CAGATCATCA ATTAATTGAA ACTACAAATA GAGCTATTAG CTTATATATG 


240 




GCAAATTAAT 


TTGAGTA6TA CCAATTATGA TGTATTAGTG CATCCCAAAT ATCTTTTGTT 


300 


5 


TTAAAGTTTA TTTCATCATT TCTTATCXy^ AATGGTGTAA TAATGTCTTT ATCTAACCAA 


360 




GTGTTGATAA 


GTTCATTTGG TACACCATCT 


AACAACATTT CACTTTTACT AATTATAAAA 


420 


10 


CATTCCCAGT 


CAAGTGAAAC ATTTTGTGGA 


TTCACATTUVT TACATTGATT ATGATTATCC 


480 


ATAAACACTC 


ACTCCTTTAA AATTCTGTAC 


TCTTCATTGC GTTTTACCCC GTCACATTAT 


540 




CTTTTAAACT AAAATCATCA TTACTTATGA AAAAAATGTA CATCAAAAGC AAAGGTTTTC 


600 


15 


GCTACCGAAA AAGTTTAAAT AATGGTttAA TATATttGGT ACTCATTTTA ATAAAAAGAG 


660 




AATACATTTT GAGCTATCAA TACTTTTTAT TGAAGAGGTG TTATTsyTGG CTAAAACGTT 


720 




ATATTTAATG 


CGCCACGGAC AAACTTTGTT 


TAATTTTAAG GGACTAATTc AGGGATTTGG 


780 


20 


AGATTCGCCG 


CTAACAGA 




798 




(2) INFORMATION FOR SEQ ID NO: 618: 




25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 






ATACGCTCAA 


TTGATAAAAA TTTAATATCT 


GCCATTTGTA TACGCATCGC TTCGAATGTT 


60 


35 


TCCGTTGCAA 


TATCAAATGA TAATTGTGAC 


TGGAACCTTA AACATCGAAT CATACGTAAA 


120 




GCATCTTCTT 


GGAATCGTTC CTCAGCTATA 


CCTACAGTTC TTATTATTCG ATTATTAATA 


180 


40 


TCTTGTTGAC 


CATCAAAATA ATCATACAAT 


TTGTATGCTG TATCCATTGC TATCGCATTC 


240 


ATCGTGAAAT 


CTCGTCGTTG CAAATCTTCG 


TATAAATCAC GAACAAATGT AACACCACTT 


300 




GGTCTACGGT 


GATCGACATA ATCTTCTTCA 


GCCCGGAATG TTGTCACTTC ATAATTTtCA 


360 


45 


TCATtAAAaa 


CTACATTtAT CGTGCCAtGT 


yCTTnACCTA CAGGTATCGT ATGACTAAAG 


420 




ATAGATTCTA 


TTTCATCCGG CGTTGCACTT 


GTTGTGATAT CTATATCATG AATATTTCTT 


480 




CCCATGACAT 


AATCTCTTAC AGAGCCACCT 


ACATAATATG CTTCAAAACC ATTGTCTTGA 


540 


50 


ATTTGTTCTA ATATAGGCCT TGCCTGTTCA 


AATAATGATT TATCCATATT ATTACTCGCC 


600 




TTTACTTTTG 


TTATGCTCAT TTAGCATTTT 


TTGATAATAA TACTCATATT GATCTGTAAT 


660 


55 


AAGTTCTGAT 


CCAAAACGTT CAGCAATATC 


TGCTAGCATG TTTTTCTGAA GTTTGTTGTA 


720 
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ATCTACGACA AATCCAGTTT CACCATGTTT AATAACCTCT TTAATTCCAC CGGCATTTGA 840 

ACCAATTGGA ACQACGCCTG TwTTCATAGC CTCAAGTAAA GTTAGTCCAA AGCTTTCTTT 900 

5 

TTCACTTAAT AATAATACTA AGTCAGATAA TTGGTAAAAT TCACTTACX5C AATCTTGTTT 960 

CCCTAAAAAT AAAACATCCT CTTCTACGTT TAACTCTTTC GTCAATTGAC GCATTGGCAC 1020 

TAATTCAGGA CCATCTCCAA GTAAAATTAA TTTACTAGGT ATCTTTTCAC GTACTTTTGC 1080 

AAATGTTTCT ATAATAGTAT CTATGCGTTT TACTTGTCTA AAATTCGATA CATGTATTAA 1140 

CACTTTTTCA TCTGGTGCTA TACXZAAATTG TGATTTTAAT GCTGTGTTAT GTTTAGTTGG 1200 

'5 AAACTCATTT TCACGTACAA AATTATAAAT CGGTATAATT TCTTTGTTAG TTTCAATAAT 1260 

TTCATGTGTT TCTTGTGCTA AAGATTTACT CACACTTGTC ACAATATCAC TTTTTTCAAT 1320 

GCCAAATTTA ATTGCACCTT GGAGTGAATG ATCATAGCCC AAAACAGTAA TATCAGTACC 1380 

20 

GTGTAGCGTT GTCATAATTT TTATATCTTT ACCTQACATC TCACQAGCTA AAATCCCACA 1440 

AATTGCATGA GGTACAGCAT AGTGCATATG CAACAAATCA AGATCATATT CTTTAATAAC 1500 

TTCAGCGATT TTAGTACTTA ACGTAATATC ATACGGTGGA TACTGAAATA CTGCATATTG 1560 

25 

ATTCACTTCA ACTTGATGAA AAATCATATT CGGTAATGGT TTTCTTATTC TAAACGGGAT 1620 

ATTTGAAGTG ATAAAATGTA CTTCGTGACC TCGCTCTGCT AATTTAATTC CTAATTCTGT 1680 

30 GGCAATAATT CCAGAACCAC CCATGGACGG GTAACATGTT ATACCTATCn TCATTCOCTT 1740 

GCCCATCCTT TCTTTCTATT TCTCnCTATG ATnCTCGATG CGTAGA 1786 
(2) INFORMATION FOR SEQ ID NO: 619: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 
45 ACAGGTATGG ACTTnGCTCA AATGACACGA CATTATTTAT CAAGACCTAT TGCTATAATC 60 
TTTTGGATCA TTGCAGAACT AGCAATTATC GCTACAGATA TTGCTGAGGT TATTGGTAGT 120 

GCTATTGCTC TTAATCTCCT ATTTAACATA CCTTTAATCG TCGGTGCACT AATAACTGTA 180 

SO 

CTTGATGTAT TTTTACTACT TTTTATAATG AAATATGGTT TTAGAAAAAT TGAAGCTATT 240 

GTTGGTACAT TAATTTTCAC AGTGTTATTC ATCTTTATAT TTGAAGTCTA TATTTCATCA 300 

CCACAGTTGA ATGCTGTGTT AAATGGATTT ATACCACATA GTGAAATCAT TACAAATAAC 360 



1547 



EP 0 786 519 A2 

TTACATTCAT CAATTGTACA ATCTAGAACA TACTCAAGAC ATAACAATGA AGAAAAAGCG 
CAAGGaTTAA ATTTGCTACQ ATAGATTCX5A ACATTCAGTT ATCAATCGCA TTTGTAGTCA 
ATTGCTTATT ATTAGTGtTA GGAGCATCAC TATTTTkCAA CTCAAATGCT GACGATTTAG 
GTGGTTTCTA TGATTTATAT CACGCCTTAA AAACTGAACC TGTACTAGGT GCAACAATGG 
GTGCAATCAT GAGTACATTA TTTGCAGTTG CATTATTAGC GTCAGGTCAA AATTCAACGA 
TTACTGGTAC TTTAGCAGGA CAAATTGTAA TGGAAGGATT TTTAAGATTA CACATACCAA 
ATTGGTTAAG ACGTTTAATT ACACGTTCTC TTGCTGTCAT TCCTGTTATC GTATGCTTAA 
TCAT 

(2) INFORMATION FOR 5EQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 
TGTATTGCAG TCATGCCATA AATTTTTTGA ATAATAAACG GAGACGCAGA AATATAAGTA 
AATAATATTA CAAATGTCAT ACCTTGGAAT GAGCATTGGT AATACAAAAC GTGGCGTCTT 
CAnTAATATT TTGAAGTTTT TAAACATTGT CTTTAaTCCA CTACTTGACT CACGATTTGT 
CACTGTTAAT GATTCAGGTA CTTTTAATAA AGAACCTATG ACCATGACGA AGCCAAAGAT 
AGTCAGAATG ACAAAGACCA TACGCCAGAC AGAATAATTT AAAATTATGC CCCCTATTGT 
TGGCGCAACA ACTGGTGCAA TACCATTAAC AAGCATCAAT AATGCCATAA ACTTAGTTAG 
TTCATTACCA CTATACATAT CACTTGCTAT AGCTCTTGAA ATAACTGCTG tGCGCCACCT 
GTCACTCCTT GAAGaAATCT TAATGCAACC ATCAGCCAAA TATTATGTAC AAAAACAATA 
CCTAAACTTG CTAATGTAAA AATAATCATA GCTATAATAA GCGGCTTTCT GCGCCCAGTT 
GAATCTGAAA TTGGACnAGC AACCAAATTA CCAA 
(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AAGCAAGAGA GAGTACAACA TTTATATGAT ATTAAAGACT TACATCGATA CTACTCATCA 60 

GAAAGTTTTG AATTCAGTAA TATTAGTGGT AAGGTTGAAA ATTATAACGG TTCTAACGTT 120 

5 

GTACGCTTTA ACCAAGAAAA TCAAAATCAC CAATTATTCT TATTAGGTAA AGATAAAGAG 180 

AAATATAAAG AAGGCATTGA AGGCAAAGAT GTCTTTGTGG TAAAAGAATT AATTGATCCA 240 

10 AACGGTAGAT TATCTACTGT TGGTGGTGTG ACTAAGAAAA ATAACAAATC TTCTGAAACT 300 

AATACACATT TATTTGTTAA TAAAGTGTAT GGCOGAAATT TAGATGCATC AATTGACTCA 360 

TTTTCAATTA ATAAAGAAGA AGTTTCACTG AAAGAACTTG aTTTCAAAAT TAGACAACAT 420 

TTAGTTAAAA ATTATGGTTT ATATAAAGGT ACGACTAAAT ACX^GTAAGaT CACTATCAAT 480 

TTGAAAGATG GAGAAAAGCA AGAAATTGAT TTAGGTGATA AAT 523 
(2) INFORMATION FOR SEQ ID NO: 622: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2s (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 
^ A5AAGTGTGA nAAAAATTTA AnAGAGATAT GCACATAGAT GACGCATTGC TATATCCAAG 60 

CAATTGAGAA AGCTGCTGAT GCTCCAAATC aCGGAATGAG GGAACCATGG AGAGTTGTGC 120 

ATGTTCCGAA AGACAGATTA GGAGATATGA GTAAGGATAT TTCTAAATTT GCATTTCCTA 180 

ATGAATTAGA TAAGCAACAA TGTCATTATG ATGCAGTTAC GAAACTAGGT GGCATGTTAT 240 

TGCTTATTTT AAAAACAGAT CCAAGACAAC GTCAAAATGA TGAAAACTAC TTTGCATTTG 300 
tGCATATGCA CAAAATCTTA TGTTGTTACT TTATGAAGCG GGAATAGGTA CATGTTGGAA 360 
aTCGCCATTA TATATCTATG ATCCTAAAGT AAGnAAACAC TTGGTATAAA GnAAGATGAA 420 
GTTCTTGCTG GATTCTTATA TTTAACGGAT TTAGAAGnAG ATATGCCTAA AGCACCACGT 480 
^ AAAAATAGAA ACTTAATTAC ATTATATTAA TATGTATAAT TATAGAAACA TTAATAAAAG 540 
CTGAGTCATG AATTGATGGA CATCTATCGA GTTAGAGATT TAATCTAACT TACTAGAGTC 600 

GGTACAATAA CAGTCTCAGC TTTTTATTGT GCAGTATATA CACATTTTTA TTTTAGTATT 660 

SO 

TATTtAAAAG TTtCTGCTaA AAATGATTCA ACTT6TTCAG GTGACTTAGC ATTTGCTGAA 720 

TGAAGGTGTG CAATTTTATC GCCGTTTTTA AATACTAGCA AGCTAGGGAT ACCCATAACT 780 

TCATTTTCAA CAACTACATC TTCTAATTCA TCACGATTAA CAGTATACCA TTGGTAATCA 840 

ss 



3S 



40 



1549 



EP 0 786 519 A2 



10 



IS 



20 



25 



30 



35 



40 



CCTGCCTCAA ATTTAACAAT TACAGGTGTA TCGCTATTAA TTACAGATTT AAATGATTCA 
TTACTTTTGA TTGATTGCAT TGTAACAACT CCTCTAGATA GTTTAATAAT TTTTATTATA 
GCTAAATTTA TATCATAATA AAAAATTTTA GCTTCAAAAT GAAAGCCTTT TGCTTTGGAA 
AATGATATAT TTATTTTAAA TACATAAAGG AGGTTGCAGT CGTATGATTA AATTTTACCA 
ATATAAGAAT TOTACAACTT GTAAAAAGGC AGCAAAGTTT TTAGATGAAT aTGGCGTAAG 
TTATGAACCA ATTGATATCG TTCAACATAC ACCTACAATA AATGAATTTA AAACAATAAT 
TGCAAATACA GGCGTAGAAA TTAATAAATT GTTTAATACA CACGGCGCX3A AATATCGTGA 
GCTTGATTTG AAAAATAAAT TACAAACTTT ATCAGATGAT GAAAAGTTAG AGTTGTTATC 
ATCTGATGGT ATGTTAGTAA AGCGTCCTCT AGCAGTAATG GGCGATAAGA TAACATTAGG 
ATTTAAAGAA GATCAATATA AAGAGACTTG GTTAGCGTAA GTGaAATGTA AGCGTTTACT 
AAATATCTCG ATATTTAGAT TCATTACATG TAAAATGAAA TAAGCTATAC AATTGTTAAT 
TTTTATAAAT ATAGTTGAAT AGCATCTAGC CTTATGGCAT CATTAATGAT GTAAAGATTA 
ATTAGGAGGG GATTCTCTTG GCAGTACCAA ATGAAyTGAA ATATTCAAAA GAGCATGAAT 
GGGTTAAAGT TGAAGGTAAT GTAGCAATAA TTGGAATCAC AGAATACGCA CAAAGCGAGT 
TAGGTGATAT TGTTTTCGTT GAATTACCAG AAACAGATGA TGAAATTAAT GAAGGGGATA 
CGTTTGGTAG CGTAGAATCA GTTAAAACTG TATCAGAATT ATATGCACCA ATCTCTGGTA 
AAGTAGTTnA A 

(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1871 



45 



SO 



SS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

TTGCTGTAGA AGATAAAGAG TCCCATCAAT GGATAGGCTT TATAGGTTTG AATTATATTC 60 

CAGAAACAAG CGATTATCCA TTTAAAGAAT TACCGCTTTA TGAAATAGGT TGGCGCTTGT 120 

TGCCAGAATT TTGGGGAAAA GGATTAGCAA CTGAAGGCGC AAAGGCAACA TTGAAGTTAG 180 

CAGAAGAACA TCAAATATAC GATGTCTATA GTTTTACAGC AGAAGCAAAT AAAGCTTCAC 240 

AACGTGTAAT GGAAAAAATT GGCATGACAG TGTATGATCA TTTCGAATTA CCCAATCTAA 300 

GTAAGTATCA TTTATTaAAA AGGcAAGTGC GCTATTACAT TAATCTTCcG AAAGTGGAAA 360 
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ATTAgGGGGT GtTTnGTkTn ATTTTTTTAA n 451 
(2) INFORMATION FOR SEQ ID NO: 624: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

^5 CGATAGTTAT CTAAAGCCAT TTTnGATTGT GTTATGAAAT CTAATGATGC GTGATAATTT 60 

AATGCrACAT AACGrTaATA TAAAATATCA ATAGTGAACA TTTGAGCAAA TAATGAAGTT 120 

GTTGCTCCCA TATGCATCTC ATTTTCATCA GTTTTCCCAT AAGTTAAAAC AATATTTGAT 180 

^ GCCTGTGCTA CGGGATTATC CCTTGTACTA OTAATTGTAA TTATAGGTAT ATGGTAGTCA 240 

TCAATAACTT TAACCATTGA TTGCATTTCA CTTTGCGTAC CATTGTTAGT AATAAGAATA 300 

aCACTGTCGT TCGAATTGTG AGTTGCTAAT AATGTAGCAA AAATATGTGT TTCTTGAACA 360 

25 

AGTTGAATAT TAAGACCTAT TCTTGATAAC TTTTGGTATA AGTCGGTAGC AACTACAAAA 420 

GATGCGCCAA AACCATATAT AAAAATCGTC TCAGAACGTT TTAAACAATG ACATATTTGA 480 

TCAATAGTTT TATCATTTAA TTCGTTATTT GCATGATTAA GTGCGCGTGT AGTACGTGTA 540 

30 

TGGAGTTTAG TTCTTAAAGA TTCTGTGCTT TCGTTATTCA TTAATTCAAC ATTGTAAATT 600 

GATGATGCTT TAGGAACATA TTTAGATATA TTTATTITCA AGTCGTGAAA ACCGCCATCA 660 

35 GTAAT 665 
(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 
GATGCCAATT AACCGTGCAT ATAATGTTGA GAAGTTAATC GAAGCAATTC AATATTATCA 60 
50 AGAAAAAACA AATCGTCGTG TTACTTTTGA ATATGGTCTG TTTGGTGGTG TGAATGACCA 120 
ACTAGAACAT GCAAGAGAAT TAGCACATTT AwTAAAAGGC TTAAACTGCC ATGTTAACTT 180 
AATTCCtGTC AACCATGTTC CAGAAAGAAA TTATGTGAAA ACGGCTAAAA ATGATATCTT 240 

55 
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TTCGGATATT 


GACGCAGCTT GTGGTCAATT 


AAGAGCAAAG GAACGACAAG 


TAGAAACGAG 


360 




GTAAAGACAA ATGCTAGACX; 


CACAATTTTT 


TACTGATACT 


GGACAACATA GAGATAAGAA 


420 


5 


TGAAGATGCG 


GGTGGTATTT 


TTTATAATCA 


AACTAATCAA 


CAACTTTTAG 


TTCTGTGTGA 


480 




TGGTATGGGT 


GGCCATAAAG 


CAGGAGAAGT 


TGCAAGTAAA 


TTTGTTACAG 


ATGAGTTGAA 


540 


10 


ATCyCGTTTT GAAGCGGAAA ATCTTATAGA ACAACATCAA GCTGAAAATT GGTTGCGTAA 


600 


TAATATAAAA 


GATATAAATT 


TTCAGTTATA 


TCACTATGCA 


CAAGAAAATG 


CAGAATATAA 


660 




AGuTATVisjO 1 


ACAACATGTG 


TTTGTGCACT 


TGITTTTGAA 


AAATCAGTTG 


TGATAGCAAA 


720 


IS 


TGTCGGTGAT 


TCTAGAGCCT 


ATGTTATTAA 


TAGTCGACAA 


ATTGAACAAA 


TTACTAGTGA 


780 




TCACTCATTT 


GTTAATCATC 


TTGTTTTAAC 


GGGTCAAATT 


ACGCCGGAAG 


AAGCATTTAC 


840 




ACATCCACAA 


CGTAATATTA 


TTACGAAGGT 


GATGGGCACA 


GATAAACGTG 


TGAGTCCAGA 


900 


20 


TTTGTTTATT 


AAGCGATTAA ATTTTTATGA 


TTATTTATTA 


TTAAATTCAG ATGGATTAAC 


960 




TGATTATGTT 


AAAGACAATG 


AAATTAAGCG 


TTTGTTAGTA 


AAAGAAGGTA 


CAATAGAAGA 


1020 




TCATGGTGAT 


CAATTAATGC 


AATTGGCATT 


AGATAACCAT 


TCGAAAGATA 


ACGTTACTTT 


1080 


25 


CATALTTCoCIj 


GCTATTGAAG 


GTGATAAAGT 


ATGATAGGTA 


AAATAATAAA 


TGAACGATAT 


1140 




AAAATTGTAG 


ATAAGCTTGG 


CGGCGGTGGC 




TTTATCTTGC 


TGAAGATACG 


1200 


30 


aIa^vI lnAW4 


TTAAAGTTGC 


AATTAAGGCG 


ATTTTTATAC 


CACCTAGAGA 


AAAAGAAGAA 


1260 




GTTTTGAACG 


AGAAGTACAT 


AAU i \^Pk 1 vJAC 


AGCTATCACA 


TCAAAATATA 


1320 






TCGATGTTGA 


TGAAGAAGAT 


GACTGTTACT 


ACTTAGTAAT 


GGAATATATC 


1380 


35 




CTTTGTCTGA 


GTATATTGAA 


ALt X CAl vjoVsl. 


CATTAAGTGT 


TGACACAGCG 


1440 




ATT A ATTTT A 

AX A AAli X X X A 


CGAATCAAAT 


ATTGGATGGC 


a TT A A A <^ ATfi 


CGCATGATAT 


GCGTATTGTA 


1500 




CATAGAGATA 


TTAAGCCACA AAATATATTA 


ATTGACAGCA 


ATAAAACGTT 


GAAAATATTT 


1560 


40 


GATTTTCGAA 


TTGCTAAAGC 


TTTAAGTGAG 


ACGTCTTTAA 


CTCAGACTAA 


TCATGTGTTA 


1620 




GGTACTGTGC 


AGTACTTTTC 


GCCAGAACAA 


GCAAAAGGTG 


AGGCAACGGA 


TGAATGTACA 


1680 


45 


GATATTTATT 


CTATAGGTAT 


kGTGTTATAT 


GAAaTGCTTG 


TTGGTGAACC 


ACCCTTTAAT 


1740 


GGAGAAACTG 


GAGTTAGCAT 


TGCGATTAAA 


CATATTCAGG 


ATTCTGTGCC 


AAATGTGACA 


1800 




ACAGATGTAC 


GTAAGGATAT 


TCCGCAATCT 


TTAAGTAATG 


TCATTTTACG 


CGCTACAGAA 


1860 


SO 


AAAGACnAAG 


CGAATCGTTA 


CAAAACAATT 


CAAGAAATGA 


AAGATGATTT 


GAGTAGTGTT 


1920 




TTACATGAAA 


ATCGAGCGAA 


TGAAGATGTC 


TATGAACTCG 


ATAAAATGAA AACGATAGCG 


1980 




GTACCTTTGA 


AAAAAGAAGA 


TCTAGCAAAG 


CATATTAGTG 


AACATAAGTC 


GAATCAACCT 


2040 



55 
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10 



IS 



20 



AGCCAGAAGG TACGGTGTAC GAACCAAAAC CTAAAAAGAA ATGAACACGA AAGATTGTGC 2160 

TCTTATCACT AATCTTTTCG TTGTTAATGA TTGCACTTGT TTCTTTTGTG GCAATGGCAA 2220 

TGTTTGGTAA TAAATACX5AA GAGaCACCTG ATGTAATCGG GAAATCTGTA AAAGAAGCAG 2280 

AGCAAATATT CAATAAAAAC AACCTGAAAT TGGGTAAAAT TTCTAGAAGT TATAGTGATA 2340 

AATATCCTGA AAATGAAATT ATTAAGACAA CTCCTAATAC TGGTGAACGT GTTGAACGTG 2400 

GTGACAGTGT TGATGTTGTT ATATCAAAGG GCCCTGAAAA GGTTAAAATG CCAAATGTCA 2460 

TTGGTTTACC TAAGGAGGAA GCCTTGCAGA AATTAAAATC GTTAGGTCTT AAAGATGTTA 2520 

CGATTGAAAA AGTATATAAT AATCAAGCG 2549 
(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



35 



40 



45 



SO 



55 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

TGCTTACTTC GCCTTCAATA CGTACTAATT CATGTCCACA ACTTGGACAA TGGGTTGGCA 60 

TATGATATGT GACAGCATCC TCAGGTCTAC GTTCTGGAAT ACTACGTACA ACTTCAGGTA 120 

TGATGTCACC TGCTTTTTTC ACTACAACAC TATCACCAAT TCGAATATCT CTGTCATGAA 180 

TTAAATCCTC ATTGTGCAAA GATGCTCTTG aTACAGTTGT ACCAGCTACT TTTACTGGTT 240 

CTAAAATAGC AGTAGGTGTG ACTACACCTG TTCGTCCAAT ACTTAATTCA ATATCTAATA 300 

ATTTAGTTAC TACTTCCTCA GCTGGAAATT TATAAGCAAT GGCCCATCTA GGAGATTTTT 360 

GTGTGAATCC CATCTCATCC TGTTGATCTA AATCATTAAC CTTAATAACA ATCCCATCAA 420 

TATCATAAGG TAATGACTCT CTTTGGCTTG TCCATTTTTC AATATACTCT AAAACACCAT 480 

CGATATTATT TACACGCGCT CTATTTTTAT TCGTTGTAAA ACCTAATTTA TCTAACTCAT 540 

CTAATGCTTC ACTTTGCGAA CGCGCATTGA AATCAGTGAA ATCATTGACA CTATATATAA 600 

ATACGCTTAG CTTTCGTTTT GCCGTTAATT TAGAATCTAA CTGTCTTAAT GATCCCGCAG 660 

CAGCGTTTCT TGGATTnGCA AATAACTGCT CATCATTTTT TTCTTTTTCT TCATTTAATC 720 

GTAAAAATGA ACGTCTCGGC ATATATGCTT CACCACGAAC TTCTACATTT AATGGtTCTT 780 

TCATTTTCAA AGGTATCGCA TGAATTGTTT TTaAATTTTC GGTAATATCT TCACCTGTTG 840 

TTCCATCACC ACGTGTTAAA CCTTGaACGA AGTATCCATC AACATATTTC AATGATACTG 900 
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GTTGGTCGAA 


TTTTCTCAAA 


TCATCCTCAT 


TAAATGCATT CCCTAAACTT AACATTGGCG 


1020 




TGTCATGGTT 


GACTTTATTG 


AAAGAGGCTT 


GGGCTTCACC 


GCCAACTCTA ACTGTTGGAG 


1080 


5 


AATCTACAGT 


CTTATACTCA 


GGATGCTCCT 


CTTCTATTTT 


AATCAGTTCA 


TGAAGTAATT 


1140 




TGTCATATTC 


ACTATCTGGT 


ACAGATGGAT 


TATCCTCTAC 


ATAGTATTCA TAACTGTATT 


1200 


10 


GATTTAATAA 


ATCATGTAAC 


TCGTTCACAC 


GAGACGATAA 


ATCAGCCATC 


CCTTAATCCT 


1260 


CCTTTTTTtC 


AATTGGTGCA 


AATTGCGCTA 


ACAAACGTTT 


TGGCCCTTGT 


GATTTAAAGA 


1320 




TAATATCTAG 


TTCGATTGAG 


CCATTTTTCT 


CGTTTACATT 


ACTCACCATG 


CCTTCTCCCC 


1380 


IS 


AGGCTTTATG 


CATCACTTTG 


TCACCTACAT 


TCCAATCAGA 


TGACAATACT 


TGTTTTTTCG 


1440 




TTGACGTTGT 


TCGTTGACTA 


AATCCGC6TT 


TAGCAAAAGG 


TTTTGCCTTA 


GGTT6TATC6 


1500 




TTTGTCGTTT 


GCCACTTGAA 


TGATTTTCTA 


ATAGTGATTC 


TGGAATTTCC 


TTTAAAAATC 


1560 


20 


TGGATGGCAT 


ATTTGACTGA 


GGGCGACCAA 


ATAACATTCT 


TGATGTCGCA 


TGAGTGATAT 


1620 




ATAACACCTC 


TTCAGCCCTT 


GTAATTGCTA 


CATAACAAAT ACGACGTTCT 


TCTTGCATTT 


1680 




CATGATCATC 


TTCACTCTTA 


ATCGCTCTAA 


TATGTGGGAA 


TAAAGATTCT 


TCCATCCCCA 


1740 


25 


TTATAAAGAC 


AATTGGAAAT 


TCAAGGCCCT 


TAGCCGAGTG 


CATCGTCATT AGTGTTACGC 


IBOO 




CATTTTCAGT 


ATCTGCCTCA 


TCAATATCAG 


CTACTAAC6A TAAATCCGTT AAAAAGTTAA 


1860 


30 


TTAATGACTG 


TTCTTCTAAT 


GGGGTATTTT 


CCTCATAGTC 


TTTTGGTACT 


GACATAAATT 


1920 


CATCGATGTT 


TTCTAATCTA 


CTTCGAGATT 


CTAATGTATT 


TTCACGTTCA 


AGCATTTCTC 


1980 




GATAGCCAGA 


CTTTTGTAAT 


ACTTCATCAA 


CTATTTCATG 


AATTTCTAAA 


AATTCTTGTT 


2040 


35 


CTTTTATCAA 


GCTTTGGATT 


AACTCGTAAA 


AATTAAGACA 


CTCTTGTGTC 


ACCTTTTTTG 


2100 




aCAATCCGAT 


AAAATCAGCT 


TCTCCAAGTG 


CATCAAACAT ACTGATATTG 


TTTTGAAGTG 


2160 




CATAGTTTTG 


AACTTTTTCA 


ACAGATGAAG 


GACCTACACC 


TCTTTTTGGA ACATTaATAA 


2220 


40 


TACGTTGCAA 


ACTAATGTCA 


TCATTACTAT 


TGGCAATTAT ACGCAAATAA CTTAATAAAT 


2280 



CTTTGA 2286 
(2) INFORMATION FOR SEQ ID NO: 627: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 

55 
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TGACATTAAA AGTTGGGTGA AnAAACAAGG CATACTTTAG TTAAGCTTGA TGAAAATAAC 120 

AATGGAATTA ATGCGATTAT TCAAAAAGAA AAAGCAAAAG ATTTAGATAT AAATTATTCT 180 

^ GCTAAAGGTA CTACCAATTG TATTATTTAG TGGAGAATTA GACAAGCTGT AGCAGCGTTG 240 

ATTATTGCCA ATGGTGCTAG AGCTGCTGGA AAAGATGTAA CTACCTCCTT ACTTTTTGGG 300 

GGCTTATGCC nTTAAAAAAG TGCCACCGTT AATGTTAAAA GCAGTTGCCA AAATGTTTGA 360 

10 

TTnATGTTGC CCCAAAAGAT TTACGATGCC CCTTCCCAAA 400 
(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 
AATAATTTGG GCGCTnTTTG CGTCGGGATA TTATACCGCT TCCTTAATTG TTCAACATTG 60 
TAATCACTGT TTTTCAATTG ATATTTTGCA GAGTAAATTG GTACTTCTGG GTTATATGAC 120 
ACTTCGTCCT CTTTATAGTT TTCCAATTCT TTGAAATTCC CGTATTGTAC AAAGAAGTTA 180 
AATTCTTCGA TTTCTTTTTT TACTTTTTCG TCATCGATTG GTTTTAATGG AATCATTTTA 240 

30 

TTAkTTTCCA TTTTCACAGG ATATCTTTTT GTATGATTGT GTGTCATTCC ATCGCTATCT 300 
TCAACAACTT CTCTAACAAT ATAATGCCCT TTAGCCGTTC TAGTATTTCT GTTAATTTCT 360 
AAAACTGCTC CTCTrGATTC CAGATTTTCT CCTTTTAATT GGATTTTCAT TTCAGATCTA 420 
ATTAGcCAAG TACCTTTATC ATCTtTTTTA AAT 453 
<2) INFORMATION FOR SEQ ID NO: 629; 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 
TCATATGCAT TTGCAAAATA AACGCCAGAA GCAAGGTTTA GAATTGGGCC GTCCGTTTTG 60 

50 

CTCAaTTCAC TTGcATTCAA TAATTCATGC TGATCATGAT CAATTTGCTT ATCTAATTCT 120 
GCaATTTTCT TCATTTGCTT ATCTGATTTG TTTTCTTTCG CCATCATTTG ATCACGACGA 180 

55 
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GCTAATGGTA CTAATACTTT ATCTCCATAT GTGTCAATAA AGTTATAAAA ATAATCATCT 300 

GTTTTTGATA CAAATCCAGC ACGCTCTTCA GTTTCACGAT ATAAATCTAA GAAAAGATTG 360 
* AACTCATCAC GTTCAAGGAA TCTGACTTTA ACACCATAGT TTATOGCTTT ATTAATATTA . 420 

CGTTTACGTT GACTATCAAA TGTCTTTTTC AATGTTTCGG GTGTTTTACC TTCAAGGTTT 480 

AATACGCCCA TCCATCGTAC TTGGCTCGAT GTATCATACT CAGTTGTAAA GCCATGATGC 540 

10 

TCGTAACCAT GTGATTTAAA CAAGTTTACT AGGGCATCAT TTTTCTCGCG ACCTTCAAAT 600 

GGCACGATAT CTTTATCATA TAGATGATAT AACCAATACG GATCTAATTT AACATATAAA 660 

IS CATTGATGTT GCTGTAAATA TTTATCTAAC TCTTTTAAAT AATAATCAAC TAATCCTAAA 720 

TCTGAAAAAT CCATTACTGG ACCACGATTC GAATAGTAAA CATAACTTCC CATAGTAGGA 780 

ATTTTAGAGA AAAGGCTTGC TGCAATTACT TTGTTATTGT CGTCTTTAAT ACCTAATAAA 840 

ACTACTTCAA AGCCATCATT CTCACGGGTA ACTATATTTT CTTTTACTTG GAAATAATGA 900 

CTTTCCAATG ATGGATTTTG TACAAAGTTG TCAAATTCGG TAACAGTTAA CTCTGTAAAT 960 

TTCATGTTTT GATAATTCCt TCCTAAAAAA TTCTGTCTTT AACTTTTTTA AGTGCGGTAT 1020 

2S 

ATGCTGCGTA AACAGGTTTA TTAATTGGTT TAATAAAGTC ACCAACATAT TCmATAATTT 1080 

CAGCATTGTA ACCTTTTTTG AATTTAACTA CACCAGCATC TTCAGCATCT TCTGTAAATT 1140 

TrCCACTAAC ACCATAGAAA TTATAACGGT CAATGCATGA TTTAATGCAT AATTAATCAT 1200 

30 

TTCCCATTGC ACTGCATAAC T 1221 
(2) INFORMATION FOR SEQ ID NO: 630: 

OS (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 
TGGCCCAAnT AACACAAGTA ATTGCTGCTA ATGGTAACAT CACAAAAAAT GAAATCGTAA 60 

45 

CTACAAATGT TAAACCTTGG AATACACCAA CCATTTCTGG TAATCGTTTA CTATAGTATC 120 
TATTGTGAAT CCAAGTAATA ATAGCTGAAA TAATAATACC ACCTAGAATA TTCGTATCCA 180 
ATGTGGCAAT ACCTGCAATT GATTTTAAAC CAGGTACATT TTCAACGCCT TTTTCTAAAT 240 

SO 

TAGCGCCAAA CGTATGTGGC CATTGTGTTA AAATGGCATT TATAAATGTA TTAAACATTA 300 
AGTAACCCAT CAATGCTGCA AGTGCTGCAT GACCTGGTGC TTTTTTAGCT AAAGAAAGTG 360 

55 
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TTACTGACCA AAATTTAAAC CAAAACGTAT GTTGATCTGC TAAACTCCCC ATGATTGTAG 480 

GATTTTTAAA TAAT6TCGCA AAGCCAAGCA CTATCCCAAA GAAAGCGAAC ATTAATACCG 540 

GTACAATCAT TGCACTACCG AAACGCTTTA TCGCATTCAT CTTCTATTCC CTCCATATCA 600 

TCTTTCCTAA CAATACATCT AATTAGATTC ATTTATAAAT AGATGTCTTA CTATTTAAAT 660 

ATAATATATA GTAAACGCTT ACACACCTAC AACGACATTG ACGTATTTTG AAAGTATTTT 720 

GTATAATCAG ATTATCTTTT CATATAGTGA AAATTTTTTC ACGACCTTAT ATATGACATC 780 

GTTGTATTTG TAATACATTC GTTTTAAACG CATAATCAAA CCTATATCAA TACACAAATA 840 

TATATAATGA CATACAAGAT TTTAATGTAA TAACGATCTA TTACACATTT ATTTTCAAGG 900 

AGGTTGAATA TGTTTTTAGA TGAACACATT AATCGAAACT TTGATAAACT TAATGATAAT 960 

GATTTACATA TCGCTCACTT TATCAATACA CATATAGATG AATGTAAAAA TATGAAAATA 1020 

20 CAAGATTTAG CGCAATTCAC ACATGCCTCG AATGCAACCA TTCACAGATT TACACGCAAA 1080 

TnAGGrriTG ACGGTnATAG TGGATTTAAA TCGTACCTTA A 1121 
(2) INFORMATION FOR SEQ ID NO: 631: 



10 



IS 



25 



30 



35 



45 



50 



. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

AACCTTCCAT TTTACTTGAT CGATAACATC AGTTCTGCTT TACAAATCTC TTCATTAATA 60 

TGCGTCTTAA AACCATGAAA TTTAACATGT TCCGATAAAT GATAATCTTC TACAAGTTGT 120 

CGATATTCTG ACAAACCATT TCCATGTCCA TAAATATTCA ATTGAATATT GGGATGTTTT 180 

40 GTTACTAATT GCTTGATTAC TTCAATTTGA TGTTTAATTT GTTTATTTTC AACGAGGCGA 240 

GCAATTGATA TGATATGATT TTTCTCCTTT TGATTGATGT CAAATTGATA CTTTAAATTT 300 

GCCACGTAGC CAACCGGAAT ATTGATAACT GGTATTTTAT TTTCAATATA TTGTGAAATA 360 

TCTTGGCATT GCTTTTCTGT TGATACAACA ATCGCTTTAT AACGTGTTAA ATTATTAAAC 420 

ACTGTTTTAT AAAAACTTTT TATACCATTA CCGGCACCGG ATAAATGTGT ACTATGGAGC 480 

ACAACAATAA CTGGAATACT TTGATTTAAT CCCGCTATAA CATTTCCTAA TTCATGAGGA 540 

CGATCTAATA TGATTTGATC ATTATTTTTA CATAATTGAT GGAGAAAATA TTGAACTAAT 600 

TCATCTTCTG TATCAAAAAA TTGTTGATGC TGGTCTTCAT TTAAGATAAC CTTTGTGAGC 660 

55 



1557 



10 



IS 



EP 0 786 519 A2 

TAGTAATTTT CGAGTACAAT CCGTTGTCCT TCACCTAAAA TTCGAGAACA ACTTAAAAAG 780 

CCTCTTCCAT CATACAATTC GCGTTTTACT TTTCTTCTTT TATGATCAAA ATAATTCACA 840 

TAATTTAATT GATGATACTG TTTATCTAAA AAATGAGCAT ACATTACAAA TTGCTCTTCA 900 

TCATATATTC TGACATCATT TGAATTTTCC ACAAATTTCA ATGTGTACCT ACATGACTTT 960 

TCCCAATACT GTATCCAGTT AACTTGCTTT GTCTTTTTAT AATTGATTGC TTTTTGAAAA 1020 

TAGTCATACA TTGTAAATAC ATCATTTTCA ATCTGATGTT GCTTCGCATA TGTGTATGAA 1080 

TAAGGATTCC ATTTAACATA TACACATTTT GAAGATATGC CGTGTTGTTT GAACAACTTC 1140 

AATCTATTTA TTTGCXSCTTT TTCTACACCT GTAATTTTAC TTTCTAAAAT TGTTCCTAAA 1200 

ATGTAATTCA TATTATCGCC TCATATAAGT TTTATTCCGT ATCTTTATTG TTTATTTrAT 1260 

ATGAAAAATA CATCTATTGC ATGTGTAATT ATAAAAAAAC CAGGCCACAA GGACCTGGGT 1320 

20 CATATTGTAT TATTTGTTTT GTTTTTTGCG ACGACCGAAT AACAATAATG AACCTAATGC 1380 

TGCAAATAAT CCACCAAATA ACGTTGCGTT ATTTGAGCOG TTATTTTCAC TACCTGTTTC 1440 

TGGTAATGCT TTTGCTTTAT TGTGATGGTC TTTAGTAGTA CTCATTGGTT TAACAGGTGT 1500 

ATGTTTTCCT GCATCCGAGT CTGAATCGCT GTCTGAATCA CTGTCTGAGT CTGAGTCGCT 1560 

ATCAGAGTCT GAGTOGCTGT CCGAATCTGA GTCGCTATCT GAGTCTGAGT CGCTGTCTGA 1620 

ATCTGAATCA CTGTCTGAGT CTGAGTCGCT ATCTGAGTCT GAATCGCTGT CTGAATCTGA 1680 

GTCGCTATCT GAGTCTGAAT CGCTGTCTGA ATCTGAGTCG CTATCTGAGT CCGAATCGCT 174 0 

ATCTGAATCT GAGTCGCTGT CTGAGTCTGA GTCGCTATCT GAGTCTGAAT CGCTGTCTGA 1800 

35 GTCCGAATCG CTATCTGAAT CTGAGTCGCT GTCTGAGTCT GAATCGCTAT CTGAATCTGA 1860 

GTCGCTATCT GAGTCTGAAT CGCTGTCCGA ATCTGAGTCG CTATCTGAAT CTGAGTCGCT 1920 

GTCTGAATCT GAATCACTGT CTGAGTCTGA 6TCGCTGTCT GAGTCTGAGT CGCTGTCTGA 1980 

^ GTCACTATCT GAGTCTGAAT CGCTGTCTGA TGTATCTTCT TCGAAGTATC CGTTATCAAG 2040 

TGTGAAATCA TCATGATCCG TAATTGTTAC GTCAACTTOG CCACCATCTG CATCTTTATC 2100 

ATCTTCAGTT GTATTTGTAA CTGTTTGTGT TAAGCCAGCA GGCTTTTCAA AAATAACTTT 2160 

GTATTTACCG CTATCTAAAT TATCAAAGCA GTATTTACCA TTTTCATCTG TTTTAGTTGT 2220 

TCCAATTACT TCGCCTTTTT CATTTAATAA AGTAACTTTA ACATCTTTGA TACCTTTTTC 2280 

AGTTGAATCT TGTTTGCCGT CTTTATTACT GTCGTACCAA ACATAATCAC CTAAACTATA 2340 

TTTTGGTGTT TTATAGAAAC CACTGTCTAA TGTCATGTTA TCTGCATCTT TAATGACACC 2400 

TGTTGTTGTT AAACCATTAG AATCTTTTTC AGTATCATTT CCAGAAGTTA CTGAAGTTGG 2460 

55 



25 



30 



45 



50 



1558 



EP 0 786 519 A2 



5 



10 



IS 



30 



35 



TTGATATTTA 


CCATTTTCAT 


CTGTTGTAAC 


TGTTTTTAAA 


ACTTTGTCGT 


TTTCATCTTT 


2580 


TAACGTAACT 


GTTACACCTG 


AAATGCCCTT 


TTCATCTTTA 


TCTTGAACAC 


CGTTTTTATT 


2640 


TGTATCTTCC 


CATACATAGT 


CACCTAAGTT 


GTAAGTCGGT 


TTGTAGAAAC 


CAGAGTCAAT 


2700 


AGTATCGTTA 


TCTTTATCTT 


TAATGACACC 


TGTTGTTGAT 


GTACCATTTG 


AATCTATACC 


2760 


TTCATCAGTT 


CCTGAACCTA 


CTTGTGTTGG 


TGTGTAACCT 


GATGGTGTTT 


CGAATTCAAC 


2820 


TTTATAAGTT 


CCATTTTCTA 


ATCCAGTAAA 


TTGATATTTA 


CCATCTTTAT 


CTGTTTTAGT 


2880 


TGTTTGTAAA 


ACTTCACCGT 


TTTCATTTTT 


CAATGTAACT 


GTTACGCCTG 


AAATACCTTT 


2940 


TTCAGTTGAA 


TCCTGCTTAC 


CATCTTTATT 


TGTATCTTCC 


CATACATAAT 


TACCTAAATT 


3000 


ATATTTTGGT 


GTTTTGTAGA 


ATCCACTATC 


TAATGTCATG 


TTATCAGCAC 


CATTAATAAC 


3060 


ACCTGTTGTT 


GTTAAACCAT 


TAGAGTCTTT 


TTCAATGTCG 


CTACCAGATG 


TTACTGTAGT 


3120 


CGGTGTATAG 


CCTTCTGGTG 


TAGTAAATTC 


AACTTTATAA 


TTACCATTAT 


CTAAATCAGT 


3180 


AAATTTATAT 


TTGCCATCAG 


CGTCTGTTGT 


AACTGTTTTT 


AaCAgTTACC GTTTTCATCT 


3240 


TTTAATGTTA 


CCGTTACGCC 


AGATATACCT 


TTTTCATCTT 


GGTCTTGGAT ACCATTTTTA 


3300 


TTTGTATCTT 


CCCAGACATA 


GTCACCTAAG 


TTGTATTTAG 


GTTTGTAAAT 


ACCTAAGTCT 


3360 


GCAGATAAGT 


TATCTTTGCC 


ATTAACTGTA 


AT7UVCTGAAG 


ATAAGCCGTT 


TGAATCTAAT 


3420 


TCTTCGTTAT 


TACCTTGTTT 


TGAAGGGGTT 


ACTTCATAAC 


CTTTTGGTAA 


GTTTGAAAAT 


3480 


TCTACACGGT 


AATCTCCATT 


AGGTAAGTTT 


GGAATCAAGT 


ATGACCCATC 


TTCTTTAGTA 


3540 


ACTGCTTCTC 


CTACTTTTGT 


ATTTGTATTA 


TTATCAAATA 


CAGTTACAGT 


TACATTGCCA 


3600 


ACGCCTTTTT 


CTCCTAATTC 


TTGAACACCQ 


TTTTTATTAG 


TATCTTCCCA 


TACGTAGTTA 


3660 


CCAATTTTAT 


ATACTTCTTG 


ACCAGctCCG 


CCACTTTGGT 


TATTAGTAAA 


TCCTAAAGCA 


3720 


TTGCCAGTAG 


AAACGGATTT 


ATTACCTGTT 


GAAGATAAAG 


TAGCCATTTG AACAAGTGTT 


3780 


GGGCTTTCGC 


TATTTGTATA 


TTGGAATTTT 


GTATTAACCA 


TTACAACATA 


AGCAGAATCT 


3840 


GCATTTCCAA 


AATCAATAAC 


AGCGCTATTG 


TtGTCGCCAT 


ATGTAATTTT 


CTGCAAGTAT 


3900 


TGATTTGTTA 


CATCTGTAAG 


CTCTTTAGTA 


TTCACATCGT 


ATCCTTTATT 


TAATGTATAA 


3960 


CCTTTAGGAA 


CTTGATATAT 


TTTTATATCT 


GTTACATCTT 


TATTT 




4005 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 

TATTAGGTTA CTCTAGTTTC CAAAGCGGGA ATTTTAATGT TATTAACAGC AAGGACAGCA 60 

AAAGCAATAT CGGCGCATTG ATTGAAAATC CAGGAATATA TCCTTTTATG TCTGGATATG 120 

AAAACTTGAA GTTATTGAAT GAATCAAAAA ACACTCAAGA TATCGATAAA ATTGTCTCAC 180 

AACTTCATAT GGATGAATAC ATTCATAAAA AAGCTAAAAC GTATTCTCTT GGTATGAAAC 240 

AAAAATTAGG AATTGCTATA GCATTTTTAA ATAAACCTCA ATTCATTATC TTAGATGAAC 300 

CAATGAATGG CTTAGATCCA AAAGCTGTGC GAGATGTACG TGAATTGATT GTCCAAAAAG 360 

CGCAAGAAGG TGTTACTTTC TTAATTTCGA GTCATATTTT AAGTGAATTA GTTAAAATCA 420 

CAAACTCTAT CCTTATTATT AACAAAGGTA AAATTGTTAC AGAAACATCG GAAOAAGAAC 480 

TTAAACAATT TAAAGATAAT GATTTAGAAA ATGTATTACT AGAAATCATA GAAAGGGAGG 540 

ACCAAGCATA AAATGGGAAC TTTAATTAAA CAAGAATGTT TCAAATTATT TAAAAAGAAA 600 

TCAACTTTTA TCGCACCTAT TGTCTTTATT CTACTAATGG TTGCTCAAGG TTATATTGCT 660 

ACAAAATACA ATGAAATTTT TACGCCACAG GAATCTTTCA CATCTGCTTA TAATGGTTTT 720 

TCATGGTTTG CATTTTTATT AATTATTCAA GCAAGTACAA TCATTTCAAT GGAATTTCAT 780 

TACGGTACGA TTAAAAATTT ACTCTATCGT GAATATTCAA GAACAACTAT GATTGTTAGC 840 

AAAATCATCA CATTATTTAT TATTTCTTTA ATTTATTTTG TTATTACAAT TATTGCTTCA 900 

ATTGTTATTG GGTCTTTATT CTTTAATGAT TTAAATATAT TTGAAAGTAG CGGTAATCAA 960 

TTATCTTTAT TGAATCAATT ATTATTAGTT AGTTTAGGCA CATTTGTTGG CGTTTGGTTA 1020 

GTTTTAAGCT TAACGTTGCT ATTATCATCT GCAACAAATT CAACGGGAGT AGCCATTGCT 1080 

GTAOGTATTG T T TT T TATTT TGCAAGTTCT ATTTTAGCAG TTATTCAAAC GGCACTTTTA 1140 

GAAAAAATAG ACTGGCTAAA GTGGAATCCT ATTAATATGA TGAATATTAT GCTTCAAACA 1200 

40 GTTGAAAAAG GCTTTAGTAA GTCGACAAAA TTAGAACTTC ATGAATTGTT TATTGGTAAT 1260 

ATTGCTTATA TTTCTATTTT CTTAATACTT GTAGTATTTA TTTTCAAGAA GAAAAATATT 1320 

TAGTAACTTA AAGTATTAAA TGTCTAAATA CACACATATT CCATCGTAAT TCAAAATCAT 1380 

45 

TTTCAAATCC CTTCACCCAA ATAATGGTGC GGGGATTTTT TCATCCAAAT TTTGGAAATT 1440 

(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 



5 



10 



15 



30 



35 



GCTGACATAA TTGCATCAAA 


TTTCACATCC 


CCATAAAAAT CGCTACCACA TAACCTACGA 


60 


TAATACCTAC AAGAACTGGA ATTAAAGATA GGAATCCTTT AAAAAATCCT TGAACGACTA 


120 


TTGTTACAAG 


CAAGGTTATC 


ATTGCAACAA 


TTAAGAAACT GATATTGTAA CCTTTCATAT 


180 


CTCCAGGATT 


TTCATACATT 


GCCATATTGA 


CTGCAGTAGG CGCTAAGCTT AAACCAATTA 


240 


CCATGATGAC 


TGGTCCAACA 


ACAACTGGTG 


GTAATAATTT CATTAACCAT GCTGTCCCAC 


300 


TTAATTTGAT 


TAGAATCCCG 


ATGATGACGT 


ACATAACACC ACTCATGaAT AATGCTACAA 


360 


GCATGTCTCC 


TAAGCTATGC 


GTACTTAATC 


CCGTGATAAT TGGCGTGATA AATGCAAAGC 


420 


TAGATCCCAA 


GTATGCTGGT 


ATTTGCGCCT 


TCGTTATTAA GATATAAAGT AATGTACCGA 


4B0 


TTCCCGAAGC 


TAGTAACGCT 


GCTGATATTG 


GTAGTCCTGT TAAGAATGGT ACTAGTACTG 


540 


TTGCGCCAAA 


CATCGCAAAT 


AAATGTTGTA 


AGCTTAAAAA TGCCCATTGC GCTGGTTGTG 


600 


GTTTTTCATT 


TACATCTAGT 


ACGGGTTTTA 


CTGTTCGTTC AAACATTTCA TCATTTTGCA 


660 


TAATATTCAT 


TTCCTCCGAT 


AATAAAAAAA 


TCTCTTTACA TCAGTATATG TAAAGAGACA 


720 


AAAAGTGTGA 


CAAGTTGCTA 


CAAGTCATTT 


TOGTCCATAG AAATTGACTT ATAGTTGTCG 


780 


AACATGAGGG 


TATTATTAGA 


TAAACAAGCA 


TATGAAAACT TATTTATCAT TCAACTCCCC 


840 


CACCTTTTTC 


AGTCTCTCGT 


ACTGAATTAA 


AAGGGGtATT ATTTAATTAT AACTGCATTT 


900 


CTTTGATCcA TTtCTTCyAA ATAGACACTT ACCGTTTCCT CTTTAGAAGT AGGTAwATTT 


960 


TTACCAACAA 


AATCTGCTCG 


AATTGGTAAC 


TCACGATGTC CTCGATCAAC CAAAGCAGCT 


1020 


AAACCAATTT 


TAATAGGTCT 


AGCATTTAGC 


AAAATAGCAT CAAGTGAAGC ACGAACCGTT 


1080 


CGACCAGTAT 


ACAGCACATC 


GTCAATAATG 


ATGACTACTT TATCTGTAAT ATCTGTGTCG 


1140 


ATGTCTATTG 


CGTCTTTTGT 


CGTAAGTGAT 


GACATGTGcT CTATATCATC TCTAAAGTAT 


1200 


GTAATATCAA 


TTGTTCCAGT 


AGGTATACGT 


TGTTGCTCAA TTTGATGAAT TTtATCTTGT 


1260 


ATACGATTCG 


CTAAATATTC 


ACCTCTTGTn 


TTGATACCTA AAAGATTAAA TTATCAGTAC 


1320 


CTT 








1323 



^ (2) INFORMATION FOR SEQ ID NO: 634: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



IS 



ACCCATCTCA TCGTATTTTG AATTTAATAG ACGAGATCGA TGTATATCTG AATTCATCCC 60 

AACTATGGAT TAATGTTGGT ACATCATtAA cGCATAACCA ACATTTTGAG CAGTTGTTTT 120 

ATAAGTAACG TGATTTTTAT CTAATTGCCC TCTTAATGCG TCCTCTGTAA ATTCAACACT IBO 

ATCAGAACCA TTAGAGGTCG CTTCATATAA GTTATTAGAT GCAATATGTG CTAAATCGCT 240 

ATTGATTTTC AATGGTTTTA ATCCTTTTAA TTTTCTCATT TCATTCGTTA CTTCATAAAG 300 

AGAAATTAAT TGATTTGGAT TTTGCTCAAC TGGACGCTTA TTATGCTCTT CTGACGTAGA 360 

ATTAGAATTT AATTGATAAG GTTCAATATC TGCTAACATT TCTTTTGTTA AAAATCGTAC 420 

ACTTAGCACC TTTTTCGATT GTTGATCAGA ATACACTTGT GCATATATGT CGCCATATTT 480 

AATCaGTGTT TGTGTTTTTA AATCTTCATC TGAAAGTTCA AATTCATATT TTTTACCATC 540 

AACTTTAAAG GACGGTTCTG GATTAATACT TGTATGATTA AAAATTTCTG CAGAATGTTG 600 

20 TCCTATTTTT AACGGACTAA CATTGACTTT CTCACCTGTA GCATACACTG AAACGATTTC 660 

TTCACGTTTA GTTGAAACAA TGTAATAACT GTTTTTGTCT TTAAACACAT AATTrTTGTA 720 

rCCATCTCTA AAAGGGTAGA CrCGATCTGC TTGTCCAAAT T 761 
(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

nTAAATATAT TTATATATTA TAGAATAGAA AGACCTGAAG ATTGAATATC TTTCGCAAAG 60 

CCTTTAACTG TATCTACTGA TAATTCGTTA ATATCGCGAC CTAAGTTTGT ATTCACTTTT 120 

TTCACAACAT CTGCTGGGCA TGTAATAATA TCTGCACCAA TTTCATCAGC TTGAATCACA 180 

TTGAATAATT CGCGGCAACT TGCCCATAAT AATTTAACGC CGTCTTTACT ATGCGTAACT 240 

TTnACAGCCT CTkTCATTAA TGGTaATGGA TCTACGCCTG TAtCTGCAAT ACGTCCTGCA 300 

^ AATACTGAAA CATATGTTGG CACACCTTCA GTTACTGCTT CAGTTATTTC TTTAACTTGT 360 

TCAATTGTGT AAACAGCCGT AACGTTTAAT CTCACATTGT CAGCTGAAAG TTTTTTAATT 420 

AAAGGAATCG TTGATTCACC TTTTGTATTT ACAATAGGAA TTTTAACAAA TACATTTTCG 480 

50 

CCATATTGTT TTAAAATTGC TGCTTCTTTk TCCATAGTTT CTAAATCGTC TGCAAATACT 540 

TCAAATGArA TTGAAGCATC TGGAATTTCT TTCACAGCTT CTTCAGCAAA AGCTTTGTAA 600 

55 
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TTTTTATAAG CTGCTTTCAT TnCTTCAATA TCTGCACCGk CCGCAAATAC TTCTACATTT 720 

AGTTTAGCCA TATAAyATAG CXTTCCTTGAT TCTTATTAAA ATTTTAACAA CATCTGCATG 7 BO 

5 

kCTTTTTCTT ACAACCATTT GTAAAAAATG ATTTTTATTT CTTtGTT 827 

(2) INFORMATION FOR SEQ ID NO: 636; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

TTAGTCGGTA TAACCATCGG CTAGGTGGTT TTTGTATTAA AAAAGTGGAT aCCAAAATTT 60 

20 ATTAATAATT ATTTTAATGT TAGAAAAAAA CTAAATAAAA ACTCGCTAAT GATATCCAAT 120 

AATATGTATA CAAAACGA6A CATATATTGC ATATGATTAA CGAGATACTG AAAATATTTT 180 

ATCACCCCTA AAATGATTAT TCATTTTCAG OGGTAATTCG ACCTAAAGTC AAACTTACAA 240 

TAAAACCGAT GATAAATACT ACTAATGAAA CGAACCACAT CACGATATTA GTTGGTAAAC 300 

CTGGAAATAC TGCAAAGAGG GAGCCAACAA CAAAACCAAT GATTAATGCA AAAGTCATTA 360 

GTTTATGATG TGTTAGGAAA TACTGGATAA TTTTGCTTGA AATAATGAAT CCAGCAAGCA 420 

30 

CGCCAAATCC GACTGCAAGT AATATAGGAA GACCTGCAAA GTTAAGTTTA ACAACTTCAG 4 BO 

ATATTGCTAG CATGACCGTA CCATAGACGC CAAATACTAA TAACATAAAT GACCCTGAAA 540 

TACCTGGGAG TAACATAGCA CTAGATGCAC ACATACCTGC AATAAAATAT TTAATAATAA 600 

GACTAGTTGA TAGAGTAAGT GTTTCTCCAG CATGTTTATC ACCATTATTC ATTAATGTAA 660 

TAACAATTAA GATAGCGATA CCAGCTATAA CCATCATGTA ATGTyTAGTT GTAAATGACG 720 

40 TTTTATAGTT AGAAATTTTC AATAAATATG GAACGATACC AATGATTAAT CCACCAAAGA 780 

AAAACATAGT TGGAATATGG TGTTGGCTTA ATAAATAATT AAAAAGATTA CTTAGTGATC 840 

CCATTGCCAG TAACATTCCA ATTATAATGG GGATTAAAAA TGTAAAACTT GGCCAAAAAC 900 

GTCGTGAGAA TATGCCGCTA ATTGAAGCGA TAAATTGATT GTAAATACCT AACAATAATG 960 

CGATAGTCCC ACCGCTAACA CCAGGTACCA AGTCACTCGT TCCCATAGCA AAACCTTTTA 1020 

GAATATTAAT CCATTTAAAT TGTTGCATGA ATAACTCCTT TCAAACGATT GGAATAAAAT 1080 

so 

CATAAATAGC ATCATACCAT ATTACAAATG TCCTAGTGAA ATGATAACAT ATTTTAAATT 1140 

CATAAAATCC ATTGAGAAAT TATGTGCACT TATTATCATT TATATTTTTA AAGAGAGCGG 1200 
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AGGTATAAGT AAGTTATAAT TAACTGAACG CATTATTACA AAGTCTTTTT GACTACAAAT 1320 

TAAAATTATT ATAAACTAGT TAAGAAAACT TTATATTTTA CGGAGGGAAT ATAAAATGGC 1380 

ATCAACATTA GAAATyAAAG ACCTACATGT GTCTATTGAG GATAAAGAAA TCTTAAAAGG 1440 

TGTTAACTTG ACAATTAACA CTGATGAAAT ACATGCQA 1478 



(2) INFORMATION FOR SEQ ID NO: 637: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 
20 ATTACAGCAT CTTCTCTAGG TAGATTATTA AAAGATAGAG GTCTAAATGT AACAATTCAA 60 
AAATTCGATC CATACTTAAA TGTTGACCCA GGTACAATGA GTCCTTATCA ACATGGTGAA 120 
GTATTCGTAA nGGATGATGG TGCAGAAACT GACCTAGACT TAGGACATTA CGAAAGATTT 180 
ATTGATATTA ATTTAAACAA GTTTTCAAAT GTGACAGCCG GTAAAGTGTA TTCACACGTA 240 
TTGAAAAAAG AACGTCGTGG TGATTACTTA GGCGGAACAG TTCAAGTTAT TCCGCATATT 300 
ACAAATGAAA TTAAAGAACG TTTATTACTT GCAGGGGAAA GTACGAATGC AGAOGTTGTT 360 

30 

ATCACTGAAA TTGGCGGTAC AACAGGTGAT ATTGAGTCAT TACCGTTTAT TGAAGCGATT 420 
CGTCAAATTC GTAGCGATTT AGGTAGAGAA AATGTTATGT ATGTTCACTG TACATTACTG 480 
CCTTATATTA AAGCTGCTGG AGAAATGAAA ACGAAGCCAA CACAACATAG TGTTAAAGAA 540 

35 

TTACGAGGCT TAGGTATTCA ACCAGACTTA ATCGTTGTAA GAACTGAATA TGAAATGACA 600 
CAAGATTTAA AAGATAAAAT TGCATTATTC TGTGACATTA ATAAAGAAAG TGTTATTGAA 660 

40 TGTCGTGATG CAGACTCTTT ATACGAAATT CCATTACAAT TAAGCCAACA AAATATGGAT 720 
GATATCGTTA TTAAACGTTT ACAATTAAAC GCGAAATATG AAACACAGCT TGATGAATGG 780 
AAACAGTTGT TAGATATCGT TAATAATTTA GATGGT/^TVAA TTACAATTGG TTTAGTAGGT 840 

^ AAATATGTTA GCTTACAAGA TGCATATTTA TCAGTTGTTG AATCATTGAA ACATGCTGGA 900 
TATCCTTTTG CCAAAGATAT TGACATTAGA TGGATTGATT CAAGTGAAGT AACAGATGAA 960 
AATGCAGCCG AATACCTTGC AGATGTCGAC GGTATTTTAG TACCAGGTGG ATTTGGTTTC 1020 

SO 

CGTGCAAGTG AAGGTAAAAT TAGTGCAATT AAGTATGCTA GAGAAAACAA TGTACCATTC 1080 
TTTGGTATTT GTTTAGGAAT GCAACTTGCA ACAGTTGAAT TTTCAAGAAA CGTATTAGGC 1140 

55 



1564 



EP 0 786 519 A2 



TTACCAGAAC AAAAAGATAT 
TGTTCAATTA AAGAAGGCAC 
^ AGACATCGTC ATCGTTATGA 
ATGGTGATTT CTGGTACAAG 
AATGtTTCTT TATTGCTTGT 

10 

ACCCGATTTT TAAATCATTT 
GCTAATAAAA CCGGTACTTT 
^5 TAATATTATG TTAGTGACAA 
CTTGTCATTT CAATCATTTG 
TGTGGTyQGA CAATCTTATC 
TCGATAAAAT GTACGAGATG 
ACCACATCQA TATCTAAGTC 
AATGGTGCGA ATAATAATAC 

2S 

TaAGTTTnCG GCTAG 1995 

(2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 
ATTAGTGATG AAAGTCAGAT TGAAGCTTIA TTAACAGCTG AAAAATATTC AGAAATGATT 60 
GGTGAATAAT CACCGTGTAA CTCCTTAATC TAAGATTGAG GAGTTTATTT TTAGTCTGAG 120 
AAAATAAATG ATATGAAAGA AAAATTATTA GGTACTATTA TTTGGAGTAT TGCTACATTT 180 
TATTATTCAA GAATOATGGA AATAATGAAT TTAGCTATTT TAAAAATAAA AATTGGGGGA 240 

45 

AGTTAATATG CTAAACATTC AAGACGTTAn CATnCTTTCT AAAAAGGAGC AAAAAGCATA 300 
TAACCGTTTC GTAGAATCTG TAGAAAACGG TAATTTACCA GTACTACCAT GTATTGAAAT 360 
so GGATCTAAAA GAGATGCAAG AAGAAACATT AAACCAGAGT AAGATTGGTG GAATGCCATT 420 
TTTAAAATCT TTTAAAGATA TACCATTAGA TGAAAATAAT GTACCAATGG TATTGCTAGC 480 
ACAGATTAAT TTGGATGATC TTCCAGAACA ACAAGAATTA TTTCCTGTAA AAGAAGGGAT 540 

SS 



CGAAGATTTA GGTGGTACAT TACGCTTAGG CTTATATTCA 1260 

ATTGGCACAA GATGTTTATG GTAAAGCGGA AATTGAAGAA 1320 

ATTTAATAAT GACTATAGAG AACAATTAGA AGCAAATGGT 1380 

tCCAGATGGA CGTTTAGTAG AAATGGTAGA GATTCCGACA 1440 

CAATTCCACC CAGAATTCTT ATCTAGACCA AATCGTCCGC 1500 

ATTGAAGCTT CATTAAAATA TCAACAAAAT AAATAAATTT 1560 

CATTGTTAAA CATTGAAAGT ACCGGTTTnT CGTATAATTT 1620 

GGTATGAAAT AACAATAGTG ACTTTTATAA TTCTAAGTCT 1680 

TGTATAAATG TCATAGTATA CATAATTCAA TGCCATCGCA 1740 

GTAATCTTCA GTGTAGACTA TAGGTcTTGG TGTAGATAAA 1800 

ATCAGGGAAA TCATCTGTTT TAGGTTTGTT GCTTATTAAG 1860 

GATAAGTTTT TGAATATCTA ATOCAACTTO ATyATTATAA 1920 

ACGATCAGTT 6AGTCAATTT CTTTAAwkTC TTTAATAGCG 1980 
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AAAG 904 
(2) INFOI^TION FOR SEQ ID NO: 640: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

,5 CGATGTCTTT ACTATTAGAC TTAGCCATTG GTTTCACCTC TCCAAAAATT GTAAATGTGT 60 
ATCATCAATA TGAAAGTTAC ATAAAACTGA CATATTTCTT TAAAATATCA ACGCCATTGA 120 
TAACTTCCTG TTTTAATTGA TACGCTGTAA CAAAATACTA TAGTTAGTGC TTACATGTAT 180 

^ ATGTTAAAGC AAGCAGTGGT AAATGTAAAT TATAATTATT CATTAACTTT GCAATATATT 240 
AAATCTTTTA TTCATAGAAG ATAAATATCA AATCAATCAT AATTATTTGA CAACAAATAG 300 
CTAACGATTG TTTTAATCTA CATTTG6CTT ATAGCATTTT AAACCTATAC TCTATTTTGA 360 

25 

TACAATATAA GT6TAAAATC AATCATAAAA AGGATATTCA ATATCTGCAT CCAAGAAAAA 420 
CATTACAATT ACCTTT 436 
(2) INFORMATION FOR SEQ ID NO: 641: 

30 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

GTTATTAAAT TCAGAGTQGT AGCAAATTAA AGTTAnTCAA GA6TTAAGAT GAATTTAATT 60 

CATGAACACG TCTATTATTT TTATAATTGT AGCAAATAAA GCTTTACATC AAGGAGGTAA 120 

TTAAATATGT TCAAAAAATA TGACTCAAAA AATTCAATCG TATTAAAATC TATTCTATCG 180 

CTAGGTATCA TCTATGGGGG AACATTTGGA ATATATCCAA AAGCAGACGC GTCAACACAA 240 

AATTCCTCAA GTGTACAAGA TAAACAATTA CAAAAAGTTG AAGAAGTACC AAATAATTCA 300 

GAAAAAGCTT TGGTTAAAAA ACTTTACGAT AGATACAGCA AGGATACAAT AAATGGAAAA 360 

TCTAATAAAT CTAGGAATTG GGTTTATTCA GAGAGACCTT TAAATGAAAA CCAAGTTCGT 420 

ATACATTTAG AAGGAACATA CA 442 
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TTGATGTATG TAGTATCTCT ACCATTGATA CCGCTCAACT TTTTATTAAC TCGATACTTT 1560 

GGTGTCTTCG TATCAATATT AATAAACTTA GTATTATCAG TCATTTGTGT CTTGTTTTTA 1620 

5 

ACATTGAAGA GTTTATTTTG GGTGTTGCCG TGGGGGATAA TGCAGAGAAT CCCGCTTATT 1680 

ACGCTTGGTA TACTACCTAA TGGtTTAGTT GTAAACCATA ATTCAAAATA CTTTAATGAT 1740 

CTCAATGCCT TATATATTTC GATTATTGTT AGCATCATTA TTTTCGCGAT AGTAACATTT 1800 

TO 

TTAAATAATA AGAAAAGTTG GCGATTAAAA TGATAATTAA CGAATTAAAG TCATGTAAGT 1860 

TGAAATTTTC TAAGCAAGCG CTCACATTTG TACCCATTAT TGTAACCATA TTGTTTATAT 1920 

IS TATTTATAAA TTGGTATTTA AACGTAAATT TATGGAATGG TCGACAAATm AGTTTGTTTA 1980 

CAGCGAGTTT TAATGCAATT ACATCGCTAT TAATTTCTAT AAACGTCTAT CAAGTTATCA 2040 

AITTTGAAGA AAATATTGGT CACTTTAATC ATATTTTAGG AAAAGCTAAT AGGCTAAATT 2100 

^ GGTTAAATGC ATCAATGATT TTTACTTATA CTATTACAGC CATATGTATT CTATTAGCAT 2160 

CAATTAATTT ATTGTGGCAT TCACATGATA TGAAAATAAC ACTTATGTTT ATAGGcGTaT 2220 

CATTGTTTTT CAATGTaATT ATATTACTGC TACTTTTTAT TTTTAGTATT TTCATTAAAG 2280 

25 

ATGTAATGGC TATTGTTGTC GGAGTTTTAA TGTTTATTTT TAACGTTTAT TTTGGATTAG 2340 

AAGTGCTTGG AGATCATTCG TGGTTCTATT TACCAATCAC ATATGCTACA CGTTACX5TTT 2400 

ATATGTTTAn CGAAGGGGAG TATACCAGTT ACATTAACAT TGGGCAATCT ATATnATTAT 2460 

30 

CACGnTGCCG AT 2472 
(2) INFORMATION FOR SEQ ID NO: 643: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 
ATAATATAGG AATTATTTCG ATnACAGATT TTACGAATCG TGCTACGATG CAAAATGAAn 60 

45 

ATAAAQATCC ATATGGCGAA AAGTTAGCTT ATGGAATTGC TTTTAATGGC AGTGTGGATA 120 
TGCAAGGGGA TAAACAAGTC ACAATTCCAA AATATAGTGT AGTTACAATT ACTGGCGAAA 180 
ATAGTAAAAA TTATCGTGTT ACCGCCGATA ATAAGACTTA CTATGTTAGT AAAGATAAAT 240 
TAGAATATTT TAACCCGGCA GGTTTATATC AAACGCATAG TTTTAAAAAA TTAGCACCAT 300 
ATATGAAATC AAATTATAGT AATTACTATG CATACTTTAA TAGTCAATTA CATAAAAAGC 360 
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10 



CACAACAACC GATACAATTA CTTTTCAATG ATAATAATCA GTTATACGGT TTTGTTTATC 480 

CAATTGTAGA TAAAAAAGAA TTAAAAGATA AGTTTAATAT TAACAATAAC ATTTGGATTA 540 

CTAAAGTTGG GAATGGATAT TGTATTGCCA ATTTGAAAGA AGACAAATGG ATTTATATTG 600 

AATTGTAGGT GTAAAGATGC TAGATAATAT TATTTTATAT TTTAAA 646 
(2) INFORMATION FOR SEQ ID NO: 644: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

20 TAATATCGGA ATTTGATAAT GAAGATATCT AATTTTTTAA TATTCGTAGC TTTTATTTTT 60 

CTACTTATTT TATGTTTATT TTTAATCTTA CAAATGACAA ACCATTAAAA GTAGCATCCC 120 

AACATCAAAC AAAAAAACAA TTCATCAAAT AAAAATCGCT ACAAAACCAA GTCATTAAAC 180 

ACGCAATAAT TAAAATTTTC CACTCATTAT AATTCTGAAT TCCAAATGTC GAATTCCGAA 240 

AACCAAACTC CAAATTCCAA AAACGCAACT CCAAAATTAA AAGCATTTCC CTACCATTCG 300 

GGAAATGCTT TTTACATACT GgATTACTCT GTCATTAATG ATTTTACAAC GGgAAACCAT 360 

GTCGtCATGT ATGACCaAAG TAGCGTCGCT AtCaTaAgGt GGTTCGGATC TTTATTGGAT 420 

AATnAT 426 
(2) INFORMATION FOR SEQ ID NO: 645: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3241 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

45 

nTTATTTACC CACAACATGT TGCGACATTA GGTAAATGGG TACCTTATTT ACTTGGTATT 60 
GTTATGTTAG GTATGGGATT AACAATTACA CCTAATGATT TCAAAATGGT CTTTAAAGCA 120 
CCTAGAGCAG TAATTATTGG TGTCTGTCTA CAATTCAGTA TTATGCCCAC ATTAGCATTT 180 

so 

ATAATTGCAA AGTCTTTTCA TTTACCACCT GATATTGCTG TTGGCGTAAT ATTAGTTGGA 240 
TGTTGTCCGG GTGGGACATC AAGTAATGTA ATGAGTTATT TAGCCAAAGC TAACGTAGCA 300 
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ATATATCTAT TTGCAAATGA ATGGTTGGAA 
GTTCAAGTTG TATTAATTCC AATTGCTTTA 
^ ATTGCTGAAA AAGCTTCTAC AGCTTTGCCA 
TTA6CAATAG TTGTAGGTGG CAGTAAGCAC 
TTAGTAGTTA TTTTACATAA CGTATTAGGG 

10 

TTAAAATTAG ATCGACAAGA TCAAAAAGCA 
GGTTTAGCTG TGTCATTAGC aCCATTGCAT 
GTGTTTAGTT TCATTCATAA TATAACAGGG 
TTATAATTGC ACTAATAGAA TGAAGTGGTC 
AAAAATAGAG GAGTAAATAT ATGTATAGAG 
20 TAGATACX3GA ACAACATTTA TTTAATGTTA 
ATCCTATAAG CATTGATTTT TATCGTTCTT 
ACCATTTAAT TAAAGCGATT GGTTCGGAAA 
TTACTAGTAC AACATTACCG ATGATTGATA 
AACGTCACAT TCCTATGGCA ATTGCCACAA 
TTAAAGCATT AGGTCTAGAC GATTATATAG 

30 

AAGTTAAACC TGACCCTGAA TTATATTTAT 
CACAATCTTT GGCTATTGAA GATTCTGTAA 
TAGATGTTAT TGTTAATACG AATAAAATGA 

3S 

ATGTAGCAAA AGATATTGAT TACGATCAAA 
CGTATCATGA TGGGTTACAT TATATTGTTT 
40 GGAAATTTGG TGATTGGTCC TATATTTAAC 
TCTTTTGTTG kTGGkTCoiAT GrTTTACTTA 
TTACAAGGTA AACTTTAACG AGAAAACCAC 
TGTGAAATGG TAGAACGTTC ATGTTTATGT 
TATTTTAATA ATAACGTCAC GATGATAAGT 
ATTGTAGCTC TTGTTGCAiT ACTTCCTTGT 

SO 

GGCACAAAGC TAATAATTAG TAAGATTAAT 
ATGAACGACC TCCTTTATTT TTTTCAATCA 

55 



GTATCTTTCG TGAGTATGTT GTGGTCAGTT 420 

GGTATTGTTT TGCAAATTAT TAATCGTAAA 480 

ATTATATCAG TTGTTGCTAT TTCATTAATT 540 

CAAATCTTAA CTACAGGATT ATTAATATTT 600 

TATACGATTG GATATTGGTT AGCTCGTCTT 660 

GTCAGTATTG AAGTTGGAAT GCAGAACTCT 720 

TTTAATCCAA TTGCAGCAGT ACCAGGCGCA 780 

CCTATTTTAG CAAAGTATTG GTCAAAAAAG 840 

ATCGGACTAT GTTAAGCTTT GATAAAGAGA 900 

CAGTTATATT TGATTTCGAT GGAACAATAA 960 

TTAATAAACA TTTAGAGATG CATAATGCCG 1020 

CTATTGGAGG AGCAGCTACA GATTTGCATG 1080 

ATAAAGATAA ACTTTATGAA GAACATCATC 1140 

CGATTAnATC ATTGATGGCA TTTTTAAAGC 1200 

G7AGTGTGAA AGCGGAAATA ATGCCCACCT 1260 

AGGTAGTTGT TGGTAGAGAm GATGTTGAAC 1320 

CTGCAGTACA ACAATTAAAT TATATGCCGA 1380 

ATGGTGCAAC AGCCGCGATT GCAGCTGGAT 1440 

CAAGCGCACA GGACTTTTCT AATGTAGATT 1500 

TTGTAGCGCG TTTCTTTACG AAATAGGAGG 1560 

TTTCTAGCTG GTCCAGTAAT TTTAGGCGTT 1620 

AAACAGACAC CATTTCGCGT GCAAGTAAGA 1680 

ATACTCGCAA CAATTGGCTA TTTTTTACTA 1740 

CTTACCTCAT TAAATGGACX3 ACCATATGTA 1800 

ATGAGATAGG GTGGTTTAAA TAGTTACATA 1860 

ACAATTAAGA TAATATCTAT GCCTACCATA 1920 

TCTTTTGCTG ATTTCATAGC ACGGTAGTTT 1980 

ACAATTACAC CAATTAATGC TGTTGTCATG 2040 

ATTCCCAAAT AAACGTAGCA ATCACACCGA 2100 
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CAATAATTAA TGCAATCGGT AAAGTCGTAC CGAGTTTAAT CTTGCGCTCT GGAGAATTAA 2220 

TAATAGTAAA TACTGTAAGA CAAATGAGTA TGAAAGCAAG TGTTGCAATA ATAGTTCTTC 2280 

CAACTAAATA TAGGATGTCA GGTTTTTCCA TACCGATATA ATTTATGATG AAAAATGCTA 2340 

CAGCAAAGAG TACCGATATT TTTGTAGCAC GTAGCAGTAT TTGTTTTAAC ATTGATATAC 2400 

TCCTTTTTAA TATTATTAAA ATTATATCAT AATTACCAAG AATAGCTGAA GTTGTATGTG 2460 

ACTCAACGGT ACTTGAGCAA CTTTTTTAAT TTTTTAGAAA AATCACAAAA TAATTGTTTG 2520 

CAAAGTTGCA AAAGCCTGCT ATAGTAGTTC TGTAAACGAT TGCATGGTAT GCAAATATTA 2580 

ATGTACCAAA ATCGATAATT TATAGTATAA TTACGGCAAT AAGTTTTTTT ATGGATTTAT 264 0 

TTAGTATCAA TCAGAGATGG GGTAAGAAGT TATGGAGAAC AATGAACTAC AAAGGGGATT 2700 

GAGTGCCCGT CAAATTCAAA TGATTGCACT TGGTGGTACG ATTGGCGTGG GGCTTTTCAT 2760 

GGGTGCGACA AGTACAATTA AATGGACAGG CCCATCAGTT ATCCTTGCAT ATTTAATTGC 2820 

GGGTATCTTT TTATTTTTAA TCATGAGAGC AATGGGGGAA ATGATTTATT TAAACCCTAC 2880 

AACAGGATCA TTTGCAACAT TTGCAAGTGA TTATATACAT CCTGCAGCAG GTTATATGAC 2940 

2s AGCATGGAGT AATATATTCC AATGGATTGT AGTTGGTATG AGTGAGGTCA TCGCAGTAGG 3000 

AGAATATATG AAGTTTTGGT TCCCGGAATT GCCAACTTGG ATTCCTGGTG TTATTGCTAT 3060 

TTTATTATTA ATGGCAGCGA ATTTATTCTC GGTAAAAGCG TTTGGAGAAT TTGAATTTTG 3120 

30 GTTTGCTTTA ATTAAAGTTG TAACAATTAT TTTAATGATT ATTGCTGGTT TTGGTCTTAT 3180 

nnTCTnTGGT TTTGGAAATG GTGGCCATGC GGTAGGTATT TCTAATCTAT GGACAAATGG 3240 

C 3241 
C2) INFORMATION FOR SEQ ID NO: 646: 



TO 



IS 



20 



3S 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 



AGGCGTCAAC 


TCAGATGGTT 


TAATAATTGC 


CGTATTACCT GCTGCAATAG CACCGATTAA 


60 


AGGTtCGAaC 


ACTAGTtGAA 


AAGGATAGTT 


AAATGGTGCA ATGATCAAAA CTGTTCCATA 


120 


AGGTTCTTTT 


TTGATATAGC 


TTTTTGTTGG 


AAATAAATAT AAAGGTGTGT CTACATTkTT 


180 


TGTTTTAGTC 


CAGTTTTTAA GTTCcTTACO 


GGCAATTTTG ATACTTyTCA AAGTTATGCC 


240 
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£ 



10 



15 



20 



40 



AATATCGCTC 


TCGTATGATT 


TAATAGCTTT 


GCTTAACTTC 


TTTAATTGCT 


CTTTTCTAAA 


360 


ACTAATATCT 


TTAGTTTGTT 


GTGTATTGAA 


AAAAGCTTTA 


CTGTCATAAA 


ATTTTTGCTC 


420 


AATGATATTC 


ATAATGAAAA 


GAACCTCCTT 


ATATGATTAT 


TTTGGAAAAA 


GCGATTAATT 


480 


GATTTGAATG 


TTGTCGCCGT 


TAATTTTAAA 


TGGTCTTTCG 


AATTATATAT 


GTTGAAAGTT 


540 


6AAAATAGAQ 


CGATGAATCG 


TGTACATAAT 


AATATTTATA 


ACTTTAATCA 


TAACGAAAAA 


600 


G6TAGGAAGA 


AAACAAAAAT 


TTATACTCAA 


CATCGCAAAT 


ATTTTAAGAA 


AATGTAAAGA 


660 


CAAAAGGGGA 


ATTGTATAGA 


AATCACTAAT 


CTGTGGGTTA 


GGGTAGCTAA 


AGGAATAAAA 


720 


ACTACTATTG 


AAAAAGGGTT 


GTAAATTAGT 


CAAACGTAAA 


TAAAAAACAG 


TTCATTGAAA 


780 


GTGAAATAAA 


TTCTACTTTA 


ATGAACTGTT 


AGTTAAATAC 


AACATGTCTA 


TAATTAGACA 


840 


GTAATATAGT 


ATTATTTTGT 


TAATGCTTCA 


GTGATTTGAG 


GTACGATTTO 


TTTTTTTCGA 


900 


GAAACGACAC 


CAGATAAGAA 


GGCCATGTCA 


TCTTCTAATT 


GAACATTGAA 


TGtTCGCCAA 


960 


CTTTATCTTT 


TTCAGCACCT 


ACAACTAAAA 


TTTTA6AATC 


ACTATTAATG 


ATGTCAGTAA 


1020 


CAACAAGTAC 


AAATAAGTCA 


TATTTTTCTT 


GTGCACTTAC 


AGCTAACATT 


TCTTTTTCTA 


1080 


rATCTTCriT 


ACGATTTAAC 


ACTTCGTCAA 


GGTCAACAGC 


ATTAACTTGT 


GCAATACXSAG 


1140 


TCACATAGTC 


ACCCATAGTA 


AATGATTTAG 


CATCCATGTT 


TAATAAGAAT 


TCAACTGATT 


1200 


TATCAGTTGT 


TGAAGCACCT 


GCTTTtAACA 


TATCTAAGCC 


G7ACTTTTGA 


ATATCAACTT 


1260 


TAGCAATATC 


TTTnAATTCT 


tCAGCTGCTT 


TAACATCTTG 


TTGTGTACAT 


G 


1311 


(2) INFORMATION FOR SEQ ID NO: 647: 








Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 






CATATACTTT 


TTCATTTCTT 


TACGAGATAC 


TTTACCAGAG 


GATTTAGACT 


TCATACGCTG 


60 


ATCCATATGT 


GCTTGCGTTT 


CAGaATGTCC 


ACAAACACAA 


CGATATACCG 


CTTCTTTCCC 


120 


TTTACCAAAC 


AACGTTAATT 


TCTTTTTACA 


GTTTGGACAT 


CTTGCATTTG 


TTTTGCGCTG 


180 


TACATTCTTT 


TTCGTCTTAC 


AAGATGGATC 


TTGGCACACA 


AGCATCTGAC 


CATTTTTAGT 


240 


TTTAACTTTA 


ATCATGAATT 


TACCACACGT 


TGGGCATTCT 


GTGGTTGTTA 


AATTATCGTG 


300 


TTTATATTTA 


CGATCACTAT 


TTTTAATCCC 


ATTTACAACA 


TCTTTC6TAA 


AATCTTTCAT 


360 



{ 

ss 
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CCATTGTGcA GTTAAAAGTG GCGACGTTAA TTCTTCTGGT GCTAATTCTA ATATTTGTTT 480 

ACCTTTTGAC GTTACTTTAA TTTTACCGTC TCTTGATTCA ATGGCATTCA TATTAAATAA 540 

5 TTTATCGATA ATGTCGGCCC TTGTTGCAAC TGTGCCGATA CCACCTGTTT GTTTTAAAGT 600 

TTGCGCATAT TTTTTATCCT TCAATTGAAT AAAGTTCTGA GGGTTCTCCA TCGCTTTTAA 660 

TAACGAACCT TCATTAAAAT ATTCTGGAGG TGTTGTTTCA TGTTCTCTAA TATTTGTTTT 720 

10 

TGAAATCTTC ACTTCATCGC CTTCTGAAAA AGGCTGTTGC ATCTCTGTAA TAGATTCACC 780 

TTGTCTAATA GATTTAAAAC CTAAAACAGT TGTTACATTC TCTTTCAAAA CAAATGTGTG 840 

^5 CCCTGCAACC TCTAAAGTTA CAGTTATCGC GTCATACTCG TGCGGAGGCA TTAAAGCTTC 900 

TAAAAAACGC TCGACAATCA TATCGTATAA CTTTAATTCT CTATTACTTA AGTCTGACAT 960 

GACAGGTCTC ACTTCTGTAG GAATAATTGC ATGGTGATCA GATACTTTTT GATTATTAAA 1020 

20 TATCGACATT TTTGATGAAA ATGTTTTAGA CATTAATGGG CGTGCTTGGT CTTTATATGT 1080 

TGTTGCCATC GTCACCTGAA TACGTTCTTT CATAGTATCT ACCATATCAG TTGTTAAATA 1140 

GTTTGAATCT GTTCTTGGAT AGGTTACGAC TTTATGTCTC TCATATAAGC TTTGAAGTGT 1200 

25 

ATTCAATGTT TCTTTAGGTC CAATTTTATA ACGTCTATAC ATATCTTGrT GTAAATCTGT 1260 

TAAATTGTAC AGTGATTGCG GATACGACTT CTTATGTTTA GTAGCAACAG ATTTAATCTT 1320 

ACCATCGACA TTTTTCAAAT TATTAACCAT CTGTTCTAAA GTTTCTTTAw TGGtATATCG 1380 

30 

CTGATTTGaw TCTAGCTGAA AATCAAACCC TTTTACCGTT AATGATAATG TAAAGTATTG 144 0 

TTGTGGnTTG AACTGATTAA TCTCTTGTTG GTCGTGTAAT TTACTAAATT GAAACGGn 1498 
35 (2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 

45 

TAAAACAATT CGTTTTATTA ATACAGTTTG TAAAAGTATT CTCTTAGTAC TAATAGCTAA 60 

TGTAATGATA GAAAATACTA GTGTTATTAA AGATTTGAAT AAAATAAAAG AAACTGAGAA 120 

SO ATATTGGAAT GTATTAGATG ATTATTACAC GATTGAATTT GCACCTTATC ACGAAACAAA 180 

ACAAAGTTTG ATTGATAATA TGGTGCGATC AGAACAATTA GTAAAGGCTA GTGAAGCAGA 240 

AAATAATGCG ATTTTATTCA AACCAAAGGG TGACTCCGTT GACAATGACA ACTTTTCGCC 300 
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TCAACCTGAT 


ATTCCGATAA 


AAAATCAAAA AAATAATGTC GAAGTAATTA TTCCACAAAA 


420 


GTTTCATGCA 


ATGCGTAATG 


AAATCAATfUV AGrATATrAT 


TCATGGTTTG AATTTGTACA 


480 


A/VATAAAAAT 


AATAAAGAGA 


ATAAGTTATC TATACAGTTT 


ATCAACAAAA ATGATTGTCG 


540 


AATTTTTTCA 


TTTGATGCAC 


GAGATAGTCG Cf^TTTKSTriA 




PGr'f*AATf*AT 
^\7V>\>/mx >vT\x 


600 


TGTGAATGTT 


CAGGCATCAG 


ATTTATCGaA TGATTTttAT 
'•x X xnxvvwon xvM%x x x ivi^nx 




X VvCIW X VwOUlUVJ 


660 


CGGGTATTTa 


TTCaAAAATT 


ATGaCGCGCT AHTAAAAAAT 


ATTGGAAAAG 


TATCCATCTT 


720 


GATGGGGAAA 


TCCAGTGGAA 


TAArrAATTA T&AafiATAOP 


GTGATGGAAA 


TGTATCATGA 


780 


AAACAATTTG 


AAATTAACAG 


TACTCAACTT TTCACAAATC 


ATTATCGCAA 


TCATTTTAAT 


840 


AATTATTATT 


TTATTTGATG 


TGAAATATTA TTTTGAACAG 


CATCGAAAAT 


TACTCGTAAT 


900 


CAAAAAGC7A 


TATGGTTATT 


CAACATTAAG AGCCAATTAC 


CAATACTTAT 


TAATAAATAA 


960 


TATAGTTGTT 


ATTTTTATTG 


GAATATTGAC GAATGTAATT 


TTACATTCTC 


ACTATATAAT 


1020 


GATGTTATTT 


GCAACGATTC 


TTGT 






1044 



(2) INFORMATION FOR SEQ ID NO: 649: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 

^ GAACATATTG GGTTATGCAA GGnGGTCACT CTTCACACTT ATAAACAACA TTTTAATAAT 60 

GTAAAGTTTA ACCAGCTAAC ACTTTTGTTA GCTGGTTTTT ATTTTCCTTC AATTTTTAAA 120 

TGGTTAAGTC CCCTTCTATA TCTTATAAGA CAATCATTAT AATCAATTCA ATTAATACAT 180 

40 TAACAACCAC AACTAATAAA TATAGTAACT TCAAAATCCA TATTTATGTC TAAAGATAAT 240 

CTCAATGTTG TTCACGTCAA TAAAATTATC CCTAGGTTTT TAAAAATTGT ACATGTTTAA 300 

ACAATCAAAA GTGTACATTA TTAAATTATC ATTTCCAGTT AGATTTAGAA AACATTCACA 360 

^ CCACGCATGG ACCAACGTAT TCGTCTTcAT TcATTTTAG 399 



(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 747 base pairs 

(6) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 



1575 
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TCATTATCAT AAAAATAATT TTTGACAACT TGTTGTGTAA TGGTACTACC ACCTTGCACA 660 

TCTCTGTCGC TAATCGTTGA AAATAAAGCT CTAGTTGTAC CTTTCAAATC GAATCCATGA 720 

TGATTGTAGA ATCGTTCATC TTCCATTGAA ATAAAGGCAC CTTTAACATA CTCTGGCATG 780 

TTATCAGCTG ACACAAAACT ACTTTTATTT TCAATTTTTC TTAGTTCATC CACATTATCG 840 

CGTGTAGATA AAAAATACAT GATACCAATA AACAATGCGA TAATGATTAG AATGGTTAAT 900 

AATATTTTTA ATAGTATTCG TTTACTmT TTCTTTTTCG GCGGTTTGCC AACTGGTTGA 960 

TAATACGTAT TATAGTGAGG TTCGTGTTTC ATATGCTCAA AATGTTCATT TGAGTTTGAG 1020 

TACCTATCGC TTCTTTTCAT GCGTTTGCTC CTTCTTTTAA AACTCACTTA GtATATACCT 1080 

TGaGTTTACC AGTACTATCA CAAATAGGCT ACACTTTTTG GGAAAATCAG TCCAAGGGCT 1140 

TACAATCGTA TACGCCATCA TACTTACTTT TTTCTTTTTT GAAAAAATTA TAGATAAATC 1200 

^ ATTGCAATTT TAAATATTAA TCATGTCAAA TATTGTTATA TTTTATAAAA ATAAAAGACC 1260 

ATCCCTATTA AATGCCAATA GAGACGACCT TTTATTTGTT ATTCATTTAT TAAAACTAAA 1320 

ATCCATATTT CATTTCAAAC GAAAATATAT AAATTTTAAC AATCOrTAAC CAC 1373 
(2) INFORMATION FOR SEQ ID NO: 652: 



10 



IS 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 859 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 

GGCAGATAAT TTAGTCATTG TTGAATCGCC TGCAAAAGCA AnAACCATTG AAAAGTATTT 60 

AGGTAAGAAA TATAAAGTTA TAGCTTCAAT GGGACACGTC AGAGACTTAC CAAGAAGTCA 120 

AATGGGTGTC GACACTGAAG ATAATTACGA ACCAAAATAT ATAACAATAC GCGGAAAAGG 180 

TCCTOTTGTA AAAGAATTGA AAAAACATGC AAAAAAAGCG AAAAACGTCT TTCTCGCAAG 240 

TGACCCCGAC CGTGAAGGTG AAGCAATTGC TTGGCATTTA TCAAAAATTT TAGAGCTTGA 300 

AGATTCTAAA GAAAATCGCG T TG TTTTCAA CGAAATAACT AAAGACGCTG TTAAAGAAAG 360 

TTTTAAAAAT CCTAGAGAAA TTGAAATGAA CTTAGTCGAT GCACAACAAG CGCGTCGAAT 420 

ATTAGATAGA TTAGTTGGCT ATAACATCTC GCCAGTTCTT TGGAAAAAAG TAAAAAAAGG 480 

GTTGTCAGCG GGTCGAGTTC AATCTGTTaG CmTTCGTTTA GTCATTGACC GTGAAAATGA 540 

nATTCGAAAC TTTAAACCAG AnGAATATTG GACTATTGAA GGAGAATTTA GATACAAAAA 600 



40 



45 



50 



55 
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aAAAGATGTT GAGAAAATTA CAGCTGCATT AGATGGAGAT CAATTCGAAA TTACAAACGT 720 
GACTAAAAAA GAAAAAACGC GTAATCCAGC AAACCCATTT ACAACTTCTA CATTACAACA 780 
^ AGAGGCGGcA CGTAAATTAA AcTTTmAAGC AAGAAAAACA ATGATGGTCG CACAACAATT 840 
ATATGAAGGT ATAGATTTG 859 
(2) INFORMATION FOR SEQ ID NO: 653: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 



20 


TTCAkCTTQG 


TTTGGAAATC 


ATGTTCCTCA 


ATTGTTGGAC 


TTAAACGAAT 


TTCTTTAACA 


60 




TTGATAATTT 


TTTGTTTCTT 


TTTCATTTCT 


TTTTCTTTTT 


TCTGTTGTTC 


GAATTTGAAT 


120 




TTACCGTAAT 


CCATAATTCT 


TGCAACTGGT 


GGTTTCGCAT 


TCGGTGCAAC 


GACCACTAAG 


180 


25 


TCTAAATCTA 


CACGTTCAGC 


CATTTCTAAA 


GCTTCACGCT 


TTGATTTAAC 


ACCAATTTGT 


240 




TCACCATCTT 


GACCGATTAA 


ACGTAATTCT 


TTTGCACGAA TTTTGTCATT 


GATTTGAGTT 


300 


30 


TGATCTTTTG 


CTATGGTTGA 


CACCTCCAAA 


ATTTTTACGA 


AATTTGCACC 


AAGCAAAAAG 


360 


GAAGAGCAGG 


TATAAAATAC 


CCGCTCTTCC 


TTATACACAG 


TTATGTGTAA 


TGTGATTAAC 


420 




CTGCCAACTG 


CTTTATGCGT 


CGCTACAGGT 


GAGAAGCGGG 


TGCTTCTACT 


TGGTTCGTTT 


480 


35 


CGTATTCAAC 


GTTATTAATC 


ATATCAACAA 


TTCACATTTA AGTCAACACT 


ATAACTGTAA 


540 




TTATTTTTAT 


TTTAACCTTT 


TATTTCATCC 


ATTGACACGT 


CTTGACGTAA 


ATCTACTTGT 


600 




TCTAATG6AA 


TTTTTTTOGT 


TTTATATCGA 


AGCTTATGAT AAATAAAGAA 


TGCTAAAAAT 


660 


40 


ACTGGAATTC 


CCATATACGT 


AATTAAGAAG 


CGACTAAAAT 


TAAAATCTCC 


TGTnTTAATA 


720 




AAGTCAACAT 


CTTGCCCAAn 


AAnTACT 








747 



(2) INFORMATION FOR SEQ ID NO: 654: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 
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TTGTGCTTCT TmTAGCTT CTTGAACCTC TTGTGCCTCT TGTGATGTAT CACTyAAATT 120 

ATTTGCACTT GCTTCTTCTT TTATCGCTGC TTGTTGTGCT TTCAATGCCA CTGCTTTTGr 180 

TTCTTyATTT GATACAGCCA CACTTTTATC CGCTTCTGCT TGTGCTTCTC TTTTAGCTTC 240 

TTGAATCTCT TGTGCTTCTT GTGATGTATC ACTTAAATTA TTTGCACTTG CVi ' CTi ' CnT 300 

TATCGCTGCT TGTTGTGCCT TTAATGCCGC TTGCTCATTT TTAGATTTGT TTAAAAATCC 360 

TTCAACACGT TCTTTTGTAT AGGCAACCGT TTCTTCAAGT TGCGTTTTTC T TTCTTCAAA 420 

CTTTTGCGAC AGTTCTTGTn CTTTGACTTT llAAATCATCT GCTTTTTGAT AAACTTTATT 4 80 

TTAAAATACC AACCTAAAGC C 501 
(2) INFORMATION FOR SEQ ID NO: 655: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 

CACCTTGTCA TAATTAATTT TTTGATTTTT CATTTTACTG ATAATAGGTT CAGCATTAAT 60 

CATGATTTAA CCTCCCACAT TTAATCATTA ACTTCTATTA TATATGATTC ATATTAAATG 120 

TCAGTCAAAA AAGTTAGAAA TTCATTTTAA TGCATTTATA TTTCGAAAAT CCCTTATGTA 180 

TCTAAAAGCA TTTTTTAAAC TTGAATTTTA AAACACTAAA CAACACATAC GTCTCTGTGT 240 

35 CATTTTCATT TTTTGTATGT CATATATATG TTTACTTCAT TTAAATCAAT TTCATCTTAT 300 

AATTTATCGT GTATTTTACA AAAGATTGAC TTCAATTCAT CGXAAAAGTT ATACTTTTGC 360 

CATTTTTTAA TGTAACATGG TGTTAGTAAT AAAAATAATA CATTGAGGTG TTTTACATGA 420 

CAGCATTATT CCCTTATATC GCTTTTGAAA ATTCAAAAGA AGCCCTTGCA TATTACGAAG 480 

AAGTATTTGG TGCAACTGAC GTTAAACGTT TAGAAGTTGG CGAAGAACAA GCGTCACATT 540 

TTGGTATGAC TAAGGAAGAA GCGCAAGAAG CAACTATGCA TGCTGAATTT GAAGTGCTTG 600 

GCGTAAAAGT GTTATGTTCT GATTCTTTTG GTCGCGCTGA CAAAATTdAT AATGGCATAT 660 

CATTATTAAT TGATTATGAT GTTAACAATA AGGAAGATGC TGATAAAGTT GAAGCATTCT 720 

ATGAGCAAAT TAAAGATCAT TCTTCAATTG AAATAGAATT ACCGTTTGCT GACCAATTCT 780 

GGGGTGGCAA AATGGGCGTC TTTACCGATA AATACGGTGT TCGTTGGATG 830 
(2) INFORMATION FOR SEQ ID NO: 656: 
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(A) LENGTH: 53 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 

GTATCCATGG GCCCGTTnCG CACAACATTT GGnACAATTA GGTACACGnG TTGTCaTCGG 60 

TCGTTTCGGT ATAATTTTAT CGAATGaAGG CGGTGCGTTA CAAACAATGA AACTACCATA 120 

CGAATATTAC ATTGGTGGTA AATTAGGTrC TGGTCAACAA TGGTATTCAT GGATTCATAT 180 

'5 CAATGATTTA ATTCAAGCTA TTTTATTTTT AATAAATAAC GAGTCAGCTA GTGGTCCGTT 240 

TAATTTAACT GCACCTATAC CTGAACGTCA AAATTTATTT GGCTACACTT TAGCAAGAGC 300 

TATGCATAAG CCTCATGAAA CTTGGGCACC AAGTCTTGCA ATGCGTCTCA TACTTGGTCA 360 

20 

AATGTCAACA GTAGTATTGG ATACTCAAAA AGTATTACCT aATAAAATTC AAGCATTGGG 420 

aTTCCAATTT AAATATAGTA ATTTAAAAAT GGnACTTGAA GATTTAATTA AAGAATAATC 480 

AATACCATTA ATGAGCATTA GAAACAACAT ATGTACTAAA TGTAATGTCT AGAGCGACT 539 

25 

(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 



45 



ATCAAGTTGA ATATCATCCA TATTTAACCC 


AACATAAATT 


GArATTATAT TTGGCAGCAC 


60 


AACGTATCGT gATGGaATCT TGGTCACCAT 


TGATGAATGC 


ACAAATTTTA AATGATGA6A 


120 


CAATTAAAGA 


CATTGCTCAA 


GAATTAGGAA 


AGTCACCTGC 


CCAAGTTGTT TTAAGATGGA 


180 


ATGTGCAGCA 


TGCTGTGGTT 


ACAATCCCTA 


AATCGGTGAC 


ACCAAACAGA ATCTCTGAAA 


240 


ATTTCCAAAT 


ATTTGATTTC 


GAATTATCAG 


ATGAACAAAT 


GACGCGAATT GATGGTTTAA 


300 


ATCAAGATAA 


GAGAATTGGA 


CCTGATCCAA 


AAAAATTTGA 


AGGCTAGATT AAAATCGCTC 


360 


AACTGATGAA 


AAGGTTAGAT 


GAATTGTCAG 


GGCTTGGGAC 


ATTAAGTTCT TAGGCAATGT 


420 


AAAAAAGCTG 


ATTTCTATTA 


ATTATTTGAT 


AGAAATCAGC 


TTTl"rrGATA TGTATTTTAT 


480 


AATGTACAGC 


TCGTTGAGCT 


GCTATTTTCC 


TTATATTAAG 


TGCCATTAAT ACAAAACCTA 


540 


GCTCTCGTTT 


AACTTTATTT 


AtTCCTCGAA 


CTGACGTTCG 


AGTTAAACCC AAAATAGCCT 


600 
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CTGGTTCAGA AAGCTTTTGA TTAATTTGGA CTTTAAAGTA TTCCCAATTA TAATTCTTCA 720 

TGATTtTCTT ATTGGATTTT GAATTTGGTT TCATGCATTG TTGCCTCAAA GAACATGCTG 780 

AACAGTCATC GCATTCATAT AGTTTGAAGT CTCGTTTAAA ACCATATCTA TCATTACGGT 840 

ATGCATATCT TTTAAAACCT ATTCTTTTGT TATTAGGACA TATAAATTCA TCATTAAGTT 900 

CGTCATATTT CCAATTTTGA GTGTCGAAAA TGTCACTTTT AAACTTTCTA GTTTTATCtT 960 

TAATAAACAT GCCATACGTA ATAAGTGGCG TTTTATTAAA ATCATCTATA ATGGCCATAT 1020 

AGTTTTGGCT CACTACCCAT AACCTGGCAT CAGCTACCAA ATGAACCGAA GGGATTTTTG 1080 

GAATCCATTG GTTGAAAAAA TGGGAA 1106 
<2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 

TTTTAACTTT ACTCTTTGAT TTAAAGAGTG ATAAATGTTT ACAGTTTAAT TAAAACTGCA 60 

TAAGAACTTC TAGCTTTTCT CTTTCGTTCA AAGAGAAGCA GCTGTTCGCA GTTTAATCAA 120 

AACCACATAA AGCTTTTAAC TTTACTCTTT GATTTAAAGA GTGATAAATG TTTACAGTTT 180 

AATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTTTCGT TCAAAGAGAA GCAGCTGTTC 240 

35 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AGAGTGACAA 300 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAAGA 360 

GAAGTTCTAA TACCACCATA TCGTGCGATC GGGAACGGTA 400 
^ (2) INFORMATION FOR SEQ ID NO: 659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 base pairs 

(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear ' 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 
ATAATTACAA TGGCTACTGC AATTATTGTT GGAATTATAT ATTTCAGTTT AGTCATCATT 60 
TATGCTCCCT TGATTTCAAA TTCATATCAT TAGTTTACCA TATTGAAGAT GATATAATAA 120 
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AACTTTAACT GAGCTTCATG GGGCACCAGG TTTTQAAGAA GAAGTAAAAA ATTATATGAC 240 

TCAGCAAATG GCGCCGTACG TAGATGAATT TATTGAAAAT CGTATGGGTG GATTTTTTGG 300 

^ TGTGAAAAAA TCTAAAAATC CAAATGCAAA ACGTGTAATG ATTGCAGCAC ATATGGaTGA 360 

AATCX3GATTT ATGATTACAA ATATCACTAA AAATGGAATG ATTCAATTCA CAAATTTAGG 420 

TGGTGTTGCA AATGATATTT GGCAAGGACA ACGCTTAGTA ATTAAAAATA GAAATGGCGA 480 

10 

TAAAATTATC GGTGTTGTTT CTAATATACC TAAACATTTT CGTACTGGTA GTGAAGGTGC 540 

ACCGGAAATT AAAGATTTAA CATTAGATAT AGGTGCTCAA AATGAAGATG AGGTGCGTGA 600 

gCGCXSGAATA GATATAGGAG ATACAATTGT ACCTCACACG CCATTCACAC AGTTATCTGA 660 

ACATCGATAT AGTGCTAAAG CATGGGATAA TCGTTATGGT TGTGTCTTGG CAATTGAAAT 720 

ACTAGAATTA TTAAAAGATA TAGAATTAGA TGTAGACTTG TATGTTGGCG CAAATGTTCA 780 

20 AGAAGAGGTT GGATTACGAG GTGCGAAAGC ATCTGCAGAG ATGATAGACC CAGACGTTGC 840 

ATTTGTAGTT GATTGTTCAC CTGCCAATGA CGTTAAAGGA AACCAACCAT TATCTGGTGA 900 

ACTTGGTAAA GGGACGTTAA TTCGCATAAA AGACGGTACA ATGATTTTAA AGCCTGTATT 960 

TAGAGACTAT TTATTAAAGT TAGTAGAAGC ACATGACATT GAACATCAAT ACTATATGTC 1020 

ACCAGGTGGA ACAGATGGTG GAGAAATTCA TAAAGCTAAT ATTGGTATTC CGACTGCAGT 1080 

TATTGGTGTA TGTGCACGAT ATATTCATAG TACAGACTCA GTATTTGATA TAAGAGACTA 1140 

30 

TTTTGCAGCT AGATCTTTAC TTTCAGAAGC CATTTGTAAT TTAGATAATA ATCAAATAGA 1200 

AACATTACAA TATAAATAAT CGGGTAATAA CAACTATTAT CTCTAAATAG TTATATATAA 1260 

TCATTAATTA AGGAGACATA AAAATGAAAC AACTTGAATC AGAACAACAA TTTGAATCTT 1320 

3S 

TAAAACAAGG TGCTACAGTA TTTGAATTCA CTGCAGGCTG GTGTCCAGAT TGTAGAGTGA 1380 

TAGAACCAGA TTTACCGGAA TTAGAAGCGA GATATCCTAT GTTTGACTTC GTATCAGTAG 1440 

40 ACCGTGATAA ATTTATGGAT ATTTGTATTG AAAATGGTAT TATGGGTATT CCAAGTTTTC 1500 

TAGTATATAA AAATGGAGAA CTGCTTGGAA GTTATATTGG AAAAGAACGA AAATCAATTG 1560 

AACAGATAGA TGCATTTTTA GCTCAATACG TGTAATTTAG ACTAGAGAAA AACGGGGTAA 1620 

4S TACTCGTTTT TCTCTGTTAC TATGTGTTGA TTTATTGTAA ACTATTATAA 6GTGCGAAAT 1680 

TAGGAGTGTT ACATATGAAT ACCTTTCAAA TGAGAGATAA ATTAAAGGAA CGTTTAAGCC 1740 

ATTTAGACGT TGATTTTAAA TTTAATCGTG AAGAAGAAAC TTTGCGTATT TATCGAACAG 1800 

SO 

ATAATAACAA AGGTATCACG ATTAAACTTA ACGCTATAGT CGCAAAATAT GAAGATAAAA 1860 

AAGAAAAAAT TGTAGATGAA ATTGTTTATT ACGTTGATG 1899 

55 
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70 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

GTATAATATC CTTTCAATCT GTTTTCATAT TTTATATATT TTTTAAATAT AAGTGCTAAA 60 

TGTTTTAACT AAAGCATAGA TTGACAAGAT GTTATACAGA ATTTCAAATT CTATCCAATA 120 

TTGTTCGAAG TGTAGTATCA CTGGATTGGT ATTAAACAAT GTAAAGGAGA GATTGCAAAT 180 

GCCGTATAAT TACAAGAAAC AAAATGGAGA GTTAATGTCT GTAATGAGCC AAGGTGAAAA 240 

GTTTATTCAT CAATCACCCG TTAAT6ATGA ACTTAGTGCA TTGATTAAGT TATTAATTTC 300 

20 TAAAATTAAC GGTTGTCATT ATTGTGTTQA TATCCATAAA AAAGAATTAA AGGAATTGGG 360 

TGTAaCACAA ATGAAAATTG ATGAAGTCTT GAGTTTTAGA CATTTAGATT TATTTACTGA 420 

TCAAGAAAAA GTGACGCTTG AATTTGCAGA AATGTTAAAT TCAATCAAAG ACTTTAAGAA 480 

GTTTGAAATT ATTGACCGGC TAAAATCATT TTATGATGAA GAACAAATTA TTGATCTTGT 540 

CTTTGTTGTA AACCAAATTA ACGGTTGGAA CAGATTAAAT ATTATTAGTG ATAGACTATA 600 

ATTGTTCATA TAAATGCAGA GTTTCATCTC GAACGCTATA TCATAACAAA TCATGCCACT 660 

ATACAGGTCA AATCTTGTAT AGTGGCATTT TAATTTATCC CTTTGAATAC TGTTATTTAA 720 

CGAATATCGG TCCACCTGGT CCAACAGGGA TACCTAATAG GAACCAAATG ATGACAAATA 780 

CTGTCCATAC AATACTTAGC GCGATTGAAT ACGGCATTAA ACTAGAAAGT AAGGCTCCGA 840 

GTTTCATGCG TTTATCGTAT TTTTGTGCAT AAGTTAATAA TAAAGGTAAG TACGGCATCA 900 

TCGGTGTAAT TGGATTGGTA ATTGAATCGC CTACACGGTA AATGACTTGT GTGAATGCGG 960 

40 GATGAAAGCC GATAAGGATT AACATTGGTA CGAATATCGG TCCTAAAATA CCCCATTTAG 1020 

CCGATGCGCT TCCGATTAAC ATGTTGACCA TTGCACTCAG TACAATAATA CCTAGTATCA 1080 

ATACAATACC GTTTTGATGT TCTAATAATT TGGCACCTTT AACAGCAGCG ATAATTCCTA 1140 

AATTACTCCA CTTTAAATAC GCAAGTAGCT GTGCTGCAAA AAACACAATA ACGATAAATG 1200 

TTCCCATTGA TCCTACAGCA TCGCCGAACA TTTTACCTAA GTCTTTTGTA TTTTTAATTT 1260 

CTTTGCTTAA AATCCCATAA ACTAATCCAG GTACTAAAAA TACGACAAGA ATAATTAATC 1320 

CGACACCGTT AATTAATGGC GCATCGTCTA GTAAGCTGCC TGTTTTAGCA TTTCTTAAAA 1380 

AGCTATGTTC AGGAATGGCT GTAATAATTA ATAAAATAAT TCTGACTATG AAACTGATAT 1440 

SS 



25 



30 



35 



45 



SO 
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CATCATGCAT TAAACTGTCA TCATATTTTC CTAATCTAGG AATAATGAGC TTAGTTGTAA 1560 

CTAGCAATAT CGTAGGAAGT AATACAACGA CACTCGCTGC GATAAAGTAC CAGTTCATAG 1620 

^ CAACGTTTGT TTTAATAGAA TCTGAAACGA TACGTGTTGC CGGTTCTGTA AATGAATAGA 1680 

CCAAAGCATC TTGCATACCA ACAACTATAT TTGCTGCAAA TCCTCCAACA GCGGAAGCAT 1740 

ATGCCATCGT TAGTCCAGCG ATAGGGTGAT AGCCAATTTT AATAAAAAGC ATTGCTGCAA 1800 

10 

GCGGCGGCAA GATAATTGTC GCAGCATCGC CGGCTGTACT ACCTAAAATA CCAATTAATA 1860 

TAATAGTCGG TAAAATTAAG AAACGTGGTG CGCGATTCAC AACAGAAATC ATTAACTTAT 1920 

CGAAGTATCC TGTTTTCTCT GCAACACCAA TACCAATCAT CACTGCTAGT ACTAAGCCTA 1980 

75 

ATGCTGGGAA CTCTGAGAAA TTTTTAATCG TATCATTCAT TATCATCGTA AATCCATCAT 2040 

GGCTAATTAT ATTTTTAATA TAAATGGTTT GATGCGTACC TGGATGCTTA ACAGATACAT 2100 

20 TAAATAATGA GATAACCCAT GTCATAATGG CTAAGCCCAC ACACATTAAA AAGAATAAGA 2160 

CGCTAGGATC TGGCAATTTA TTTCCGATTT TTTCAACACT ATTCAAGAAA CGATTGACGA 2220 

TAGACCCCTT TTGTTGATGT TTTGATGTCA TCAATTATTC CCCCCTTTGT TAAATATTTA 2280 

25 AAGTGTAACA AAAAATACTC TCAAAAGTAA CAATTTTCAG GAAATAAAAA AACTAATATT 2340 

GTTAAATATT TTGAGTTATT CAATAGAAAG TGTATAGCAG AGTAGTTAAG ACTGCCTX3AA 2400 

GACTTATCTA TTAGGTTTAT GAAGCATCGA ACAGTGGAAA ATAAGGACTG TAAGTTTAAG 2460 

30 

ATATGTTGTA TAGGAGTGAC TGAATGAAAC GTTTGGAAAA TAAAGTAGCA GTCGTAACAG 2520 

GAGCAAGTAC AGGTATCGGT CAAGCTTCTG CAATCGCTTT AGCTCAATAA GGTGCGTATG 2580 

TATTGGCGGT AGACATAGCT GAAGCGGTAT CGGAGACTGT CGATAAAATT AAAAGTAATG 2640 

3S 

GTGACAATGC GAAGGCGTAT AATGTGGATA ATGCAAGCGA ACAACAAGTG GTAGACTTTG 2700 

TGTCTGACAT AAAGGAACAG TTTGGAAGAA TCGATGTGTT GTTTAATAAT GCCGGTGTGG 276 0 

^ ATAATGCGGC TGATAGAATT CATGAGTATC CAATAGATGT GTATGACAAG ATTATGAATG 2820 

TAGATATGCG TGGGACATTT TTAATGACGA AAATGATGTT ACCTTTAATG ATGAATCAAG 2880 

GTGGCTCTAT TGTTAATACG TCTTCATTTT CCGGACAAGC AGCAGACTTG TATCGCTCTG 2940 

45 GATATAATGC TGCGAAAGGT GCAGTGATTA ATTTTACAAA ATCAATCGCA ATTGAGTATG 30O0 

GCCGTGATAG CATTCGAGCC AATGCGATTG CACCAGGTAC AATTGAAACC CCGTTAGTAG 3060 

ATAAACTGAC AGGTACGAGT GAGGATGATG CAGGTAAAAC ATTTAGAGAA AATCAAAAAT 3120 

SO 

GGATGACTCC GCTGGGACGT TTAGGTAAAC CAGAAGAAGT TGCTAAATTA GTAGTCTTCT 3180 

TAGCATCTGA CGACAGTTCA TTCATAACTG GAGAGACGAT TCGAATTGAT GGTGGTGTGA 3240 

SS 
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AATTATTTAA AATCGATATC CAGTGGAAAA GAATTTGGCA TCAAGTAGGC AATGTATGCA 3360 

TATTGTCGCA ATGAAAGACA TTTAAAGCGG AGATTAATTA ATCTTTGAAA ATCACATATT 3420 

s 

GTTCATTTGA AGTGTATTGC TTAAAGCTTG TTTTAACGTA TTGTAGGTCG TGCATCTAAT 3480 

TTATCAGAAT CTCGAGTTCA AACTTTTGGA GTATCTTTAA AATAAGCTGT ATTTTGTCTA 3540 

TTCTAATAAA TTAAGGAGAA TTTTATGTTA AAAGAAAAAG AAAGTTTTAG ATTGCTATAT 3600 

10 

CAAGCTATAA GAGAGATTGC AGATAAAATT GGAGATAATC AGTTAGAAAC TAATTCCGTT 3660 

AGTTTATtAT tATTGGACTT TGATTTTGAA CATGAAGTAT TTGATGAATT GTATCTTGTG 3720 

,5 ATTTtAAAAT ATTTAAATAC AGTAAGTATA GAGAACATAA GTCATAGTGA GCTT 3774 

(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 



30 



35 



SO 



TACTGGTTTT 


GGATTTTGGA 


GGTAATACAG 


TACCTAATAG 


TAATnATGGT 


GTCGTCAnAA 


60 


TTATATCTTT 


CCGTGATGTT 


CTTACTCATT 


AGAACATCnG 


CCTTCAGAGG 


AATCATGATA 


120 


CGAGGAATAA 


GAAATTTAAA 


TGTGAGCGAA 


GTCAATATAG 


TATTTGCGAT 


TATTTTTATT 


180 


AACTCCATTA 


TTATTGTTAG 


TTTGATTTTT 


CGAGGATAAC 


TTCAATITTT 


GCATTTTGAG 


240 


GTTTTTTAAC 


ATATCTATTT 


GCATCAGTTG 


ATGGCAACCT 


TTTACTTAAA 


TCTATTGTGT 


300 


AGTTATTGTC 


TGCACCTGTT 


ATTTTAATTT 


GTCCTTTATT 


ATAAGAATTA 


TTATATAATT 


360 


TTTTACTTTT 


AATTAATGTT 


TGACGAATAC 


GAAAATCTAA 


TTCTTTTAAA 


GITAAAACAG 


420 


GCTTATTGCC 


TTCATAAACT 


GGAAATCCGC 


CAGTAAACGT 


TTCTGCTTTA 


TCTTTATATG 


480 


TTACATTCAG 


TTTATAGTGT 


TTATCGTTAG 


ATGTTGCTGC 


AGGAGTAACA 


CCACCAGTAA 


540 


ACGTTTCTTG 


AGATAATGCA 


AAAGaATCAA 


TGGTTTCTTO 


GTCTTTTATG 


CyAAAAATAT 


600 


CAACGCTTTT 


ATTTCTTAAT 


TGGTTGATAT 


TGCCCCAACT 


TTCAGGTCCA 


TAAACTTGAA 


660 


TATGACTATA 


CCAAGaAAAC 


TGTAACAACG 


TTGCATGAAT 


CgTACCGTTA 


TCTTTTTGCC 


720 


ATAACGTACT 


GTTAGAGAAG 


GTTAAATATT 


TTTGCGAGTA 


ATATTTAGTT 


AACTCATTAA 


780 


CGTTAGTTTC 


GTTTTGATTT 


ATATAAtAAG 


CTTTCGCTTC 


AGATGAAGAA 


TTGATAkGTG 


840 


TATTAGGAAA 


TTGTGTAGAT 


GCTGTACCTA 


ATAGTAACAA 


TGTTGTTGAT 


AAAATAATTT 


900 



1585 



EP 0 786 519 A2 



ATAAAAAGGG GTTAATTAGA TAATTGAAAT TATCCGCATT TACAAAAGGT AATAGGTTAG 1020 
TTAGATTTTT CGAGTATGAC TcAATTTCTG CATTACGAGG ATTTTTAACA TAACGGTT 1078 
^ (2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 base pairs 

(B) TYPE: nucleic acid 

70 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662: 



20 



25 



30 



35 



40 



AACTTGCTTT 


TTAGTATCTA 


CGTTAATATT 


AATAGCTTTA 


ATTTCACTTG 


TATTAATTAA 


60 


ATCAGCTGTG 


TAAATACCTG 


CTTTCAAATC 


GATAACTTTC 


TTTGTTCCAT 


TTTTAAAGTA 


120 


AACAGTATAT 


TTCGCTTGCT 


TCGATAGTCT 


TAAATCTATA 


TCACTAATAC 


CTCTGTCTGA 


180 


TTTTAAAACT 


GATTTAACTC 


TATCCTCTAA 


ATCTTTATAA 


CTAATATTTT 


GATTCTTATT 


240 


AAATGTTAAG 


CTTGATAAAA 


TATTTTGGCT 


TGTACOGTTC 


ACAGTGaTTG 


CATATGGAAC 


300 


ATGGACTTTA 


GAATATCCAT 


GGTGTAACGA 


ACTTGATGAT 


TTATCTAATG 


GCTTAGCTGC 


360 


GGCAGACGCT 


TCATTATTAT 


TAAAGTTTGC 


ACCTGTTGAT 


GCTAAAACAC 


CTAATGCTAA 


420 


AGTTGTTGTA 


ATCAATGACT 


TAAATTTCAT 


AAATTATCTC 


TCCTTTTTTG 


TGTAATTCGT 


480 


ATTTGCAACT 


TAATTATAGC 


CAGACTTTCT 


CTATTTTTTG 


AATTAACTGA ATATTAATAA 


540 


TAAATTATCT 


TTAACAATAA 


TTTTTTAACA 


CTGTTAAAAG 


TTCTTTTAAT 


TTTGATTAAC 


600 


TAATTAATTT 


ACAATACCTA 


AAATGTTGTT 


TGGTTTTGTT 


TATACCAAGC 


TTCAAACTTA 


660 


AATGTCATAA 


CAACATTCAT 


TTCTTAATTC 


CTATTAGATT 


TGTCGATTAT 


ATTTACAGCA 


720 


TCTTTATACT 


CAAAAAACAT 


TTACTTAAAA 


ATATAAATTC 


GATTTAATAA 


TTAATTTAAA 


780 


TTTAGTTAAT 


CAATTTTGCA 


TCTATTTTGT 


TGTAAGCTAT 


ATAAAAGGAG 


TGATAATGAT 


840 


GGTGAAAAAA 


ACAAAATCCA 


ATTCACTAAA 


AAAAGTTGCA 


ACACTTGCAT 


TAGCAAATTT 


900 


ATTATTAGTT 


GGTGCACTTA 


CTGaCAATAG 


TGCCAAAGCC 


GAATCTAAGA 


AAGATGATAC 


960 


TGATTTGAAG 


TTAGTTAGTC 


ATAACGTTTA 


TATGTTATCG 


ACCGTTTTGT 


ATCCAAACTG 


1020 


GGGGCAATAT 


AAACGCGCTG 


ATTTAATCGG 


ACAATCTTCT 


TATATTAAAA ATAATGATGT 


1080 


CGTAATATTC 


AATGAAGCAT 


TTGATAATGG 


TGCATCAGAC 


AAATTATTAA 


GTAATGTGAA 


1140 


AAAAGAATAT 


CCTTATCAAA 


CACCTGTACT 


CGGCCGTTCT 


CAATCAGGGT 


GGGACAAAAC 


1200 


TGAAGGTAGC 


TACTCATCAA 


CTGTTGCAGA 


AQATGGTGGC 


GTAGCQATTG 


TAAGTAAATA 


1260 



55 
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CAACAAAGGC TTT6TTTATA CAAAAATAGA GAAAAATGGT AAGAACGTTC ACGTTATCX3G 
TACACATACA CAATCTCA 
(2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1380 
1398 



IS 



20 



25 



30 



35 



40 



45 



SO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 

TTGTAATTGG AACGTACGAA CTTTTCTAGG TAAGAAACGT CGAATCTCGT CCTCATTATA 60 

ACCAACTTGT AGTCGTTTAT TATCTAAAAT AATTGGACGA CGTAATAAGC CAGGATTATC 120 

TTGAATGATT GAATATAAGT CTTGTAATGG TAGTGAATCA ATATCAACAT TTAATTTTTG ISO 

GTATGTTTTA GAACGTGTAG AAATGATTTC ATCAGTACCG TCTTCAGTCA TTTTTAATAT 240 

TTGCTTAATT TCATCAATTG TTAAATGTTC AGAAAAAATA TTACGCTCCG TATACGGAAT 300 

GTCcATGTTC tTGkTAACCA TGCTTTCGCT TTTACGGcAA GATGTGcAaC TTGGtGaAGT 360 

aAATAATGtX ACCATACATC TCACTCTCCT ATTTGAATGA ATAAAATTCA TTGCTTAAAA 420 

TTTAGTTATA GATCAAGAAA AAACATTTTT TTCTAAAATT CTTAATCGTT ACTATTTATT 480 

ATAACTATCT AACATTAAAA TTAAATGAGA AAAACCTAAT TTTTCAGATA AGTTTCTACA 540 

CTTATAAAAA AGATTATTAA TCCCTTTGTT AGTAGTAAGT TATACGTATA TTCTAACACA 600 

TCTTACATTT TTAAGAAATA CTGTTATAAT GATAATTATT AAAATATTAC TAAGAAAGTA 660 

GGCATTTAAA TGGAGACATT ATTTTCAGGC ATCCAACCTA GTGGAATTCC TACTATTGGA 720 

AATTATATTG GCGCACTAAA ACAATTTGTT GATGTGCAAA ATGACTATGA TTGTTATTTC 780 

TGTATCGTAG ATCAACATGC AATTACAATG CCACAAGATC GTTTAAAATT ACGTAAACAG 840 

ACCAGACAAT TAGCAGCGAT TTATTTAGCT TCTGGTATAG ATCCAGACAA AGCAACATTG 900 

TTCATACAAT CTGAAGTCcC TGCACACGTA CAAGCAGGAT GGATGTTAAC TACGATTGCT 960 

TCTGTTGGAG AATTAGAGCG TATGACGCAA TACAAAGATA AAGCTCAGAA AGCAGTTGAA 1020 

GGTATACCTG CTGGTCTATT AACATATCCA CCTTTAATGG CAGCTGATAT TGTTCTTTAC 1080 

AATACTAATA TCGTTCCAGT TGGAGATGAC CAAAAGCAGC ATATCGAATT GACTCGTAAC 1140 

CTTGTAGATA GATTTAATAG TCGCTATAAT GATGTGCTTG TGAACCTGAA ATTCGTATGC 1200 
(2) INFORMATION FOR SEQ ID NO: 664: 



55 
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(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664: 



10 


TGTACCTTTT 


GCTTTAATAA 


ATACTGTTTC TTTATCATAA TTAGGTTCAG 


TTAAGAAATT 


60 




AAATTGTAAG 


CTTTGAGTAA 


TATTTTTTTG ACTATCACTT GTTGTAGCTG 


TACGTGTATA 


120 




CATTTTAGTA 


TCACCATCAA 


GATTCTTCTC AGAAACTTGT TTGATTTCAG 


AATTAATCTT 


180 


15 


TGCAAATGAA 


GTAGCTGGTA 


ATACAGTGAA AGTAGTCGAT AGTGCTAAsG 


wACAAATTGT 


240 




6ATATTTTTA 


CATAGTTGTT 


TAATCATTAG TAATCCGCCC TTTCAATATT 


ATCCTTCTTT 


300 


20 






AAIAI 1 lAXv.. AAVAxAWXlXA A\^\jxiini\i 


TTTTCCAATC 


^bU 


AACTTCATAA 


GTGACAATTA 


ATCTTTGACC ATCTTTATTT TtcTCTAAAA 


TTGGAGGTGC 


420 




ATAATGTATy 


CCAGGTCTGT 


TTTTCAAAAT ATCTTGATTT CGTGTGTATG 


TTACTTCAAA 


480 


2S 


TTGCGTTTTC 


TCATTTGACT 


TTTCATTAGA TAAATAAGTT AAAAATTCTG 


GATTAAAGCC 


54 0 




ACTTCTTACT 


AATGCTGGGT 


ATCTATATTT TGAAGCAAAG CTTAGTTCaG 


GGTTTTCTAC 


600 




AGTAGCAATT 


CTCGTATTTC 


TATAGAATAA TAATTCATCA TTTCTATTTT 


TCACTTCTCC 


660 


30 


ACCATACTTC 


AAGTCATTCG 


CAATAACTGA CCAGTGTACA TGgCCAGTTn 


ATTTATTTTT 


720 




ACCGCTGGCA 


ATTGTGTCAT 


AATTTGCTGC ATCATCAACT AATCGTTTTG 


GAAGTAGCTA 


780 




TTTGCTG 








787 



(2) INFORMATION FOR SEQ ID NO: 665: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665: 

TTATCTGTCA TTAACCACTG GTACATGTGA ACCCGGCnTn AATCAATATA TATATTTAAA 60 

AGCAAAGGCG CGCCATGTGC CTTTtTTnTA tTTTGTATAT CTGTATCAAA ATCGATTTGA 120 

SO 

TTAAAATCCG CTTTATTTTA TCATCTATTC AAATGATTTT AGTGCGCTTA TTTTACTATG 180 

GCATTATTGC CTAACCTGTT TGAGATATAT TAATATTTGA TGATTCGTAT TCAAAGTTTT 240 

CATTTAACAT GTATTTAGTA TCATGATAGC TGCTTCATTG ATGATATACT ACCTACTTTT 300 

55 
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CAGAGGCAAT ATTGAACAAT TCTTCATCAT GACTATTTTC CATCACATAA CTATGCTTAG 

CGAACGCTAA CATATCTTTA TCATTATTCG CATCTCCGAA GGCCATGAGC TCTGAAGGAG 

ACATTTCCCA TTTATCTAAC AATCGTTTTA ATGCCTGnCC TTTAGTCATG TTT 

(2) INFORMATION FOR SEQ ID NO: 666: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOMLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666: 
GACTTCTCTT GCACTAAAAA TTGCTCAAAT TGCTCGCTGT CATAACCAGG CGGTGTTTCC 
AACCATACAT ATATGCCACC TTTAGCATGA ACAAATGGCA AATCAGCTTT TGCAAGCATG 
GCTTCGAATC GGTCACGACG TGTTTTAAAT ACATTGCTTT GTTCTTCTAA AAAATCATCA 
TAATGGATTC AAAGCATATA TTGCGGTCAT CTTGTAAATG CACCAAACAT CCCAOCATTT 
GTGTGCGTTT GGTACTTTTT CAAAGCTTGA AATCATATCT TTATTACCAA CTGCCAAAAC 
CGACTCTGAA AACCTGGACA TGTTGATGAA CCTTTnAGAC AAGGGAAGAA ATTTGCAATC 
GCAACATCTT TGCCCATTTT nCCGAAGCAA GTGnGACTAG 
(2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 
TCTCCAATGC TACTCAAATT AAAAAGGTTT TAAATATTGT TAATTCTGAT CCAGAGCGAA 
AGATTGTTAT CGTTTCTGCT CCAGGTAAAA GACATGATAA TGATATTAAA ACAACTGATT 
TGTTAATCAG ATTATATGAA AAGGTCATTA ATCATCTTGA TTATCATGAT AAAAAAAGAG 
AAATTATTCA GCGTTATGAT GATATTGTAA AAGAATTGCA AATGGATGAA AGTATTTTAC 
GGACGATAGA TGTGACTTTG GAACATTATA TAAATCAATT AAAAAATGAA CCAAAGAGAC 
TATTAGATGC ATTACTTTCT TGTGGTGAAG ATTTTAATGC GCaAyTGAtA GCyTTATATA 
ATAAtAGTCa gGTtACCAAC AAmATwTATA TCCcCGAAGG AAGCGGGTAT TT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: €68: 

10 

TTACTCGTTA AAGATATTGT AAAAGATGAA GTGACAGAAT ATGACATTCA TCAAATGTTA 60 

CCGCATCCGA TTAATATGGT AAGGGTTAGA CTTTTTGGTG TGAAATTAAA AGAGATTATA 120 

IS GCTAAAAGTA ATAAACAAGA nTATATGTAT GAACATGCAC AAGGTTTGGG TTTCAGAGGG ISO 

AATATATTTG GAGGATATAT TCTTTATAAT TTAGGGTACA TTCATTCTAC AGGGCGTTAC 240 

TATCTGAATG GAGAAGAAAT CGAGGACGAC AAGGAATATG TACTAGGTAC GATAGATATG 300 

20 

TATACGTTCG GTCnTATTnC CCAACATTGA AGGATTACCA AAQAGTATTT AATGCCAGAG 360 
TTTTTCAGAG GTATATTTAA nGAAAATTAT TGGCTATTTA 400 
(2) INFORMATION FOR SEQ ID NO: 669: 

25 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
„ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 



35 



45 



TGTTCGATAT 


TTTl'AAATTT 


ATCTTTTAAA 


TACAACAACT 


GTTTCCGTAA 


TGATTTAACT 


60 


GTTTTATGAC 


TAATGCCATT 


GAATATTTCT 


AGCGTTTTAT 


TTAACTTATC 


GATAATCGCA 


120 


TGTAAATCCT 


TCAAAATGTC 


TTTTGTTTCA 


AAAGTAAATA 


CATTATGGAA 


GCGATGAATA 


180 


TCATCATCAT 


AAACATCAGA ATCATTGATA 


ATCGTAAATA 


TCGTTGAGAA 


CAATTGCTCA 


240 


TTTAACTCAT 


GAATCTCATT 


CATACTAGCC 


TTCAAGCCAA 


AAATATCAAT 


TGGTGCAATA 


300 


TCTAATTTTT 


CCAAAATTCG 


CT6CTTTTCC 


AGTTGATCAA 


TTGCCTTTAA 


CAATTTTTCA 


360 


TTTTCGTTTT 


TACC7UVTCAA 


ACCAAGCTGA 


TATTTAATAT 


CAGCATAACT 


CAACTCATTT 


420 


GTCACTTGAT 


TTAAGGCATA 


GTCTGGTAAG 


CGATGTGCTT 


CATCCACTAT 


ACAATCATCA 


480 


AACAATTGAT 


ATATTGAATT 


TTCAACATCA 


GAATGaATTA 


AATGTGCATG 


ATTTGTAATA 


540 


CCAATTTGaA 


TGTTCTGTGC ATTTCGCTTA 


ATAAAATTAT 


aATAATGAAC 


ATCGTGACGT 


600 


GCCGGTACAT 


ATGTTTCAAT 


TTTCTGGTcA 


AAATACATCT 


TTTGACCACC 


TTTTAAATTT 


660 
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ATATTCACtT CX5TAATTACT TGTGTCATCT TTTAAAATTT OACTAATAAG CCCCAATGAA 780 

ATGTAATCaC TTtTACTTTT AATCAATAGT GCATTAATTT TAAAATTCAA CGCTTCATTC 840 

5 

ATTGCTGGAA TATCTTTTTC TAACAATTGA CTTTGCAGTA ATTTAGTATT GGTAGAAATC 900 

ATGACATGCT TCCCAGTTTC AATATTATAC ATCAAGGCCG CAAGTAAATA TGCTAATGAT 960 

TTACCACTGC CTAGTGATGC TTCAATCATT GCTTTTTCAC TATGCATGAG CTGATCTAAT 1020 

10 

ATAGTTTCCG CTAAATATAA TTGTTGCGGT CGATATGTTA AGCCAAGTTG ATCTACAGCT X080 

TTGCTATATA AAGACTTCAA GCTGCCATTA TAATTTGTTG TCGGCTTTTT AAAATCAACT 1140 

IS TGCTTACGAT AGATAATCTG TTCGAACTTT TCGTACGATT TATCCAATGG CTTTGCATCA 1200 

TATTGCCTAA CCATCTCAAA GAAAATATCA TACAAA 1236 
(2) INFORMATION FOR SEQ ID NO: 670: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670: 



30 




TGACCAATAT 


GACTCGTGGA 


CTGATATGTT 


TAAAGCACTA CTGCATGAAA 


60 




CATTTAAAGC 


ATATGGCGTT 


CTATTTATAG 


ATGCGCAGTT 


TGAGCCGTTA AGAAAAATGG 


120 




AAGCGCCTAT 


GTTTAAAAAG 


ATTTTGAAAA 


AACATCAGTT 


GCTTGATGAT GCTTTTAGAG 


180 


35 


CAACACAACA 


ACGTACTCAA 


AATCAAGGCT 


TGAATGCGAT 


GATACAAACA GATACAAATG 


240 




TTCATTTAIT 


CTTACATGAT 


GAAAATATGC 


GTCAATTAGT 


tTCGTATGAT GGTAAGCAyT 


300 


40 


TTAmATTAAA 


TAAAACAGAT 


AAGACATATA 


TAAAGGAAGA 


AATTATAAAT ATTGCGGAAA 


360 


ATCAACCkGA 


ATTATTTTCT 


aATAATGTAG 


TGACAAGACC 


ATTAATGGnA GAATGGTTAT 


420 




TTGAACACGG 


TGGCATTTGT 


TGGAGGACCG 


AGTGAAATTA 


AGTaCTGGGC TGAACTAAAA 


480 


4$ 


GATGTATTTG 


AACTATTTGA 


TGTTGAAATG 


CCtATCGTGA 


TGCCAAGGCT TAGAATTACT 


540 




TATTTAAATG 


ACCGTATAGA 


AAAATTACTT 


TCGAAATACA 


ATATTCCATT AGAAAAAGTG 


600 




TTAGTCGATG 


GTGTTGAAGG 


AGAAAGAAGT 


AAGTTTATTA 


GAGAACAAGC ATCACATCAA 


660 


SO 


TTTATTGAAA 


AGGTAGAAGG 


TATGATTGAA 


CAACAGCGTC 


GTCTAAACAA AGACTTATTA 


720 




GATGAAGTGG 


CGGGGAATCA 


AAATAATATT 


AACCTTGTGA 


ATAAAAATAA TGAAATTCAT 


780 




ATACAACAGT 


ATGATTATTT 


GTTAAAACGT 


TATCTTTTAA 


ACATTGAAAG AGrAAACGAC 


840 



55 
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5 



10 



15 



GAAAGAATAT 


GGAATCCACT 


TCAAATTTTG 


AATGATTTTG 


GGACAGATGT 


GTTCAAGCCC 


960 


TCCACCTATC 


CACCACTTTC 


TTACACTTTT 


GATCATATTA 


TTATAAAACC 


TTAATATACC 


1020 


AAGGGTTTAG 


CCCGATTTAT 


CTTAATGATA 


AATCGGGCAT 


TTTTTTGTTT 


TTTAAAATAA 


1080 


ATTTCACAAA 


TTTTGTATAA 


ATAGTGGTGG 


ATAGTGGGGA GATGTGGTAA ATTATATATA 


1140 


AGGTGAGGT6 


ATAAAAAATG 


TTCATGGGAG 


aATACGATCA 


TCAATTAGAT 


ACAAAAGGAC 


1200 


GTATGATTAT 


ACCGTCCAAG 


TTTCGTTATG 


ACTTAAATGA 


GOGTTTTATT 


ATCACAAGAG 


1260 


GCCTTGATAA 


ATGTTTATTC 


GGTTACACTC 


TAGACGAATG 


GCAACAGATT 


GAAGAGAAAA 


1320 


TGAAAACCTT 


ACCTATGACA 


AAAAAA6ACG 


CACGTAAGTT 


TATGCGTATG 


TTCTTCTCTG 


1380 


GTGCTGTTGA 


AGTAGAACTT 


GATAAGCAAG 


GGCGTATTAA CATCCCTCAA 


AACTTGAGGA 


1440 


AATACGCTAA 


TTTAACTAAA 


GAATGTACAG 


TAATCGGTGT 


TTCAAATCGT 


ATTGAGATTT 


1500 


GGOATAGAGA 


AACTTGGAAT 


GATTTCTATG 


AAGAATCTGA AGAAAGTTTC 


GAAGATATTG 


1560 


CTGAAGATTT 


AATAGATTTT 


GATTTTTAAA 


ATGGAGGAAT 


TGAAGtGTTT 


CATCATATCA 


1620 


GCGTTATGTT 


AAACGAAACC 


ATTGATTATT 


TAAATGTAAA 


AGAAAATGGT 


GTGTACATTG 


1680 


ACTGTACGCT 


AGGTGGAGCG 


GGACAnGCCC 


TTTATTTACT 


AAATCAATTA 


AATGACGACG 


1740 


GAAGATTAAT 


AGCAATCGAT 


CAAGACCAAA 


CTGCAATTGA 


TAATGCTAAA 


nGGGTATTAA 


1800 


AGGATCATTT 


GCATAAAnG 










1819 



<2) INFORMATION FOR SEQ ID NO: 671: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 609 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 
ACCAATTATT GGACAGGAAT TAGCAAAAAA TGCAATGCTT GCATTAATCT ATGCATCGAT 60 

45 AGGTATCATC ATCTATGTAT CATTACGATT TGAATGGCGC ATGGGTCTTT CATCTGTATT 120 
GGCATTATTA CATGATGTAT TCATTATAGT AGCGATTTTC AGTTTATTTA GAATTGAAGT 180 
AGATTTAACA TTTATCGCCG CTGTATTAAC AATTGTCGGT TATTCAATTA ATGATACAAT 240 

^ CGTAACGTTT GACCGTGTAC GTGAAAACTT ACAAAAGGTT AAAGTGATTA CGACAACAGA 300 
ACAAATTGAT GATATCGTTA ATAGATCAAT tAGACAGACA ATGACACGTT CAATTAATAC 360 
AGTATTAACA GTTATTGTAG TAGTAGTTGC TATACTATTC TTCGGTGCTC CTACGATATT 420 

55 
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T6CCGTTCCG CTATGGGGAA TAATGAAAAA ACGTCAGTTG AAAAAATCGC CGAAACACAA 540 

ATTAGTTGTA TATAAAgAAA AGAAATCGAA CGATGAAAAG ATTTTAGTTT AAAAtGaATT 600 

AAGCGGTAT 609 



(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672: 





CTTAAAACAG 


CAATTTGAAC 


GTTTACAAAA 


TGAACAAATC 


TTTGTTTATG 


TTTGTCATGG 


60 


20 


TAATCACGAT 


CCTTTATCAT 


CAAAGATTTC 


ATCAAACTGG 


CCAGATAATG 


TtnTTGTATT 


120 




TTCAAATAAA 


GTTGAGACGT 


ATGAAGCAAT 


TACTAAATCT 


GGTGAAACAA 


TTTATATTCA 


180 




CGGATTTAGT 


TATGAAAATA 


GAGCAAGTTA 


TGAGAACAAG 


ATTGATGAAT 


ATCCATCAAG 


240 


TCAAGGCCAA 


AAAGGCATAC 


ATATTGGTGT 


CTTGCATGGT 


ACGTATAGTA 


AATCTTCAGT 


300 




TAACGAAAGA 


TATACCGAGT 


TCATTTTAGA 


AGATTTAAAC 


AGTAAATTGT 


ATCATTATTG 


360 


30 


GGCTTTAGGT 


CATATACATG 


AACGTCAACA 


ATTAAGTGAT 


ATGCCTGTAA 


TTAACTATTC 


420 




AGGTAATATT 


CAAGGTAGAC 


ATTTTAATGA 


GCAaGGTGAA 


aAAGGTTGCT 


TATTAATCGm 


480 




GGGTGACCAC 


TTAAAATTAA 


AGACTAAATT 


TTATCCTACA 


CAGTATATTA 


GATTTGAAGA 


540 


35 


AGCAACTATT 


GAAACGGATA 


AGACATCTAA 


GCAAGGTTTA 


TACGAGGTCA TTCAAAACTT 


600 




TAAAGAACAA 


GTGAGAGAAG 


AAGGAAAAGC 


CTTTTATCGT 


TTAACGCTTG 


TTATTAATAG 


660 




TGAGACATTA 


ATTTCACCTC 


AAGATTTATT 


ACAAGTTGAA 


GAAATGATTA CAGATTATGA 


720 


40 


AGAAAACGAA 


AATCAATTTG 


TATATATTGA 


TGAGTTAAAA 


ATACAATATG 


CACAAAATGA 


780 




TGAGTCACCT 


TTAGTTAATG 


AATTTTCAGC 


GGAATTATTA 


GTCGATCAAA 


CTGTTTTTGA 


840 


45 


TAAAGCGATG 


TCAGATTTAT 


ATTTAAATCC 


AAGGGCATCT 


AAGTTCCTAG 


ACGATTATGG 


900 


AACATTCGAC 


CATACAGCAT 


TAGTTAATCG 


TGCTGAAGAA 


ATATTAAAAG 


CTGAAAT6AG 


960 




AGGTGAACAA 


AATGATAATT 


AAATCACTTG 


AAATTTATGG 


TTACGGTCAA 


TTTGTTCAaC 


1020 


SO 


GTAAAATTGa 


ATTTAATAAA 


AACyTCaCTG 


AAAITTTTGG 


TGAAAATGAA 


GCGGGTAAAT 


1080 




CGACGATTCA 


AGCATtCATC 


CATTCGATAT 


TATTTGGATT 


TCCAACTAAA 


AAGTCTAAAG 


1140 


55 


AGCCAAGACT 


AGAACCACGT 


CTAGGTAACC 


AATACGGTGG 


TAAATTAGTA 


CTTATTCTTG 


1200 
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5 



10 



IS 



20 



25 



TATATTTACC 


TAATGGTGCT 


GTGCGTGATG 


ATGCTTGGTT ACAAAAGAAA CTTAATTATA 


1320 


TTTCTAAAAA 


GACATATCAA 


GGTATCTTTT 


CATTTQATGT ACTAGGGCTT CA/WSACATTC 


1380 


ATAGAAATCT 


AAATGAAAAA 


CAATTGCAAG 


ATTATTTATT ACAAGCmGGG GCTTTAGGAT 


1440 


CAACTGAaTT 


CACGTCAATG 


CGCGAAGTGA 


TTAATCGTAA AAaAGATGAA TTATATAAAA 


1500 


aATCAGGTAA 


AAATCCGATC 


ATTAATCAAC 


AAATTGAGCA ATTAAAACAA CTAGAAAGTC 


1560 


AAATTCGTGA 


AGAAGAAGCA 


AAGCTAGAAA 


CATATCATCG CTTAGTAGAT GATCGAGATA 


X620 


AATCATCACG 


TCGATTAGAG 


AATTTAAAGC 


ATAATTTAAA TCAATTATCA AAAATGCATG 


1680 


AAGAAAAACA 


AAAAGAGGTT 


GCTTTACATG 


ATCATTCACA AGAATGGAAG TCTCTAGAAC 


1740 


AACAGTTAAA 


TATTGAGCCA 


ATCACATTCC 


CAGAAAAAGG TGTGGATCGT TACXSAAAAAG 


1800 


CACGAGCGCA 


TAAGCAATCG 


TTAGwAAGAG 


ATATTGGTTT AAGAAATGAG CGTTTAGCTC 


1860 


AACTTAAAGA 


AGAAGCGACT 


CAATTAGAGC 


CAGTTAAACA ATCTGATATT GACGCCTTCA 


1920 


TTAGTTTGAA 


TCAACAAGAA 


AATGAAATTA 


AAAATAAAGA ATTTGAACTT ACTGCAATCG 


1980 


AAAAGGATAT 


TGCGAATAAA 


CAACGTGATA 


AAGATGAATT GCAATCAAAT ATTGGTTGGT 


2040 


CTGAAACGCA 


TCATGACX5TA 


GATAGTTCAG 


AGGCAATGAA AAGTTATGTC AGTGAGCAAA 


2100 


TCAAGAATAA 


ACAAGAACAA 


GCTGCATACA 


TTAAACAATT AGAACGTAGT TTAGAAGAAA 


2160 


ATAAAATCGA 


AGATAATGCG 


GTTCATAGCG 


AACTAGATTC TGTTGAAGAA AAATAGTTCC 


2220 



(2) INFORMATION FOR SEQ ID NO: 673: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 
TTGCAGGTAT CATTTTTTTA ATGCCATATG GATTATGTTT TCTACCGTTT TATAAGCAAA 60 

4S 

AAAAGAAAAA ACAGACATTT AAAAAATACA TGGTTTACAC TACGATTGGT TTOTCAATTT 120 
GTCTAGGCTT ATCTCTAGTT TTGGTTCACA CTACGAAAAT TTATATGGAC GAAGGTGGCG 180 
5^ TAAGATACTA TTACGGTAGT TTTGTAATGA AACAAGCGGG CGGTTATGCT TATTTAGCTT 240 
TAGCGGTACT TTCAACGTTG TTAATTGTTG CGAAAAAAGC TACAAATAAA AATAAAGAAA 300 
TCGAAACCGT CGACAATACA AATATAACGG AAAGATAATT AAGGGAGTGC TCATTCAGGA 360 

55 



1594 
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(2) INFORMATION FOR SEQ ID NO: 674: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 
CCACTTTGTG GATTTTCTTT ATATTCTCCA CGCTTGtACT ACAACTTCTT TCGTTTCTGT 60 
IS TTCATCACCG ACTGCCGCAT TCGTTAGCAC ATGTAATAAC TCTTTTGCGG TTAATACATT 120 
CTCATCTATA ATCTTATCTT TTTGTTCTTG TATATATTGC TTGATGTGCG GCTTTTTCAA 180 
TAACCTACAC GCTGTCACAT GTGCGCTATT TGCGTTATAT CCTGyTTTTA TGGCACTTTG 240 
^ TGTTACATTC AGTGTTCTAA TATACTCATT CACAAAACGT GCTTGCrrTG CAGTTAACTC 300 
ACTCATTTTA TCACCCCCAC AATTTTATCT AATATGGTTT CATACCATAA TATTACAGAT 360 
TGTTCTGAAC AATCTAAGGC ACTACTAATA TCTTGATAAC TAAGTCCTTG TATAAGGGAG 420 
TCAAAAATAT AAAACTCTTT ATCGGTCGCT AATCTOTCAA CAATCATTTC TATGTGATTC 480 
TTTATAATAT GATCATTGAC ATTATCGTCT GTCATCAATT CGTCAGAATC TTCATCACCT 540 
ATTAAAAAGA AATCATCAGT ATTTATTTCA TCATCGCCCC GTTAACTAGC TTTGAAGTCT 600 
TTAGCACACT TGCATATACC GGCTGTCGTG CTGGCkAGAT ACTAGCATTG AAGTGGTCTG 660 
ATATTGATTT TGAAAACAAC ACGATTAGTA TTACTAAAAC ATATTACAAT CCAAATAATA 720 
35 ACAAGAAGAA ATATCAGATA CTTCCCCCTA AAACTGAAAG TTCTATCGGT AAGATTTCCG 780 
TTGACCCAAA TGTAATAAAG GtGtTGCGtG ATTATAAGAT AAACGTTCAA AATAATTGGA 840 
AAAACGAATT ATATAATGAT AACCATTGTA ATGAAAAAAC TATCAATGTG GATACAAACC 900 

40 

ATTATGTCAC GTACCAGCAT ACTCAAA 927 
(2) INFORMATION FOR SEQ ID NO: 675: 
(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 
GATGTTAAnA TAAGTACATC ATTTTGACTC CCAAATATTG GCTTTAGACC TTGAAATGCT 60 
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70 



IS 



CTCATAATGG CATCAGGTAC cAGGGGTTGG GaCCAGGTGT TAATAACAAC GGTTGATGAT 180 

AATACATTTA TATGCCTCCT ATAAATAAGA TTTCACTATT TTAGCAAATT TTCTGAAAAT 240 

TTAAAAGCCT AAAAAgtGAT AACGCTATAT TGTCGAACAA ACGTTATCAC TTAACGATTT 300 

TATTTGGCCT GAATGATGGG ATAGATTTTT ATGTCCACAT TATTTCTTAC AGCATTTGAA 360 

ATCATGCAAT TATTATCTGC AATTGTTATC AATTTTGGTA ATCGCTTTTC TAATTGTGCT 420 

ATTTGATCAC TTGGAATTTG AATTTGAGGA TGGTGCACAA TTTTTGACAT ACTGAATTTT 480 

CCGTTATTTA AACAAGCTGT TCCAATCGAT TGTTGTkCAA TTGAAATATC TGTGAACTTT 540 

GCACX5TTCAA GAGTAGCTGC TAATGAGATG ATATAACATG ATGAAGCGGC TGATACTAAC 600 

ATTTCATCGG GATTTGTTCC TATACCAACA CCACCTAAAG AAGCAGGTAT AGAAATATTC 660 

TCTGAAAGTA TGTCGCCTTG AACGTTTCCG ACATTGTTAC GACCACCTTG CCAAGAAGTT 720 

^ TGGACTTTAA AGTCATGTTG ATGCAATTGC TTTAACCTCC AATATAATTG TGATAGTTTA 780 

ATTTTAGAAT AGTTTATCAA AGTTTAACAG AAAGGTGACT TATCAATGAC TCTGAATAAA 840 

CTGAAAGATG AATTACAAAT TGTTTCGCAC CGTGGATTGC CGAGTGATTT TCCTGAAAAT 900 

ACAATGGTCG GTTATCGAGA GGTAATGGGG CTCAATGTTG CTATGTTAGA AATAGATGTT 960 

CATTTGACCA AAGACCAACA TTTT G TTGTG ATACATGaTG AAACAATTGa TAGAACATCG 1020 

GaTGGtArGG GGCGTAwTGc TGaTTACACA TTATCGCAAT TAAAATCATT TGATTTTGGT 1080 

AGTTATAAAG ATGTTGCTTT 1100 
(2) INFORMATION FOR SEQ ID NO: 676: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 676: 

ATTAATTTCA TATGGAAATA GTTGAACTAA CTGCGCTGTA TAAGCTTCAA CAGTTTGATG 60 

AATTAACGAT TGATCTTCAA TATATCCGTA GAACAAATCT TCAGTACAAA CTACTTTACC 120 

TTTATCAGGT TTAATTGCAC CTGCCAACAA TTGACCTACC AACGCTTTGG AAGATTCAGG 180 

TTCACCAATT ATACCTAATG CTTCTCCTTG ATAAATATGT AAACTAATAT TGTTTAAATC 240 

GATATCTTCA GCATCATATC CAAAAGGTAA ATACCATTTC TTATTCTGTT TATTCCTATA 300 

GTAGTGTGTT ACTTTTAGTA ACTTTAAAAC AATTGAACTT CCCATCTATT TTCATCCTTC 360 

SS 



25 



30 



45 



50 
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CCCCACGCaA AAATACCTTT TAATCTTnCT ACTTTAAAAT 
(2) INFORMATION FOR SEQ ID NO: 677: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

TO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 



IS 


GTCAAGTTTT 


CGGCTAGATT 


TTAAACGCTC 


ATCACTATGT 


AGAATGAAAC TTTCGAAAAA 


60 




CTGTAAATCA 


TCATAACCTT 


TTACATAAAC 


ATAACCTTCG 


CCACCAATTG CTTGAATTAA 


120 




ACATTGGGCG 


GCCATTTGAA 


TTTCTAAAGA 


TTGTTTTTCT 


AGCCTATTAA AGATACCTAT 


180 


20 


TAGTTGTGTG 


TTTAAGATTT 


TTGACATCTT 


TATCCTCCAA 


TCTACTTATA AAATATTGTA 


240 




ATTAATGACT 


ACATATTATG 


CAACGGCTTA 


AATTGTATAA 


AAATGTATAC GTTTGCATTT 


300 


2S 


AGTATAACTA 


TCGCATTTTT 


CAAAAAATAC 


ACATTTAATC 


TGCAGTATTT CAATGCATTG 


360 


ACGCTATTTT 


TTTGATATAA 


TTACTTTGAA 


AAATACGTGC 


GTAAGCACTC AAGGAGGAAC 


420 




TTTCATGCCT 


TTAGTTTCAA 


TGAAAGAAAT 


GTTAATTGAT 


GCAAAAGAAA ATGGTTATGC 


480 


30 


GGTAGGTCAA 


TACAATATTA 


ATAACCTAGA 


ATTCACTCAA 


GCAATTTTAG AAGCGTCACA 


540 




AGAAGAAAAT 


GCACCTGTAA 


TTTTAGGTGT 


TTCTGAAGGT 


GCTGCTCGTT ACATGAGCGG 


600 




TTTCTACACA 


ATTGTTAAAA 


TGGTTGAAGG 


GTTAATGCAT 


GACTTAAACA TCACTATTCC 


660 


35 


TGTAGCAATC 


CATTTAGACC 


ATGGTTCAAG 


CTTTGAAAAA 


TGTAAAGAAG CTATCGATGC 


720 




TGGTTTCACA 


TCAGTAATGA 


TCGATGCTTC 


ACACAGCCCA 


TTCGAAGAAA ACGTAGCAAC 


780 




AACTAAAAAA 


GTTGTT6AAT 


ACGCTCATGA 


AAAAG6TGTT 


TCTGTAGAAG CTGAATTAGG 


840 


40 


TACTGTTGGT 


GGACaAGAAG 


ATGATGTTGT 


AGCAGACGGC 


ATCATTTATG CTGATCCTAA 


900 




AGAATGTCAA 


GAACTAGTTG 


AAAAAACTGG 


TATTGaTGCA 


TTAGCGCCAc ATtAGGTTCA 


960 


45 


GTTCATGGTC 


CATACAAAGG 


TGAACCAAAA 


TTAGGATTTA 


AAGAAATGGA AGAAATCGGT 


1020 


TTATCTACAG 


GTTTACCATT 


AGTATTACAC 


GGTGGTACTG 


GTATCCCGAC TAAAGATATC 


1080 




OVAAAAGCAA 


TTCCATTTGG 


TACAGCTAAA 


ATTAACGTAA 


ACACTGAAAA CCAAATCGCT 


1140 


SO 


TCAgCAAAAG 


CAGTTCGTGA 


CGTrTTAAAT 


AACGACAAAG 


AAGTTTACGA TCCTCGTAAA 


1200 




TACTTAGGAC 


CTGCACGTGa 


AGCCATCAAA 


GAAACmGTTA 


AAGGCAAAAT TAAAGAGTTC 


1260 




GGTACTTCTA 


ACCGCGCTAA 


ATAATTAATA 


TTTAGTCTTT 


AAGTTATTAA TAACGTAGGG 


1320 



£5 
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AATAAATAAA ACAGTTTGAT TTTAAAATOA AAGCGTAAAA ATGGTAAAAT ATATCAAAAT 1440 
TGATTGTGAT A 1451 
^ (2) INFORMATION FOR SEQ ID NO: 678: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 B base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 

nGTATTGAAG CGGTTAAACA AACACCTAAT GCAACTGACG AAGAAAAGCA GGCTGCTGTT 60 

AATCAAATCA ATCAACTTAA AGATCAAGCA ATTAATCAAA TTAATCAAAA CCAAACAAAT 120 

20 GATCAGGTAG ACACAACTAC AAATCAAGCG GTAAATGCTA TAGATAATGT TGAAGCTGAA 180 

GTAGTAATTA AAACAAAGGC AATTGCAGAT ATTGAAAAAG CTGTTAAAGA AAAGCAACAG 24 0 

CAAATTGATA ATAGTCTTGA TTCAACAGAT AATGAGAAAG AAGTTGCTTC ACAAGCATTA 300 

GCTAAAGAAA AAGAAAAAGC ACTTGCAGCT ATTGACCAAG CTCAAACGAA TAGTCAGGTG 360 

AATCAAGCAG CAACAAATGG TGTATCAGCG ATTAAAATTA TTCAACCTGA AACAAAAGTT 420 

AAACCAGCTG CACGTGAAAA AATcAATCAA AAAGCGAATG AATTACGTGc TAAGATTAAT 480 

CAGGATAAAG AAGCAACAGC AGAAGAAAGA CAAGTAGCAC TAGATAAAAT CAATGAATTT 540 

GTAAATCAAG CCATGACAGA TATTACGAAT AATAGAACAA ATCAACAAGT TGATGATACA 600 

ACAgTCAgCG CTgATAGCtT GCTTTAGTGA CGCCTGACCA TATTGTTAGA GCgCTGCTAG 660 

AGATGCGT 668 
(2) INFORMATION FOR SEQ ID NO: 679: 

40 ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

4S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679: 
GATCCAAATT TAAAAGGAAA AATAGCCTTT AACGAATTl'A CGAAACAAAT TGAATGTTTA 60 

SO 

GGGAAAGTGC CATGGAATAC TAATTTTAAG ACACGTCAAT GGCAAGACGG TGATGATAGC 120 
AGTTTAAGAA GTTATATCGA AAAGATTTAT GACATACACC ATTCAGGTAA AACAAAAGAT 180 

SS 



2S 



30 



3S 
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(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680: 



10 


AATTATTTGA 


ATAAATTAGC 


AATTAAAGAG 


TTGATTTGTC AGTTTAAGyA TTTGAGTGCA 


60 




TTTGAAAAAG 


ATGTCATGTA 


TTTAATGTGT 


GAACAATATA AGCCGAGAGA AATTGCTCAA 


120 




TTGATGCATG 


TAAAAGAGAA 


AGTGATTTAT 


AATGCCATAC 


AACGATGTrA AAATAAAATA 


180 


IS 


AAACGTTATT 


TCAAAATGAT 


TTGAAAAGCG 


CCTTAGGACG 


TGAATTGAAT 


TATAACQTGT 


240 




TACTTACTGA 


TGGTTTGACA 


TTTGTTATAA 


ATTTTATGTA 


TAGTATACTG 


GTATTATAAT 


300 


20 


GAATAAAGGT 


GAATTATTGT 


GAGAAAAATA 


CCTTTAAATT 


GTGAAGCTTG 


TGGCAATAGA 


360 


AATTATAATG 


TTCCTAAGCA 


AGAAGGCTCG 


GCAACAAGAT 


TAACCTTAAA 


GAAATATTGT 


420 




CCAAAATGTA 


ACGCGCACAC 


AATTCATAAA 


GAATCGAAAT 


AAATACATTC 


GAAATAATAC 


480 


25 


TTTGATAATA 


TGTTCAAAGG 


ATTTGGAGGT 


TGAGCAGATG 


GCTAAAAAAG 


AAAGTTTCTT 


540 




TAAAGGCGTT 


AAGTCTGAAA 


TGGAAAAAAC 


AAGTTGGCCG 


ACGAAAGAAG 


AGCTATTTAA 


600 




ATATACTGTA 


ATTGTAGTTT 


CTACTGTTAT 


ATTCTTCTTA 


GTCTTTTTCT 


ATGCCTTAGA 


660 


30 


TTTAGGAATT 


ACAGCATTGA 


AAAATTTATT 


ATTTGGTTAG 


AGGAGTGAAG 


ACATGTCTGA 


720 




AGAAGTTGGC 


GCAAAGCtTG 


GTATGCAGTG 


CATACATATT 


CTGGATATGA AAATAAAGTT 


780 




AAAAAGAATT 


TAGAAAAAAG 


AGTAGAATCT 


ATGaATATGA 


CTGAACAAAT 


CTTTAGAGTA 


840 


35 


GTCATACCGG 


AAGGAAGAAG 


GAAACCTCCA 


GTAAnAAGnT 


GGCCAAGCCT 


AAAACCGCCT 


900 




GTTAAAAAAA 


ACCATTCCCC 


TGGGnTAnGG 


TTTTAAGTGG 


GAATTTAA 




948 



(2} INFORMATION FOR SEQ ID NO: 681: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 863 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681: 
SO ACAnATAATA ACAAAGCGCT TGCTAGTACC TCTTAAAAAG ATGATGCTAG CAAGCGCTTk 60 

TCTATACTAT ATATTATTTT TCTAAAATTT TAACACCCTC TTGAGTGCCT ACAATAACTT 120 
GATCTGCCAT ATCTAAGAAG TATCCTGTCT CAAACACACC TGTCAGATGA ATTAAATACT 180 

55 
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TATCAGTTAT AAATGCGACA TCTTCGTTTA CACOACGTTC TACTTTTATA TCAGCGTATG 300 

ATTCAATTTT ACGTAATATG TGATACCAGT TAAATTTATC CACCTCTACT GGTAACTTAA 360 

^ AOGTCTCACC TAAGTATTGA ACTATTTTCG TTTCATCX3AC AACCACAACA AAACGCGATG 420 

CCATTTCATC TATAACTTTC TCTCTGAACA GCGCACCACC GCCACCTTTA ATTATATTTA 480 

AAGATGGATC TACTTCATCA GCACCATCAA TTGCTAAGTC GATATGATCA ACATCATTGA 540 

10 

TTTCACATAT TTTAATACCT AATTCTTTTG CTAAAAATGC AATTTTATTA GAAGTGCATA 600 

CACCTGTAAT ATTGTAACCA CGTTCTTTAA TTAGTTGCGC CATTTGAGGT AAGAGTAATT 660 

CCATTGTACT TCCTGTACCA ATTCCCAGCG TCATGTCACC ATTGATTkGA CTTAAAACAT 720 

CATTTAATGT CATTAACTTG AGTGCTTTGA CATCTTTCAT GAAGGTAGCC TCCCATATTT 780 

AtGTAATCTA TTCAATTCAT ATTTTACATG ACTCGTATAA ATTAACATAC CCTTATnGCT 840 

20 AACCATTTGT GTTAAACATA TCG 863 

(2) INFORMATION FOR 5EQ ID NO: 682: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH; 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682: 
TTCATTTTTG TTGTTCAGCA ATTTTGGTCC GAATTTCAAT TCATCAGATG ATAGCTCTAT 60 

^ TAATTCTGTA GAATATTCTG CtACAACAAT TTCATAAATA TGGCCTTTTT CTTCCATTAT 120 
TATTTCATCa ATTATTTCAT AAITCAATTG TTGTAATGTT TGTCTTAAAT TTTCAGTTTG 180 
GATATTACTT TGTAAAATCA ACCTTGGATG TTGACTTAAC TTATCTTGCC CATCTTTTAA 240 

40 AATTTTAGCA ATAAGTGGTC CGCCCATACC ACAAATTGTG aTATTATCGA TTACGTCCTC 300 
AGGTTGAATA ACACTTAAGC CATCCCCTAA ACGTACATCA ATTCTATCTA CTAATTGGTT 360 
TGCAGCTACA TTTTTCACAG CAGCTTGAAA AGGGCCTTGA ATAACTTCTC CAGCAATAcC 420 

45 

GaTTCGCATA AATGGTTTTG AATTGCATAG ATTGGCAAAT AAGCAT6ATC TGAGCCAATA 480 

(2) INFORMATION FOR SEQ ID NO: 683: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 
<B} TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683: 

CTGCAAAAAA TATTGGTATA ATAAGAGGGA ACAGTGTGAA CAAGTTAATA ACTTGTGGAT 60 

AACTGGAAAG TTGATAACAA TTTGGAGGAC CAAACGACAT GAAAATCACC ATTTTAGCTG 120 

TAGGGAAACT AAAAGAGAAA TATTGGAAGC AAGCCATAGC AGAATATGAA AAACGTTTAG 180 

GCCCATACAC CAAGATAGAC ATCATAGAAG TTCCAGACGA AAAAGCACCA GAAAATATGA 240 

GTGACAAAGA AATTGAGCAA GTAAAAGAaA AAGAAGGCCA ACGAATACTA GCCAAAATCa 300 

AACCACAATC CACAGTCATT ACATTAGAAA TACAAGGAAA GATGCTATCT TCCGAAGrTT 360 

gGcCCAAGAA TTGAACCAAC GCATGACCCA AGGGCAAAGC GACTTTGTTT TCGTCATTGG 420 

CGGATCAAAC GGCCTGCACA AGGACgTCTT ACAACGCaTA AcTACGCACT ATCATTCAGC 480 

AAAATGACAT TCCCACATCA AATGATGCG6 GTTGTGTTAA TTGAACAAGT GTACAGAGCA 540 

20 TTTAAGATTA TGCGAGGAGA GGCGTATCAT AAGTAAAACT AAAAAATTCT GTATGAGGAG 600 

ATAATAATTT GGaGGGTGTT AAATGGkGGa CaTTAAATCC mCGTTCATTC mATATATAAG 660 

ATATATCACG GTAATTGCGC ATATAACTT 689 
(2) INFORMATION FOR SEQ ID NO: 684: 



10 



IS 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 858 base pairs 
(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: 

TTATTAAATT GGTATGTGTT CATTATACAT ATGaCAAATA TGaATGTAAA CCGATAATTT 60 

AGATTTTTTG GAATAACCTG AAAATTCAAG TTaTAGCGTT GCTTATATTT TAAAAGGTGG 120 

40 TGATAATGAG ACTTTTTGAA AAATAAAATT CAAAATACTT ATAGCATAAT CAATATGCAC 180 

ATTAAATAAA TGTACTCTTT TAATGCGTTG ATAACTGTAT TTGTAATTTA GAGAAGGGGT 240 

GTTCACTATG CTGATGTGTT AAAAAATAAA ATAAAAAGGA CACCTCGATG CTATAAATAT 300 

TAGCATCGAG ACGCCCAGTT AATGTCTATT AAATTGAATA TAGTCTCGGA CATGAATCAA 360 

TGCCCTAGGC CCTGCAATGT TATATTGACA GTAGTTGACT GAATGAAAAT GACTTTGTAG 420 

CTAGCTTTTT TCAATCCTTG TCGGTGCAAC ACATAGAGAA ATTGGATTCC TAATTTCTAC 4 80 

AAACAATACA AGTTGCGGAA TAAGTCCCAA TATAGAAGGT GACAGTAAGC CAACTTACAA 54 0 

TAATGTGCAA GTTGGTCGGG CCTCAATACA GAGATTTTCG AAAAGAAATT CTACATATTA 600 



45 



50 



55 
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10 



IS 



2S 



30 



35 



40 



45 



50 



55 



GCCACCTTGT TTATTTAAAT CGATAACACG GTTTGCGATT GTATTGATAA ATTCAAAGTC 720 

ATATGAAGTA AAGATAATAG AACCTTTGAA TGATTTAAGT CCATCATTAA CAGCAGTAAT 780 

ACTTTCTAAG TCTAAGGGTT GTGGGTCATC AAGTAAAAGA CGTTGCCCTG ATAACATCAT 840 

TTACTAGCAn CACGACTT 85 B 
(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 

20 TTAGTATTTT CAGGllTGGAA ATTGATGTTT GAAATTCACT GAAATGGGCC AGAACCTgAA 60 

ATGaAACTTT ACCAACAATT TGGtCTTCAT CAATGAGGCC AAACGCACGG CTATCTTTAC 120 

TTACTTCACG ATTATCTCCA AGCACTAAAT ATTTACCTTT TGGAATGACA TTTGATTTAG 180 

GATTCGCATT CGGTAAATCT TTAACTTGGA AAGTCCCAGT AATGTAATCA CCTTGTTTAT 240 

GTTTTAAATT GTAGTTTAAA TATGGTTCAT CTTGTTTTTT ACCATTGACA TATAATGTAT 300 

CATTTTTGTA TTCTACTTTA TCACCAGGAA CACCGATGAC ACGTTTAACA TAGTCATCAT 360 

TTTTGTTTGC ATGGAAGACA ACTACATTAC CTTTTTCCAA ACCACCTGTT TTATATCCAA 420 

CAATGTTTAC AGCTACTCGC TCGCCATCTT TCAAAGTTGG ATCCATTGAT TCACCTTTAA 4 80 

TTGTATATGG CGTAACAATA AATTTACCTA CTATAAATAA AATGACAAAA GCGACTGCAA 540 

TTGAAATAAT CCATTCCAAT ATTTCTTTTT TCAATTTTGA CACCTCTTTT TAAGATTTGA 600 

ACTGAACAGT CCATTTTGAA AAAGGATAGT ATCGTAAACt AACATTACCA ATAATAtCCT 660 

TTTTATCGAT TAAACCAAAT TGTCTTGAAT CGTGCTTGTk ATTATCTTGA TCATTTAGCA 720 

CAACAAAATT GTTTGGCGGA ATAATATCAC CATCTAATTC TTTAAAATTG CGCAAACTAA 780 

AATCTTTAAT TTTTCTGTTC TTGGCATAAG ATGCGTCAAC CGGTCGGTCA TCACGGTATA 840 

ATTGTCCCTG ACGAAACGCC ATTGATTGAC CAGGTTTGGC AATAATTCGA CTAGTATATA 900 

TCTCGTTACC ACGCCTATAT GTAATGATAT CACCATTATT CAATTGATTA AATGTAACTT 960 

TAATTTTATT TACAATAACA CGATCCCCTT TGTTAAGGGT TGGTGACATA TCATTATTCG 1020 

GAATGACATG ACCAACTATT ACAAAAGTTT GTACGAACAG TACAATGATA ATAGCAAGTA 1080 

TCAATGAAAT CAAATATTTT ACAACTTTTT TCACGATGTC ACTCCTTTTT CGATCCCATA 1140 



1603 



EP0786 519A2 

ATCCCTCTTA AAGGATGCGT TAAAATTGTA GTAATTTCCT TACCTAAATA ACCTAAAATA 1260 

ATTGTTGAAA CTAACTTTGA TGATGCCAAA ACAATGAAAT AATATTTAGG TCTAATATGA 1320 

GATAGACTCG CTACAAAATT TATTAATGTA TTTGGCGTAA AAGGAAAACA AAGTAAAATA 1380 

AACAATGGGA TTAATCCTTG GCGATCAATA AAACTAATCA AGCGTTGAAC AGCAGTACGT 1440 

TGITTAATTC GCTGCATCCT CTCAGTGTTC ACCAATCGTT TACAGATCAA ATAGACTGTA 1500 

AATGTTCCAG AAATTAATCC AAGCCAACTA ATCAATATAC CTAAAATAGG TCCATAAGCT 1560 

TGAATGTTAA TTAAAATATA GAGTGCTAAA GGAAATACTG GAATTATAGC TCTAATATAT 1620 

AACAATATAA ATCCAGGTAA ATAACCAAAC TGTCGAAATA TCTCAAACCA TTCTTCTACT 1680 

TGATGAAACG ACAAATCATC AATCCCTTTC TTTGGTTGAA GATAATTATT CTTACATTAT 1740 

AAAGTTA 1747 
20 (2) IKFORMATION FOR SEQ ID NO: 686: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



15 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686: 

CGTAAAGATT ACTATTTAGG AGGGTGACTA TGAAAAAGAA ATTAGGTATG TTACTTCTTG 60 

TACCAGCCGT AACTTTATCA TTAGCCGCAT GTGGGAATGA TGATGGAAAA GATAaAGATG 120 

GCAAGGTAAC AATTwAAcGa CaGTTaTCCm TTGCAATcAT TTgCAGAGCA AATTGGTGGA 180 

AAACACGTGA AGGTATCATC AATCTATCCA GCAGGGACAG ATTTACATAG CTATGAACCA 240 

ACACAAAAAG ATATATTAAG TGCAAGCAAG TCAGACTTGT TTATGTATAC AGGGGATAAT 300 

40 TTAGATCCGG TTGCTAAGAA AGTTGCATCT ACTATCAAAG ATAAAGATAA AAAACTGTCT 360 

TTAGAGGATA AATTAGATAA AGCAAAGCTT TTAACTGATC AACACGAGCA TGGTGAAGAG 420 

CATGAACATG AGGGACATGA TCATGAGAAA GAAGAACATC ATCATCATCA TGGTGGATAT 480 

GATCCACACG TATGGTTAGA TCCTAAAATT AACCAAACTT TCGCTAAAGA AATTAAAGAT 540 

GAATTAGTGA AAAAAGATCC AaAACATAAA GATGACTATG AGAaAAACTA CnaAAATTAA 600 

ACGACGATCT TAAGAAAATT GATAACGATA TGAAGCAAGT TACAA 645 

(2) INFORMATION FOR SEQ ID NO: 687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 956 base pairs 

55 



45 



SO 



1604 



EP 0 786 519 A2 



10 



(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687: 

TTTGTTACTG CTTCTAAAAT AATATCCTTT AATTGTTTAA CATGTTGGAT TGTCATATGA 60 

GGTGATGGTA CATTAAAAGG ATTTAATTCA TCTATTTGTG CATATTGATT TATGACATCT 120 

TGATGCATTG AAATAGGGTT GATATCATTT GTTACTACTT TATTAGATTG GTCTTGTGAC IBO 

ATACTAATGG TGCCACCAGT ATGAATAACA AGTAGATGTT TCATATATTT CCTCCTATAT 240 

IS TTAATTTACC TAATTATGAT AAAATATTAT TCATAAAACG ACAAGGAAGG GAAATGACX5C 300 

ATGAAAGCCA TTAATATTGC ATTAGATGGT CCAGCTGCTG CCGGAAAAAG TACAATTGCG 360 

AAACGTGTAG CCAGCGAACT ATCAATGATT TATGTCGATA CAGGAGCAAT GTATCGTGCA 420 

TTAACATACA AATATTtAAA ATTAAACAAA ACTGAGGACT TTGCAAAACT AGTTGACCAA 480 

ACAACATTAG ATTTAACTTA TAAAGCAGAT AAAGGTCAAT GTGTCATTTT AGATAACGAA 540 

GATGTAACAG ACTTTTTAAG AAATAATGAT GTGACGCAAC ATGTTTCATA CGTTGCATCT 600 

AAAGAGCCAG TACGTTCATT CGCCGTTAAA AAACAAAAAG AGTTAGCTGC AGAAAAAGGT 660 

ATCGTAATGG ATGGTCGCGA TATCGGAACT GTAGTGCTAC CAGATGCAGA TTTAAAAGTA 720 

TATATGATTG CATCAGTTGA AGAGCGAGCA GAAAGT^GAT ATAAAGATAA TCAATTAAGA 780 

GGTATCGAAT CAAATTTTGA AGATTTAAAA CGTGATATTG AAGCTCGTGA TCAATATGAC 840 

ATGAACCGTG AAATATCACC ATTAAGAAAA GCAGATGATG CAGTGACATT AGATACGACm 900 

35 GGCmAGTCGA TTGAAGAAGT TACTGACGAn ATTTTAGCGA TGGTGAGTnC AATTnA 956 
(2) INFORMATION FOR SEQ ID NO: 688: 



20 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688: 

AAAAGGGAGG AAAGAGAACA GTTAAATATG AATACAAAAT AATTTTGTTT TCGGACAGCA 60 

50 GGGGTATTAG ACGCGATTGA CAATGTCIGT TrAATTAAAC GTAATGTTTA TTTAAGCGAT 120 

GAATATTAGG TGAAAAGTTT TTGAATTTGA ATGTAATTGA GGTTTATTGA TTAGACATTT 180 

TATTGAATTG CGTGTTATTA TATAAATGTA AAAATAAGAC GACATGCGCG AACATGTCGT 240 

55 
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5 



10 



IS 



20 



25 



TAATGCGGAA 


TGGTTTTTTT 


ATTTTCCGCT 


AATTGAAATA AAAATGACGT TTTAATATAT 


360 


TATGGGCTAG 


GTGGTTTGTA 


AGAAAGGGTT 


AGTTAITAAT GTTTTATGAA TTAAGGAAAT 


420 


TTGAGTTTAA 


GGTTTAATCA 


ATTGTGATTT 


TGTTGATGAA GCGTTTAGTT AGAGTATTTT 


480 


CGCCACCACT 


AGTTACTCCT 


TCTCCCACTT 


TACCCGAGAC TGGAGAAGAg CTATCTGAAG 


54 0 


AATAAATAGA 


TACTTTTTGG 


CCATTTTGTA 


GTAAACCAAG ACCTTTTAAC TkCTCGGTTA 


600 


GAGAATTCCA 


TGTATTTTTA 


GCATCTAGCT 


TTTTGTTAAA GTCAyCGTAG ACATTTTCCT 


660 


TAGTTAAATC 


AATTTGTTTT 


AATCCTTTAA 


AGTCTATAGA TTGTGTTAGA TGGCCTCCAT 


720 


CATCATTTCc 


AGGAGCAGAA 


ACGCTGCTAG 


AATATCCATT GCTTAATAAA TAAGTAACGT 


780 


TGATTGTTTC 


GTACTCGTTA 


CTTAAAATAA 


TATCAGAATC ATGTAAgAAT CTTrAACTTT 


840 


TTTCCATAAT 


TGACCATCTG 


TCATTTTTTC 


TTCTGCTTTA GCCGTTTTAA CAACTTTATT 


900 


TGTATCTAAT 


CCTAAGTATG 


AAGAATGTAA 


GCCTGTTCCT AATGTTGTTA ATACTAAAGC 


960 


ACTTGCTACT 


aATGTTTTAC 


CTAAAAAttT 


TGTATTCATT TTTATTGCTC CTTtTTTTAT 


1020 


ATTGTAAACG 


TTTACAATGA 


AAATATAATA 


ATAATTTTTT AAAAGAACAA TTAACTAAAT 


1080 


ATCAAAAATG 


TATTAACTAT 


CTATTAACTA 


AAAAATAGAA TAATTTTTTA ACATAGTTTT 


1140 


GTTGTTTTGA 


ATTATAAAAA 


CTAAAG 




1166 



<2) INFORMATION FOR SEQ ID NO: 689: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 



40 



45 



AGCACTGAAG GATGGCTAGT 


TGTCATAGCA 


TATGCGGTCA 


TTGGTATAGT 


TATAACGAGC 


60 


GTTTCTTATA TGTTGTCTAT 


TAAAATTTTT 


AACAAACAAG 


AACTAkAATT GTCGATTTTG 


120 


GTGATGATAG TATTTGAAAT 


AATATTTAGA 


GCAGGTGATA 


AATCTTTACG 


ATTGTCATCT 


180 


GTTCTTTTkG GTGTGGAATG AAATGTGGGG 


GATAAGTATA 


GGTGACATAT 


CTATATTGAT 


240 


TTATTTGTTT TGAGGTGGTT 


ATGTTGTGTG 


GGAATTATTT 


CCTTTTAGAT 


AGCGGGGATT 


300 


AGAGGATATA TGTTATTTAT 


AAGTATCATT 


TGATGATTGT 


ATAGGCTAAC 


GATTTCCTCG 


360 


GAAATATTTA AAAACCTCGA 


TCATGTAGCA 


TAACTGAAGT 


TTGTCACAAA 


AGTATAATGT 


420 


GAAGTTCGAC ACTTTTGGaT 


TCAGTTCAAA 


TACTTTGACC 


GAGGTAAATA 


CTATTTATTC 


480 



55 
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TGATACTGAG ATAATCATTA CATGGTCGTG ACCTTTAAAT AAAAGGCTGA CAATATAAGA 600 

CATAACGAGT ATACCTAGTG AATATGAAAT ATACTTCGCG TTTGTCAGTT CATTATGGAA 660 

^ ATAAGGCGTG ATTAACCATA ATCCAATATA GAATATTAAA ACACTGATAT ACATCATATT 720 

AATTTCAAAC AAGTCATTTA GTTTATTGTT ATTACTAAAA ACAATTGCAG CATTAATCAC 780 

ACCTAAAGCG ATATTGATTA ATAGATGCGT ATACGATAAA CGGAAACCGA TAGATGTTAA 84 0 

10 

TTTATGATTA ATATAATTTT CAGTAATGAT CCAATATACA CCGAAAAGAC TAATTAAAAT 900 

CATAAATTGG AATATATAAA TGTAACTAAA ATGATCAATG CTAAATGATG ACGAAGCTAA 960 

^5 ACCAACCAGT ACCTCGCCAr AGwTAtAATT GTTAGTAACG AAAAACGTCT ACTAAATGCA 1020 

TCATATTAAC AGGTtlTAATA CAAGTATTTC TGAAATGGAA TAAGnCTGTC GCTGCATGAT 1080 

ACG 1083 

20 (2) INFORMATION FOR SEQ ID NO: 690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 
2s (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 

30 



35 



40 



50 



TTAATCATCT 


GGATGTATTT 


AGTATTnAGA 


ATAATAAnAA 


AACGATCATG 


TTGTATtTGA 


60 


GTAGCGATTG 


GTTTGCGGAA 


TTAGGCTTTA 


CTTTCTTTAA 


TTACCACTAT 


ACAGCAAAGT 


120 


TGATTAAATC 


ATCCTATAAT 


TTGAAATGTC 


TACTATTAAA ATTGACATAT 


CGATACCTTG 


180 


ATAATCAGCC 


TCTTAATGAC 


GCTGATAyTA 


GAAAATTACA 


GGATATTATT 


AAAATCATTG 


240 


CAAAAGAAGC 


AAGTATGGAT 


AAAAAGATTG 


CACAAAATCA 


ATATCGATAT 


GCGTATTATG 


300 


GTGATTTGCG 


TGATGAGCTC 


GAATATATTT 


ATCAAAATGT 


AAATCAACGA 


TTGACATTAA 


360 


AAAGTGTCGC 


TGATAAATTA 


TTTGTCTCAA 


AGTCAAATTT 


GTCATCACAA TTCCACTTAC 


420 


TTATGGGCAT 


GGGTTTTAAA 


AAATATATTG 


ATACTTTGAA 


AATnGGTAAA 


TCGATTGAAA 


480 


TTCTACTTAC 


TACTGATAGT 


ACTATTAGCA 


ACATAAGTGA nCATTTAGGT TTTAGTAGTA 


540 


GCTCCACTTA 


CTCTAAAATG 


TTTAAAAGTT 


ATATGGATAT 


CACACCGAAT 


GAATATCGTA 


600 


ATTTATCAAA 


ATATAATAAn 


TGTTTAC 








627 


(2) INFORMATION FOR SEQ ID NO: 691: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 691: 
TTGAACGTAA TGCTAGCAAA TGACTTTGTG CCATAAAATA TTCTTCCCAT TTGATTCTTT 60 
CCAAGATGTT CACCTTCCAT ACTTAAAATT TAGTAACATT TTCTAATAAT ATAAGATTAA 120 

10 

TCACAAAAAA TAAAATTTGC AATTAAAATA ATCCATTATG TCGTGAAATA AGATTTCAGT 180 
TTATCAAAAG TTTTACTTCC AAAACCTTTT ACTTTTTTCA AATCGTCAAT TTCTTGAAAT 240 

IS GCACCTTGTT GGTTGCGATA TTCAACAATT GCATTAGCTT TAGCTTGCCC AACTCCAGGA 300 

ACAGACATCA ATTCTGATAC AGATOCCGTA TTTAAATTTA CTTTAGTATT ATTTGTGTTC 360 
CCaTTTTTTT CGTGCACACT GTrTACTTCA ATTTGTGGTT CAACATTCIT TTGTCCTTTA 420 

^ TQAGGTATGA AAATCATTTT TTGATCTGTT AATTTTTCAG ACAAATTAAT TTGACTTACA 480 

TCTGCATCCT CCAATAATTG TGCTTTATCA AGTAAATCAA CTACTCTATC CTTAGATGTC 540 
ATTTTATAAA CATTAGGATG TTTAACAGCA CCTTTTACAT CGACATATAC AGGACCCTTA 600 

25 

TTTTTGGAAT TATCTCCATC TTTGACCTGG ACATCTTCTA C 641 

(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: €31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: 



40 



4S 



SO 



TATTAAAGnA 


CTTGATGAAC 


CAAATCATAA 


AAAGCnATAT ATGTTATTTG 


CAGCTGGCAT 


60 


TGTGTTnGCA 


ACTATTTTAC 


TTATTTCGGC 


ACATTTATAC AGCAGAAAGA 


GAGGTAACCA 


120 


AGTTTGAGAA 


TCATAAAGTA 


TTTAACCATT 


TTAGTGATAA GCGTCGTTAT 


CTTAACCAGC 


180 




CCAGTTCTCA AGAATCAACT 


AAATCCGGCG AATTCAGAAT 


CGTACCAACA 


240 


ACTGTTGCAT 


TGACAATGAC 


ATTGGACAAA 


TTGGATTTAC CAATTGTCGG 


CAAACCCACG 


300 


TCATATAAGA 


CATTGCCTAA 


TCGTTATAAA 


GATGTACCGG AAATTGGTCA ACCAATGGAG 


360 


CCGAATGTTG 


AAGCTGTTAA 


AAAGTTAAAA 


CCAACACATG TTTTGAGTGT 


GTCAACGATT 


420 


AAAGATGAAA 


TGCAACCATT 


TTACAAACAA 


TTAAATATGA AAGGCTACTT 


TTATGATTTT 


480 


GATAGTTTAA 


AAGGGATGCA 


AAAGTCGATT 


ACACAATTAG GTGaTCAATT 


TAATCGTAAA 


540 
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GCAGCTAAAC AAAAGAAACA TCCCAAAGTA T 631 
(2) INFORMATION FOR SEQ ID NO: 693: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 : 
AATTTAACTA TGTTTTCCAC ATTGTTTCAT GTCACGAAAA GGACAACGCG CGACTATAAG 60 
TATCAACTAT TTCCACAAGT TTTATTOGTG TTTTTATTAT TCATCGATAC GCTTCATTTT 120 
CATCTCTCCA ACACAAAAAA GAAGCTAAGC AACTTATGTT GCCTAACTCC TCTATACTAT 180 

20 CCATATTTTA CTATTATCCA TATTTCATTG AATTATCTAA TGTTGGcTTC TATTTTTTCA 240 
ATATTTCTAC CGTCAATGAC GTCACTCATG CGATTTGTTT GTAATTTTTT ATTAAGTTCA 300 
AACGTATAAT AGCCGCCATC TTTCATTATC ACTTTTATCT TACTATCTTT AGGAAACTTT 360 

25 TTATACAGAT CAAAATTTTG AATTAAATAC TGTCTCAATT TAAAGTCGAG TTCTTTAAGT 420 
GAAATCTCTT CTTTATAAAT GTAGTGTACT CTACCGTACG TAGCAATACC GTCACCTTCA 480 
TCTCTCTTGA TTTGAAATCT TGGTGCGTTT ATATAATCAT AATAAGCGTC TTGATTTTTC 540 

30 

TTAGTGACAC CACCATATGA AAACACTGTG CCATTACGGT TTTCCGCTTC TTTAACAACA 600 
AATATGTCTA ATCCCGGATT TTTACGTGCT TTAAATCTTT CAATATCTTT ACCAAATATC 660 
TGTACTCTTG TGAATTTTCT ATTTTTATCA AAGATAAGGT AATGCTTGCC ACCTTTGCTA 720 
TAACGATAAC CAGTAACATT TTTAAGTTCC TTACTTGCGC CACTATAGTA ATCTCTTAAG 780 
TCAAAGATAT CTTTTGTCAC ATTTTCATAT TTTGCTTTAT GTTCACTCGC ATTTACAGTT 840 

40 TGATGCAATG ACGTTATTGT TCCTGTTGCT AAAATACCTA ATGCTAAACT TGCTTTCGCA 900 
ATTGCTGTCA TTTTCATAGT TGTATGCTCC ATTCGTAATT ATTAGATTTG TTCGCTTACG 960 
TCTATTGAAT CATACAGCTT TATTATAGTT AGCGTATTTG ACCTTTCACA TTAAACCATG 1020 

^ TTTAATAATC ATTGAATCAT TATTAAGTAA ATTAAGGAtC TATAATGTTC GTTAAATAAA 1080 
CTGAnCCCGT TGTGCTTCAC ACCCGnTnGA T 1111 
(2) INFORMATION FOR SEQ ID NO: 694: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 

TTATGGATGG ATTAAGAGGT CGTGTTGAAA AAATCAACGA TAACTCTGTT ATTGTTGACT 60 

TAACAATTAT GGAAAATTTT AATGACCTTG ATTTACCGGA AAAAACTGTT ATCAATCATA X20 

AACGATATAA GATTGTTGAA TAAGAAGGTA AGTTATAATG AATAAAATCT CGAAGGCTTT 180 

AACTTGGTTT ATTATAAGTT TCATTATATT TCATCTCATA TTATTTATTA TGTGGGGCGA 240 

ACACCAAGAA TACTGGTATT TATATACAGG TATAATGCTA ATTGCTGGTA TCAGTTAT6T 300 

ATTTTATCAA AGAGATATTG AATCTAAGCG GTTGCTTACA TCAATTGGTG TTGGTATTAT 360 

TACGGCAATT ATTTTAATTA TGCtTCAACT TTTATkCTCA CTTATAACTT CTAATTTAAG 420 

TTATAG 426 
20 (2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxable 

(D) TOPOLOGY: linear 



70 



75 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 

AAAAGGGGnG TAAGGTTTAG CTCAAGTACG AGAAGTCTTT GGTGATGAAG CAATTGATGA 60 

AAATGGTGAG ATGAATCGTC GTTATATGGG TGATCTAGTG TTTAATCATC CAGAAAAACG 120 

CTTAGAATTA AATGCTATCA TACATCCTAT CGTGCGAGAT ATTATGGAA6 AAGAAAAGCA 180 

AGAATATTTA AAACAAGGAT ATAATGTAAT CATGGATATT CCATTATTAT TTGAAAATGA 24 0 

ATTGGAAAAT GCAGTAGACG AaGTGTGGGT TGTATACACT TCTGAAAGTA TACAAATGGA 300 

40 TCGTTTAATG CAACGTAATA ATTTGTCATT AGAAGATGCG AAAGCACGTG TCTATAGCCA 360 

AATTTCTATT GATAAAAAAA GCCGAATGGC CGATCATGTT ATCGATAATT TAGGGGATAA 420 

ACTTGAATTA AAACAAAACC TTGAGAGATT GTTAGAAGAA GAAGGTTATA TTGAAAaGCC 480 

GAATTACGGA GAAGAAGATT AATATTACAC TATAAATAAG TCATTACTTT ACGTACGCGT 540 

TGATGTATGT AAGTAATGAC TATTTTTTAT AAAAAAGATA AATAAATCAA CGGAAAACGC 600 

TTTCAAATTT CATATAATAT GCTATACTAA TTCCATAAAG TATAACACAT AAAGATCAAG 660 

GGGTGCTTTT AATGTCAACG AATATTGCAA TTAATGGTAT GGGTAGAATT GGAAGAATGG 720 

TATTACGTAT TGCATTA 737 

55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: 

AACGATAAGA ACAAAAGATA TATTACAAAG CGTTTATTTA AAACOTTATT TATTACGCGC 60 

GATGATGGCA GGATTTATTA TCGGGATTAT TACGGTCTTC GTATTATCAG TTAAAGCAAC 120 

ACACGAACCA GATTTACCGC CAGgCATTGT GAATATGGCC AGTGCCATTA CATTCAGCTT 180 

TGCGTTAGTA CTCATTTTAT TTACAAACTC CGAACTACTA ACCAGTAACT TCATGTACTT 240 

TACTGTAGGC CTGTATTmTA AAGTAATTAA ACCAACTAGA GTATTGaAAA TATTTTTATT 300 

20 ATGCTTTGCA GGAAATATTT TAGGTGCTGC TATTTyATTT AGTTTCATGC GTTTTTCAAA 360 

TGTAATGACG CCAGATAwGt TAAAyCAGTT ATCAGCAGTT ATAGAGCATA AAACGTTGTC 420 

TACTGGTTTT GT 432 
^5 (2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: 

3S 



40 



45 



SO 



CTTTTATACG 


AAAGTTAAnA 


TCAAATATAA 


CATTAATGTT 


TGATGGGGAT 


TTTGCGGGTA 


60 


GTGAmGCAAC 


ACTTAAAACA 


GGTCAAAATT 


TGTTACAGCA 


AGGGCTAAAT 


GTATTTGTTA 


120 


TACAATTGCC 


ATCAGGCATG 


GATCCGGATG 


AATACATTGG 


TAAGTATGGC 


AACGATGCAT 


180 


TTACTGCTTT 


TGTAAAAAAT 


GACAAAAAGT 


CATTTGCACA 


TTATAAAGTG 


AGTATATTAA 


240 


AAGATGAAAT 


TGCACATAAT 


GACCTTTCAT 


ATGAACGTTA 


TTTGAAAGAA 


CTAAGTCATG 


300 


ATATTTCGCT 


TATGAAATCA TCGATTTTGC 


AACAAAAGGC 


TTTAAATGAT 


GTTGCACCAT 


360 


TTTTCAATGT 


TAGTCCTGAG 


CAATTAGCTA 


ACGAAATACA 


ATTCAATCAA 


GCACCAGCCA 


420 


ATTATTATCC 


AGAAGATGAG 


TATGGCGGTT 


ACATTGAACC 


TGAGCCAATT 


GGTATGGCAC 


480 


AATTTGACAA 


TTTGAGCCGT 


CAAGAAAAAG 


CGGACnAGCA 


TTTTTAAAAC 


ATTTAATGAG 


540 


AGATAAAGAT 


ACATTTTTAA ATTATTATGA 


AAGTGTTGAT 


AAGGATAACT 


TCACAAATCA 


600 
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TATCAGTGAT GCTGTGCAGT ATGTTAATTC AAATGAGTTG AGAGAAACAC TAATTAGCTT 720 

AGAACAATAT AATTTGAATG ACGAACCATA TGAAAATGAA ATTGATGATT ATGTCAATGT 780 

TA 782 



(2) INFORMATION FOR SEQ ID NO: 698: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\lble 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

AAATCATTGG CAAAATATAC GAATTGCATT TAAATCGTCA ATTTACATAT ATTTTTGCGT 60 
2Q AGTAATCAAT CGTTATCATT GTTATTTATC GTTACATTAT TTCGAGTATC AGTATGTATT 120 

TCGGGCTTOG TTTGATAACG ACATTTCTTT GTGACATCGC TTCATCAGTG TAACAACAAA 180 

TACAATGATT TCGTGATGTT AGTTACCCAT TTTATGTGTT GCATAAAATA TGTTGTTATA 240 
25 AAACATTTTA AATCATTTTA TATAAACAAT CTATATATTT TTGGCATTTC CAAAATATCA 300 

CTTGTTATAT TAAAAACCGA CAAGACATTT TATCTTATCG GTTGAAATTT GTTATTGTTA 360 

TTTGTAATGT TTTTAGGTTT CTTTTTAATA TAATATATTT CAGTGAAAAT ACATGATTGA 420 

30 

TTGTGATTTC ACTGAAACAT GGTTAATTGC GTTGTTGATG AATAACTTTA GCATAAATAT 480 
AGGAAGTTAT TTTGTACATC GCCATATATA GAAACGAAAT TATAATGACA GCTAGTACGT 540 
AACTTGTTAA AAATATATGA TGGTTATTAA TACCTATCAT ATTTAGTAAC GTATATACAA 600 

35 

TGTTACTAGA AATTAATGTG TGAATCAGTG CTACTGTTAT TGGTATtGCG AACAAGAAAG 660 
TCATTTGATT TCGTGTTATC TTTGCTATTC TTCCATTATC TAAACCAAGT TTTT 714 
(2) INFORMATION FOR SEQ ID NO: 699: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699: 

SO 

ACTGAGAGCA ATAATTTAGT AACTTCTACT CAAGGAATTA TTAAAGAAGC ATTGCATAAA 60 

TTGGGATTTG nnTTTAATTA AAGAACCTTT AAGAATGTTA CAAGTGCGTA TCCCTGTACG 120 
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TGGACCAACA AAAGGGGGCG TG06TTTCCA CCCAGATGTT GATGAAGAAG AAGTAAAAGC 240 

ATTATCAATG TGGATGACTT TGAAATGTGG CATTGTGAAA CTTACCATAC GGTGGTGGTT 300 

AAGGGTGGTG ATCGTTTGTG GATCCACGTC AAAnGAGCAT TCCATGAAGT TGGAACGTTT 360 

A 361 



(2) INFORMATION FOR SEQ ID NO: 700: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700: 

20 GTCCTTAATT GGTATCCACC TTTAGCACCG CGTACACTTC GAATTAACCC CGCATTTCTT 60 

AAAGGACCTA CAAGCTGTTC TAAATATAAA TCACTCAAAT TATTTTCTTC AGCAATTGAC 120 

TTTAATGATA TACATCCTTG CCCCTCTTTT TTAGCAAGAG AAATCATCAA TGTAAGTCCA 180 

^5 TATCTCCCTT TAGTAGAAAT TTTCATTGTA TAACCTCACT TAATTCGAAT ATTGATATTC 240 

CCATTTTAGC ATTTTTTGAG TTAAGATAGT ATAAGAAAGG TGTGACAAAT GTGAGTACAG 300 

AACCATTAGC ATCGAGAATG CGCCCAAAAA ATATAGATGA AATCATTTCC CAACAACATT 360 

TAGTTGGACC AAGAGGCATT ATCAGAAGAA TQGTTGATAC AAAAAAATTA ACTTCAATGA 420 

TTTTTTATGG TCCACCTGGT ATAGGCAAAA CAAGTATTGC CAAAGCAATT TCGGGCAGTA 480 

CGCAATATAA ATTCAGACAA TTGAATGCTG TAACTAACAC TAAAAAAGAT ATGCAACTTG 540 

35 

TTGTTGAAGA AGCTAAAATG TCTGGTCAAG TTATCTTGTT ATTAGATGAA ATACATCGAC 600 

TAGATAAAGC TAAACAAGAC TTTTTAITAC CTCATTTAGA AAATGGCAAA ATCGTCTTGA 660 

TCGGTGCTAC AACTTCAAAT CCTTATCATG CTATCAATCC AGCGATTCGT TCAAGAGCGC 720 

40 

AAATTTyCGA GTTATATCCT TTAAATGACG AaGATGTGCG CCAAGCGTTA ACTCGTGCAA 780 

TAGAAGATGA TGAGAATGGT TTGAAAmCAT ATCaACCCAA AATTGATGAA GATGCCATGA 840 

45 CCTACTTTTC TACACAAAGC CAAGGTGATG TTCGTAGTGC GTTAAATGCA TTGGAATTAG 900 

CTGTATTAAG CGCAGATAAT GACAAAGACG GTTATCGACA TGT 943 
(2) INFORMATION FOR SEQ ID NO: 701: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701: 

CATTAAGTGA AGTTGTTGAT ACACCCATGC ATCAAGTCAA TTGTTCTGTT GATTTAGATA 60 

CAGAAAGCTT ATTAGGCTTT AAAACAATTA AAACAAATGC GGAAGGTCAA CAAGAAATTG 120 

TCTTTGTAGA TGGTCCAGTT ATTAAAGCTA TGAAAGAGGG GCATATTTTA TATATTGATG 180 

AAATAAATAT GGCTAAACCT GAAACATTGC CTGTATTAAA TGGGGTCTTA GATTATCGTC 240 

GTCAAATTAC GAATCCATAC ACTGGTGAAG TAATCAAAGC TGTACCAGGA TTTAACGTTA 300 

TAGCAGCGAT AAATGAAGGT TATGTTGGtA CTTTGCCAAT GAATGAAGCA CTAAAAAAaT 360 

IS CGCyTTGTTG TTaTTCACGT kGATTATaTT GATGGGGaCA TTTAAAAAAT GTGAnTAAGG 420 

AGCAAGGTTT ATTACAAGAT GTTAA 445 
(2) INFORMATION FOR SEQ ID NO: 702: 

(i| SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702: 

TGCAAGATAA ACAAATTTGT CATTGTTTTG GTT6TAAAAA AGGTGGCAAT GTTTTTCAAT 60 

TTACTCAAGA AATTAAAGAC ATATCATTTG TTGAAGCGGT TAAAGAATTA GGTGATAGas 120 

TTAATGTTGC TGTAGATATT GAGGCAACAC AATCTAACTC AAATGTTCAA ATTGCTTCkG 180 

AyGATTTACA AATGATTGAA ATGCATGAGT TAATACAAGA ATTTTATTAT TACGCTTTAA 240 

CAAAGACAGT OGAAGGCGAA CAAGCATTAA CGTACTTACA AGAACGTGGT TTTACAGATG 300 

CGCTTATTAA AGAGCGAGGC ATTGGCTTTG CACCCGATAG CTCACATTTT TGTCATGATT 360 

40 TTCTTCAAAA AAAGGGTTAC GATATTGAAT TAGCATATGA AGCCGGATTA TTATCACGTA 420 

ACGAAGAAAA TTTCAGTTAT TACGATAGAT TTCGAAATCG TATTATGTTT CCTTTGAAAA 480 

ATGOGCAAGG AAGAATTGTT GGATATTCAG GTCGAACATA TACCGGTCAA GAACCAAAAT '540 

ACTTAAATAG TCCTGAAACA CCTATCTTTC AAAAAAGAAA GTTGTTATAC AACTTAGATA 600 

AAGCGCGTAA ATCAATTAGA AAATTAGATG AAATCGTaTT ACTAGAAGGT TTTATGGATG 660 

TTATAAAATC TGATACTGCT GGCTTGAAAA ACGTTGTTGC AACAATGGGT ACACAGTTGT 720 

CAGATGAACA TATTACTTTT ATACGAAAGT TA 752 



30 



35 



45 



SO 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703: 





nCGTTTTTGT 


nAAACnGCTT 


ATATGTATAA 


CTTTTCCCAA 


TTTATCATAA 


GTTTTGATAG 


60 




AAGGAGTTGG 


ATCGCTAAAG 


AAAATATCTC 


TAAAGATATC 


ATCTTCAATG 


ATAGGAATAT 


120 


io 


TATGTCtTTC 


GCtATAAGTA 


ATTATATTTT 


TCTTTTGCTC 


ATTCGTTAAA 


GAACGACCTG 


180 


TCGGGTTATT 


AAACCTAGGT 


TCTATATAAA 


TCGCTTTATT 


TTTAAAATTA 


ATAAATCTAT 


240 




CAATGATGGT 


ATCAATTTCA 


TTAATTTGAT 


TATAAGGAAC 


ATCAATATGT 


CTAAAATTCA 


300 


20 


ATTGCTCAAA 


AACATTTGTA 


GAGTGAATAT 


ATGATGGTGT 


ATTCGAAATT 


ATTATGGCAT 


360 




CTTGACCTAA 


AAACCCAATA 


GATAAAAGTT 


GAATGGCATG 


TAAAGCGCCT 


GAAGTGATCA 


420 




TTACATTTTC 


TCTACCTACA 


TTTATACCTT 


GCTTTGACAT 


TCGTTCAACG 


ATAATATCTC 


480 


25 


TTAACTTGAT 


ATAACCATAG 


CCATTATTAT 


AACCAAAAGA 


TAAGTCTTCA 


ATATGACTGG 


540 




CTGTATTAGA 


CATGGCTTTT 


TTCAATTGAA 


TAT6TGGCAT 


TAACGATATA 


CCCAATTCAC 


600 




CTTThCTTKT 


ATGTATATAC 


GAATCATCTG 


TCTCAATTTT 


ATTAATTAAT 


T6CACCGTAT 


660 


30 


ACTGACTTCT 


TTGTTGAGAG 


GACCATAACA 


TCATTTCAGA 


CCACTTATTT 


GTAATATGTG 


720 




CTTCATTCAA 


ATAGTCATTA 


ACATATGkTC 


CACTACCTAC 


TTTAGTATAG 


ATAAATCCTT 


780 




CAGCTTCTAA 


TAACTCAATA 


CTTTTAATAA 


TCGTTACTCT 


ATTTACGTTG 




830 



(2) INFORMATION FOR SEQ ID NO: 704: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704: 



SO 



AGTGGTGTTG GAAAAGCTAT TATGAAATTA 


TTACGTGAAC 


AACAAGTTTA 


ATAAAAAAAG 


60 


AGGGGTCAAA TATGAAAGGA TTAATTATTA 


TTGGCAGTGC 


ACAAGTGAAT 


TCACATACAA 


120 


GTGCACTAGC AAGATACTTA ACTGAGCATT 


TTAAAACACA 


TGATATTGAA 


GCGGAAATAT 


180 


TCGATTTAGC AGAAAAACCG TTAAATCAAT 


TAGATTTTTC 


AGGAACAACA 


CCGTCTATTG 


240 
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TATTAGGAAC GCCAAACTAT CATGGTTCAT ATTCTGGAAT ATTGAAAAAT GCATTAGATC 360 

ATCTAAATAT GGATTATTTT AAAATGAAAC CTGTAGGCTT AATAGGAAAT AGTGGTGGTA 420 

5 TTGTTAGTTC AGAGCCATTG TCACATTTAA GAGTAATCGT CAGAAGTTTA CTAGGCATTG 480 

CTGTACCAAC TCAAATAGCA ACACATGATT CTGATTTTGC TAAAAATGAA GATGGTTCAT 540 

ATTACTTAAA TGATAGTGAA TTCCAATTAC GAGCAAGATT ATTTGTCGAT CAAATTGTAT 600 
10 

CTTTTGTGAA TAATAGTCCA TATGAACATT TAAAATAATA TTAAnAAATA TGTAAATnT 659 

(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 
nCACCATATA GTAACTGGCA CCAACTACAT TACCGTCTTT TAAAAAGATT TTTTTATAGT 60 

25 TATTATCAAC ACTATTAAAT ATTTCAATAC CTTTAATTTC TGCATTTTCT ACAATTTGAC 120 
CAGCACTATA CAAGTCACAC CCAGAAACTT TTAATGACGT AAATGTTGTT GATCCCTTGT 180 
ATCCGTTCGT TTCTTTATTT GTTAAATGAT CAGCTAATAC TTTACCTTGT TCATATAGTG 24 0 

30 6TGCAACGAG TCCATAAACT TTGCCGTTAT GTCTGCACAT TCACCAACTG CATATACATT 300 
GCTATCACTT GTTTGCATCA CATCATTGAC AACAATACCA CGATTACATC TAGACCTGAT 360 
CTTGGCACTC CTGGGAAGGC GGAACCACTG CATACAACTA 400 

^ (2) INFORMATION FOR SEQ ID NO: 706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706: 

TTAAAAATAC AGCTACAGGT AATTTTAATG ATTTTTCATC AATATCAAAT TTGGGATTAT 60 

GGTGTGGCGC TGTAATACCT TTAACCACTC AGAAAGAATG CACCTGGTCG TACTTTCAAA 120 

TAATGTGAAA AATCTTCTCC AATCATCATT AAATCTGATT CATTAAAGCG TACATGTAAG 180 

TCATTTGTTG CTTCTTTAAT AACTTGGATA TGCTTTCTCG TTTATTATGG ACAGGCAAAT 240 



55 
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GCTTAATCCA TTTTGTCCAT ACATGATTCT GTATATCTGA AATCGAAAGT TCTnACTGTA 


360 




CCTTTACAAA ATGCCTTGnn 


380 


5 


(2) INFORMATION FOR SEQ ID NO: 707: 




10 


(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




IS 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 






CAnAAAGGAT CAAAGTGATC GGTATAGTGA TAATTAGCTC CAAAGAAAGA ATATTCTAAA 


60 




TCAAATCCAT ACCAA6CAGA AAGTATTAGC GAATATCAAT TTAAATOGTA CCGATTCAAA 


120 


20 


TAAAGAAACA CGACATATAG AATTTTTACT TGATGATTTT AGTGAATCAT ATGAACCAGG 


180 




AGATIGTATA GTAGCATTAC CGCAAAACGA CCCTGAATTG GTTGAAAAAC TAATATCCAT 


240 




GTTAGGTTGG GATCCGCAAT CTCCGGTGCC AATTAATGAT CATGGTGATA CAOTTCCTAT 


300 


25 


TGTTGAAGCA CTAACATCAC ATTTTGAATT TACTAAATTA ACATTGCCAT TATTGAAAAA 


360 




TGCAGATATC TATTTTGACA ATGAAGAATT ATCTGAnCGT 


400 


30 
35 


(2) INFORMATION FOR SEQ ID NO: 708: 




(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 447 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 

(D) TOPOLOGY: linear 

• 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708: 




40 


TAATGCTGGT ACGGGTCATG CAGCATTATG TGAGTTGAAC TACACAGTTT TACAACCTGA 


60 




TGGTTCTATC GACATCGAAA AAGCGAAAGT GATTAACGAA GAGTTTGAGA TTTCAAAACA 


120 




ATTCTGGGGT CACTTAGTGA AAAGCGGTAG CATCGAGAAC CCAAGAGAAT TTATCAATCC 


180 


45 


ATTACCACAC ATCAGTTATG TTAGAGGTAA AAACAATGTT AAATTCTTAA AAGATCGTTA 


240 




CGAAGCGATG AAAGCTTTCC CTATGTTCGA TAATATCGAA TATACTGAAG ACAT0GAA6T 


300 


SO 


AATGAAAAAA TGGATTCCAT TGATGATGAA AGGCCGTGAA GATAACCCTG GTATCATGGc 


360 


GGCAAGTAAA ATTGACGArG GTmCAGATGt AAiTlCTyCGGT GAATTAACAC GTAAAATGGC 


420 




TAAAAGCATT GAAGCACATC CAAATGC 


447 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLCXSY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 



IS 



20 



2S 



30 



35 



40 



4S 



50 



GTGTCTACCT 


GTTTTTGTTG 


TTCAATTGTT 


AACTTTTCTT 


TTTGAATAGT 


AGTATTCGAT 


60 


AATTCTTTAT 


CGCTTATAAA 


ATCATCTAGT 


GGATGGCTTT 


CTTTGAAACT 


TTTATTTTCA 


120 


GTCATCAATC 


ATCTCTCCAA 


TAGTTGCTAT 


TTTACATCAA 


TCTTGATGAT 


GTTTTTGATG 


180 


TAACTTATTT 


AATTCCATTT 


CAATATCTAA 


ACGTTCATAA 


TCATCTTCGT 


TGAGACGCTT 


240 


TAAATCAGCG 


ATTAATGTTC 


GTTTGACCTC 


ATCCAAAGTA 


ATTCGTGTTT 


GTTCTAACTT 


300 


TTGCTGTTCA 


TTAATTGATT 


TTTTGGGCAT 


TTTTGCTAGA 


CGTGTATATG 


CATCAACCAA 


360 


ATTTAAAGCA 


TTATCAATAT 


GAGAATAAAA 


AAAGCCTTCA 


ACTTTATAAA 


ATGATGCAGG 


420 


TCTCTGTCTA 


ACTGTCGTAT 


AAATAGAACG 


TGAAATTTGG 


TATATATCAT 


TAATCTGCCT 


480 


AAAATCTTTA 


ATTGATCTTA 


TATTGACATA 


CGTTTTTAAA 


ATACCTCTAA 


GTTTTTGGTG 


540 


TGTATGATTT 


AACTGATTTC 


GAATATAGCG 


ATAGTCTTTT 


CTAGTCAAAC 


CAATTTCGTT 


600 


TAAATATTTG 


CGTGAAGTGA 


GTTTTTGTAT 


CGGTAGGTAT 


GTCATTAAAA 


AGCCAACAAT 


660 


ACCAATAGAC 


ATATCAATTA 


AAAAAGATAC 


ATCAAGTGCA 


ATCATCCCAA 


ATATGCTTGT 


720 


TAAAAAOGCT 


ACAGGAATTC 


CCACTAACAC 


CCCAAATATA 


TGAGAAATAT 


TATATCTCAC 


780 


TGTCATCTTC 


CTTTATTTAG 


CATTTTATAT 


TGATCGAAAA 


TCCGATAATT 


TTTGATTTAG 


840 


TTCTAACTCT 


TCAAGTTGAT 


GGCTTGTTAC 


ATTTGATGCT 


GGTGAGGCAC 


CTTCAATTAC 


900 


ACCTT6AATA 


AATCTCTCTA 


TATCTGCGTC 


ATCCCCTTGT 


GCATATATCT 


CTACATAGTC 


960 


ATCTACATTT 


TGAACAGTAC 


CGACAATGTT 


ATAGTTCATT 


GCAATGCGTT 


GTGTAAAATA 


1020 


TCTAAATCCG 


ACGCCTTGAA 


CGCGTCCGAA 


TACTTGTAAA 


TGTATATGtC 


TCaTTTTTAC 


1080 


CACCTCATAA 


TGTTATTATA 


CGTAGTTTTA 


CTTAAAAAAA 


CTAATAATTA 


CTATAGTTAC 


1140 


TACTTTGTTT 


GTTTCAAGTC 


GTCAAACTTG 


ATTTTCAGA6 


GATAAAGGTA 


TAAAAATAAG 


1200 


TATAGAGTTT 


TTGAAGTATG 


GAAGGGGTCT 


TTAATAATGT 


GGACAGTTAC 


CAAAATTAGA 


1260 


GCCGATTATG 


AGGGATGGTG 


GTTATTCAGT 


GACTGGCCAG 


AAAACATTGT 


TGAAAAATAT 


1320 


CAATATCAAG 


ATTTTGATGA 


CATGTTTAAG 


CACTATCAAC 


AATTGATTAA 


TCAATGTAAA 


1380 


GTTCAGTTCG 


ATAACTATGT 


CACAGGCAAA 


TATAATATTT 


ATGCATTTTA 


TAATAATTGT 


1440 
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<2) INFORMATION FOR SEQ ID NO: 710: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 710: 
GCTGnAAAAT ATGTTAGAAG CAGATGCATG GGCAAAGCTG GGGTCCTTAT TTATTGTCAG 60 
GTAAAGATGT CTTCAATTCA ACTATTGGAA TATATGGTAT GGGAGATATT GGTAAAGCTT 120 
TTGCAAGAAG GTTGCAAGGG TTTAATACTA ATATTCTTTA TCATAATCGA TCAAGACATA 180 
AAGATGCAGA GGCGGACTTT AATGCAACAT ATGTTTCTTT TGAAACGTTG TTAGCAGAAA 240 

20 GTGATTTTAT CATCTGTACA GCGCCACTTA CAAAAGAAAC ACATCATAAA TTTAATGCTG 300 
AAGCATTTGA ACAAATGAAA AATGATGCAA TTTTTATTAA TATCGGTAGA GGACAAATTG 360 
TAGATGAAAC AGCATTAATC GATGCACTAG ACAATAAAGA AATTTTAGCA TGTGGTTTAG 420 

25 ATGTATTAGC AAATGaACCG ATTGATCaTA CACATCCATT aATOGGaCGT GaTAATGtTC 480 
TGaTTaCACC aCACATTGGG TAGGCGCATT CAGTTAACn 519 
(2) INFORMATION FOR SEQ ID NO: 711: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711: 



40 



45 



SO 



CTATTACAGG 


TGGCGAACCA 


ATGTTTTCTA 


AAAAGTCTAT 


TAGAAATGTT 


GTTAAACCTC 


60 


TATTAAAGTA 


TGCACATCAT 


CGAGGTATAT 


ATACACAAAT 


GAATTCAAAC 


CTAACATTGC 


120 


CTCAAGATCG 


TTATTTAGAT 


ATTGCTGAAT 


ATATCGATGT 


TATGCATATC 


TCACACAACT 


180 


GGGOAACAAC 


TGATGAATTC 


GCAAATGTTG 


GCTTTGGCGC 


AATGAAGAAG 


CAACCACCGT 


240 


TAAAAGCTAA 


GTTAAAATTA 


TATGAACAAA 


TGATTTCGAA 


TGCACGTACA TTATCAGAAC 


300 


AAGGAATGTT 


TGTATCTGCG 


GAAACAATGC 


TCAATCAAAQ 


TACGCTACCA CATTTACGAA 


360 


AAATACATCA AGAAGTCGTT 


CATGATATGA 


AATGTAGCAG 


ACACGAGAIT 


CACCCTATGT 


420 


ATCCAGCTGA 


CTTTGCAAGT 


CAATTAAATG 


T6TTAACTCT 


AGCGGAAATG 


AAAAAGACAA 


480 
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TGTTTCCATG CTTAAAGGAT GATGAAGATC AAAAGTTACT ATCACGTTTA AGAAATGCTA 600 

AAAATGTAAC GACTAGAAAT GACCCGGATG GCCGTAGTCG TTTAAATGTC AATGTATTTA 660 

CAGGTAATGT AATCGTAACT GATTTCGGAG ATGAAACAGG TACAATTTCG AATATACAAA 720 

AAGATAAATT AACAGATGTA TTTGATAAAT GGTTATCCTC TGATCTTGCT AAATCATTAA 780 

ATTGTCATTG TTCCGAGTTT AGTTGTTTAG GGCCAAATGT TCTTGTTAAA AATATGTACT 840 

ATCCGAATAT GGATTTTAAA GATAATGAGC GTCATAT6CA CAAACAACCA CAAATTATAC 900 

AATTTTAAAA ACTCTTAATT ATGCGGAGAA GCACTTTATC GATAAGTAGT CTCCGCATAT 960 

TTTAATGCTA TTATAAAATA AAAAACAATT AATTGCTGGC AGTACTCTAC TTAAATAATA 1020 

AAGGGCATTT AATAGGACTA ATAGTCTATA ATAAAAGGGG TAAATTTTAA CTAAAAGCAT 1080 

AAACGTGCAT AATCAAAAAG ACAGaTTGTA GGTGGAATAT TCGAaCATAA CAGTTCAATT 1140 

CATCCTTAAC AATCTGTCTT TATATTTTTA GTCTCTAATA TGTTGCACTT GAGCTAAATA 1200 

TTCAATTGTT gTTTACTTtC AaTGOGaCXSt GCTTttCTtC GTtCAaCaCG TgwGGTGCTG 1260 

TATCATAAAA CCATTTTTCA ACATCATCTT CTGGATATAC ACCAGGTACA TGTTTAGGTT 1320 

2S GCCTTCATCA TCTAACGCAA CAAATGTAA 13 49 
(2) INFORMATION FOR SEQ ID NO: 712: 



10 



IS 



20 



30 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712: 

GAAATTAAAA AAGCAATTGG ACAAGATGCA ACAGTGTCAT TGTTTGATGA ATTTGATAAA 60 

AAATTATACA CTTACGGCGA TAACTGGGGT CGTGGTGGAG AAGTATTATA TCAAGCATTT 120 

GGTTTGAAAA TGCAACCAGA ACAACAAAAG TTAACTGCCA AAAnCAGGTn GGCCTGAATG 180 

GAACCAGGAG GAAnTTGAAA ATATGCTGGG GATTACATTG TGAGTACAAG TGAAGGTAAA 240 

45 CCTACACCAG GATACGAATC AACAAACATG TGGAAOAATT TGAAAGCTAC TAAAGAAGGA 300 

CATATTGTTA AAGTTGATGC TGGTACATAC TGGTACAACG TCCTTATACA TTAGATTTCA 360 

TCCGTAAAGA TTPAAAA 377 

^ (2) INFORMATION FOR SEQ ID NO: 713: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 713: 
AATGAAnCGA AAAATGACTA TAAAAAGTTT AAAGT6TTTT CACTTATTTC AACACTTGTC 60 
10 ATTGTCATTT TAGCAATTAT AAGATTTGTT CATAAAATGA TGTAATTAGA 6TGAGACATT 120 
GTTTTATGTC TCAGGATCCA GTTATTCATT ATATCTACAA TATTTACGAT TATATAAATA 180 
ACCCGAGATT TTAGTATGAT TCATTtCACT AAAATCTCGG GTTTCTATTT GATAATTTTT 240 
AATGGGATAT GGCATGTATA OGTTCTTGCC TTTTATCTCA TTTCCAATGA TTAATCTGGA 300 
TATTGTTCTA AAAATGCTTT CGCTTCTTTA TTAACTGTTT TAAAATCAAT ACCTTGTTGC 360 

ATCGCTGCAA AGACACATCC ACAATAACAC TGCCTAAAGA TATTATAGTC ATTACACATT 420 

20 ' 

TCTATGGaAC GCTCATAACC TTTACTTTTC yTAAAATCAC TTGGcAAATA GTTCACaTCG 480 

TATATTTTTT GGACATCCAT ACCAAGTTCA TTQATTAATT GTGCGTTC 528 

(2) INFORMATION FOR SEQ ID NO: 714: 

2S 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714: 



40 



45 



SO 



TTTAATGGCA 


TGAAGTCACA GTATATAAAG 


AATCTGGTGT 


GACAATCAGT ATGACTAAGT 


60 


ATATAAAAAA 


AGAGCAGGTG CGAAATAATG 


GCGAAAGAGT 


CGAAATCAGC TAATGAAATT 


120 


TCACCTGAGC 


AAATTAACCA ATGGATTAAA 


GAACACCAAG 


AAAATAAGAA TACAGATGCA 


180 


CAGGATAAGT 


TAGTTAAACA TTACCAAAAA 


CTAATTGAGT 


CATTGGCATA TAAATATTCT 


240 


AAAGGACAAT 


CACATCACGA AGATTTAGTT 


CAAGTTGGTA 


TGGTTGGTTT AATAGGTGCC 


300 


ATAAATAGAT 


TCGATATGTC CTTTGAACGG 


AAGTTTGAAG 


CCTTTTTAGT ACCTACTGTA 


360 


ATCGGTGAAA 


TCAAAAGATA TCTACGAGAT 


AAAACTTGGA 


GTGTACATGT TCCGAGACGT 


420 


ATTAAAGAAA 


TTGGGCCAAG AATCAAAAAA 


GTGAGCGATG 


AACTAACCGC TGaATTaGaG 


480 


cGTTcACyTT 


CTATcAGTGA AATAGCTGAT 


CGATTAGAA6 


TCTCAGAAGA AGAAGTGTTA 


540 


GAAGCAATGG 


AAATGGGACA AAGTTATAAT 


GCGTTAAGTG 


TTGATCATTC CATTGAAGCT 


600 


GATAAAGATG 


GTTCAACTGT TACGCTATTA 


GATATTATGG 


GGCAACAAGA TGACCnTTAT 


660 



55 
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CGAGAAATCA T 731 
(2) INFORMATION FOR SEQ ID NO: 715: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 



IS 



20 



2S 



30 



3S 



TTTTAGrGGa 


AAGCGaATTA 


GtCaTATwCg 


CaGCAGATwG 


aATGATTGaA AAmQaAATTG 


60 


aTTCGATTCC 


AATTGTAAGA 


AAAAAAGATA 


ATCAAAAGTA 


TGAAGTAATT GGAAGAATTT 


120 






•PT A T*l* ft/'*' F' Art 


CATTATATAA AGAATAGGTG AGAAGTAATG 


1 no 


GAAAAAATTA 


AAATTATCGT 


AGCTTCAGAT 


TCTATAGGTG AAACGGCAGA 6TTA6TTGCT 


240 


AGGGCAGGTA 


TTTCACAATT 


CAATCCTAAG 


CAATGTAAAA ATGAATTATT AAGATATCCA 


300 


TATATTGAAT 


CTTTTGAAGA 


TGTTGATGAA 


GTGATTCAAG 


TTGCAAAAGA TACAAATGCT 


360 


ATCATTGTTT 


ATACACTTAT 


TAAACCTGAA 


ATGAAGCAAT 


ATATGAGTGA GAAAGTAGCA 


420 


GAATTCCAAT 


TGAAGTCTGT 


CGATATCATG 


GGGCCATTAA 


TGGATTTATT ATCTGCTTCG 


480 


GTTGAAGAAA 


AACCTTATAA 


TGAGCCAGGT 


ATCGTTCATA 


GATTAGATGA TGCATATTTC 


540 


AAGAAAATTG 


ATGCGATAGA 


GTTTGCAGTT 


AAATATGATG 


ATGGTAAAGA TCCTAAAGGA 


600 


TTACCTAAAG 


CTGATATTGT 


TTTACTTGGT 


ATTTCGAGAA 


CTTCAAAGAC ACCATTATCT 


660 


CAGTATTTAG 


CGCATAAGAG 


TTACAAAGTT 


ATGAATGTAC 


CGATTGTACC AGAAaGTGAC 


720 


ACCGCCAGAT 


GGCTTATATg 


6ATATTAATC 


CAAAGAAATG 


TATCGCACTT AAAATAAGTG 


780 


AAGAnAAATT 


AAATCGCATT 


AGAnAAGAGC 


GACTAAAACA 


ATTAnGACTA 


830 



(2) INFORMATION FOR SEQ ID NO: 716: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716: 
^ nGAAGAACAA GTTTAnCATC TAAATGCCCG TTTAAAACTA CAACTACTTG ACGATGTTAA 60 
ATCAGTGTTT AAnTCTCAAA TGACGCAAAA TAGTGATTTT AATGAAGAAA AGAAAGTGTC 120 

55 
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5 



10 



IS 



20 



25 



AGAACGTATA 


AAAAAATACT 


TTAATAAGCA 


ACTCACTGAG 


CAAATTGCAC CAATCGTTCA 


240 


ACAATTAGCA 


GATTTACATG 


TCATTATTAA 


TCCTCAGTTT AACTTTGAAT CAGCTAATAT 


300 


A6AGCAACCA 


TTATTGCACA 


TCGATTTCAA 


CGATATGCTA AATGCATTGC CTAAACAATT 


360 


AACAAAACGT 


AAAATTTTGA 


ATCCAAATGG 


GCAAAGAGUVT ATACAOXSAAT CAATTT6TCA 


420 


AAGTACGTTA 


GGATTATTAC 


AACCACAAAT 


GGGATTATTG AGGCAACAGC TTGAATTATA 


480 


T6TAAAQCAA 


ATGGCTGTAG 


AAGCTGAATC 


GCAATTTGAA AGTITTGAAG CTAATATTCA 


540 


AAC6CAAATA 


AACGATTTAT 


TAGCATTTGA 


TTTAGATACA ACACTTATCA ATCAATTGAA 


600 


AGATAAACAT 


CAACAACTGA 


AAACTATTTT 


ATATTAAGAa 


AGAAGGAACG TTTTAAATGC 


660 


CTAATAAAAT 


ATTACTTGTA 


GATGGTATGG 


CGCTATTATT 


TAGACATTTC TATGCTACAA 


720 


GTCTTCATAA 


ACAATTTATG 


TACAATTCAC 


AAQGTGAACC 


TACAAATGGA ATACAAGGAT 


780 


TTGTGCGTCA 


TATCTTTTCG 


GCAATACATG 


AAATACGCCC 


TACACATGTA GCTGTATGTT 


B40 


GGGATATGGG 


ACAATCAACT 


TTTAGAAATG 


ATATGTTTGA 


tGGTTATAAG CAAAAkCGTT 


900 


CTGCACCACC 


AGAAGAATTG 


ATACCACAAT 


TTGATTATGT 


TAAAGAAATT TCAGAGCAAT 


960 


TTGGCTTTGT 


AAATATTGGC 


GTTAAAAACT 


ATGAAGCGGA 


TGAT6TTATA GGTACATTAG 


1020 


CACAACAATA 


TTCAACTGAT 


AACGATGTCT 


ATATTATTAC 


GGGCGACAAA GATTTACTGC 


1080 


AATGTATTAA 


TGACAATGTT 


GAAGTTGGCT 


AATTAAAAAA GGTITAACAT TTATAA7AGA 


1140 


TATnCATTAC 


ATCGTTTTn 








1159 



(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717: 
GATTTAATCA ATGCAGTTGC AGAGCAAGCT GATTTAACTA AAAAAGAAGC TGGTTCAGCA 60 
GTAGATGCTG TATTCGAATC AATCCAAAAC TCACTTGCTA AAGGTGAAAA AGTACAATTA 120 

45 

ATTGGTTTCG GTAACTTTGA GGTACGTGAA CGTGCTGCAC GTAAAGGTCG TAACCCTCAA 180 
ACTGGTAAAG AAATTGATAT CCCAGCAAGT AAAGTTCCAG CATTCAAAGC TGGTAAAGCA 240 
SO TTAAAAGATG CTGTAAAATA ATTTTACTTA AAAAGCCCTG AATAAGGGCT TTTTATTTTG 300 
CTTTTAATAC TTACAACCTG TACATAAATT GTAATGTTCT TCTAAGTTTT TAATCTTTGG 360 

55 
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AACTACAAAA TATACATATG aATATTGaGa TTAATTGTTA GCGTTGaATT TACTTAAAAO 480 

GTAACCATGT CTACTATAGT ATTTTACGTT ATTTAAAAAG ATGAATAATG TAAATGAAGT 540 

^ AAAGGTTATT ATGAGAATTA CAAAAGCTAC ATAA 574 

(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 
AAGaTATTGA AAmTGmAaTG GaGCACGCCC TTATTTGATA GAAGTAAAAG ACATTTAATT 60 

2^ CTTACCGATG CAGGTCAAAT TTTITATGAG AAAAGTAAAG AAATTGTtGC ACTGTATGAT 120 
TATTTACCAT CTGAAATGGA ACGCTTGAAT GGACTGGAAA CAGGACATAT AAACATGGGC 180 
ATGTCGGCAG TCATGAATAT GAAGATTCTT ATCAATATTC TTGGTGCATT CCATCAACAA 240 

25 TATCCAAATG TTACATATAA TTTGATAGAA AATGGCGGTA AAACAATTGA ACAGCAAATT 300 
ATCAATGATG AAGTAGATAT AGGCGTGACC AdTTGCCAG TCGATCATCA TATTTTCGAT 360 
TATACTACCC TAGATAAGGA AGATTTGCGA CTTATCGTGA GCAGAGAGCA TCGACTCGCA 420 
AAATATGAAA CTGTTAAACT CGAAGATTTA GCAGGTQAAG ACTTCATTTT ATTTAATAAA 480 
GACTTTTACT TGA 493 
(2) INFORMATION FOR SEQ ID NO: 719: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 

^ GCATTGCCAA AACAATACGT TTATATTATT GAGGAACTAC TGTATAAAAG TAATGAATAT 60 

CAAAATAAAA AATCATATTA CGAAACACTT GTTAACCAAG TAATTGAACT TAAACAGGCA 120 

GATGATTTAA TTATTGGACT CGCTTATTCC GTACAACGCT TAGTCGTCGA TCATTTACAC 180 

50 GTTGTCGGTG ATATTTATGA TCGTGGACCA CAACCAGATA AAATTATGGA TACACTGATT 240 

AATTATCATT CCCTAGATAT TCAATGGGGT AATCATGATG TGCTTTGGGT TGGAGCCTAT 300 



55 
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5 



IS 



25 



35 



40 



50 



GATATTATCG 


AAGACGCTTA 


TGGCATTAAT 


TTAAGACCAC 


TGCTTACTTT 


AGCTGAAAAA 


420 


TACTATGACG 


CAGATAATCC 


TGCTTTTAAG 


CCTAAAAAAA GACCTGACAA 


ACACGAACGT 


480 


TTAACTCAAC 


GTGAAGAAAG 


TCAAATTACT 


AAAATTCATC 


AAGCTATTGC 


GATGATTCAA 


540 


TTCAAGTTAG 


AAATACCAAT 


TATTAAACGT 


CGTCCAAATT 


TCGAAATGGA 


AGAACGTCTT 


600 


GTGCTTGAAA 


AGGTTAATTA 


TGATACAAAT 


GAAATTACAG 


TTTATGGTAA 


TACATACCCA 


660 


TTGAAAGACA 


CATGTTTCCA 


AACTGTCAAT 


CGTGATAATC 


CAGCAGAATT 


ACTACCTGAA 


720 


GAAGAAGAAG 


TCATGAATAA 


ACTATTATTG 


TCATTCCAAC 


AATCTGAAAA 


ATTACGTCGT 


780 


CATATGTCTT 


TCTTGATGCG 


TAAAGGCTCT 


CTTTACTTAC 


CATATAATG6 


CAATTTACTC 


840 


ATTCATGGTT 


GTATTCCAGT 


TGATGAAAAT 


GGTGAGATGG 


AATCATTTGA 


AATTGATGGT 


900 


CATACTTACA 


GCGGCCAAGA 


ATTATTAGAT 


GTGTTTGA6T ATCATGTCCG 


TAAATCATTT 


960 


GATGAAAAAG 


AAAATACTGA 


TGACTTATCG 


ACGGATTTAG TTTGGTATTT 


ATGGACTGGG 


1020 


AAATATTCGT 


CACTATTTGG 


TAAACGTGCC 


ATGACTACGT 


TTGAGCGATA 


CTTTATTGCA 


lOBO 


6ATAAAGCTT 


CTCATAAAGA 


AGAAAAGAAT 


CCGTACTATC 


aTCTTCGTGA 


AGATGTGaAT 


1140 


ATGGkTCGTA 


AAATGCTCaG 


TGaTTTCGGA 


TTAAATCCAG 


ATGAAGGACG 


CATTATTAAT 


1200 


GGTCACACAC 


CAGTGAAAGA 


AATCAATGGC 


GAGATCCTAT 


CAAGGCTGAT 


GGAAAGA 


1257 


(2) INFORMATION FOR SEQ ID NO: 720: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720: 






TGAGCACATG 


ACAGTGGCTG 


AAAACATTAA ATTTTTTAAA 


TCACTTTGTA 


AAAATCCAAT 


60 


TAACGATACA 


ACTATCAACG 


AATATTTACA 


GCAATTAAAC 


TTTGATGATA 


CGTCTGCCAA 


120 


AGTATCTACA TTGTCCGGTG 


GGAATAAACG 


TAAAATTAAT 


ATATTAGTAG 


GTTTACTAGG 


180 


TCAACCTCGA 


ATTCTCATTT 


TAGATGAACC 


GACAGTTGGT 


ATTGATTTAA 


AATCTAGACA 


240 


TGACATCCAC 


CAACTACTTA 


ACATCATGAA ATCTAAATGT 


TTAATTATAT 


TAACTACCCA 


300 


TCATTTAGAT 


GAAGTTGAGG 


CACTTGCAGA 


TGGTTATCAA 


GTTAATTGGG 


CCAGGTnCCC 


360 


TTTTnTTCAA 


CAGTTTTTGG 


GGGCCAACCA 


TGGGCTTATA 






400 



(2) INFORMATION FOR SEQ ID NO: 721: 
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(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi} SEQUENCE DESCRIPTION: SEQ ID NO: 721: 

to CTTATTTATA TTATAATATA AATATTTTAT TTTTCAATAC TTAATTTTAA AAATCCCCTA 60 

TGAAAATGTT GATGAGCTAC ATCTTTAAAT CTTATTATAC ATTGTATAAA ATTATATTGC 120 

GAGGTAGTAA ATTGATATTA TACACTTTTG GTTGTGTAT6 CATCTCACCT ATCTTTTTTG 180 

IS CTTTTTTCAA ATAAGAAAGT ATTAAAAATy AATGCCTATA CAGGAACAGA CATGGCTAAA 240 

CTTGATTTAA ATAGTCTTGA CGACGAGCAC GTAAAATTAT TAATAAATGA ATTAAAATAT 300 

CCAGAAACTC ATATCGATGT AAATGAATTA AAAACAATAG TTGCTAGTCG AATAAATGAA 360 

20 

AGGCAAGAAA TAATAAGTTT TAAGTTAGGA ATAAAGTACT TATTAACAAT AAAAAGAGGG 420 

AACATAGAAA AAQATAGGTT TTCAATTTCA ATCATTTTCA AAGATACCTA TCACACCCTA 480 

GTTAGAATAG ATATTAACGG TGGTACTCAC GATAATCCAG ATGGAACAAT CGCTCCGAAA 540 

25 

AGTCATATTC ACATATATAA TGATAAGTAT 570 
(2) INFORMATION FOR SEQ ID NO: 722: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

SS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 
ATTTTAGTCT TAGGTGTTGA TTGCATGATG AATGCAGAAG TTATACCTGC AGCCATATTA 60 

40 

QCACCTTCAT TATTGTGTAT AATTTGAATA TTATTTGCTT TTAAAGTATG TCCAATTAAG 120 
TTTGAAGTCG TTGTTTTACC ATTTCTTCCA CTGATAAATA CAATATCATC AACTTGCTCT 180 
GCTAATTTTC TTAATATATC TGTATCCACT TTTCTAGCGA TTTGTCCAGG TAAATCTGTT 240 

45 

CCTCTTTTAC CTACTGCTCT ACTTGCTTTA CGCGCCAATT TCGCTAGATG GATTGCCGTC 300 

CACTGTCTCA TGTGTTTCCT CCtCAAATTt CCACTCGCAT CATTATAACA TGACAAGGCA 360 

50 ACTTCAAAAA AGTTTCTCAA TCACAAATnG ATACCAGTGT 400 

(2) INFORMATION FOR SEQ ID NO: 723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 base pairs 
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w 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723: 

GATTATATGC TGTTGGCAAA ACAATATACC CAGATGAATT TTTATTTTTA GTAATAGTAT 60 

AACCATGCTT TGTCTTACTA ACTTTGACAG CTTCTAATGA ATTTGAAGCG TCTTTAAGCG 120 

TGGTATAATC TTCGCCGTAT ATCCCTTTTA AATTTACTCG ATACTTACCT TTAGGCAATG 180 

ATAATCTAAT TCTATCTGGA GCTTTAATGC GTATCGTTAC GGGTGTTACA ACGCGTCGAT 240 

IS ATTTATAAGT GAGTTTATTT CTTTCTTGTG TATATTCATT CACTTTAACA TCATGAGCTT 300 

TATCCGGCGA AAGTAATTCT AAATCCATTT CAAAATACAA ATCTTTAAAT TGATTAGAAA 360 

CTGATTTTGG CAACTGTACA GTTAGACCAC CATTATTTTG TTTAACTTGT AATAAATGTT 420 

TTGTAGGAGA TTGCCAGGCT GCACTATTTA ATTTAATTGT TGAATCTGAT AGTAAATTTT 480 

TATTGGCTTT AAAATGTGTA TTAACATCTT TAATATTGTT AGAAACAATC CCTTGCAACA 540 

TTGCTTGTTC TTTATCTAAT GGAGATTTTA ATTCTTTATT GGAAAAGACC TTATTTGTAA 600 

TATGTGCACT TGGATAATGG ATGGTATTTT TAGAATGAAT CCAACGAACT TTATTGTCTT 660 

TGTGTTCAGA CTTAATTTTA AATCCATATG GtAAGTTGTC ATCATGATtC ACTCTAATTC 720 

GATCATTAAC ATTCCAAAGT GATAGTaAAT TTgACGATgC CAAGTAATCT ATAAGTGCTG 780 

TTTTTATCGA TTGGCATATT AATTTGGAGT GTCTTGTCAT AATATTTTAA AATGTCTCCA 840 

TTAAAAATAC TAGAATATAA TGAAATGCCA TTATAATGAT ATATAAATGG TGAATTTAAT 900 

35 GCATAGTCTG ACATATAATC AATGCGATTA AATGAGCCTG TTGCATTTTG ATTTATCTTT 960 

TTTATAAGCT GGTTTACATA GTTACTATGG TAATCATGTT GTTTCAACXrr TGATAATGAT 1020 

TGTTGATAAG GTTTGATTGC CATGTTTTTG TTGTTATCTA AAATGACGAT TTGTTGAATC 1080 

40 

ATaACGATTA ATACTAATAT TGCAACGGTT AAT 1113 

(2) INFORMATION FOR SEQ ID NO: 724: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 



20 



25 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724: 
TGCCGTTTAG CAAAATCTAG TATCGCTTGA TGATCAGATT CTGAAATTTC AGTGTGTACT 60 
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GATTGATTAA GTTTATATGC AAGTGCATGT TCTCGTCCAC CAGCACCAAT TACTAATACA IBO 

TTCATTTATG TTACGCX:CCT TTAAAATTAG TGTTTAAAAT GTCGAGTGCC TGTGACTACC 240 

^ ATTGCAATAC CATGTTTATT AGCCATATCA ATTGAATCTT GATCTTTAAT CX5AACCACCC 300 

GGTTGGATAA TTGCCTTTAT ACCATGTTGT GCTGCAAGTr CAACTGTATC TCCCATAGGG 360 

AAAAATCCAT CAGATACTAA CGgCTACATG ATCATTGATT TCAATAGCTC TCTCTAACGC 420 

10 

AATTTTAGCA GCACCGACAC GATTCATTGG CCAGCAnCTA TACC 464 
(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725: 



25 



30 



35 



40 



GTTAAATnCA 


nTATTTnAAA 


TaTATTACCT 


TATTAGAAAA 


AGTCGTAATA TGAGGTGTAC 


60 


AAATGACGCA 


AATTTTAATA 


GTAGAAGATG 


AACAAAACTT 


AGCAAGATTT 


CTTGAATTGG 


120 


AACTCACACA 


TGAAAATTAC 


AATGTGGACA 


CAGAGTATGA 


TGGACAAGAC 


GGTTTAGATA 


180 


AAGCGCTTAG 


CCATTACTAT 


GATTTAATCA 


TATTAGATTT 


AATGTTGCCG 


TCAATTAATG 


240 


GCTTAGAAAT 


TTGTCGCAAA 


ATTAGACAAC 


AACAATCTAC 


ACCTATCATT ATAATTACAG 


300 


CGAAAAGTGA 


TACGTATGAC 


AAAGTTGCTG 


GGCTTGATTA 


CGGTGCAGAC 


GATTATATAG 


360 


TTAAGCCGTT 


TGATATTGAA 


GAACTTTTAG 


CAAGAATTCG 


TGCAATTTTA 


CGTCGTCAGC 


420 


CACAAAAGGA 


TATTATCGAT 


GTCAACGGTA 


TTACAATTGA 


TAAGAACGCT 


TTTAAAGTGA 


480 


CGGTAAATGG 


CGCAGAAATT 


GAaTTAACAA 


AAACAGAGTA 


TGATTTACTA 


TATCTTCTAG 


540 


CTGAAAATAA 


AAACCATGTT 


ATGCAACGGG 


AACAAATTTT 


AAATCATGTA 


TGGGGTTATA 


600 


ATAGTGAAGT 


AGAAACAAAT 


GTCGTAGATG 


TTTATATAAG 


rTATTTACGA 


AACAAGTTAA 


660 


AACCATACGA 


TCGTGACAAA 


ATGATTGAAA 


CAGTTCGTGG 


CGTTGGGTAT 


GTGATACGAT 


720 


GACAAAACGT 


AAATTGCGCA 


ATAACTGGAT 


TATTGTTACC 


ACGATGATTA 


CGTTTGTCAC 


780 


GATATTTTTG 


TTTTGTTTAA 


TTATTATTTT 


TTTCTTGAAA 


GATACACTGC ATAATAGTGA 


840 


GCTTGATGAT 


GCaGAACGAA 


GCrCaAGCGA 


TATTAAT/UVt 


TTATTTCaTT CTAAgCCTGT 


900 


TAAAgATATA 


TCTGCaTTAG 


ACTTGAATGC 


aTCTTTAgGT 


AAtTTTCaAG AGATAATTAT 


960 


TTATGATGAG 


CATAATAATA 


AATTATTTGA 


GACATCGAAT 


GATAACACAG 


TGAGAGTTGA 


1020 
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ATATTTAATT ATTAAAGAAC CAATTACAAC GCAAGATTTC AAAGGGTATA GCTTGTTAAT 1140 

TCATTCACTA GAAAATTATG ATAACATCGT AAAATCATTG TATATCATTG CGCTGGCATT 1200 

^ TGGAGTGATT GCAACAATTA TAACTGCCAC AATCAGTTAT GTATTTTCAA CACAAATTAC 1260 

TAAACCGCTT GTCAGTITAT CAAATAAAAT GATTGAGATT CGACGAGATG GTTTTCAAAA 1320 

TAAATTGCAA TTAAATACAA ATTATXSAAGA AATAGATAAT TTAGCAAATA CX3TTTAATGA 1380 

10 

6ATGATGAGC CAAATTGAAG AATCATTTAA TCAACAAAGA CAATTTGTTO AAGATGCGTC 1440 
(2) INFORMATION FOR SEQ ID NO: 726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 



TGATATTCCG ACTTGCGAGG ATTATTGCGC ATTAAGGATT AACGCAGTAT GAGTCCAAAG 60 

2S ACGCGCGACA CCTGGAAAAA GGATTACCTA ATGCCTTATT TACAGTAACC TTGTATGATA 120 

AAGATCGGTT AATTGGTATG GGTAGAGTGA TTGGCGATGG CGGAACTGTT TTTCAAATTG 180 

TTGATATTGC AGTTTTGAAA AGTTACCAAG GTCAAGGTTA CGGCATCTAA TTATGGAGCA 240 

TATTATGCAA TATATTAAAG GTGTGGCTGT TGAGAGTACA TACGTTATCT GATTGGCAGA 300 

CTACCCAGCG GGnTAAATTA TATACCAAAT TTGGGTTTnA TACCTACCGA ACCAGAnTCC 360 

AGGCGGTGAT GTATG 375 



35 

(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 

TAAAAAACTT AGAGATGAAA ATCATATTAT TTATCTGTTT TTGGGACAAT TTTTTACTAA 60 

AAATGAAGAT CCATGGCATC AAATACTTAA TGATTTAGAA GTTACAAATT CTGTTGATAA 120 

so 

TTTTTTAAGG TCAATAAGTA ATAAGGCCAA AGAAACAAAA AAAAGAGCTT TTATTATTAT 180 

TGATGCGCTT AATGAAGGTG AAGGTAAAAG GTTATGGGGA AATTATTTTC AAAGCTTTAT 240 



55 
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AGATGTTATA TTACCCAAAA ACOCAATACA AGATAACAAT ATTGTAGTAT TTCAGCATGA 360 
AGGTTTTAGT AAGGAAGAAA ACTATAATCC AATTGTATCT nTTGTGATT TTTATGGATT 420 

5 

AGAGCTACCT AAGTTACCTA TATTAAATCC AGAATTCAAC AATCCATTAT T 471 

(2) INFORMATION FOR SEQ ID NO: 728: 

<i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728: 



30 



35 



40 



4S 



SO 



TAGGATGTTC 


CACCCTgGCA 


ATTGcTCTAA 


GACATCtACa 


GCAGTAGAAT 


ayCCTGTTTT 


60 


CGTCtTTTTA 


ATAACAGGTA 


ATTGTAATGT 


CTCAAACAAT 


ACAACACCTA 


ATTGCTTAGG 


120 


AGAATTTATA 


TTAAAATCTT 


CACCAGCTGC 


ATCATGGATA 


TTTCGAATCA 


AGACGTCTAA 


180 


TTTTTCTTGA 


ATTTCTTTTT 


CCATTTCTTC 


TAAATCATGA 


ACATCTGTAA 


ATATACCAAT 


240 


TTCTTCCATT 


TCACTTAAAA 


TCTTAGCTAG 


CGGTAGCTCT 


AAATCAGCCA 


AGAGTTCTAC 


300 


CTGATTGTAT 


TCTTCTAATT 


GTTTATCCAT 


ATTTGGTTTC 


GCAAAGTAAA 


TTGCATCAGT 


360 


AATAGAAGCA 


ACATATGGAT 


TTAAAACATC 


ATCTTCAGGT 


ACCTTAAATT 


TCTTACCTTT 


420 


TCCATATATA 


CTCACATCGT 


CTTTCACAAA 


ACTTTGACCG 


TACAATGAAA 


CAACTGATTG 


480 


AACATCACTA 


ATCGTACGAO 


ATGGATCAAT 


TATATAACTG 


GCCAACATAA 


TATCGAAAGA 


540 


AATATTTTGA 


ATATCAATCC 


CCAATCTATG 


TGATGCTACA 


TATGTTTTTT 


TAGCATCATA 


600 


TACGACTTTT 


TTCGAATTCG 


GATTTTCTAA 


CCATGAAACT 


AGTTCGACAT 


AATTATTTAT 


660 


GTCATCCGCA 


TTAATTACAA 


TATGTTTCTC 


ACCTGtAAAT 


aAAGAGAATT 


TTAAAAtATT 


720 


ATTtCGCAAA 


TAGtTACCAC 


CGtCTAATTC 


GAAATGGATG 


GCCGCTTCTT 


TcAATGAAGT 


780 


AAAATCCAAT 


ATTATCAAAA 


GACGTTTCCA 


ATTTCAAATG 


TCTTtTCTAT 


TGCATCTTCA 


840 


ACGcTtGCTG 


ATTGaTCAAT 


GTCAGCCAAC 


AATTGTTTGA 


ATTCTAACTT 


CTTAAACAAT 


900 


TCGATTTTTT 


CTTGTTGTTC 


ATCTtGAkGA 


GTCATTAACG 


TATCTTCaAG 


TTTTACTTCa 


960 


ATCGGACTAT 


CTACATTAAt 


CGTTGCTAAT 


TCTTTACTCA 


TTAATGCATC 


TTCTTTGCTA 


1020 


TTTTGAAGTT 


TTTCTTTTAA 


CTTTTTACCT 


GAAATTTCAT 


CTAAATGTTC 


ATAGACACCT 


1080 


TCTACTGTGT 


CAAATTGGTT 


TAGCAATTTT 


ATTGCTGTTT 


TCTCTCCAAC 


ACCTGCAACA 


1140 


CCTGGTATAT 


TATCAGAAGT 


ATCTCCCATT 


AATCCTTTCA 


TATCAATAAT 


TTGATTAGGT 


1200 
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CCTTTTTTAG TGTAATAAAT GGTTACATTA TCCXjTTGCAA GTTGTGTTAA ATCTCGGTCT 1320 

CCCGTAATAA TAATTGTCTG AAAGCCCGCT TTATCTGCTT CTTTACTTAA AGTTCCGATA 1380 

^ ATATCATCTG CCTCATAGTT ATCTAATTCA TAACGTTTAA TATCATAAGC ATCTAATAAT 1440 

TGGCGAATAT AAGGAAATTG CTCACTTAGT TCAGGCGGCG TTTTCTGGCQ TCCACCTTTA 1500 

TATTCACTAT ATTTTTCATG TCTGAAAGTC GTTTTACCTG CATCAAACGC TACTAAAAAA 1560 

10 

TGATTTGGCT TTTCTTCTTT TAAAATCTTC TCTAGTAACA TTGCAAAACC ATATACTGCA 1620 

TTGGTATGAA TGCCTGCTTT GTTTGATAAC AAAGGTAATG CATAAaAAGC TCTAAAACTT 1680 

AAGCTATTAC CATCGaTTAA TACTAATTTA TTCACAATTT TAACCTCCAG AACTAATTTA 1740 

IS 

TATATnTGTG 1750 
(2) INFORMATION FOR SEQ ID NO: 729: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729: 
AAATTAATTT TAACATCCTT TCAAAATAGT TTTAACGGAT CcTCCcAAAA CGTAAACTCA 60 
^ CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT TCGATATTCT AACGGAAGAA 120 
TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT TGTAATGATA GAATGACTCA 180 
CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA ATAAACATGG CATTAGACAA 240 

35 

AGATATAGTA GGGTCTATAG AATTCCTTQA AGTAGTAGGG TTACCAAGGT CAACTTACCT 300 
TTTAAAGGAC CAACGGTGAA ACGTAAAGTT AAACCAATCA GAATGAACGT GATGATTGAA 360 
TTAGAAGTAG GGGAGnATAT AGTTCCTCAT TTATCCAACC GGTCAGGTGG ATTATTGCAC 420 

40 

TCAAATATGC nGnTATACG 439 
(2) INFORMATION FOR SEQ ID NO: 730: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 
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10 



IS 



30 



3S 



45 



SO 



AGTTTAGGAT TGATGACAAA GCCATTACGA CCAACAAATC CTAAACCTGC ACGTTCTGCT 120 

ACTGCCCTAT CTGATAATAC ACCCGTATCT ACCATAGATT TGATTTCAAC ATCTGGAACT 180 

TTAGATTCAA TAAATGCAGC TAACATGTCT AATCGTTTAC GCATAATTGT ATGATAATCT 240 

TGACCCCACG ATGCTCTAGC AAATAAGCCT CTGCX5ATCAC CTCTAACACT CTTAGGTGCA 300 

CCTTTCAGTT TGTTAGGATA ACCAACTGCA ATTGCTATGA TTGACCTTGC TGTTGGTAAG 360 

GATAATTTAG GCTCCGTTCG TAAAGCAATA TCAGATTCTT CAAaTCCTGA GGCATAACCA 420 

TTTGCATGAT ATGCTTCTAG CTTTTGCTTC AATTCATCAA AGGGnTCGGC AGTAGTAAAT 480 

C 481 
(2) INFORMATION FOR SEQ ID NO: 731: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 



25 



' (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: 

GTGATGAAAA ATCTTTTTCT ATTGATTGAG AAAATAGTTA TTTATATAGT TTTTAAAGAA 60 

AAAAAGTGAT AATATTTAGC TAATCTAATG AAAATTGATT TATTGGACAA AAAATACATT 120 

TTAAATGAGT AGAGGAGGCT GCAAwTGGGT TACAGAACTT TAAAAAGTAT TTTTCATGAA 180 

CACAATGAAA GTAAAATGAA GGAAGAGTAT ATTAAAAGAT TTAATTCTTT mGCTTCTTTC 240 

AATACTAATA TTAATATCAT ACCTATGGAA AATGGAAAAA AAGTTAATGA TTTGGAATAT 300 

CCTCTATTCT TTATGGTGAC TAAAAATCTA TCAAAAAAAC AAGAATTAAT ATCAATTAAT 360 

AGTAGAAAAA TTGATAGAGC ACTTAATTCT TTACCATATG CAGCTAGAGA ACAATATTTT 420 

40 AATGATTTAT TAATCGATGA ATTACAAAGT ACTAATGAAA TTGAAAATGT ATTTAGTACT 480 

AAACAAGAGA TTGCACATGC GTTAAATAAC CAAGCATCAG AATTTCTTAA GTTCAGAGGC 540 

CTGGTGGATC AATATAAAGA GATAGAACTT AATAAAAAAA TTAAAGTTGA TAATGTAAGA 600 

GACATTAGAG CGATTTATGA TAAATTAGTT TCAAATGAAA TTAACGAACA AGATAAGTTA 660 

GATGGAGAGC TATTTCGTAA AAATTTTGTC GGTGTGCATG ATGGGTCAAC GAATAAATAT 720 

ATACATGTTG GGTTACAACC TGAAACCAAA ATTGTTGAAT TTATAGGTGA AATGCTAACA 780 

TTTTTAAAAT ATTTTGATGC GCCTCAGCCG TTCAAAATCA TGGCTAGT 828 
(2) INFORMATION FOR SEQ ID NO: 732: 

55 
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(A) LENGTH: 1622 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732: 



10 


CGTATGTTTC 


GAATTTTATG 


nATTGCATTG 


GATAATAGTA 


CCAGTTAAAA 


GAACTAAGAT 


60 




TGTAATTATC 


CCCCCTTAAA 


ATTCAAGAAT 


ATATTTTCTT 


AATCTATCAG 


CAAGAGGTTA 


120 




TATTTCAAAG 


TGTTTATTTT 


TAAAACAACA 


TCTAAAAGCC 


ATTTcATAAt 


AAwGTTAAAT 


180 


IS 


CaTTAGaATG 


TATAAAGATT 


CCaATTAATA 


AAAATA6ATA 


AAATGCAATA 


AAGTTCCAAA 


240 




TACTTATTAT 


TATTCATCAC 


AAATTACGTG 


ATGCCCTCTA 


CAACACTAAA 


TCAAGGATGA 


300 


20 


TATTTA'iTlT 


ATATACATTA 


GCATTCATTG 


TTTGCTTACC 


CACATTATCC 


TGTTTAC3TPCT 


360 


i"ri~i~iATAAC 


TCTAGTCTTG 


AATGTCATAT 


TTACAAAGAA 


AAAAACACTA 


AAATATTTAA 


420 




AACTAACACT 


ATTTATTCAC 


TTTGTTCTTT 


TATAGCACCT 


TUTATOTTTA 


TTTTCTTC3TA 


460 


2S 


TTCAATAAAT 


AGTATGGAGA 


TGATAATTTA 


AAGTAGATAT 


ATTTGAAGTA 


TACTATAGAG 


540 


TAAATTATAT 


TGGGGAAATA 


TGTAAACGAA 


CCGGTTGATT 


TTATAGTGGA 


TTAACACTTC 


600 




ATAGAGAATA 


TAATCAAGAA 


CAAAAACAGT 


CAATAGGTGT 


GAATnTTTT 


AAAGACGGAT 


660 


30 


CTGTTGCTAC 


TAAAAAAAGA 


AGTATACAAT 


TCAATAGTTA 


AACCTTAAAA 


CAAGAAATAT 


720 




TATTCAAAAT 


CAATGAATTT 


CCTATCTTAT 


TAGTTTTAAC 


AATATTTATT 


CTCATAGAGT 


780 




TTGTCCAATT 


AAATATAGAT 


GATTCAAATA 


TTTATAAGCA 


TACAAAAGAG 


CAGTAAGACA 


840 


35 


TTTTCTAATA 


GAAAATAACT 


TTACTGCTCT 


TTGTACATTA 


CTCATCAACT 


ATTGTAAATT 


900 




AAATAATAAA 


TAACTACCTA 


TTTTATTATT 


CAGCTAAAAA 


ATGCTCTGCT 


AAATATTTTG 


960 












AGTTAATGCT 


TGGATTTCAG 


1020 


40 


GGCGTGCATT 


TTTCATAGCA 


ACTGTATAAT 


GACCAAATTC 


AAACATTGCT 


CTATCATTGT 


1080 




CGCTATCTCC 


AATAACTAAC 


GTTTCTTCTT 


GATGAATACC 


AAAATGTQCA 


ATCATTTCTT 


1140 


45 


TAATGCCTGT 


ACCTTTATCA 


6TTTGATAAG 


CCATTGTTTC 


CGCATTAAAT 


CTTGAT6AAT 


1200 


TTGArACACT 


AATCTGTAGT 


TGCACATGAT 


TTTGCTTTAA 


TTCATCTCTA 


AATGCTGTTA 


1260 




TTTTTTCTAA 


ATTAGAACTG 


AATAGATAAA 




TTCACCTTCA 


GGAAACTTAG 


1320 


SO 


TTACCCAATC 


TATCTTACCA 


GCAAGCGCAT 


CTTGTCTTGA 


AGACCATTCA 


CTATGACTTA 


1380 




CGCCATTAAT 


AGGATCTTGA 


CTACGAATCA 


TATCTCGCAT 


CCATGTTTCA 


TCTTCTTTTA 


1440 




AAGAAACTCT 


ATTACCTTCA 


AAAGGAAATA 


CCTCATAATA 


AATTTGTTGG 


CGCTTAGCTA 


1500 
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CTACTTCTCC AATTGTTCCA TTTCAACTAA TGATGCCATT AACCGCAAAA TCTTGAGGTA 


1620 




CA 


1622 


5 


(2) INFORMATION FOR SEQ ID NO: 733: 




10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 




15 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733: 




AATGCCAnAT AAATCCAAAT TCTTTTAACG AACAAGAGTG CCCAAGTTAT TGCTGATCGT 


60 




TATGCCAGAT AnGCATATCA ATGATAATTA TGGTTTAGAA AGAATTTCTA AGACAAATCA 


120 


20 


TGGATATAAT TATGTGTATT CCAATGATAA TTCAACTAGT AAGCAACATG TAAGTATTTC 


180 




AAATCAAGGC ATAATAACGA AATAATAGAT GGAACAGTGT ATTCTAATTG GATATACTGT 


240 




TTTTATTTTG CAATAATTTA ATTTAAAAAG GTGAATTCAA CTTATAAAAT GATGTAAATG 


300 


25 


TTATGTCAAA ATCAACCAAT CCGTAATGTA TTTTAAAATG TTAATATAGT TCTGAAGAAG 


360 




TATAAATGAG GTGTTGAAAT GGCTAAAAAT AAGAAAACgA ACGCGATGCG TATGCTTGAT 


420 




CGTGCAAAAA TTAAATACGA AGTTCATAGC TTTGAGGTAC nTTAAGAACA TT 


472 


30 


(21 INFORMATION FOR SEO ID NO* 734* 




35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS* dovible 

(D). TOPOLOGY: linear 




40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734: 




CATAGCCCCA AATTTTTTCT ATTATTTGTT CACGAGTAAA GATTTGCTTA GGACGTGCTG 


60 




CAAGCATAAA TAATAATTGA AATTCCTTGT TCGGTAACGT CATCGTTTTA TTAGATACTT 


120 


45 


GGAGTTCCAA ATAGGATTGG TTTAGCGTTA AGTTGCCAAT AGTCATTTCT GAATTTGAAT 


180 




TGATATTATA TCGACGTAAT ACAGCACGAA TTCTAAAAAT AAGTTCCTTA ACCTCAAAGG 


240 




GTTTGGTTAC ATAATCGTCA GTACCGCTTA TAAACGCACG CTCTTTGTCA CTAAGTGCAT 


300 


50 


CCCGCGCTGT TAACATAATA ACTGGTATAT CATAATCATT TTTTAATGTA TTACATAATT 


360 




GAAAGCCGTC CATACCATCC ATCATAATAT CTACCACTGC AATATCGaCA CGCTGTTTTT 


420 
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CACTATCTrr ATTTACAGTT TTGAAAATGA ATCAAGATAC TGTATTGAAA GCGCGATTGA 660 

AAATGTTTCA TTTTTCTAAA 6AGCGTTTAT TAATCATTCG TACGTTGATT ACATGGTTTT 720 

5 ATACTATGTT ATQGGnGTAT CGnTGGTGTA GTTTGGAATG TGGTTCAGTA TTCCGAATnA 780 

(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 737: 



20 



40 



45 



TATTTAATTT 


CTTnATTnTG 


GTTTTTAATT 


nTAAGATGAC 


CATAAATGAA TTTCTTCTGC 


60 


TGTCATAGAA 


TTAGAGCAAA 


ATACGAGCAT 


AAATTTAACT 


AATTTCTTGC TTGTCTTATA 


120 


GTCTTGATTG 


CCTTGTAAAA 


ATGTTAGTAT 


TAATTGCTTC 


ATCGTTTCGA ATTGTTGTGT 


180 


TGATTTAATA 


GCGATTGCTA 


AACCTATATT 


TTTTTCGAAA 


AGCTTTTCAA AACATCTATT 


240 


TAACAGGTTG 


TAATTCGTTT 


GGCTAATCTC 


AATATCGTGA 


ATATCGTCAA TGAAAACGAC 


300 


AGGTTTTGGT 


AAGTGCTCAA 


AGTTAATATC 


GTAATATTCA 


TTGAAAAtAA ATTGAAATAG 


360 


TTCATTGAAA 


TTACTAAATC 


GAACAAATAC 


TTTGGCACGC 


TGACTTCGAT CTTTCGGATT 


420 


AAAATCATCT 


ATATGAATAT 


CTGTTGTATT 


TGTAGCCAGT 


TGAGCTGTTA AATCAATTGT 


480 


TGAAATTAAC 


TCAGTGAATT 


GTGATACGTT 


ATCTTGTTG6 


AAGTGTATTT GTGGTGGTTC 


540 


AGTAATCTTG 


GAAACGAGTG 


AACGGAACTG 


TTTAGGACTA 


AAATGCAAGT AATTCTTAAA 


€00 


TTGATTTGCA 


AAATTTGTAT 


GACT6TTAAA 


ACCGGCCAAT 


TCAGAAACAG TTGTAATAGA 


660 


ATGTTTTGTA 


GAAAGTAATA 


AATTGATGGC 


GTTAACAAGT 


TTAATACTTG TAAAATAATC 


720 


TTTAAAATTC 


ATACTCAAAT 


ATCGAACAAA 


AAGATTAGAA 


CAATATGATT CAGAGATGTT 


780 


GCAATGCATC 


GCTATATCTT 


TTAAAGACAA 


GTGCGCATCA 


ATATkGTCAT GAATATAGTT 


840 


TACACAGTCT 


ATGAACACCG 


GATTGCTTAA 


A6CAATATTT 


GGTAAATATT CATGATCTAT 


900 


TCTTATAAAA 


GCTTCTTTAA 


GTAAAGTGTC 


TATnATACTT 


TGACCT 


946 



(2) INFORMATION FOR SEQ ID NO: 738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



ss 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738: 

TTTATTTGAT GATGTACTAA AGCAAGATGA AAGATTTGTT ATTGTAGTTC AAGCATTAGA 60 

5 

AGAAAAGAAC GGACAATTAT TAAAGAGAAC TTTAAGGGAA TATCCCGGTT TAAACCATAA 120 

ACAAATGAAT GATTTATTTA TGCACTTAAA GGAATTATTT TCCGAAGAAT CATTTGCTGA 180 

AAACCAATCA GCGTTCAGTA TTACAGTTTA TACAAATTTA QATTATACTG CTGACCAAAT 240 

ATATGCTCAT GTAAAACGIT TCAGAGGTAA GCATGACTGG ACACAAACAG CTAAATAAAA 300 

TCAAGATGCT TGCAGTGgAT TkTAACmATT AATGtTAGaT gTCmCyGcAT TTTTaATGCC 360 

IS ATTCGGTAAT TAATCCGGAT GGTCCAATTT AATTAAC 397 

(2) INFORMATION FOR SEQ ID NO: 739: 

(i) SEQUENCE CHARACTERISTICS: 
20 tA) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739: 





CAAGGTATTT 


CATTTGACCC 


ATTCACACCA 


TTAATAGTGG 


CAGCAGCATT ATACTTTGTC 


60 


30 


TTAACATTTG 


TACTTACACG 


CATTATGAAC 


ATGATTGAAG 


GGAGATTGAA TGCCAGTGAT 


120 




TAAAATAAAC 


AATCTTAATA 


AAGTTTTTGG 


AGATAATGAA 


GTTTTAAAAs ATATCAATCT 


180 




TGAAATCAAT 


CAAGGGGAAG 


TAGTAGCAAT 


AATAGGTCCA 


TCTGGTAGTG GTAAAAGTAC 


240 


35 


ATTGTTAAGA 


TGTATGAATT 


TATTAGAAGT 


ACCTACTAAA 


GOTCAAGTGA TTTTTGAAGG 


300 




CAATGACTTA 


ACGGAAAAAG 


GGACACAAGT 


AGATAAACTA 


CGTCAAAAAA TGGGTATGGT 


360 


40 


ATTTCAAAAC 


TTCAACCTAT 


TTCCACATAA 


AAAAGTTGTC 


GATAATATTA TTTTAGCTCC 


420 


TAAATTATTA 


AAGAAAGATA 


ATAACGATGA 


ATTACATAAG 


GAAGCATTGT CGTTATTAGA 


480 




TAAAGTGGGA 


TTAAAAGAAA 


AAGCAGATGT 


ATATCCGAAT 


CAATTATCAG GTGGTCAAAA 


540 


45 


ACAAAGGGTA 


GCAATTGCAA 


GAGCTTTAGC 


AATGCATCCA 


GATGTTATTT TATTCGATGA 


600 




ACCAACTTCA 


GCATTAGATC 


CTGAGGTAGT 


TGGTGATGTA 


TTAAAAGTAA TGAAAGACCT 


660 




AGCCAAAGAA 


GGTATGACCA 


TGGTGGTTGT 


GACACATGAA 


ATGGGATTTG CCAAAGATGT 


720 


SO 


AAGTGACAAA 


GTCATATTTA 


TGGCAGATGG 


CGTTGTCGTA 


GAGTCAGGCA CnCCAGTCGA 


780 




AATATTTGAA 


CAACCGCAAC 


ATGAAAGAAC 


ACAAAATTTC 


TTAGCAAGAG TATTATAACA 


840 




ACCTAAOGAG 


GCTTGAATAT 


ATGATACGCA 


CCACAAAGTT 


ATATCATATA TTCAAGCTTT 


900 



55 
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(2) INFORMATION FOR SEQ ID NO: 740: 

(1) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740: 
AGTAAATGCG ATGTTGCAT6 GAGGCTATAG CAATGAGCAA TAAAGCTTGG GGCGGTAGAT 60 
IS TTGAAGTACA ACCTGAAGAG TGGGTTGACG ACTTTAACGC ATCCATTACT TTTGATCAAA 120 
CGCTCATAGA TCAAGATATC GAAGGCAGCA TTGCACATGC AACTATGCTT GCGAATCAAG 180 
GCATTATTAG TCAACAAGAC AGCGAACAAA TTATACAAGG ACTAAAATCT ATTCAACATG 240 

20 

ATTATCATCA AGATCAAATT CAATTTAGTG CATCATTAGA AGATATTCAT TTAAATATTG 300 
AACATGgAAT TAATTAAACG TATCGGTGgA TGCTGGTGGT AAGTTGCATA CTGGACGCgT 360 
AGtAAACGAT CAAGTTGcAA CAGACATGCA CTTGTACACT GAAGAAACAA GTGCAAGnnn 420 

2S 

TCATCGCATT GGATG 435 
(2) INFORMATION FOR SEQ ID NO: 741: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 



40 


AATATCATAA CCAAAGCTTG 


TCATTACTTA 


TATGTAAAGA TTAGAGGCTA ATTACATAAT 


60 


CCTAACTTAT 


TTAACAGGAA 


GTCATGAAAA 


CTATATTACA ACTAAAAGTA TAACTTTCCA 


120 




TCACCAAAGT 


CAGAATTTAT 


TGAAACACTC 


AACTGATAAT ATAGAAGGCT AATTATAAAA 


180 


45 


ATACCTGGCG ATTAATAAAC 


TGTTTCOGTT 


TTTTAGCACT TAATATATTT AAAAAGACAA 


240 




ATAGAGCCCT 


TTCAGGCTCT 


ATTTnCTGTA 


TCCTTAACAA CAGAAGCTGT TAAAACTACC 


300 




CGTCTTTTCa 


CAaCCAAaGc 


TaCAAAAaaG 


aTgGCTAGTT ATTCTTTCGT CACCCGCTGA 


360 


50 


ATCATTAGTG 


TTGTTATTTC 


CTTTAACTTG 


CACGTCTAAA TCAAGAaCTT TTTCCAAaGA 


420 




TaAAACCCCC 


TATTTAAAAG 


TTGAAGTTAA 


GACCCcTTCA ATTGTCATAT TAATCATTAC 


480 




CACACAATAA 


ATCAATGAAT 


TTAATAATAA 


TTATTTGTTA TTAATTTGTA AATCCTCATT 


540 



55 
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